Document (#36662)

Becks, D.
Schulz, J.M.
Domänenübergreifende Phrasenextraktion mithilfe einer lexikonunabhängigen Analysekomponente
Information und Wissen: global, sozial und frei? Proceedings des 12. Internationalen Symposiums für Informationswissenschaft (ISI 2011) ; Hildesheim, 9. - 11. März 2011. Hrsg.: J. Griesbaum, T. Mandl u. C. Womser-Hacker
Boizenburg : VWH, Verl. W. Hülsbusch
Schriften zur Informationswissenschaft; Bd.58
Der vorliegende Artikel beschreibt einen neuartigen domänenübergreifenden Ansatz zur Extraktion von Phrasen, der sich mit geringem Aufwand und ohne komplexe Lexika umsetzen und auf andere Domänen übertragen lässt. Dies wird anhand von Kundenrezensionen und Patentschriften getestet.

Similar documents (author)

  1. Schulz, H.: Zur Charakterisierung der BBK/A (1988) 4.66
    4.6581078 = sum of:
      4.6581078 = weight(author_txt:schulz in 90) [ClassicSimilarity], result of:
        4.6581078 = fieldWeight in 90, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.4529724 = idf(docFreq=69, maxDocs=44421)
          0.625 = fieldNorm(doc=90)
  2. Schulz, U.: Was ist eine sinnvolle Schlagwortsyntax (eine Polemik) (1991) 4.66
    4.6581078 = sum of:
      4.6581078 = weight(author_txt:schulz in 129) [ClassicSimilarity], result of:
        4.6581078 = fieldWeight in 129, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.4529724 = idf(docFreq=69, maxDocs=44421)
          0.625 = fieldNorm(doc=129)
  3. Schulz, U.: Einführung in die Grundlagen der inhaltlichen Erschließung mit BISMAS am Fachbereich BID der FHS Hannover (1991) 4.66
    4.6581078 = sum of:
      4.6581078 = weight(author_txt:schulz in 457) [ClassicSimilarity], result of:
        4.6581078 = fieldWeight in 457, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.4529724 = idf(docFreq=69, maxDocs=44421)
          0.625 = fieldNorm(doc=457)
  4. Schulz, U.: ¬Die niederländische Basisklassifikation: eine Alternative für die "Sachgruppen" im Fremddatenangebot der Deutschen Bibliothek (1991) 4.66
    4.6581078 = sum of:
      4.6581078 = weight(author_txt:schulz in 948) [ClassicSimilarity], result of:
        4.6581078 = fieldWeight in 948, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.4529724 = idf(docFreq=69, maxDocs=44421)
          0.625 = fieldNorm(doc=948)
  5. Schulz, H.: ¬Die Adaption der BBK : Ergänzungen zur Methodik (1983) 4.66
    4.6581078 = sum of:
      4.6581078 = weight(author_txt:schulz in 1124) [ClassicSimilarity], result of:
        4.6581078 = fieldWeight in 1124, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.4529724 = idf(docFreq=69, maxDocs=44421)
          0.625 = fieldNorm(doc=1124)

Similar documents (content)

  1. Hafner, R.; Schelling, B.: Automatisierung der Sacherschließung mit Semantic-Web-Technologie (2015) 0.20
    0.19545177 = sum of:
      0.19545177 = product of:
        0.6107868 = sum of:
          0.020710977 = weight(abstract_txt:wird in 3471) [ClassicSimilarity], result of:
            0.020710977 = score(doc=3471,freq=1.0), product of:
              0.05019149 = queryWeight, product of:
                3.7727013 = idf(docFreq=2775, maxDocs=44421)
                0.013303861 = queryNorm
              0.4126392 = fieldWeight in 3471, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7727013 = idf(docFreq=2775, maxDocs=44421)
                0.109375 = fieldNorm(doc=3471)
          0.022574756 = weight(abstract_txt:einer in 3471) [ClassicSimilarity], result of:
            0.022574756 = score(doc=3471,freq=1.0), product of:
              0.0531592 = queryWeight, product of:
                1.0291393 = boost
                3.882635 = idf(docFreq=2486, maxDocs=44421)
                0.013303861 = queryNorm
              0.42466322 = fieldWeight in 3471, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.882635 = idf(docFreq=2486, maxDocs=44421)
                0.109375 = fieldNorm(doc=3471)
          0.02988237 = weight(abstract_txt:einen in 3471) [ClassicSimilarity], result of:
            0.02988237 = score(doc=3471,freq=1.0), product of:
              0.064087465 = queryWeight, product of:
                1.1299819 = boost
                4.263084 = idf(docFreq=1699, maxDocs=44421)
                0.013303861 = queryNorm
              0.4662748 = fieldWeight in 3471, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.263084 = idf(docFreq=1699, maxDocs=44421)
                0.109375 = fieldNorm(doc=3471)
          0.068351686 = weight(abstract_txt:artikel in 3471) [ClassicSimilarity], result of:
            0.068351686 = score(doc=3471,freq=1.0), product of:
              0.11125747 = queryWeight, product of:
                1.4888452 = boost
                5.616968 = idf(docFreq=438, maxDocs=44421)
                0.013303861 = queryNorm
              0.6143559 = fieldWeight in 3471, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.616968 = idf(docFreq=438, maxDocs=44421)
                0.109375 = fieldNorm(doc=3471)
          0.098841526 = weight(abstract_txt:andere in 3471) [ClassicSimilarity], result of:
            0.098841526 = score(doc=3471,freq=2.0), product of:
              0.11292221 = queryWeight, product of:
                1.4999425 = boost
                5.658835 = idf(docFreq=420, maxDocs=44421)
                0.013303861 = queryNorm
              0.87530637 = fieldWeight in 3471, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.658835 = idf(docFreq=420, maxDocs=44421)
                0.109375 = fieldNorm(doc=3471)
          0.0852187 = weight(abstract_txt:vorliegende in 3471) [ClassicSimilarity], result of:
            0.0852187 = score(doc=3471,freq=1.0), product of:
              0.12888023 = queryWeight, product of:
                1.6024264 = boost
                6.045476 = idf(docFreq=285, maxDocs=44421)
                0.013303861 = queryNorm
              0.66122395 = fieldWeight in 3471, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.045476 = idf(docFreq=285, maxDocs=44421)
                0.109375 = fieldNorm(doc=3471)
          0.10149483 = weight(abstract_txt:ansatz in 3471) [ClassicSimilarity], result of:
            0.10149483 = score(doc=3471,freq=1.0), product of:
              0.14480792 = queryWeight, product of:
                1.6985608 = boost
                6.4081626 = idf(docFreq=198, maxDocs=44421)
                0.013303861 = queryNorm
              0.7008928 = fieldWeight in 3471, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4081626 = idf(docFreq=198, maxDocs=44421)
                0.109375 = fieldNorm(doc=3471)
          0.183712 = weight(abstract_txt:mithilfe in 3471) [ClassicSimilarity], result of:
            0.183712 = score(doc=3471,freq=1.0), product of:
              0.21507408 = queryWeight, product of:
                2.0700412 = boost
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.013303861 = queryNorm
              0.85418016 = fieldWeight in 3471, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.109375 = fieldNorm(doc=3471)
        0.32 = coord(8/25)
  2. Lewandowski, D.; Krewinkel, A.; Gleissner, M.; Osterode, D.; Tolg, B.; Holle, M.; Sünkler, S.: Entwicklung und Anwendung einer Software zur automatisierten Kontrolle des Lebensmittelmarktes im Internet mit informationswissenschaftlichen Methoden (2019) 0.18
    0.1788976 = sum of:
      0.1788976 = product of:
        0.55905503 = sum of:
          0.014793555 = weight(abstract_txt:wird in 25) [ClassicSimilarity], result of:
            0.014793555 = score(doc=25,freq=1.0), product of:
              0.05019149 = queryWeight, product of:
                3.7727013 = idf(docFreq=2775, maxDocs=44421)
                0.013303861 = queryNorm
              0.2947423 = fieldWeight in 25, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7727013 = idf(docFreq=2775, maxDocs=44421)
                0.078125 = fieldNorm(doc=25)
          0.016124826 = weight(abstract_txt:einer in 25) [ClassicSimilarity], result of:
            0.016124826 = score(doc=25,freq=1.0), product of:
              0.0531592 = queryWeight, product of:
                1.0291393 = boost
                3.882635 = idf(docFreq=2486, maxDocs=44421)
                0.013303861 = queryNorm
              0.30333087 = fieldWeight in 25, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.882635 = idf(docFreq=2486, maxDocs=44421)
                0.078125 = fieldNorm(doc=25)
          0.02134455 = weight(abstract_txt:einen in 25) [ClassicSimilarity], result of:
            0.02134455 = score(doc=25,freq=1.0), product of:
              0.064087465 = queryWeight, product of:
                1.1299819 = boost
                4.263084 = idf(docFreq=1699, maxDocs=44421)
                0.013303861 = queryNorm
              0.33305344 = fieldWeight in 25, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.263084 = idf(docFreq=1699, maxDocs=44421)
                0.078125 = fieldNorm(doc=25)
          0.04882263 = weight(abstract_txt:artikel in 25) [ClassicSimilarity], result of:
            0.04882263 = score(doc=25,freq=1.0), product of:
              0.11125747 = queryWeight, product of:
                1.4888452 = boost
                5.616968 = idf(docFreq=438, maxDocs=44421)
                0.013303861 = queryNorm
              0.43882564 = fieldWeight in 25, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.616968 = idf(docFreq=438, maxDocs=44421)
                0.078125 = fieldNorm(doc=25)
          0.06899541 = weight(abstract_txt:lässt in 25) [ClassicSimilarity], result of:
            0.06899541 = score(doc=25,freq=1.0), product of:
              0.14010766 = queryWeight, product of:
                1.670767 = boost
                6.3033047 = idf(docFreq=220, maxDocs=44421)
                0.013303861 = queryNorm
              0.49244568 = fieldWeight in 25, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3033047 = idf(docFreq=220, maxDocs=44421)
                0.078125 = fieldNorm(doc=25)
          0.0956597 = weight(abstract_txt:aufwand in 25) [ClassicSimilarity], result of:
            0.0956597 = score(doc=25,freq=1.0), product of:
              0.17420787 = queryWeight, product of:
                1.8630255 = boost
                7.028639 = idf(docFreq=106, maxDocs=44421)
                0.013303861 = queryNorm
              0.54911244 = fieldWeight in 25, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.028639 = idf(docFreq=106, maxDocs=44421)
                0.078125 = fieldNorm(doc=25)
          0.10486046 = weight(abstract_txt:komplexe in 25) [ClassicSimilarity], result of:
            0.10486046 = score(doc=25,freq=1.0), product of:
              0.1852065 = queryWeight, product of:
                1.9209367 = boost
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.013303861 = queryNorm
              0.5661813 = fieldWeight in 25, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.078125 = fieldNorm(doc=25)
          0.1884539 = weight(abstract_txt:geringem in 25) [ClassicSimilarity], result of:
            0.1884539 = score(doc=25,freq=1.0), product of:
              0.27376956 = queryWeight, product of:
                2.3354874 = boost
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.013303861 = queryNorm
              0.6883669 = fieldWeight in 25, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.078125 = fieldNorm(doc=25)
        0.32 = coord(8/25)
  3. Witschel, H.F.: Text, Wörter, Morpheme : Möglichkeiten einer automatischen Terminologie-Extraktion (2004) 0.12
    0.12469658 = sum of:
      0.12469658 = product of:
        0.5195691 = sum of:
          0.012899861 = weight(abstract_txt:einer in 1126) [ClassicSimilarity], result of:
            0.012899861 = score(doc=1126,freq=1.0), product of:
              0.0531592 = queryWeight, product of:
                1.0291393 = boost
                3.882635 = idf(docFreq=2486, maxDocs=44421)
                0.013303861 = queryNorm
              0.2426647 = fieldWeight in 1126, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.882635 = idf(docFreq=2486, maxDocs=44421)
                0.0625 = fieldNorm(doc=1126)
          0.04060579 = weight(abstract_txt:anhand in 1126) [ClassicSimilarity], result of:
            0.04060579 = score(doc=1126,freq=1.0), product of:
              0.114177465 = queryWeight, product of:
                1.5082563 = boost
                5.6902003 = idf(docFreq=407, maxDocs=44421)
                0.013303861 = queryNorm
              0.35563752 = fieldWeight in 1126, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6902003 = idf(docFreq=407, maxDocs=44421)
                0.0625 = fieldNorm(doc=1126)
          0.048696395 = weight(abstract_txt:vorliegende in 1126) [ClassicSimilarity], result of:
            0.048696395 = score(doc=1126,freq=1.0), product of:
              0.12888023 = queryWeight, product of:
                1.6024264 = boost
                6.045476 = idf(docFreq=285, maxDocs=44421)
                0.013303861 = queryNorm
              0.37784225 = fieldWeight in 1126, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.045476 = idf(docFreq=285, maxDocs=44421)
                0.0625 = fieldNorm(doc=1126)
          0.057997044 = weight(abstract_txt:ansatz in 1126) [ClassicSimilarity], result of:
            0.057997044 = score(doc=1126,freq=1.0), product of:
              0.14480792 = queryWeight, product of:
                1.6985608 = boost
                6.4081626 = idf(docFreq=198, maxDocs=44421)
                0.013303861 = queryNorm
              0.40051016 = fieldWeight in 1126, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4081626 = idf(docFreq=198, maxDocs=44421)
                0.0625 = fieldNorm(doc=1126)
          0.20565377 = weight(abstract_txt:extraktion in 1126) [ClassicSimilarity], result of:
            0.20565377 = score(doc=1126,freq=2.0), product of:
              0.2672614 = queryWeight, product of:
                2.3075602 = boost
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.013303861 = queryNorm
              0.76948553 = fieldWeight in 1126, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.0625 = fieldNorm(doc=1126)
          0.15371624 = weight(abstract_txt:domänen in 1126) [ClassicSimilarity], result of:
            0.15371624 = score(doc=1126,freq=1.0), product of:
              0.27733302 = queryWeight, product of:
                2.350638 = boost
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.013303861 = queryNorm
              0.5542659 = fieldWeight in 1126, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.0625 = fieldNorm(doc=1126)
        0.24 = coord(6/25)
  4. Jersek, T.: Automatische DDC-Klassifizierung mit Lingo : Vorgehensweise und Ergebnisse (2012) 0.12
    0.12454885 = sum of:
      0.12454885 = product of:
        0.51895356 = sum of:
          0.029289747 = weight(abstract_txt:wird in 1122) [ClassicSimilarity], result of:
            0.029289747 = score(doc=1122,freq=2.0), product of:
              0.05019149 = queryWeight, product of:
                3.7727013 = idf(docFreq=2775, maxDocs=44421)
                0.013303861 = queryNorm
              0.58356 = fieldWeight in 1122, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.7727013 = idf(docFreq=2775, maxDocs=44421)
                0.109375 = fieldNorm(doc=1122)
          0.031925526 = weight(abstract_txt:einer in 1122) [ClassicSimilarity], result of:
            0.031925526 = score(doc=1122,freq=2.0), product of:
              0.0531592 = queryWeight, product of:
                1.0291393 = boost
                3.882635 = idf(docFreq=2486, maxDocs=44421)
                0.013303861 = queryNorm
              0.6005645 = fieldWeight in 1122, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.882635 = idf(docFreq=2486, maxDocs=44421)
                0.109375 = fieldNorm(doc=1122)
          0.060942475 = weight(abstract_txt:dies in 1122) [ClassicSimilarity], result of:
            0.060942475 = score(doc=1122,freq=1.0), product of:
              0.103064656 = queryWeight, product of:
                1.432979 = boost
                5.406202 = idf(docFreq=541, maxDocs=44421)
                0.013303861 = queryNorm
              0.59130335 = fieldWeight in 1122, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.406202 = idf(docFreq=541, maxDocs=44421)
                0.109375 = fieldNorm(doc=1122)
          0.100494206 = weight(abstract_txt:anhand in 1122) [ClassicSimilarity], result of:
            0.100494206 = score(doc=1122,freq=2.0), product of:
              0.114177465 = queryWeight, product of:
                1.5082563 = boost
                5.6902003 = idf(docFreq=407, maxDocs=44421)
                0.013303861 = queryNorm
              0.88015795 = fieldWeight in 1122, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6902003 = idf(docFreq=407, maxDocs=44421)
                0.109375 = fieldNorm(doc=1122)
          0.10149483 = weight(abstract_txt:ansatz in 1122) [ClassicSimilarity], result of:
            0.10149483 = score(doc=1122,freq=1.0), product of:
              0.14480792 = queryWeight, product of:
                1.6985608 = boost
                6.4081626 = idf(docFreq=198, maxDocs=44421)
                0.013303861 = queryNorm
              0.7008928 = fieldWeight in 1122, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4081626 = idf(docFreq=198, maxDocs=44421)
                0.109375 = fieldNorm(doc=1122)
          0.1948068 = weight(abstract_txt:getestet in 1122) [ClassicSimilarity], result of:
            0.1948068 = score(doc=1122,freq=1.0), product of:
              0.2236484 = queryWeight, product of:
                2.1109009 = boost
                7.963798 = idf(docFreq=41, maxDocs=44421)
                0.013303861 = queryNorm
              0.8710404 = fieldWeight in 1122, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.963798 = idf(docFreq=41, maxDocs=44421)
                0.109375 = fieldNorm(doc=1122)
        0.24 = coord(6/25)
  5. Behnert, C.; Plassmeier, K.; Borst, T.; Lewandowski, D.: Evaluierung von Rankingverfahren für bibliothekarische Informationssysteme (2019) 0.11
    0.11420158 = sum of:
      0.11420158 = product of:
        0.5710079 = sum of:
          0.027364738 = weight(abstract_txt:einer in 23) [ClassicSimilarity], result of:
            0.027364738 = score(doc=23,freq=2.0), product of:
              0.0531592 = queryWeight, product of:
                1.0291393 = boost
                3.882635 = idf(docFreq=2486, maxDocs=44421)
                0.013303861 = queryNorm
              0.51476955 = fieldWeight in 23, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.882635 = idf(docFreq=2486, maxDocs=44421)
                0.09375 = fieldNorm(doc=23)
          0.07990846 = weight(abstract_txt:beschreibt in 23) [ClassicSimilarity], result of:
            0.07990846 = score(doc=23,freq=1.0), product of:
              0.13683255 = queryWeight, product of:
                1.6511239 = boost
                6.229197 = idf(docFreq=237, maxDocs=44421)
                0.013303861 = queryNorm
              0.58398724 = fieldWeight in 23, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.229197 = idf(docFreq=237, maxDocs=44421)
                0.09375 = fieldNorm(doc=23)
          0.13929002 = weight(abstract_txt:übertragen in 23) [ClassicSimilarity], result of:
            0.13929002 = score(doc=23,freq=1.0), product of:
              0.19818658 = queryWeight, product of:
                1.9871107 = boost
                7.496775 = idf(docFreq=66, maxDocs=44421)
                0.013303861 = queryNorm
              0.7028227 = fieldWeight in 23, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.496775 = idf(docFreq=66, maxDocs=44421)
                0.09375 = fieldNorm(doc=23)
          0.15746744 = weight(abstract_txt:mithilfe in 23) [ClassicSimilarity], result of:
            0.15746744 = score(doc=23,freq=1.0), product of:
              0.21507408 = queryWeight, product of:
                2.0700412 = boost
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.013303861 = queryNorm
              0.7321544 = fieldWeight in 23, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.09375 = fieldNorm(doc=23)
          0.16697724 = weight(abstract_txt:getestet in 23) [ClassicSimilarity], result of:
            0.16697724 = score(doc=23,freq=1.0), product of:
              0.2236484 = queryWeight, product of:
                2.1109009 = boost
                7.963798 = idf(docFreq=41, maxDocs=44421)
                0.013303861 = queryNorm
              0.74660605 = fieldWeight in 23, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.963798 = idf(docFreq=41, maxDocs=44421)
                0.09375 = fieldNorm(doc=23)
        0.2 = coord(5/25)