Document (#20529)

Author
Ruge, G.
Goeser, S.
Title
Information Retrieval ohne Linguistik
Source
nfd Information - Wissenschaft und Praxis. 49(1998) H.6, S.361-369
Year
1998
Abstract
Natürlicherweise sollte man erwarten, daß linguistische Textanalyseverfahren die Effektivität und Benutzerfreundlichkeit von Information Retrieval Systemen verbessern, da sowohl Dokumente als auch Suchanfragen die interessierenden Inhalte linguistisch enkodieren. Ein Retrievalabgleich auf der Ebene der linguistischen Inhaltsdarstellung müßte demzufolge zu besseren Retrievalsystemen führen als ein Abgleich auf Wort- oder gar Zeichenebene. Tatsächlich aber ist immer noch weitgehend unklar, inwieweit linguistische Textanalyseverfahren Retrievalsysteme verbessern können. Evaluationen von Retrievalsystemen mit linguistischen Komponenten führen nach wie vor zu unterschiedlichen, teils gegenläufigen Ergebnissen, obwohl die dazu erforderliche Computerlinguistik große Fortschritte gemacht hat. Wir gehen der Frage nach, wie es zu diesen kontraintuitiven Ergenissen kommt. Dazu wird der Stand der Kunst im linguistischen IR zusammengefaßt, so daß die Ergebnisse anhand des Vergleich verschiedener Evaluierungen diskutiert werden können.
Footnote
Vgl. auch die Erwiderung: Ladewig, C.: 'Information Retrieval ohne Linguistik?' in: nfd 49(1998) H.8, S.476-478
Theme
Computerlinguistik

Similar documents (author)

  1. Ruge, G.: Experiments on linguistically-based term associations (1992) 5.94
    5.9401517 = sum of:
      5.9401517 = weight(author_txt:ruge in 1809) [ClassicSimilarity], result of:
        5.9401517 = fieldWeight in 1809, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.625 = fieldNorm(doc=1809)
    
  2. Ruge, G.: ¬A spreading activation network for automatic generation of thesaurus relationships (1991) 5.94
    5.9401517 = sum of:
      5.9401517 = weight(author_txt:ruge in 4505) [ClassicSimilarity], result of:
        5.9401517 = fieldWeight in 4505, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.625 = fieldNorm(doc=4505)
    
  3. Ruge, G.: Sprache und Computer : Wortbedeutung und Termassoziation. Methoden zur automatischen semantischen Klassifikation (1995) 5.94
    5.9401517 = sum of:
      5.9401517 = weight(author_txt:ruge in 2534) [ClassicSimilarity], result of:
        5.9401517 = fieldWeight in 2534, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.625 = fieldNorm(doc=2534)
    
  4. Ruge, G.; Schwarz, C.: Term association and computational linguistics (1991) 4.75
    4.7521214 = sum of:
      4.7521214 = weight(author_txt:ruge in 2309) [ClassicSimilarity], result of:
        4.7521214 = fieldWeight in 2309, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.5 = fieldNorm(doc=2309)
    
  5. Ruge, G.; Schwarz, C.: Linguistically based term associations : a new semantic component for a hyperterm system (1990) 4.75
    4.7521214 = sum of:
      4.7521214 = weight(author_txt:ruge in 5543) [ClassicSimilarity], result of:
        4.7521214 = fieldWeight in 5543, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.5 = fieldNorm(doc=5543)
    

Similar documents (content)

  1. Bachfeld, S.: Möglichkeiten und Grenzen linguistischer Verfahren der automatischen Indexierung : Entwurf einer Simulation für den Einsatz im Grundstudium (2003) 0.12
    0.115949534 = sum of:
      0.115949534 = product of:
        0.5797477 = sum of:
          0.019166507 = weight(abstract_txt:nach in 3827) [ClassicSimilarity], result of:
            0.019166507 = score(doc=3827,freq=1.0), product of:
              0.08062221 = queryWeight, product of:
                1.1429039 = boost
                4.3471055 = idf(docFreq=1562, maxDocs=44421)
                0.016227245 = queryNorm
              0.23773234 = fieldWeight in 3827, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3471055 = idf(docFreq=1562, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3827)
          0.020542027 = weight(abstract_txt:können in 3827) [ClassicSimilarity], result of:
            0.020542027 = score(doc=3827,freq=1.0), product of:
              0.08443483 = queryWeight, product of:
                1.1696157 = boost
                4.448705 = idf(docFreq=1411, maxDocs=44421)
                0.016227245 = queryNorm
              0.24328856 = fieldWeight in 3827, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.448705 = idf(docFreq=1411, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3827)
          0.06499474 = weight(abstract_txt:führen in 3827) [ClassicSimilarity], result of:
            0.06499474 = score(doc=3827,freq=1.0), product of:
              0.18197492 = queryWeight, product of:
                1.7170706 = boost
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.016227245 = queryNorm
              0.35716316 = fieldWeight in 3827, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3827)
          0.1539426 = weight(abstract_txt:linguistische in 3827) [ClassicSimilarity], result of:
            0.1539426 = score(doc=3827,freq=1.0), product of:
              0.32334435 = queryWeight, product of:
                2.288838 = boost
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.016227245 = queryNorm
              0.4760949 = fieldWeight in 3827, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3827)
          0.32110178 = weight(abstract_txt:linguistischen in 3827) [ClassicSimilarity], result of:
            0.32110178 = score(doc=3827,freq=2.0), product of:
              0.47959536 = queryWeight, product of:
                3.4140158 = boost
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.016227245 = queryNorm
              0.66952646 = fieldWeight in 3827, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3827)
        0.2 = coord(5/25)
    
  2. Fuhr, N.: Zur Überwindung der Diskrepanz zwischen Retrievalforschung und -praxis (1990) 0.11
    0.1103557 = sum of:
      0.1103557 = product of:
        0.5517785 = sum of:
          0.0937991 = weight(abstract_txt:besseren in 6624) [ClassicSimilarity], result of:
            0.0937991 = score(doc=6624,freq=1.0), product of:
              0.12877347 = queryWeight, product of:
                1.0213641 = boost
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.016227245 = queryNorm
              0.7284039 = fieldWeight in 6624, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.09375 = fieldNorm(doc=6624)
          0.03285687 = weight(abstract_txt:nach in 6624) [ClassicSimilarity], result of:
            0.03285687 = score(doc=6624,freq=1.0), product of:
              0.08062221 = queryWeight, product of:
                1.1429039 = boost
                4.3471055 = idf(docFreq=1562, maxDocs=44421)
                0.016227245 = queryNorm
              0.40754116 = fieldWeight in 6624, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3471055 = idf(docFreq=1562, maxDocs=44421)
                0.09375 = fieldNorm(doc=6624)
          0.049801394 = weight(abstract_txt:können in 6624) [ClassicSimilarity], result of:
            0.049801394 = score(doc=6624,freq=2.0), product of:
              0.08443483 = queryWeight, product of:
                1.1696157 = boost
                4.448705 = idf(docFreq=1411, maxDocs=44421)
                0.016227245 = queryNorm
              0.5898205 = fieldWeight in 6624, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.448705 = idf(docFreq=1411, maxDocs=44421)
                0.09375 = fieldNorm(doc=6624)
          0.11141955 = weight(abstract_txt:führen in 6624) [ClassicSimilarity], result of:
            0.11141955 = score(doc=6624,freq=1.0), product of:
              0.18197492 = queryWeight, product of:
                1.7170706 = boost
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.016227245 = queryNorm
              0.6122797 = fieldWeight in 6624, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.09375 = fieldNorm(doc=6624)
          0.2639016 = weight(abstract_txt:linguistische in 6624) [ClassicSimilarity], result of:
            0.2639016 = score(doc=6624,freq=1.0), product of:
              0.32334435 = queryWeight, product of:
                2.288838 = boost
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.016227245 = queryNorm
              0.8161627 = fieldWeight in 6624, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.09375 = fieldNorm(doc=6624)
        0.2 = coord(5/25)
    
  3. Sprachtechnologie, mobile Kommunikation und linguistische Ressourcen : Beiträge zur GLDV Tagung 2005 in Bonn (2005) 0.09
    0.091746636 = sum of:
      0.091746636 = product of:
        0.76455534 = sum of:
          0.11141955 = weight(abstract_txt:führen in 4578) [ClassicSimilarity], result of:
            0.11141955 = score(doc=4578,freq=1.0), product of:
              0.18197492 = queryWeight, product of:
                1.7170706 = boost
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.016227245 = queryNorm
              0.6122797 = fieldWeight in 4578, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.09375 = fieldNorm(doc=4578)
          0.2639016 = weight(abstract_txt:linguistische in 4578) [ClassicSimilarity], result of:
            0.2639016 = score(doc=4578,freq=1.0), product of:
              0.32334435 = queryWeight, product of:
                2.288838 = boost
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.016227245 = queryNorm
              0.8161627 = fieldWeight in 4578, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.09375 = fieldNorm(doc=4578)
          0.38923416 = weight(abstract_txt:linguistischen in 4578) [ClassicSimilarity], result of:
            0.38923416 = score(doc=4578,freq=1.0), product of:
              0.47959536 = queryWeight, product of:
                3.4140158 = boost
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.016227245 = queryNorm
              0.81158864 = fieldWeight in 4578, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.09375 = fieldNorm(doc=4578)
        0.12 = coord(3/25)
    
  4. Luckhardt, H.-D.: Computerlinguistik und Informationswissenschaft : Facetten des wissenschaftlichen Wirkens von Harald H. Zimmermann (2006) 0.09
    0.08584557 = sum of:
      0.08584557 = product of:
        0.7153797 = sum of:
          0.116801344 = weight(abstract_txt:linguistik in 79) [ClassicSimilarity], result of:
            0.116801344 = score(doc=79,freq=1.0), product of:
              0.13449144 = queryWeight, product of:
                1.0437938 = boost
                7.9402676 = idf(docFreq=42, maxDocs=44421)
                0.016227245 = queryNorm
              0.86846673 = fieldWeight in 79, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9402676 = idf(docFreq=42, maxDocs=44421)
                0.109375 = fieldNorm(doc=79)
          0.14447182 = weight(abstract_txt:computerlinguistik in 79) [ClassicSimilarity], result of:
            0.14447182 = score(doc=79,freq=1.0), product of:
              0.15497139 = queryWeight, product of:
                1.1204517 = boost
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.016227245 = queryNorm
              0.93224835 = fieldWeight in 79, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.109375 = fieldNorm(doc=79)
          0.4541065 = weight(abstract_txt:linguistischen in 79) [ClassicSimilarity], result of:
            0.4541065 = score(doc=79,freq=1.0), product of:
              0.47959536 = queryWeight, product of:
                3.4140158 = boost
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.016227245 = queryNorm
              0.9468534 = fieldWeight in 79, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.109375 = fieldNorm(doc=79)
        0.12 = coord(3/25)
    
  5. Maschinelle Sprachsynthese (1996) 0.07
    0.071845464 = sum of:
      0.071845464 = product of:
        0.5987122 = sum of:
          0.1062727 = weight(abstract_txt:fortschritte in 5940) [ClassicSimilarity], result of:
            0.1062727 = score(doc=5940,freq=1.0), product of:
              0.1262827 = queryWeight, product of:
                1.0114381 = boost
                7.694134 = idf(docFreq=54, maxDocs=44421)
                0.016227245 = queryNorm
              0.84154594 = fieldWeight in 5940, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.694134 = idf(docFreq=54, maxDocs=44421)
                0.109375 = fieldNorm(doc=5940)
          0.038333014 = weight(abstract_txt:nach in 5940) [ClassicSimilarity], result of:
            0.038333014 = score(doc=5940,freq=1.0), product of:
              0.08062221 = queryWeight, product of:
                1.1429039 = boost
                4.3471055 = idf(docFreq=1562, maxDocs=44421)
                0.016227245 = queryNorm
              0.47546467 = fieldWeight in 5940, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3471055 = idf(docFreq=1562, maxDocs=44421)
                0.109375 = fieldNorm(doc=5940)
          0.4541065 = weight(abstract_txt:linguistischen in 5940) [ClassicSimilarity], result of:
            0.4541065 = score(doc=5940,freq=1.0), product of:
              0.47959536 = queryWeight, product of:
                3.4140158 = boost
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.016227245 = queryNorm
              0.9468534 = fieldWeight in 5940, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.109375 = fieldNorm(doc=5940)
        0.12 = coord(3/25)