Document (#28651)

Author
Salton, G.
Title
Automatic processing of foreign language documents
Source
Theory of subject analysis: a sourcebook. Ed.: L.M. Chan, et al
Imprint
Littleton, CO : Libraries Unlimited
Year
1985
Pages
S.340-355
Abstract
The attempt to computerize a process, such as indexing, abstracting, classifying, or retrieving information, begins with an analysis of the process into its intellectual and nonintellectual components. That part of the process which is amenable to computerization is mechanical or algorithmic. What is not is intellectual or creative and requires human intervention. Gerard Salton has been an innovator, experimenter, and promoter in the area of mechanized information systems since the early 1960s. He has been particularly ingenious at analyzing the process of information retrieval into its algorithmic components. He received a doctorate in applied mathematics from Harvard University before moving to the computer science department at Cornell, where he developed a prototype automatic retrieval system called SMART. Working with this system he and his students contributed for over a decade to our theoretical understanding of the retrieval process. On a more practical level, they have contributed design criteria for operating retrieval systems. The following selection presents one of the early descriptions of the SMART system; it is valuable as it shows the direction automatic retrieval methods were to take beyond simple word-matching techniques. These include various word normalization techniques to improve recall, for instance, the separation of words into stems and affixes; the correlation and clustering, using statistical association measures, of related terms; and the identification, using a concept thesaurus, of synonymous, broader, narrower, and sibling terms. They include, as weIl, techniques, both linguistic and statistical, to deal with the thorny problem of how to automatically extract from texts index terms that consist of more than one word. They include weighting techniques and various documentrequest matching algorithms. Significant among the latter are those which produce a retrieval output of citations ranked in relevante order. During the 1970s, Salton and his students went an to further refine these various techniques, particularly the weighting and statistical association measures. Many of their early innovations seem commonplace today. Some of their later techniques are still ahead of their time and await technological developments for implementation. The particular focus of the selection that follows is an the evaluation of a particular component of the SMART system, a multilingual thesaurus. By mapping English language expressions and their German equivalents to a common concept number, the thesaurus permitted the automatic processing of German language documents against English language queries and vice versa. The results of the evaluation, as it turned out, were somewhat inconclusive. However, this SMART experiment suggested in a bold and optimistic way how one might proceed to answer such complex questions as What is meant by retrieval language compatability? How it is to be achieved, and how evaluated?
Footnote
Nachdruck des Originalartikels mit Kommentierung durch die Herausgeber
Original in: Journal of the American Society for Information Science 21(1970) no.3, S.187-194.
Theme
Automatisches Indexieren
Computerlinguistik
Object
SMART

Similar documents (author)

  1. Salton, G.: Another look at automatic text-retrieval systems (1986) 4.87
    4.8684025 = sum of:
      4.8684025 = weight(author_txt:salton in 1355) [ClassicSimilarity], result of:
        4.8684025 = score(doc=1355,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.7894444 = idf(docFreq=49, maxDocs=44421)
            0.12837885 = queryNorm
          4.868403 = fieldWeight in 1355, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.7894444 = idf(docFreq=49, maxDocs=44421)
            0.625 = fieldNorm(doc=1355)
    
  2. Salton, G.: ¬A new comparison between conventional indexing (MEDLARS) and automatic text processing (SMART) (1972) 4.87
    4.8684025 = sum of:
      4.8684025 = weight(author_txt:salton in 2324) [ClassicSimilarity], result of:
        4.8684025 = score(doc=2324,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.7894444 = idf(docFreq=49, maxDocs=44421)
            0.12837885 = queryNorm
          4.868403 = fieldWeight in 2324, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.7894444 = idf(docFreq=49, maxDocs=44421)
            0.625 = fieldNorm(doc=2324)
    
  3. Salton, G.: Future prospects for text-based information retrieval (1990) 4.87
    4.8684025 = sum of:
      4.8684025 = weight(author_txt:salton in 2326) [ClassicSimilarity], result of:
        4.8684025 = score(doc=2326,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.7894444 = idf(docFreq=49, maxDocs=44421)
            0.12837885 = queryNorm
          4.868403 = fieldWeight in 2326, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.7894444 = idf(docFreq=49, maxDocs=44421)
            0.625 = fieldNorm(doc=2326)
    
  4. Salton, G.: Fast document classification in automatic information retrieval (1978) 4.87
    4.8684025 = sum of:
      4.8684025 = weight(author_txt:salton in 2330) [ClassicSimilarity], result of:
        4.8684025 = score(doc=2330,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.7894444 = idf(docFreq=49, maxDocs=44421)
            0.12837885 = queryNorm
          4.868403 = fieldWeight in 2330, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.7894444 = idf(docFreq=49, maxDocs=44421)
            0.625 = fieldNorm(doc=2330)
    
  5. Salton, G.: Expert systems and information retrieval (1987) 4.87
    4.8684025 = sum of:
      4.8684025 = weight(author_txt:salton in 2836) [ClassicSimilarity], result of:
        4.8684025 = score(doc=2836,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.7894444 = idf(docFreq=49, maxDocs=44421)
            0.12837885 = queryNorm
          4.868403 = fieldWeight in 2836, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.7894444 = idf(docFreq=49, maxDocs=44421)
            0.625 = fieldNorm(doc=2836)
    

Similar documents (content)

  1. Lioma, C.; Ounis, I.: ¬A syntactically-based query reformulation technique for information retrieval (2008) 0.37
    0.3688793 = sum of:
      0.3688793 = product of:
        0.83836204 = sum of:
          0.057458833 = weight(abstract_txt:association in 3031) [ClassicSimilarity], result of:
            0.057458833 = score(doc=3031,freq=2.0), product of:
              0.13215965 = queryWeight, product of:
                1.0080243 = boost
                5.6215343 = idf(docFreq=436, maxDocs=44421)
                0.023322387 = queryNorm
              0.43476835 = fieldWeight in 3031, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6215343 = idf(docFreq=436, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3031)
          0.014329346 = weight(abstract_txt:their in 3031) [ClassicSimilarity], result of:
            0.014329346 = score(doc=3031,freq=1.0), product of:
              0.08311867 = queryWeight, product of:
                1.1305399 = boost
                3.1523883 = idf(docFreq=5161, maxDocs=44421)
                0.023322387 = queryNorm
              0.17239624 = fieldWeight in 3031, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1523883 = idf(docFreq=5161, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3031)
          0.028976366 = weight(abstract_txt:various in 3031) [ClassicSimilarity], result of:
            0.028976366 = score(doc=3031,freq=1.0), product of:
              0.12076212 = queryWeight, product of:
                1.1801374 = boost
                4.387581 = idf(docFreq=1500, maxDocs=44421)
                0.023322387 = queryNorm
              0.23994583 = fieldWeight in 3031, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.387581 = idf(docFreq=1500, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3031)
          0.017549686 = weight(abstract_txt:system in 3031) [ClassicSimilarity], result of:
            0.017549686 = score(doc=3031,freq=1.0), product of:
              0.09514674 = queryWeight, product of:
                1.2095771 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.023322387 = queryNorm
              0.18444863 = fieldWeight in 3031, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3031)
          0.1097117 = weight(abstract_txt:weighting in 3031) [ClassicSimilarity], result of:
            0.1097117 = score(doc=3031,freq=2.0), product of:
              0.20340563 = queryWeight, product of:
                1.2505558 = boost
                6.9740796 = idf(docFreq=112, maxDocs=44421)
                0.023322387 = queryNorm
              0.53937393 = fieldWeight in 3031, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.9740796 = idf(docFreq=112, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3031)
          0.0585398 = weight(abstract_txt:statistical in 3031) [ClassicSimilarity], result of:
            0.0585398 = score(doc=3031,freq=1.0), product of:
              0.19299033 = queryWeight, product of:
                1.4918838 = boost
                5.5466094 = idf(docFreq=470, maxDocs=44421)
                0.023322387 = queryNorm
              0.3033302 = fieldWeight in 3031, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5466094 = idf(docFreq=470, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3031)
          0.166361 = weight(abstract_txt:salton in 3031) [ClassicSimilarity], result of:
            0.166361 = score(doc=3031,freq=1.0), product of:
              0.3382507 = queryWeight, product of:
                1.612653 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.023322387 = queryNorm
              0.49182755 = fieldWeight in 3031, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3031)
          0.14345652 = weight(abstract_txt:automatic in 3031) [ClassicSimilarity], result of:
            0.14345652 = score(doc=3031,freq=5.0), product of:
              0.22578992 = queryWeight, product of:
                1.8633261 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.023322387 = queryNorm
              0.635354 = fieldWeight in 3031, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3031)
          0.08297819 = weight(abstract_txt:language in 3031) [ClassicSimilarity], result of:
            0.08297819 = score(doc=3031,freq=4.0), product of:
              0.18188924 = queryWeight, product of:
                1.8697997 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.023322387 = queryNorm
              0.45620176 = fieldWeight in 3031, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3031)
          0.09512989 = weight(abstract_txt:retrieval in 3031) [ClassicSimilarity], result of:
            0.09512989 = score(doc=3031,freq=8.0), product of:
              0.17690565 = queryWeight, product of:
                2.1818578 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.023322387 = queryNorm
              0.5377437 = fieldWeight in 3031, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3031)
          0.06387073 = weight(abstract_txt:techniques in 3031) [ClassicSimilarity], result of:
            0.06387073 = score(doc=3031,freq=1.0), product of:
              0.25769895 = queryWeight, product of:
                2.4380271 = boost
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.023322387 = queryNorm
              0.24785016 = fieldWeight in 3031, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3031)
        0.44 = coord(11/25)
    
  2. Salton, G.: SMART System: 1961-1976 (2009) 0.29
    0.28710744 = sum of:
      0.28710744 = product of:
        1.196281 = sum of:
          0.0662317 = weight(abstract_txt:various in 866) [ClassicSimilarity], result of:
            0.0662317 = score(doc=866,freq=1.0), product of:
              0.12076212 = queryWeight, product of:
                1.1801374 = boost
                4.387581 = idf(docFreq=1500, maxDocs=44421)
                0.023322387 = queryNorm
              0.5484476 = fieldWeight in 866, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.387581 = idf(docFreq=1500, maxDocs=44421)
                0.125 = fieldNorm(doc=866)
          0.04011357 = weight(abstract_txt:system in 866) [ClassicSimilarity], result of:
            0.04011357 = score(doc=866,freq=1.0), product of:
              0.09514674 = queryWeight, product of:
                1.2095771 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.023322387 = queryNorm
              0.42159688 = fieldWeight in 866, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.125 = fieldNorm(doc=866)
          0.38025373 = weight(abstract_txt:salton in 866) [ClassicSimilarity], result of:
            0.38025373 = score(doc=866,freq=1.0), product of:
              0.3382507 = queryWeight, product of:
                1.612653 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.023322387 = queryNorm
              1.1241772 = fieldWeight in 866, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.125 = fieldNorm(doc=866)
          0.14664161 = weight(abstract_txt:automatic in 866) [ClassicSimilarity], result of:
            0.14664161 = score(doc=866,freq=1.0), product of:
              0.22578992 = queryWeight, product of:
                1.8633261 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.023322387 = queryNorm
              0.64946043 = fieldWeight in 866, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.125 = fieldNorm(doc=866)
          0.10871987 = weight(abstract_txt:retrieval in 866) [ClassicSimilarity], result of:
            0.10871987 = score(doc=866,freq=2.0), product of:
              0.17690565 = queryWeight, product of:
                2.1818578 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.023322387 = queryNorm
              0.6145642 = fieldWeight in 866, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.125 = fieldNorm(doc=866)
          0.45432055 = weight(abstract_txt:smart in 866) [ClassicSimilarity], result of:
            0.45432055 = score(doc=866,freq=1.0), product of:
              0.47985274 = queryWeight, product of:
                2.7163804 = boost
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.023322387 = queryNorm
              0.94679165 = fieldWeight in 866, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.125 = fieldNorm(doc=866)
        0.24 = coord(6/25)
    
  3. ¬The Fourth Text Retrieval Conference (TREC-4) (1996) 0.29
    0.28643757 = sum of:
      0.28643757 = product of:
        0.8951174 = sum of:
          0.10088957 = weight(abstract_txt:matching in 590) [ClassicSimilarity], result of:
            0.10088957 = score(doc=590,freq=1.0), product of:
              0.15266818 = queryWeight, product of:
                1.0834175 = boost
                6.0419855 = idf(docFreq=286, maxDocs=44421)
                0.023322387 = queryNorm
              0.6608422 = fieldWeight in 590, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0419855 = idf(docFreq=286, maxDocs=44421)
                0.109375 = fieldNorm(doc=590)
          0.035099372 = weight(abstract_txt:system in 590) [ClassicSimilarity], result of:
            0.035099372 = score(doc=590,freq=1.0), product of:
              0.09514674 = queryWeight, product of:
                1.2095771 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.023322387 = queryNorm
              0.36889726 = fieldWeight in 590, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.109375 = fieldNorm(doc=590)
          0.15515578 = weight(abstract_txt:weighting in 590) [ClassicSimilarity], result of:
            0.15515578 = score(doc=590,freq=1.0), product of:
              0.20340563 = queryWeight, product of:
                1.2505558 = boost
                6.9740796 = idf(docFreq=112, maxDocs=44421)
                0.023322387 = queryNorm
              0.76278996 = fieldWeight in 590, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9740796 = idf(docFreq=112, maxDocs=44421)
                0.109375 = fieldNorm(doc=590)
          0.07629844 = weight(abstract_txt:include in 590) [ClassicSimilarity], result of:
            0.07629844 = score(doc=590,freq=1.0), product of:
              0.14506362 = queryWeight, product of:
                1.2934406 = boost
                4.808826 = idf(docFreq=984, maxDocs=44421)
                0.023322387 = queryNorm
              0.52596533 = fieldWeight in 590, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.808826 = idf(docFreq=984, maxDocs=44421)
                0.109375 = fieldNorm(doc=590)
          0.12831143 = weight(abstract_txt:automatic in 590) [ClassicSimilarity], result of:
            0.12831143 = score(doc=590,freq=1.0), product of:
              0.22578992 = queryWeight, product of:
                1.8633261 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.023322387 = queryNorm
              0.5682779 = fieldWeight in 590, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.109375 = fieldNorm(doc=590)
          0.08297819 = weight(abstract_txt:language in 590) [ClassicSimilarity], result of:
            0.08297819 = score(doc=590,freq=1.0), product of:
              0.18188924 = queryWeight, product of:
                1.8697997 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.023322387 = queryNorm
              0.45620176 = fieldWeight in 590, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.109375 = fieldNorm(doc=590)
          0.09512989 = weight(abstract_txt:retrieval in 590) [ClassicSimilarity], result of:
            0.09512989 = score(doc=590,freq=2.0), product of:
              0.17690565 = queryWeight, product of:
                2.1818578 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.023322387 = queryNorm
              0.5377437 = fieldWeight in 590, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.109375 = fieldNorm(doc=590)
          0.22125469 = weight(abstract_txt:techniques in 590) [ClassicSimilarity], result of:
            0.22125469 = score(doc=590,freq=3.0), product of:
              0.25769895 = queryWeight, product of:
                2.4380271 = boost
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.023322387 = queryNorm
              0.85857815 = fieldWeight in 590, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.109375 = fieldNorm(doc=590)
        0.32 = coord(8/25)
    
  4. Chung, Y.M.; Lee, J.Y.: ¬A corpus-based approach to comparative evaluation of statistical term association measures (2001) 0.27
    0.26811442 = sum of:
      0.26811442 = product of:
        0.6093509 = sum of:
          0.113739 = weight(abstract_txt:association in 6769) [ClassicSimilarity], result of:
            0.113739 = score(doc=6769,freq=6.0), product of:
              0.13215965 = queryWeight, product of:
                1.0080243 = boost
                5.6215343 = idf(docFreq=436, maxDocs=44421)
                0.023322387 = queryNorm
              0.8606182 = fieldWeight in 6769, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.6215343 = idf(docFreq=436, maxDocs=44421)
                0.0625 = fieldNorm(doc=6769)
          0.02063901 = weight(abstract_txt:they in 6769) [ClassicSimilarity], result of:
            0.02063901 = score(doc=6769,freq=1.0), product of:
              0.08811152 = queryWeight, product of:
                1.0080534 = boost
                3.7477977 = idf(docFreq=2845, maxDocs=44421)
                0.023322387 = queryNorm
              0.23423736 = fieldWeight in 6769, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7477977 = idf(docFreq=2845, maxDocs=44421)
                0.0625 = fieldNorm(doc=6769)
          0.057967704 = weight(abstract_txt:terms in 6769) [ClassicSimilarity], result of:
            0.057967704 = score(doc=6769,freq=5.0), product of:
              0.10257484 = queryWeight, product of:
                1.0876461 = boost
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.023322387 = queryNorm
              0.56512594 = fieldWeight in 6769, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.0625 = fieldNorm(doc=6769)
          0.016376395 = weight(abstract_txt:their in 6769) [ClassicSimilarity], result of:
            0.016376395 = score(doc=6769,freq=1.0), product of:
              0.08311867 = queryWeight, product of:
                1.1305399 = boost
                3.1523883 = idf(docFreq=5161, maxDocs=44421)
                0.023322387 = queryNorm
              0.19702427 = fieldWeight in 6769, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1523883 = idf(docFreq=5161, maxDocs=44421)
                0.0625 = fieldNorm(doc=6769)
          0.03311585 = weight(abstract_txt:various in 6769) [ClassicSimilarity], result of:
            0.03311585 = score(doc=6769,freq=1.0), product of:
              0.12076212 = queryWeight, product of:
                1.1801374 = boost
                4.387581 = idf(docFreq=1500, maxDocs=44421)
                0.023322387 = queryNorm
              0.2742238 = fieldWeight in 6769, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.387581 = idf(docFreq=1500, maxDocs=44421)
                0.0625 = fieldNorm(doc=6769)
          0.06165845 = weight(abstract_txt:include in 6769) [ClassicSimilarity], result of:
            0.06165845 = score(doc=6769,freq=2.0), product of:
              0.14506362 = queryWeight, product of:
                1.2934406 = boost
                4.808826 = idf(docFreq=984, maxDocs=44421)
                0.023322387 = queryNorm
              0.42504418 = fieldWeight in 6769, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.808826 = idf(docFreq=984, maxDocs=44421)
                0.0625 = fieldNorm(doc=6769)
          0.05419768 = weight(abstract_txt:thesaurus in 6769) [ClassicSimilarity], result of:
            0.05419768 = score(doc=6769,freq=1.0), product of:
              0.16771063 = queryWeight, product of:
                1.390745 = boost
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.023322387 = queryNorm
              0.32316187 = fieldWeight in 6769, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.0625 = fieldNorm(doc=6769)
          0.06690262 = weight(abstract_txt:statistical in 6769) [ClassicSimilarity], result of:
            0.06690262 = score(doc=6769,freq=1.0), product of:
              0.19299033 = queryWeight, product of:
                1.4918838 = boost
                5.5466094 = idf(docFreq=470, maxDocs=44421)
                0.023322387 = queryNorm
              0.3466631 = fieldWeight in 6769, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5466094 = idf(docFreq=470, maxDocs=44421)
                0.0625 = fieldNorm(doc=6769)
          0.073320806 = weight(abstract_txt:automatic in 6769) [ClassicSimilarity], result of:
            0.073320806 = score(doc=6769,freq=1.0), product of:
              0.22578992 = queryWeight, product of:
                1.8633261 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.023322387 = queryNorm
              0.32473022 = fieldWeight in 6769, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.0625 = fieldNorm(doc=6769)
          0.03843828 = weight(abstract_txt:retrieval in 6769) [ClassicSimilarity], result of:
            0.03843828 = score(doc=6769,freq=1.0), product of:
              0.17690565 = queryWeight, product of:
                2.1818578 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.023322387 = queryNorm
              0.21728125 = fieldWeight in 6769, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=6769)
          0.07299512 = weight(abstract_txt:techniques in 6769) [ClassicSimilarity], result of:
            0.07299512 = score(doc=6769,freq=1.0), product of:
              0.25769895 = queryWeight, product of:
                2.4380271 = boost
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.023322387 = queryNorm
              0.28325734 = fieldWeight in 6769, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.0625 = fieldNorm(doc=6769)
        0.44 = coord(11/25)
    
  5. Harman, D.: Overview of the first Text Retrieval Conference (1993) 0.26
    0.2553112 = sum of:
      0.2553112 = product of:
        0.7978475 = sum of:
          0.08647678 = weight(abstract_txt:matching in 616) [ClassicSimilarity], result of:
            0.08647678 = score(doc=616,freq=1.0), product of:
              0.15266818 = queryWeight, product of:
                1.0834175 = boost
                6.0419855 = idf(docFreq=286, maxDocs=44421)
                0.023322387 = queryNorm
              0.5664362 = fieldWeight in 616, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0419855 = idf(docFreq=286, maxDocs=44421)
                0.09375 = fieldNorm(doc=616)
          0.024564592 = weight(abstract_txt:their in 616) [ClassicSimilarity], result of:
            0.024564592 = score(doc=616,freq=1.0), product of:
              0.08311867 = queryWeight, product of:
                1.1305399 = boost
                3.1523883 = idf(docFreq=5161, maxDocs=44421)
                0.023322387 = queryNorm
              0.2955364 = fieldWeight in 616, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1523883 = idf(docFreq=5161, maxDocs=44421)
                0.09375 = fieldNorm(doc=616)
          0.13299067 = weight(abstract_txt:weighting in 616) [ClassicSimilarity], result of:
            0.13299067 = score(doc=616,freq=1.0), product of:
              0.20340563 = queryWeight, product of:
                1.2505558 = boost
                6.9740796 = idf(docFreq=112, maxDocs=44421)
                0.023322387 = queryNorm
              0.65382 = fieldWeight in 616, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9740796 = idf(docFreq=112, maxDocs=44421)
                0.09375 = fieldNorm(doc=616)
          0.10152327 = weight(abstract_txt:early in 616) [ClassicSimilarity], result of:
            0.10152327 = score(doc=616,freq=1.0), product of:
              0.1944866 = queryWeight, product of:
                1.497656 = boost
                5.5680695 = idf(docFreq=460, maxDocs=44421)
                0.023322387 = queryNorm
              0.5220065 = fieldWeight in 616, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5680695 = idf(docFreq=460, maxDocs=44421)
                0.09375 = fieldNorm(doc=616)
          0.10998122 = weight(abstract_txt:automatic in 616) [ClassicSimilarity], result of:
            0.10998122 = score(doc=616,freq=1.0), product of:
              0.22578992 = queryWeight, product of:
                1.8633261 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.023322387 = queryNorm
              0.48709533 = fieldWeight in 616, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.09375 = fieldNorm(doc=616)
          0.07112416 = weight(abstract_txt:language in 616) [ClassicSimilarity], result of:
            0.07112416 = score(doc=616,freq=1.0), product of:
              0.18188924 = queryWeight, product of:
                1.8697997 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.023322387 = queryNorm
              0.39103007 = fieldWeight in 616, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.09375 = fieldNorm(doc=616)
          0.08153991 = weight(abstract_txt:retrieval in 616) [ClassicSimilarity], result of:
            0.08153991 = score(doc=616,freq=2.0), product of:
              0.17690565 = queryWeight, product of:
                2.1818578 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.023322387 = queryNorm
              0.46092314 = fieldWeight in 616, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.09375 = fieldNorm(doc=616)
          0.18964687 = weight(abstract_txt:techniques in 616) [ClassicSimilarity], result of:
            0.18964687 = score(doc=616,freq=3.0), product of:
              0.25769895 = queryWeight, product of:
                2.4380271 = boost
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.023322387 = queryNorm
              0.7359241 = fieldWeight in 616, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.09375 = fieldNorm(doc=616)
        0.32 = coord(8/25)