Document (#29533)

Author
Robertson, S.E.
Sparck Jones, K.
Title
Simple, proven approaches to text retrieval
Issue
May, 1997, Update of 1994 and 1996 versions.
Source
http://www.cl.cam.ac.uk/TechReports/UCAM-CL-TR-356.pdf
Year
1997
Series
Technical Report TR356, University of Cambridge, Computer Laboratory
Abstract
This technical note describes straightforward techniques for document indexing and retrieval that have been solidly established through extensive testing and are easy to apply. They are useful for many different types of text material, are viable for very large files, and have the advantage that they do not require special skills or training for searching, but are easy for end users. The document and text retrieval methods described here have a sound theoretical basis, are well established by extensive testing, and the ideas involved are now implemented in some commercial retrieval systems. Testing in the last few years has, in particular, shown that the methods presented here work very well with full texts, not only title and abstracts, and with large files of texts containing three quarters of a million documents. These tests, the TREC Tests (see Harman 1993 - 1997; IP&M 1995), have been rigorous comparative evaluations involving many different approaches to information retrieval. These techniques depend an the use of simple terms for indexing both request and document texts; an term weighting exploiting statistical information about term occurrences; an scoring for request-document matching, using these weights, to obtain a ranked search output; and an relevance feedback to modify request weights or term sets in iterative searching. The normal implementation is via an inverted file organisation using a term list with linked document identifiers, plus counting data, and pointers to the actual texts. The user's request can be a word list, phrases, sentences or extended text.
Footnote
Auch unter: http://www.ftp.cl.cam.ac.uk/ftp/papers/reports/.
Theme
Retrievalalgorithmen
Retrievalstudien

Similar documents (author)

  1. Robertson, S.E.; Sparck Jones, K.: Relevance weighting of search terms (1976) 5.65
    5.645015 = sum of:
      5.645015 = sum of:
        1.5019876 = weight(author_txt:jones in 139) [ClassicSimilarity], result of:
          1.5019876 = score(doc=139,freq=1.0), product of:
            0.49473542 = queryWeight, product of:
              6.939294 = idf(docFreq=116, maxDocs=44421)
              0.07129478 = queryNorm
            3.0359411 = fieldWeight in 139, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.939294 = idf(docFreq=116, maxDocs=44421)
              0.4375 = fieldNorm(doc=139)
        1.7626017 = weight(author_txt:robertson in 139) [ClassicSimilarity], result of:
          1.7626017 = score(doc=139,freq=1.0), product of:
            0.5504251 = queryWeight, product of:
              1.0547818 = boost
              7.319441 = idf(docFreq=79, maxDocs=44421)
              0.07129478 = queryNorm
            3.2022552 = fieldWeight in 139, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.319441 = idf(docFreq=79, maxDocs=44421)
              0.4375 = fieldNorm(doc=139)
        2.3804252 = weight(author_txt:sparck in 139) [ClassicSimilarity], result of:
          2.3804252 = score(doc=139,freq=1.0), product of:
            0.67250955 = queryWeight, product of:
              1.1659038 = boost
              8.090549 = idf(docFreq=36, maxDocs=44421)
              0.07129478 = queryNorm
            3.5396154 = fieldWeight in 139, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.090549 = idf(docFreq=36, maxDocs=44421)
              0.4375 = fieldNorm(doc=139)
    
  2. Sparck Jones, K.; Walker, S.; Robertson, S.E.: ¬A probabilistic model of information retrieval : development and comparative experiments - part 1 (2000) 4.84
    4.838584 = sum of:
      4.838584 = sum of:
        1.287418 = weight(author_txt:jones in 4249) [ClassicSimilarity], result of:
          1.287418 = score(doc=4249,freq=1.0), product of:
            0.49473542 = queryWeight, product of:
              6.939294 = idf(docFreq=116, maxDocs=44421)
              0.07129478 = queryNorm
            2.6022353 = fieldWeight in 4249, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.939294 = idf(docFreq=116, maxDocs=44421)
              0.375 = fieldNorm(doc=4249)
        1.5108016 = weight(author_txt:robertson in 4249) [ClassicSimilarity], result of:
          1.5108016 = score(doc=4249,freq=1.0), product of:
            0.5504251 = queryWeight, product of:
              1.0547818 = boost
              7.319441 = idf(docFreq=79, maxDocs=44421)
              0.07129478 = queryNorm
            2.7447903 = fieldWeight in 4249, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.319441 = idf(docFreq=79, maxDocs=44421)
              0.375 = fieldNorm(doc=4249)
        2.0403645 = weight(author_txt:sparck in 4249) [ClassicSimilarity], result of:
          2.0403645 = score(doc=4249,freq=1.0), product of:
            0.67250955 = queryWeight, product of:
              1.1659038 = boost
              8.090549 = idf(docFreq=36, maxDocs=44421)
              0.07129478 = queryNorm
            3.033956 = fieldWeight in 4249, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.090549 = idf(docFreq=36, maxDocs=44421)
              0.375 = fieldNorm(doc=4249)
    
  3. Sparck Jones, K.; Walker, S.; Robertson, S.E.: ¬A probabilistic model of information retrieval : development and comparative experiments - part 2 (2000) 4.84
    4.838584 = sum of:
      4.838584 = sum of:
        1.287418 = weight(author_txt:jones in 4354) [ClassicSimilarity], result of:
          1.287418 = score(doc=4354,freq=1.0), product of:
            0.49473542 = queryWeight, product of:
              6.939294 = idf(docFreq=116, maxDocs=44421)
              0.07129478 = queryNorm
            2.6022353 = fieldWeight in 4354, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.939294 = idf(docFreq=116, maxDocs=44421)
              0.375 = fieldNorm(doc=4354)
        1.5108016 = weight(author_txt:robertson in 4354) [ClassicSimilarity], result of:
          1.5108016 = score(doc=4354,freq=1.0), product of:
            0.5504251 = queryWeight, product of:
              1.0547818 = boost
              7.319441 = idf(docFreq=79, maxDocs=44421)
              0.07129478 = queryNorm
            2.7447903 = fieldWeight in 4354, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.319441 = idf(docFreq=79, maxDocs=44421)
              0.375 = fieldNorm(doc=4354)
        2.0403645 = weight(author_txt:sparck in 4354) [ClassicSimilarity], result of:
          2.0403645 = score(doc=4354,freq=1.0), product of:
            0.67250955 = queryWeight, product of:
              1.1659038 = boost
              8.090549 = idf(docFreq=36, maxDocs=44421)
              0.07129478 = queryNorm
            3.033956 = fieldWeight in 4354, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.090549 = idf(docFreq=36, maxDocs=44421)
              0.375 = fieldNorm(doc=4354)
    
  4. Sparck Jones, K.: Fashionable trends and feasible strategies in information management (1988) 2.96
    2.9580288 = sum of:
      2.9580288 = product of:
        4.437043 = sum of:
          1.7165573 = weight(author_txt:jones in 816) [ClassicSimilarity], result of:
            1.7165573 = score(doc=816,freq=1.0), product of:
              0.49473542 = queryWeight, product of:
                6.939294 = idf(docFreq=116, maxDocs=44421)
                0.07129478 = queryNorm
              3.469647 = fieldWeight in 816, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.939294 = idf(docFreq=116, maxDocs=44421)
                0.5 = fieldNorm(doc=816)
          2.720486 = weight(author_txt:sparck in 816) [ClassicSimilarity], result of:
            2.720486 = score(doc=816,freq=1.0), product of:
              0.67250955 = queryWeight, product of:
                1.1659038 = boost
                8.090549 = idf(docFreq=36, maxDocs=44421)
                0.07129478 = queryNorm
              4.0452747 = fieldWeight in 816, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.090549 = idf(docFreq=36, maxDocs=44421)
                0.5 = fieldNorm(doc=816)
        0.6666667 = coord(2/3)
    
  5. Sparck Jones, K.: Automatic classification (1976) 2.96
    2.9580288 = sum of:
      2.9580288 = product of:
        4.437043 = sum of:
          1.7165573 = weight(author_txt:jones in 2907) [ClassicSimilarity], result of:
            1.7165573 = score(doc=2907,freq=1.0), product of:
              0.49473542 = queryWeight, product of:
                6.939294 = idf(docFreq=116, maxDocs=44421)
                0.07129478 = queryNorm
              3.469647 = fieldWeight in 2907, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.939294 = idf(docFreq=116, maxDocs=44421)
                0.5 = fieldNorm(doc=2907)
          2.720486 = weight(author_txt:sparck in 2907) [ClassicSimilarity], result of:
            2.720486 = score(doc=2907,freq=1.0), product of:
              0.67250955 = queryWeight, product of:
                1.1659038 = boost
                8.090549 = idf(docFreq=36, maxDocs=44421)
                0.07129478 = queryNorm
              4.0452747 = fieldWeight in 2907, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.090549 = idf(docFreq=36, maxDocs=44421)
                0.5 = fieldNorm(doc=2907)
        0.6666667 = coord(2/3)
    

Similar documents (content)

  1. Dang, E.K.F.; Luk, R.W.P.; Allan, J.: Beyond bag-of-words : bigram-enhanced context-dependent term weights (2014) 0.32
    0.3156907 = sum of:
      0.3156907 = product of:
        0.7892267 = sum of:
          0.024189638 = weight(abstract_txt:large in 2283) [ClassicSimilarity], result of:
            0.024189638 = score(doc=2283,freq=1.0), product of:
              0.09956998 = queryWeight, product of:
                1.0364088 = boost
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.02162641 = queryNorm
              0.24294108 = fieldWeight in 2283, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2283)
          0.037936196 = weight(abstract_txt:approaches in 2283) [ClassicSimilarity], result of:
            0.037936196 = score(doc=2283,freq=2.0), product of:
              0.10667631 = queryWeight, product of:
                1.0727558 = boost
                4.5981455 = idf(docFreq=1215, maxDocs=44421)
                0.02162641 = queryNorm
              0.35561967 = fieldWeight in 2283, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5981455 = idf(docFreq=1215, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2283)
          0.046976186 = weight(abstract_txt:established in 2283) [ClassicSimilarity], result of:
            0.046976186 = score(doc=2283,freq=1.0), product of:
              0.15498655 = queryWeight, product of:
                1.2930458 = boost
                5.542372 = idf(docFreq=472, maxDocs=44421)
                0.02162641 = queryNorm
              0.30309847 = fieldWeight in 2283, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.542372 = idf(docFreq=472, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2283)
          0.059333164 = weight(abstract_txt:extensive in 2283) [ClassicSimilarity], result of:
            0.059333164 = score(doc=2283,freq=1.0), product of:
              0.18109529 = queryWeight, product of:
                1.3977209 = boost
                5.9910407 = idf(docFreq=301, maxDocs=44421)
                0.02162641 = queryNorm
              0.32763505 = fieldWeight in 2283, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9910407 = idf(docFreq=301, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2283)
          0.018072631 = weight(abstract_txt:have in 2283) [ClassicSimilarity], result of:
            0.018072631 = score(doc=2283,freq=1.0), product of:
              0.103291936 = queryWeight, product of:
                1.4928464 = boost
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.02162641 = queryNorm
              0.17496653 = fieldWeight in 2283, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2283)
          0.23484 = weight(abstract_txt:weights in 2283) [ClassicSimilarity], result of:
            0.23484 = score(doc=2283,freq=5.0), product of:
              0.26499248 = queryWeight, product of:
                1.6907666 = boost
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.02162641 = queryNorm
              0.88621384 = fieldWeight in 2283, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2283)
          0.03641239 = weight(abstract_txt:text in 2283) [ClassicSimilarity], result of:
            0.03641239 = score(doc=2283,freq=1.0), product of:
              0.16477259 = queryWeight, product of:
                1.8854905 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.02162641 = queryNorm
              0.22098574 = fieldWeight in 2283, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2283)
          0.06481006 = weight(abstract_txt:retrieval in 2283) [ClassicSimilarity], result of:
            0.06481006 = score(doc=2283,freq=5.0), product of:
              0.15244988 = queryWeight, product of:
                2.0276847 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.02162641 = queryNorm
              0.42512372 = fieldWeight in 2283, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2283)
          0.17204952 = weight(abstract_txt:term in 2283) [ClassicSimilarity], result of:
            0.17204952 = score(doc=2283,freq=8.0), product of:
              0.23198387 = queryWeight, product of:
                2.2372308 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.02162641 = queryNorm
              0.7416443 = fieldWeight in 2283, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2283)
          0.09460692 = weight(abstract_txt:document in 2283) [ClassicSimilarity], result of:
            0.09460692 = score(doc=2283,freq=3.0), product of:
              0.23259321 = queryWeight, product of:
                2.504583 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.02162641 = queryNorm
              0.4067484 = fieldWeight in 2283, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2283)
        0.4 = coord(10/25)
    
  2. Maron, M.E.; Kuhns, I.L.: On relevance, probabilistic indexing and information retrieval (1960) 0.26
    0.26054162 = sum of:
      0.26054162 = product of:
        0.81419253 = sum of:
          0.024832934 = weight(abstract_txt:searching in 2928) [ClassicSimilarity], result of:
            0.024832934 = score(doc=2928,freq=1.0), product of:
              0.092697114 = queryWeight, product of:
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.02162641 = queryNorm
              0.26789328 = fieldWeight in 2928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.0625 = fieldNorm(doc=2928)
          0.036716226 = weight(abstract_txt:indexing in 2928) [ClassicSimilarity], result of:
            0.036716226 = score(doc=2928,freq=2.0), product of:
              0.0954867 = queryWeight, product of:
                1.0149353 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.02162641 = queryNorm
              0.38451666 = fieldWeight in 2928, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.0625 = fieldNorm(doc=2928)
          0.06982312 = weight(abstract_txt:list in 2928) [ClassicSimilarity], result of:
            0.06982312 = score(doc=2928,freq=2.0), product of:
              0.14656729 = queryWeight, product of:
                1.2574346 = boost
                5.389733 = idf(docFreq=550, maxDocs=44421)
                0.02162641 = queryNorm
              0.47638956 = fieldWeight in 2928, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.389733 = idf(docFreq=550, maxDocs=44421)
                0.0625 = fieldNorm(doc=2928)
          0.020654436 = weight(abstract_txt:have in 2928) [ClassicSimilarity], result of:
            0.020654436 = score(doc=2928,freq=1.0), product of:
              0.103291936 = queryWeight, product of:
                1.4928464 = boost
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.02162641 = queryNorm
              0.19996175 = fieldWeight in 2928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.0625 = fieldNorm(doc=2928)
          0.0331245 = weight(abstract_txt:retrieval in 2928) [ClassicSimilarity], result of:
            0.0331245 = score(doc=2928,freq=1.0), product of:
              0.15244988 = queryWeight, product of:
                2.0276847 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.02162641 = queryNorm
              0.21728125 = fieldWeight in 2928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=2928)
          0.06951851 = weight(abstract_txt:term in 2928) [ClassicSimilarity], result of:
            0.06951851 = score(doc=2928,freq=1.0), product of:
              0.23198387 = queryWeight, product of:
                2.2372308 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.02162641 = queryNorm
              0.29966956 = fieldWeight in 2928, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.0625 = fieldNorm(doc=2928)
          0.08828141 = weight(abstract_txt:document in 2928) [ClassicSimilarity], result of:
            0.08828141 = score(doc=2928,freq=2.0), product of:
              0.23259321 = queryWeight, product of:
                2.504583 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.02162641 = queryNorm
              0.3795528 = fieldWeight in 2928, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.0625 = fieldNorm(doc=2928)
          0.4712414 = weight(abstract_txt:request in 2928) [ClassicSimilarity], result of:
            0.4712414 = score(doc=2928,freq=5.0), product of:
              0.48591816 = queryWeight, product of:
                3.2379003 = boost
                6.939294 = idf(docFreq=116, maxDocs=44421)
                0.02162641 = queryNorm
              0.9697958 = fieldWeight in 2928, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.939294 = idf(docFreq=116, maxDocs=44421)
                0.0625 = fieldNorm(doc=2928)
        0.32 = coord(8/25)
    
  3. Dumais, S.T.: Latent semantic analysis (2003) 0.25
    0.24539538 = sum of:
      0.24539538 = product of:
        0.4719142 = sum of:
          0.017559536 = weight(abstract_txt:searching in 3462) [ClassicSimilarity], result of:
            0.017559536 = score(doc=3462,freq=2.0), product of:
              0.092697114 = queryWeight, product of:
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.02162641 = queryNorm
              0.18942915 = fieldWeight in 3462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.018358113 = weight(abstract_txt:indexing in 3462) [ClassicSimilarity], result of:
            0.018358113 = score(doc=3462,freq=2.0), product of:
              0.0954867 = queryWeight, product of:
                1.0149353 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.02162641 = queryNorm
              0.19225833 = fieldWeight in 3462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.030908387 = weight(abstract_txt:large in 3462) [ClassicSimilarity], result of:
            0.030908387 = score(doc=3462,freq=5.0), product of:
              0.09956998 = queryWeight, product of:
                1.0364088 = boost
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.02162641 = queryNorm
              0.31041875 = fieldWeight in 3462, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.020757308 = weight(abstract_txt:techniques in 3462) [ClassicSimilarity], result of:
            0.020757308 = score(doc=3462,freq=2.0), product of:
              0.103634626 = queryWeight, product of:
                1.0573514 = boost
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.02162641 = queryNorm
              0.20029318 = fieldWeight in 3462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.026549805 = weight(abstract_txt:approaches in 3462) [ClassicSimilarity], result of:
            0.026549805 = score(doc=3462,freq=3.0), product of:
              0.10667631 = queryWeight, product of:
                1.0727558 = boost
                4.5981455 = idf(docFreq=1215, maxDocs=44421)
                0.02162641 = queryNorm
              0.24888192 = fieldWeight in 3462, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.5981455 = idf(docFreq=1215, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.013208802 = weight(abstract_txt:these in 3462) [ClassicSimilarity], result of:
            0.013208802 = score(doc=3462,freq=3.0), product of:
              0.076671354 = queryWeight, product of:
                1.1138561 = boost
                3.1828754 = idf(docFreq=5006, maxDocs=44421)
                0.02162641 = queryNorm
              0.17227818 = fieldWeight in 3462, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.1828754 = idf(docFreq=5006, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.024686204 = weight(abstract_txt:list in 3462) [ClassicSimilarity], result of:
            0.024686204 = score(doc=3462,freq=1.0), product of:
              0.14656729 = queryWeight, product of:
                1.2574346 = boost
                5.389733 = idf(docFreq=550, maxDocs=44421)
                0.02162641 = queryNorm
              0.16842915 = fieldWeight in 3462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.389733 = idf(docFreq=550, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.02732325 = weight(abstract_txt:have in 3462) [ClassicSimilarity], result of:
            0.02732325 = score(doc=3462,freq=7.0), product of:
              0.103291936 = queryWeight, product of:
                1.4928464 = boost
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.02162641 = queryNorm
              0.26452452 = fieldWeight in 3462, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.050966736 = weight(abstract_txt:text in 3462) [ClassicSimilarity], result of:
            0.050966736 = score(doc=3462,freq=6.0), product of:
              0.16477259 = queryWeight, product of:
                1.8854905 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.02162641 = queryNorm
              0.30931562 = fieldWeight in 3462, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.049686752 = weight(abstract_txt:retrieval in 3462) [ClassicSimilarity], result of:
            0.049686752 = score(doc=3462,freq=9.0), product of:
              0.15244988 = queryWeight, product of:
                2.0276847 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.02162641 = queryNorm
              0.3259219 = fieldWeight in 3462, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.034759253 = weight(abstract_txt:term in 3462) [ClassicSimilarity], result of:
            0.034759253 = score(doc=3462,freq=1.0), product of:
              0.23198387 = queryWeight, product of:
                2.2372308 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.02162641 = queryNorm
              0.14983478 = fieldWeight in 3462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.044140704 = weight(abstract_txt:document in 3462) [ClassicSimilarity], result of:
            0.044140704 = score(doc=3462,freq=2.0), product of:
              0.23259321 = queryWeight, product of:
                2.504583 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.02162641 = queryNorm
              0.1897764 = fieldWeight in 3462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.11300936 = weight(abstract_txt:texts in 3462) [ClassicSimilarity], result of:
            0.11300936 = score(doc=3462,freq=4.0), product of:
              0.32072574 = queryWeight, product of:
                2.6305635 = boost
                5.6376824 = idf(docFreq=429, maxDocs=44421)
                0.02162641 = queryNorm
              0.35235515 = fieldWeight in 3462, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.6376824 = idf(docFreq=429, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
        0.52 = coord(13/25)
    
  4. Patrick, T.B.; Sievert, M.C.; Popescu, M.: Text indexing of images based on graphical image content (1999) 0.21
    0.21400176 = sum of:
      0.21400176 = product of:
        0.66875553 = sum of:
          0.058053453 = weight(abstract_txt:indexing in 680) [ClassicSimilarity], result of:
            0.058053453 = score(doc=680,freq=5.0), product of:
              0.0954867 = queryWeight, product of:
                1.0149353 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.02162641 = queryNorm
              0.60797423 = fieldWeight in 680, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.0625 = fieldNorm(doc=680)
          0.027645301 = weight(abstract_txt:large in 680) [ClassicSimilarity], result of:
            0.027645301 = score(doc=680,freq=1.0), product of:
              0.09956998 = queryWeight, product of:
                1.0364088 = boost
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.02162641 = queryNorm
              0.27764696 = fieldWeight in 680, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.0625 = fieldNorm(doc=680)
          0.037617348 = weight(abstract_txt:very in 680) [ClassicSimilarity], result of:
            0.037617348 = score(doc=680,freq=1.0), product of:
              0.122266166 = queryWeight, product of:
                1.148471 = boost
                4.922683 = idf(docFreq=878, maxDocs=44421)
                0.02162641 = queryNorm
              0.30766767 = fieldWeight in 680, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.922683 = idf(docFreq=878, maxDocs=44421)
                0.0625 = fieldNorm(doc=680)
          0.12002702 = weight(abstract_txt:weights in 680) [ClassicSimilarity], result of:
            0.12002702 = score(doc=680,freq=1.0), product of:
              0.26499248 = queryWeight, product of:
                1.6907666 = boost
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.02162641 = queryNorm
              0.45294502 = fieldWeight in 680, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.0625 = fieldNorm(doc=680)
          0.08322833 = weight(abstract_txt:text in 680) [ClassicSimilarity], result of:
            0.08322833 = score(doc=680,freq=4.0), product of:
              0.16477259 = queryWeight, product of:
                1.8854905 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.02162641 = queryNorm
              0.50511026 = fieldWeight in 680, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=680)
          0.0331245 = weight(abstract_txt:retrieval in 680) [ClassicSimilarity], result of:
            0.0331245 = score(doc=680,freq=1.0), product of:
              0.15244988 = queryWeight, product of:
                2.0276847 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.02162641 = queryNorm
              0.21728125 = fieldWeight in 680, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=680)
          0.09831401 = weight(abstract_txt:term in 680) [ClassicSimilarity], result of:
            0.09831401 = score(doc=680,freq=2.0), product of:
              0.23198387 = queryWeight, product of:
                2.2372308 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.02162641 = queryNorm
              0.42379674 = fieldWeight in 680, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.0625 = fieldNorm(doc=680)
          0.21074556 = weight(abstract_txt:request in 680) [ClassicSimilarity], result of:
            0.21074556 = score(doc=680,freq=1.0), product of:
              0.48591816 = queryWeight, product of:
                3.2379003 = boost
                6.939294 = idf(docFreq=116, maxDocs=44421)
                0.02162641 = queryNorm
              0.43370587 = fieldWeight in 680, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.939294 = idf(docFreq=116, maxDocs=44421)
                0.0625 = fieldNorm(doc=680)
        0.32 = coord(8/25)
    
  5. Can, F.; Kocberber, S.; Balcik, E.; Kaynak, C.; Ocalan, H.C.: Information retrieval on Turkish texts (2008) 0.21
    0.20789182 = sum of:
      0.20789182 = product of:
        0.64966196 = sum of:
          0.03894344 = weight(abstract_txt:indexing in 2373) [ClassicSimilarity], result of:
            0.03894344 = score(doc=2373,freq=1.0), product of:
              0.0954867 = queryWeight, product of:
                1.0149353 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.02162641 = queryNorm
              0.4078415 = fieldWeight in 2373, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.09375 = fieldNorm(doc=2373)
          0.041467953 = weight(abstract_txt:large in 2373) [ClassicSimilarity], result of:
            0.041467953 = score(doc=2373,freq=1.0), product of:
              0.09956998 = queryWeight, product of:
                1.0364088 = boost
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.02162641 = queryNorm
              0.41647044 = fieldWeight in 2373, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.09375 = fieldNorm(doc=2373)
          0.022878315 = weight(abstract_txt:these in 2373) [ClassicSimilarity], result of:
            0.022878315 = score(doc=2373,freq=1.0), product of:
              0.076671354 = queryWeight, product of:
                1.1138561 = boost
                3.1828754 = idf(docFreq=5006, maxDocs=44421)
                0.02162641 = queryNorm
              0.29839456 = fieldWeight in 2373, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1828754 = idf(docFreq=5006, maxDocs=44421)
                0.09375 = fieldNorm(doc=2373)
          0.07100397 = weight(abstract_txt:simple in 2373) [ClassicSimilarity], result of:
            0.07100397 = score(doc=2373,freq=1.0), product of:
              0.14250883 = queryWeight, product of:
                1.2399032 = boost
                5.314588 = idf(docFreq=593, maxDocs=44421)
                0.02162641 = queryNorm
              0.49824262 = fieldWeight in 2373, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.314588 = idf(docFreq=593, maxDocs=44421)
                0.09375 = fieldNorm(doc=2373)
          0.074058615 = weight(abstract_txt:list in 2373) [ClassicSimilarity], result of:
            0.074058615 = score(doc=2373,freq=1.0), product of:
              0.14656729 = queryWeight, product of:
                1.2574346 = boost
                5.389733 = idf(docFreq=550, maxDocs=44421)
                0.02162641 = queryNorm
              0.50528747 = fieldWeight in 2373, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.389733 = idf(docFreq=550, maxDocs=44421)
                0.09375 = fieldNorm(doc=2373)
          0.099373505 = weight(abstract_txt:retrieval in 2373) [ClassicSimilarity], result of:
            0.099373505 = score(doc=2373,freq=4.0), product of:
              0.15244988 = queryWeight, product of:
                2.0276847 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.02162641 = queryNorm
              0.6518438 = fieldWeight in 2373, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.09375 = fieldNorm(doc=2373)
          0.1324221 = weight(abstract_txt:document in 2373) [ClassicSimilarity], result of:
            0.1324221 = score(doc=2373,freq=2.0), product of:
              0.23259321 = queryWeight, product of:
                2.504583 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.02162641 = queryNorm
              0.5693292 = fieldWeight in 2373, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.09375 = fieldNorm(doc=2373)
          0.16951406 = weight(abstract_txt:texts in 2373) [ClassicSimilarity], result of:
            0.16951406 = score(doc=2373,freq=1.0), product of:
              0.32072574 = queryWeight, product of:
                2.6305635 = boost
                5.6376824 = idf(docFreq=429, maxDocs=44421)
                0.02162641 = queryNorm
              0.52853274 = fieldWeight in 2373, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6376824 = idf(docFreq=429, maxDocs=44421)
                0.09375 = fieldNorm(doc=2373)
        0.32 = coord(8/25)