Document (#32978)

Author
Lam-Adesina, A.M.
Jones, G.J.F.
Title
Examining and improving the effectiveness of relevance feedback for retrieval of scanned text documents
Source
Information processing and management. 42(2006) no.3, S.633-649
Year
2006
Abstract
Important legacy paper documents are digitized and collected in online accessible archives. This enables the preservation, sharing, and significantly the searching of these documents. The text contents of these document images can be transcribed automatically using OCR systems and then stored in an information retrieval system. However, OCR systems make errors in character recognition which have previously been shown to impact on document retrieval behaviour. In particular relevance feedback query-expansion methods, which are often effective for improving electronic text retrieval, are observed to be less reliable for retrieval of scanned document images. Our experimental examination of the effects of character recognition errors on an ad hoc OCR retrieval task demonstrates that, while baseline information retrieval can remain relatively unaffected by transcription errors, relevance feedback via query expansion becomes highly unstable. This paper examines the reason for this behaviour, and introduces novel modifications to standard relevance feedback methods. These methods are shown experimentally to improve the effectiveness of relevance feedback for errorful OCR transcriptions. The new methods combine similar recognised character strings based on term collection frequency and a string edit-distance measure. The techniques are domain independent and make no use of external resources such as dictionaries or training data.
Theme
Dokumentenmanagement

Similar documents (author)

  1. Jones, M.H.: Year's work in cataloging and classification : 1978 (1979) 4.34
    4.3370585 = sum of:
      4.3370585 = weight(author_txt:jones in 307) [ClassicSimilarity], result of:
        4.3370585 = fieldWeight in 307, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          6.939294 = idf(docFreq=116, maxDocs=44421)
          0.625 = fieldNorm(doc=307)
    
  2. Jones, K.P.: Natural-language processing and automatic indexing : a reply (1990) 4.34
    4.3370585 = sum of:
      4.3370585 = weight(author_txt:jones in 393) [ClassicSimilarity], result of:
        4.3370585 = fieldWeight in 393, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          6.939294 = idf(docFreq=116, maxDocs=44421)
          0.625 = fieldNorm(doc=393)
    
  3. Jones, R.M.: Online catalogue research in Europe (1989) 4.34
    4.3370585 = sum of:
      4.3370585 = weight(author_txt:jones in 795) [ClassicSimilarity], result of:
        4.3370585 = fieldWeight in 795, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          6.939294 = idf(docFreq=116, maxDocs=44421)
          0.625 = fieldNorm(doc=795)
    
  4. Jones, R.L.: Automatic document content analysis : the AIDA project (1992) 4.34
    4.3370585 = sum of:
      4.3370585 = weight(author_txt:jones in 2606) [ClassicSimilarity], result of:
        4.3370585 = fieldWeight in 2606, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          6.939294 = idf(docFreq=116, maxDocs=44421)
          0.625 = fieldNorm(doc=2606)
    
  5. Jones, K.P.: How do we index? : a report of some Aslib Information Group activity (1983) 4.34
    4.3370585 = sum of:
      4.3370585 = weight(author_txt:jones in 2735) [ClassicSimilarity], result of:
        4.3370585 = fieldWeight in 2735, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          6.939294 = idf(docFreq=116, maxDocs=44421)
          0.625 = fieldNorm(doc=2735)
    

Similar documents (content)

  1. Tagheva, K.; Borsack, J.; Condit, A.: Effects of OCR errors on ranking and feedback using the vector space model (1996) 0.36
    0.36286217 = sum of:
      0.36286217 = product of:
        1.1339443 = sum of:
          0.050245374 = weight(abstract_txt:text in 5019) [ClassicSimilarity], result of:
            0.050245374 = score(doc=5019,freq=1.0), product of:
              0.113684654 = queryWeight, product of:
                1.3669709 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.020580996 = queryNorm
              0.44197148 = fieldWeight in 5019, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.109375 = fieldNorm(doc=5019)
          0.11603774 = weight(abstract_txt:recognition in 5019) [ClassicSimilarity], result of:
            0.11603774 = score(doc=5019,freq=1.0), product of:
              0.17351627 = queryWeight, product of:
                1.3789002 = boost
                6.114219 = idf(docFreq=266, maxDocs=44421)
                0.020580996 = queryNorm
              0.6687427 = fieldWeight in 5019, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.114219 = idf(docFreq=266, maxDocs=44421)
                0.109375 = fieldNorm(doc=5019)
          0.05338378 = weight(abstract_txt:documents in 5019) [ClassicSimilarity], result of:
            0.05338378 = score(doc=5019,freq=1.0), product of:
              0.11837064 = queryWeight, product of:
                1.3948592 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.020580996 = queryNorm
              0.45098835 = fieldWeight in 5019, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.109375 = fieldNorm(doc=5019)
          0.060297478 = weight(abstract_txt:document in 5019) [ClassicSimilarity], result of:
            0.060297478 = score(doc=5019,freq=1.0), product of:
              0.12838192 = queryWeight, product of:
                1.4526477 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.020580996 = queryNorm
              0.46967265 = fieldWeight in 5019, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.109375 = fieldNorm(doc=5019)
          0.21103144 = weight(abstract_txt:character in 5019) [ClassicSimilarity], result of:
            0.21103144 = score(doc=5019,freq=1.0), product of:
              0.29593924 = queryWeight, product of:
                2.205513 = boost
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.020580996 = queryNorm
              0.7130904 = fieldWeight in 5019, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.109375 = fieldNorm(doc=5019)
          0.30237338 = weight(abstract_txt:errors in 5019) [ClassicSimilarity], result of:
            0.30237338 = score(doc=5019,freq=2.0), product of:
              0.2985315 = queryWeight, product of:
                2.2151515 = boost
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.020580996 = queryNorm
              1.0128692 = fieldWeight in 5019, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.109375 = fieldNorm(doc=5019)
          0.07465709 = weight(abstract_txt:retrieval in 5019) [ClassicSimilarity], result of:
            0.07465709 = score(doc=5019,freq=1.0), product of:
              0.19634089 = queryWeight, product of:
                2.744114 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.020580996 = queryNorm
              0.3802422 = fieldWeight in 5019, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.109375 = fieldNorm(doc=5019)
          0.26591796 = weight(abstract_txt:feedback in 5019) [ClassicSimilarity], result of:
            0.26591796 = score(doc=5019,freq=1.0), product of:
              0.4093415 = queryWeight, product of:
                3.3486953 = boost
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.020580996 = queryNorm
              0.6496237 = fieldWeight in 5019, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.109375 = fieldNorm(doc=5019)
        0.32 = coord(8/25)
    
  2. Salton, G.; Buckley, C.: Improving retrieval performance by relevance feedback (1990) 0.35
    0.35316247 = sum of:
      0.35316247 = product of:
        1.2612945 = sum of:
          0.062356334 = weight(abstract_txt:query in 5441) [ClassicSimilarity], result of:
            0.062356334 = score(doc=5441,freq=1.0), product of:
              0.104921974 = queryWeight, product of:
                1.0722498 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.020580996 = queryNorm
              0.5943115 = fieldWeight in 5441, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.125 = fieldNorm(doc=5441)
          0.07678914 = weight(abstract_txt:effectiveness in 5441) [ClassicSimilarity], result of:
            0.07678914 = score(doc=5441,freq=1.0), product of:
              0.1205441 = queryWeight, product of:
                1.149306 = boost
                5.0961695 = idf(docFreq=738, maxDocs=44421)
                0.020580996 = queryNorm
              0.6370212 = fieldWeight in 5441, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0961695 = idf(docFreq=738, maxDocs=44421)
                0.125 = fieldNorm(doc=5441)
          0.057423286 = weight(abstract_txt:text in 5441) [ClassicSimilarity], result of:
            0.057423286 = score(doc=5441,freq=1.0), product of:
              0.113684654 = queryWeight, product of:
                1.3669709 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.020580996 = queryNorm
              0.50511026 = fieldWeight in 5441, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.125 = fieldNorm(doc=5441)
          0.116736494 = weight(abstract_txt:methods in 5441) [ClassicSimilarity], result of:
            0.116736494 = score(doc=5441,freq=2.0), product of:
              0.1593739 = queryWeight, product of:
                1.8689011 = boost
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.020580996 = queryNorm
              0.7324694 = fieldWeight in 5441, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.125 = fieldNorm(doc=5441)
          0.120664075 = weight(abstract_txt:retrieval in 5441) [ClassicSimilarity], result of:
            0.120664075 = score(doc=5441,freq=2.0), product of:
              0.19634089 = queryWeight, product of:
                2.744114 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.020580996 = queryNorm
              0.6145642 = fieldWeight in 5441, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.125 = fieldNorm(doc=5441)
          0.30094415 = weight(abstract_txt:relevance in 5441) [ClassicSimilarity], result of:
            0.30094415 = score(doc=5441,freq=3.0), product of:
              0.28197435 = queryWeight, product of:
                2.7793133 = boost
                4.929532 = idf(docFreq=872, maxDocs=44421)
                0.020580996 = queryNorm
              1.0672749 = fieldWeight in 5441, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.929532 = idf(docFreq=872, maxDocs=44421)
                0.125 = fieldNorm(doc=5441)
          0.526381 = weight(abstract_txt:feedback in 5441) [ClassicSimilarity], result of:
            0.526381 = score(doc=5441,freq=3.0), product of:
              0.4093415 = queryWeight, product of:
                3.3486953 = boost
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.020580996 = queryNorm
              1.2859213 = fieldWeight in 5441, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.125 = fieldNorm(doc=5441)
        0.28 = coord(7/25)
    
  3. Ye, Z.; Huang, J.X.: ¬A learning to rank approach for quality-aware pseudo-relevance feedback (2016) 0.35
    0.3509432 = sum of:
      0.3509432 = product of:
        0.877358 = sum of:
          0.029776936 = weight(abstract_txt:make in 3855) [ClassicSimilarity], result of:
            0.029776936 = score(doc=3855,freq=1.0), product of:
              0.10175429 = queryWeight, product of:
                1.0559397 = boost
                4.682171 = idf(docFreq=1117, maxDocs=44421)
                0.020580996 = queryNorm
              0.29263568 = fieldWeight in 3855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.682171 = idf(docFreq=1117, maxDocs=44421)
                0.0625 = fieldNorm(doc=3855)
          0.014030992 = weight(abstract_txt:these in 3855) [ClassicSimilarity], result of:
            0.014030992 = score(doc=3855,freq=1.0), product of:
              0.07053241 = queryWeight, product of:
                1.0767199 = boost
                3.1828754 = idf(docFreq=5006, maxDocs=44421)
                0.020580996 = queryNorm
              0.19892971 = fieldWeight in 3855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1828754 = idf(docFreq=5006, maxDocs=44421)
                0.0625 = fieldNorm(doc=3855)
          0.05067266 = weight(abstract_txt:shown in 3855) [ClassicSimilarity], result of:
            0.05067266 = score(doc=3855,freq=1.0), product of:
              0.14503802 = queryWeight, product of:
                1.2606765 = boost
                5.59 = idf(docFreq=450, maxDocs=44421)
                0.020580996 = queryNorm
              0.349375 = fieldWeight in 3855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.59 = idf(docFreq=450, maxDocs=44421)
                0.0625 = fieldNorm(doc=3855)
          0.028711643 = weight(abstract_txt:text in 3855) [ClassicSimilarity], result of:
            0.028711643 = score(doc=3855,freq=1.0), product of:
              0.113684654 = queryWeight, product of:
                1.3669709 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.020580996 = queryNorm
              0.25255513 = fieldWeight in 3855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=3855)
          0.05283624 = weight(abstract_txt:documents in 3855) [ClassicSimilarity], result of:
            0.05283624 = score(doc=3855,freq=3.0), product of:
              0.11837064 = queryWeight, product of:
                1.3948592 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.020580996 = queryNorm
              0.4463627 = fieldWeight in 3855, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0625 = fieldNorm(doc=3855)
          0.077045284 = weight(abstract_txt:document in 3855) [ClassicSimilarity], result of:
            0.077045284 = score(doc=3855,freq=5.0), product of:
              0.12838192 = queryWeight, product of:
                1.4526477 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.020580996 = queryNorm
              0.6001257 = fieldWeight in 3855, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.0625 = fieldNorm(doc=3855)
          0.041272584 = weight(abstract_txt:methods in 3855) [ClassicSimilarity], result of:
            0.041272584 = score(doc=3855,freq=1.0), product of:
              0.1593739 = queryWeight, product of:
                1.8689011 = boost
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.020580996 = queryNorm
              0.25896704 = fieldWeight in 3855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.0625 = fieldNorm(doc=3855)
          0.060332038 = weight(abstract_txt:retrieval in 3855) [ClassicSimilarity], result of:
            0.060332038 = score(doc=3855,freq=2.0), product of:
              0.19634089 = queryWeight, product of:
                2.744114 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.020580996 = queryNorm
              0.3072821 = fieldWeight in 3855, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=3855)
          0.15047207 = weight(abstract_txt:relevance in 3855) [ClassicSimilarity], result of:
            0.15047207 = score(doc=3855,freq=3.0), product of:
              0.28197435 = queryWeight, product of:
                2.7793133 = boost
                4.929532 = idf(docFreq=872, maxDocs=44421)
                0.020580996 = queryNorm
              0.53363746 = fieldWeight in 3855, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.929532 = idf(docFreq=872, maxDocs=44421)
                0.0625 = fieldNorm(doc=3855)
          0.37220758 = weight(abstract_txt:feedback in 3855) [ClassicSimilarity], result of:
            0.37220758 = score(doc=3855,freq=6.0), product of:
              0.4093415 = queryWeight, product of:
                3.3486953 = boost
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.020580996 = queryNorm
              0.90928376 = fieldWeight in 3855, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.0625 = fieldNorm(doc=3855)
        0.4 = coord(10/25)
    
  4. He, D.; Wu, D.: Enhancing query translation with relevance feedback in translingual information retrieval : a study of the medication process (2011) 0.34
    0.343665 = sum of:
      0.343665 = product of:
        0.954625 = sum of:
          0.0763706 = weight(abstract_txt:query in 244) [ClassicSimilarity], result of:
            0.0763706 = score(doc=244,freq=6.0), product of:
              0.104921974 = queryWeight, product of:
                1.0722498 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.020580996 = queryNorm
              0.72787994 = fieldWeight in 244, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.0625 = fieldNorm(doc=244)
          0.03839457 = weight(abstract_txt:effectiveness in 244) [ClassicSimilarity], result of:
            0.03839457 = score(doc=244,freq=1.0), product of:
              0.1205441 = queryWeight, product of:
                1.149306 = boost
                5.0961695 = idf(docFreq=738, maxDocs=44421)
                0.020580996 = queryNorm
              0.3185106 = fieldWeight in 244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0961695 = idf(docFreq=738, maxDocs=44421)
                0.0625 = fieldNorm(doc=244)
          0.059286572 = weight(abstract_txt:improving in 244) [ClassicSimilarity], result of:
            0.059286572 = score(doc=244,freq=1.0), product of:
              0.16104119 = queryWeight, product of:
                1.3284072 = boost
                5.8903265 = idf(docFreq=333, maxDocs=44421)
                0.020580996 = queryNorm
              0.3681454 = fieldWeight in 244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8903265 = idf(docFreq=333, maxDocs=44421)
                0.0625 = fieldNorm(doc=244)
          0.11442755 = weight(abstract_txt:expansion in 244) [ClassicSimilarity], result of:
            0.11442755 = score(doc=244,freq=3.0), product of:
              0.17309296 = queryWeight, product of:
                1.3772172 = boost
                6.106756 = idf(docFreq=268, maxDocs=44421)
                0.020580996 = queryNorm
              0.6610757 = fieldWeight in 244, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.106756 = idf(docFreq=268, maxDocs=44421)
                0.0625 = fieldNorm(doc=244)
          0.030505016 = weight(abstract_txt:documents in 244) [ClassicSimilarity], result of:
            0.030505016 = score(doc=244,freq=1.0), product of:
              0.11837064 = queryWeight, product of:
                1.3948592 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.020580996 = queryNorm
              0.25770763 = fieldWeight in 244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0625 = fieldNorm(doc=244)
          0.041272584 = weight(abstract_txt:methods in 244) [ClassicSimilarity], result of:
            0.041272584 = score(doc=244,freq=1.0), product of:
              0.1593739 = queryWeight, product of:
                1.8689011 = boost
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.020580996 = queryNorm
              0.25896704 = fieldWeight in 244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.0625 = fieldNorm(doc=244)
          0.060332038 = weight(abstract_txt:retrieval in 244) [ClassicSimilarity], result of:
            0.060332038 = score(doc=244,freq=2.0), product of:
              0.19634089 = queryWeight, product of:
                2.744114 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.020580996 = queryNorm
              0.3072821 = fieldWeight in 244, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=244)
          0.19425863 = weight(abstract_txt:relevance in 244) [ClassicSimilarity], result of:
            0.19425863 = score(doc=244,freq=5.0), product of:
              0.28197435 = queryWeight, product of:
                2.7793133 = boost
                4.929532 = idf(docFreq=872, maxDocs=44421)
                0.020580996 = queryNorm
              0.68892306 = fieldWeight in 244, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.929532 = idf(docFreq=872, maxDocs=44421)
                0.0625 = fieldNorm(doc=244)
          0.33977747 = weight(abstract_txt:feedback in 244) [ClassicSimilarity], result of:
            0.33977747 = score(doc=244,freq=5.0), product of:
              0.4093415 = queryWeight, product of:
                3.3486953 = boost
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.020580996 = queryNorm
              0.8300587 = fieldWeight in 244, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.0625 = fieldNorm(doc=244)
        0.36 = coord(9/25)
    
  5. Colace, F.; Santo, M. De; Greco, L.; Napoletano, P.: Weighted word pairs for query expansion (2015) 0.31
    0.30538845 = sum of:
      0.30538845 = product of:
        0.9543389 = sum of:
          0.081003256 = weight(abstract_txt:query in 3687) [ClassicSimilarity], result of:
            0.081003256 = score(doc=3687,freq=3.0), product of:
              0.104921974 = queryWeight, product of:
                1.0722498 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.020580996 = queryNorm
              0.7720333 = fieldWeight in 3687, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.09375 = fieldNorm(doc=3687)
          0.05759186 = weight(abstract_txt:effectiveness in 3687) [ClassicSimilarity], result of:
            0.05759186 = score(doc=3687,freq=1.0), product of:
              0.1205441 = queryWeight, product of:
                1.149306 = boost
                5.0961695 = idf(docFreq=738, maxDocs=44421)
                0.020580996 = queryNorm
              0.4777659 = fieldWeight in 3687, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0961695 = idf(docFreq=738, maxDocs=44421)
                0.09375 = fieldNorm(doc=3687)
          0.043067463 = weight(abstract_txt:text in 3687) [ClassicSimilarity], result of:
            0.043067463 = score(doc=3687,freq=1.0), product of:
              0.113684654 = queryWeight, product of:
                1.3669709 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.020580996 = queryNorm
              0.3788327 = fieldWeight in 3687, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.09375 = fieldNorm(doc=3687)
          0.14014456 = weight(abstract_txt:expansion in 3687) [ClassicSimilarity], result of:
            0.14014456 = score(doc=3687,freq=2.0), product of:
              0.17309296 = queryWeight, product of:
                1.3772172 = boost
                6.106756 = idf(docFreq=268, maxDocs=44421)
                0.020580996 = queryNorm
              0.8096491 = fieldWeight in 3687, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.106756 = idf(docFreq=268, maxDocs=44421)
                0.09375 = fieldNorm(doc=3687)
          0.06190888 = weight(abstract_txt:methods in 3687) [ClassicSimilarity], result of:
            0.06190888 = score(doc=3687,freq=1.0), product of:
              0.1593739 = queryWeight, product of:
                1.8689011 = boost
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.020580996 = queryNorm
              0.38845056 = fieldWeight in 3687, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.09375 = fieldNorm(doc=3687)
          0.06399179 = weight(abstract_txt:retrieval in 3687) [ClassicSimilarity], result of:
            0.06399179 = score(doc=3687,freq=1.0), product of:
              0.19634089 = queryWeight, product of:
                2.744114 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.020580996 = queryNorm
              0.3259219 = fieldWeight in 3687, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.09375 = fieldNorm(doc=3687)
          0.1842899 = weight(abstract_txt:relevance in 3687) [ClassicSimilarity], result of:
            0.1842899 = score(doc=3687,freq=2.0), product of:
              0.28197435 = queryWeight, product of:
                2.7793133 = boost
                4.929532 = idf(docFreq=872, maxDocs=44421)
                0.020580996 = queryNorm
              0.65356976 = fieldWeight in 3687, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.929532 = idf(docFreq=872, maxDocs=44421)
                0.09375 = fieldNorm(doc=3687)
          0.3223412 = weight(abstract_txt:feedback in 3687) [ClassicSimilarity], result of:
            0.3223412 = score(doc=3687,freq=2.0), product of:
              0.4093415 = queryWeight, product of:
                3.3486953 = boost
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.020580996 = queryNorm
              0.7874628 = fieldWeight in 3687, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.09375 = fieldNorm(doc=3687)
        0.32 = coord(8/25)