Document (#36120)

Author
Moura, E.S. de
Fernandes, D.
Ribeiro-Neto, B.
Silva, A.S. da
Gonçalves, M.A.
Title
Using structural information to improve search in Web collections
Source
Journal of the American Society for Information Science and Technology. 61(2010) no.12, S.2503-2513
Year
2010
Abstract
In this work, we investigate the problem of using the block structure of Web pages to improve ranking results. Starting with basic intuitions provided by the concepts of term frequency (TF) and inverse document frequency (IDF), we propose nine block-weight functions to distinguish the impact of term occurrences inside page blocks, instead of inside whole pages. These are then used to compute a modified BM25 ranking function. Using four distinct Web collections, we ran extensive experiments to compare our block-weight ranking formulas with two other baselines: (a) a BM25 ranking applied to full pages, and (b) a BM25 ranking that takes into account best blocks. Our methods suggest that our block-weighting ranking method is superior to all baselines across all collections we used and that average gain in precision figures from 5 to 20% are generated.
Theme
Retrievalalgorithmen

Similar documents (author)

  1. Calado, P.; Cristo, M.; Gonçalves, M.A.; Moura, E.S. de; Ribeiro-Neto, B.; Ziviani, N.: Link-based similarity measures for the classification of Web documents (2006) 2.43
    2.4286134 = sum of:
      2.4286134 = product of:
        3.64292 = sum of:
          0.8124124 = weight(author_txt:gonçalves in 5921) [ClassicSimilarity], result of:
            0.8124124 = score(doc=5921,freq=1.0), product of:
              0.37936723 = queryWeight, product of:
                1.1403338 = boost
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.03883749 = queryNorm
              2.1414933 = fieldWeight in 5921, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.25 = fieldNorm(doc=5921)
          0.8528306 = weight(author_txt:moura in 5921) [ClassicSimilarity], result of:
            0.8528306 = score(doc=5921,freq=1.0), product of:
              0.3918477 = queryWeight, product of:
                1.1589394 = boost
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.03883749 = queryNorm
              2.1764338 = fieldWeight in 5921, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.25 = fieldNorm(doc=5921)
          0.867994 = weight(author_txt:ribeiro in 5921) [ClassicSimilarity], result of:
            0.867994 = score(doc=5921,freq=1.0), product of:
              0.39647877 = queryWeight, product of:
                1.1657677 = boost
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.03883749 = queryNorm
              2.1892571 = fieldWeight in 5921, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.25 = fieldNorm(doc=5921)
          1.1096832 = weight(author_txt:neto in 5921) [ClassicSimilarity], result of:
            1.1096832 = score(doc=5921,freq=1.0), product of:
              0.4670264 = queryWeight, product of:
                1.2652396 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.03883749 = queryNorm
              2.3760607 = fieldWeight in 5921, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.25 = fieldNorm(doc=5921)
        0.6666667 = coord(4/6)
    
  2. Couto, T.; Cristo, M.; Gonçalves, M.A.; Calado, P.; Ziviani, N.; Moura, E.; Ribeiro-Neto, B.: ¬A comparative study of citations and links in document classification (2006) 2.43
    2.4286134 = sum of:
      2.4286134 = product of:
        3.64292 = sum of:
          0.8124124 = weight(author_txt:gonçalves in 3531) [ClassicSimilarity], result of:
            0.8124124 = score(doc=3531,freq=1.0), product of:
              0.37936723 = queryWeight, product of:
                1.1403338 = boost
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.03883749 = queryNorm
              2.1414933 = fieldWeight in 3531, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.25 = fieldNorm(doc=3531)
          0.8528306 = weight(author_txt:moura in 3531) [ClassicSimilarity], result of:
            0.8528306 = score(doc=3531,freq=1.0), product of:
              0.3918477 = queryWeight, product of:
                1.1589394 = boost
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.03883749 = queryNorm
              2.1764338 = fieldWeight in 3531, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.25 = fieldNorm(doc=3531)
          0.867994 = weight(author_txt:ribeiro in 3531) [ClassicSimilarity], result of:
            0.867994 = score(doc=3531,freq=1.0), product of:
              0.39647877 = queryWeight, product of:
                1.1657677 = boost
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.03883749 = queryNorm
              2.1892571 = fieldWeight in 3531, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.25 = fieldNorm(doc=3531)
          1.1096832 = weight(author_txt:neto in 3531) [ClassicSimilarity], result of:
            1.1096832 = score(doc=3531,freq=1.0), product of:
              0.4670264 = queryWeight, product of:
                1.2652396 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.03883749 = queryNorm
              2.3760607 = fieldWeight in 3531, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.25 = fieldNorm(doc=3531)
        0.6666667 = coord(4/6)
    
  3. Pereira, D.A.; Ribeiro-Neto, B.; Ziviani, N.; Laender, A.H.F.; Gonçalves, M.A.: ¬A generic Web-based entity resolution framework (2011) 1.40
    1.3950448 = sum of:
      1.3950448 = product of:
        2.7900896 = sum of:
          0.8124124 = weight(author_txt:gonçalves in 450) [ClassicSimilarity], result of:
            0.8124124 = score(doc=450,freq=1.0), product of:
              0.37936723 = queryWeight, product of:
                1.1403338 = boost
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.03883749 = queryNorm
              2.1414933 = fieldWeight in 450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.25 = fieldNorm(doc=450)
          0.867994 = weight(author_txt:ribeiro in 450) [ClassicSimilarity], result of:
            0.867994 = score(doc=450,freq=1.0), product of:
              0.39647877 = queryWeight, product of:
                1.1657677 = boost
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.03883749 = queryNorm
              2.1892571 = fieldWeight in 450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.25 = fieldNorm(doc=450)
          1.1096832 = weight(author_txt:neto in 450) [ClassicSimilarity], result of:
            1.1096832 = score(doc=450,freq=1.0), product of:
              0.4670264 = queryWeight, product of:
                1.2652396 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.03883749 = queryNorm
              2.3760607 = fieldWeight in 450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.25 = fieldNorm(doc=450)
        0.5 = coord(3/6)
    
  4. Costa Carvalho, A. da; Rossi, C.; Moura, E.S. de; Silva, A.S. da; Fernandes, D.: LePrEF: Learn to precompute evidence fusion for efficient query evaluation (2012) 1.30
    1.3003819 = sum of:
      1.3003819 = product of:
        2.6007638 = sum of:
          0.547874 = weight(author_txt:silva in 1278) [ClassicSimilarity], result of:
            0.547874 = score(doc=1278,freq=1.0), product of:
              0.29173994 = queryWeight, product of:
                7.5118127 = idf(docFreq=65, maxDocs=44421)
                0.03883749 = queryNorm
              1.8779532 = fieldWeight in 1278, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5118127 = idf(docFreq=65, maxDocs=44421)
                0.25 = fieldNorm(doc=1278)
          0.8528306 = weight(author_txt:moura in 1278) [ClassicSimilarity], result of:
            0.8528306 = score(doc=1278,freq=1.0), product of:
              0.3918477 = queryWeight, product of:
                1.1589394 = boost
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.03883749 = queryNorm
              2.1764338 = fieldWeight in 1278, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.25 = fieldNorm(doc=1278)
          1.200059 = weight(author_txt:fernandes in 1278) [ClassicSimilarity], result of:
            1.200059 = score(doc=1278,freq=1.0), product of:
              0.49205145 = queryWeight, product of:
                1.2986954 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.03883749 = queryNorm
              2.4388893 = fieldWeight in 1278, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.25 = fieldNorm(doc=1278)
        0.5 = coord(3/6)
    
  5. Silveira, M.; Ribeiro-Neto, B.: Concept-based ranking : a case study in the juridical domain (2004) 1.15
    1.153645 = sum of:
      1.153645 = product of:
        3.460935 = sum of:
          1.5189896 = weight(author_txt:ribeiro in 3339) [ClassicSimilarity], result of:
            1.5189896 = score(doc=3339,freq=1.0), product of:
              0.39647877 = queryWeight, product of:
                1.1657677 = boost
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.03883749 = queryNorm
              3.8312001 = fieldWeight in 3339, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.4375 = fieldNorm(doc=3339)
          1.9419454 = weight(author_txt:neto in 3339) [ClassicSimilarity], result of:
            1.9419454 = score(doc=3339,freq=1.0), product of:
              0.4670264 = queryWeight, product of:
                1.2652396 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.03883749 = queryNorm
              4.1581063 = fieldWeight in 3339, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.4375 = fieldNorm(doc=3339)
        0.33333334 = coord(2/6)
    

Similar documents (content)

  1. Fersini, E.; Messina, E.; Archetti, F.: Enhancing web page classification through image-block importance analysis (2008) 0.41
    0.40680405 = sum of:
      0.40680405 = product of:
        1.1300112 = sum of:
          0.047786698 = weight(abstract_txt:modified in 3102) [ClassicSimilarity], result of:
            0.047786698 = score(doc=3102,freq=1.0), product of:
              0.08920004 = queryWeight, product of:
                6.8572807 = idf(docFreq=126, maxDocs=44421)
                0.013008078 = queryNorm
              0.53572506 = fieldWeight in 3102, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8572807 = idf(docFreq=126, maxDocs=44421)
                0.078125 = fieldNorm(doc=3102)
          0.05027036 = weight(abstract_txt:weighting in 3102) [ClassicSimilarity], result of:
            0.05027036 = score(doc=3102,freq=1.0), product of:
              0.092264585 = queryWeight, product of:
                1.0170329 = boost
                6.9740796 = idf(docFreq=112, maxDocs=44421)
                0.013008078 = queryNorm
              0.54485 = fieldWeight in 3102, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9740796 = idf(docFreq=112, maxDocs=44421)
                0.078125 = fieldNorm(doc=3102)
          0.0083165895 = weight(abstract_txt:that in 3102) [ClassicSimilarity], result of:
            0.0083165895 = score(doc=3102,freq=2.0), product of:
              0.03182885 = queryWeight, product of:
                1.0346384 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.013008078 = queryNorm
              0.2612909 = fieldWeight in 3102, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.078125 = fieldNorm(doc=3102)
          0.06750416 = weight(abstract_txt:inverse in 3102) [ClassicSimilarity], result of:
            0.06750416 = score(doc=3102,freq=1.0), product of:
              0.112300254 = queryWeight, product of:
                1.1220387 = boost
                7.694134 = idf(docFreq=54, maxDocs=44421)
                0.013008078 = queryNorm
              0.60110426 = fieldWeight in 3102, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.694134 = idf(docFreq=54, maxDocs=44421)
                0.078125 = fieldNorm(doc=3102)
          0.065343 = weight(abstract_txt:term in 3102) [ClassicSimilarity], result of:
            0.065343 = score(doc=3102,freq=4.0), product of:
              0.08722007 = queryWeight, product of:
                1.3984299 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.013008078 = queryNorm
              0.7491739 = fieldWeight in 3102, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.078125 = fieldNorm(doc=3102)
          0.01836631 = weight(abstract_txt:using in 3102) [ClassicSimilarity], result of:
            0.01836631 = score(doc=3102,freq=1.0), product of:
              0.06800624 = queryWeight, product of:
                1.5123506 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.013008078 = queryNorm
              0.27006802 = fieldWeight in 3102, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.078125 = fieldNorm(doc=3102)
          0.11998277 = weight(abstract_txt:weight in 3102) [ClassicSimilarity], result of:
            0.11998277 = score(doc=3102,freq=1.0), product of:
              0.20761065 = queryWeight, product of:
                2.1575322 = boost
                7.3974023 = idf(docFreq=73, maxDocs=44421)
                0.013008078 = queryNorm
              0.57792205 = fieldWeight in 3102, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3974023 = idf(docFreq=73, maxDocs=44421)
                0.078125 = fieldNorm(doc=3102)
          0.23384126 = weight(abstract_txt:blocks in 3102) [ClassicSimilarity], result of:
            0.23384126 = score(doc=3102,freq=3.0), product of:
              0.22460051 = queryWeight, product of:
                2.2440774 = boost
                7.694134 = idf(docFreq=54, maxDocs=44421)
                0.013008078 = queryNorm
              1.0411431 = fieldWeight in 3102, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.694134 = idf(docFreq=54, maxDocs=44421)
                0.078125 = fieldNorm(doc=3102)
          0.51860005 = weight(abstract_txt:block in 3102) [ClassicSimilarity], result of:
            0.51860005 = score(doc=3102,freq=3.0), product of:
              0.4812399 = queryWeight, product of:
                4.6454554 = boost
                7.963798 = idf(docFreq=41, maxDocs=44421)
                0.013008078 = queryNorm
              1.077633 = fieldWeight in 3102, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.963798 = idf(docFreq=41, maxDocs=44421)
                0.078125 = fieldNorm(doc=3102)
        0.36 = coord(9/25)
    
  2. Wan, X.; Yang, J.; Xiao, J.: Towards a unified approach to document similarity search using manifold-ranking of blocks (2008) 0.36
    0.36033502 = sum of:
      0.36033502 = product of:
        1.126047 = sum of:
          0.004704573 = weight(abstract_txt:that in 3081) [ClassicSimilarity], result of:
            0.004704573 = score(doc=3081,freq=1.0), product of:
              0.03182885 = queryWeight, product of:
                1.0346384 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.013008078 = queryNorm
              0.14780845 = fieldWeight in 3081, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=3081)
          0.05325472 = weight(abstract_txt:compute in 3081) [ClassicSimilarity], result of:
            0.05325472 = score(doc=3081,freq=1.0), product of:
              0.11126002 = queryWeight, product of:
                1.1168299 = boost
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.013008078 = queryNorm
              0.47865102 = fieldWeight in 3081, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.0625 = fieldNorm(doc=3081)
          0.028889187 = weight(abstract_txt:improve in 3081) [ClassicSimilarity], result of:
            0.028889187 = score(doc=3081,freq=1.0), product of:
              0.09323964 = queryWeight, product of:
                1.4458817 = boost
                4.9574084 = idf(docFreq=848, maxDocs=44421)
                0.013008078 = queryNorm
              0.30983803 = fieldWeight in 3081, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9574084 = idf(docFreq=848, maxDocs=44421)
                0.0625 = fieldNorm(doc=3081)
          0.014693049 = weight(abstract_txt:using in 3081) [ClassicSimilarity], result of:
            0.014693049 = score(doc=3081,freq=1.0), product of:
              0.06800624 = queryWeight, product of:
                1.5123506 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.013008078 = queryNorm
              0.21605442 = fieldWeight in 3081, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.0625 = fieldNorm(doc=3081)
          0.24151021 = weight(abstract_txt:blocks in 3081) [ClassicSimilarity], result of:
            0.24151021 = score(doc=3081,freq=5.0), product of:
              0.22460051 = queryWeight, product of:
                2.2440774 = boost
                7.694134 = idf(docFreq=54, maxDocs=44421)
                0.013008078 = queryNorm
              1.0752879 = fieldWeight in 3081, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.694134 = idf(docFreq=54, maxDocs=44421)
                0.0625 = fieldNorm(doc=3081)
          0.06265256 = weight(abstract_txt:pages in 3081) [ClassicSimilarity], result of:
            0.06265256 = score(doc=3081,freq=1.0), product of:
              0.1788271 = queryWeight, product of:
                2.4524195 = boost
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.013008078 = queryNorm
              0.3503527 = fieldWeight in 3081, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.0625 = fieldNorm(doc=3081)
          0.41488 = weight(abstract_txt:block in 3081) [ClassicSimilarity], result of:
            0.41488 = score(doc=3081,freq=3.0), product of:
              0.4812399 = queryWeight, product of:
                4.6454554 = boost
                7.963798 = idf(docFreq=41, maxDocs=44421)
                0.013008078 = queryNorm
              0.8621064 = fieldWeight in 3081, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.963798 = idf(docFreq=41, maxDocs=44421)
                0.0625 = fieldNorm(doc=3081)
          0.30546272 = weight(abstract_txt:ranking in 3081) [ClassicSimilarity], result of:
            0.30546272 = score(doc=3081,freq=6.0), product of:
              0.35651067 = queryWeight, product of:
                4.8969917 = boost
                5.5966744 = idf(docFreq=447, maxDocs=44421)
                0.013008078 = queryNorm
              0.8568123 = fieldWeight in 3081, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.5966744 = idf(docFreq=447, maxDocs=44421)
                0.0625 = fieldNorm(doc=3081)
        0.32 = coord(8/25)
    
  3. Dang, E.K.F.; Luk, R.W.P.; Allan, J.; Ho, K.S.; Chung, K.F.L.; Lee, D.L.: ¬A new context-dependent term weight computed by boost and discount using relevance information (2010) 0.24
    0.2375758 = sum of:
      0.2375758 = product of:
        0.74242437 = sum of:
          0.040216286 = weight(abstract_txt:weighting in 120) [ClassicSimilarity], result of:
            0.040216286 = score(doc=120,freq=1.0), product of:
              0.092264585 = queryWeight, product of:
                1.0170329 = boost
                6.9740796 = idf(docFreq=112, maxDocs=44421)
                0.013008078 = queryNorm
              0.43587998 = fieldWeight in 120, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9740796 = idf(docFreq=112, maxDocs=44421)
                0.0625 = fieldNorm(doc=120)
          0.05325472 = weight(abstract_txt:compute in 120) [ClassicSimilarity], result of:
            0.05325472 = score(doc=120,freq=1.0), product of:
              0.11126002 = queryWeight, product of:
                1.1168299 = boost
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.013008078 = queryNorm
              0.47865102 = fieldWeight in 120, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.0625 = fieldNorm(doc=120)
          0.054003328 = weight(abstract_txt:inverse in 120) [ClassicSimilarity], result of:
            0.054003328 = score(doc=120,freq=1.0), product of:
              0.112300254 = queryWeight, product of:
                1.1220387 = boost
                7.694134 = idf(docFreq=54, maxDocs=44421)
                0.013008078 = queryNorm
              0.4808834 = fieldWeight in 120, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.694134 = idf(docFreq=54, maxDocs=44421)
                0.0625 = fieldNorm(doc=120)
          0.06915253 = weight(abstract_txt:term in 120) [ClassicSimilarity], result of:
            0.06915253 = score(doc=120,freq=7.0), product of:
              0.08722007 = queryWeight, product of:
                1.3984299 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.013008078 = queryNorm
              0.7928511 = fieldWeight in 120, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.0625 = fieldNorm(doc=120)
          0.025449108 = weight(abstract_txt:using in 120) [ClassicSimilarity], result of:
            0.025449108 = score(doc=120,freq=3.0), product of:
              0.06800624 = queryWeight, product of:
                1.5123506 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.013008078 = queryNorm
              0.37421724 = fieldWeight in 120, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.0625 = fieldNorm(doc=120)
          0.07059844 = weight(abstract_txt:frequency in 120) [ClassicSimilarity], result of:
            0.07059844 = score(doc=120,freq=2.0), product of:
              0.1342653 = queryWeight, product of:
                1.7350595 = boost
                5.948895 = idf(docFreq=314, maxDocs=44421)
                0.013008078 = queryNorm
              0.525813 = fieldWeight in 120, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.948895 = idf(docFreq=314, maxDocs=44421)
                0.0625 = fieldNorm(doc=120)
          0.063858934 = weight(abstract_txt:collections in 120) [ClassicSimilarity], result of:
            0.063858934 = score(doc=120,freq=3.0), product of:
              0.12557837 = queryWeight, product of:
                2.0551121 = boost
                4.6974936 = idf(docFreq=1100, maxDocs=44421)
                0.013008078 = queryNorm
              0.5085186 = fieldWeight in 120, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6974936 = idf(docFreq=1100, maxDocs=44421)
                0.0625 = fieldNorm(doc=120)
          0.365891 = weight(abstract_txt:bm25 in 120) [ClassicSimilarity], result of:
            0.365891 = score(doc=120,freq=2.0), product of:
              0.46029046 = queryWeight, product of:
                3.9345412 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.013008078 = queryNorm
              0.7949133 = fieldWeight in 120, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.0625 = fieldNorm(doc=120)
        0.32 = coord(8/25)
    
  4. Trotman, A.: Choosing document structure weights (2005) 0.23
    0.23304036 = sum of:
      0.23304036 = product of:
        0.9710015 = sum of:
          0.07109302 = weight(abstract_txt:weighting in 2016) [ClassicSimilarity], result of:
            0.07109302 = score(doc=2016,freq=2.0), product of:
              0.092264585 = queryWeight, product of:
                1.0170329 = boost
                6.9740796 = idf(docFreq=112, maxDocs=44421)
                0.013008078 = queryNorm
              0.7705342 = fieldWeight in 2016, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.9740796 = idf(docFreq=112, maxDocs=44421)
                0.078125 = fieldNorm(doc=2016)
          0.06066047 = weight(abstract_txt:occurrences in 2016) [ClassicSimilarity], result of:
            0.06066047 = score(doc=2016,freq=1.0), product of:
              0.10457572 = queryWeight, product of:
                1.0827618 = boost
                7.4248013 = idf(docFreq=71, maxDocs=44421)
                0.013008078 = queryNorm
              0.5800626 = fieldWeight in 2016, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4248013 = idf(docFreq=71, maxDocs=44421)
                0.078125 = fieldNorm(doc=2016)
          0.0326715 = weight(abstract_txt:term in 2016) [ClassicSimilarity], result of:
            0.0326715 = score(doc=2016,freq=1.0), product of:
              0.08722007 = queryWeight, product of:
                1.3984299 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.013008078 = queryNorm
              0.37458694 = fieldWeight in 2016, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.078125 = fieldNorm(doc=2016)
          0.025973886 = weight(abstract_txt:using in 2016) [ClassicSimilarity], result of:
            0.025973886 = score(doc=2016,freq=2.0), product of:
              0.06800624 = queryWeight, product of:
                1.5123506 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.013008078 = queryNorm
              0.38193387 = fieldWeight in 2016, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.078125 = fieldNorm(doc=2016)
          0.5601539 = weight(abstract_txt:bm25 in 2016) [ClassicSimilarity], result of:
            0.5601539 = score(doc=2016,freq=3.0), product of:
              0.46029046 = queryWeight, product of:
                3.9345412 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.013008078 = queryNorm
              1.2169574 = fieldWeight in 2016, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.078125 = fieldNorm(doc=2016)
          0.22044872 = weight(abstract_txt:ranking in 2016) [ClassicSimilarity], result of:
            0.22044872 = score(doc=2016,freq=2.0), product of:
              0.35651067 = queryWeight, product of:
                4.8969917 = boost
                5.5966744 = idf(docFreq=447, maxDocs=44421)
                0.013008078 = queryNorm
              0.618351 = fieldWeight in 2016, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5966744 = idf(docFreq=447, maxDocs=44421)
                0.078125 = fieldNorm(doc=2016)
        0.24 = coord(6/25)
    
  5. Alzahrani, S.; Palade, V.; Salim, N.; Abraham, A.: Using structural information and citation evidence to detect significant plagiarism cases in scientific publications (2012) 0.22
    0.21613194 = sum of:
      0.21613194 = product of:
        0.6003665 = sum of:
          0.049765114 = weight(abstract_txt:weighting in 982) [ClassicSimilarity], result of:
            0.049765114 = score(doc=982,freq=2.0), product of:
              0.092264585 = queryWeight, product of:
                1.0170329 = boost
                6.9740796 = idf(docFreq=112, maxDocs=44421)
                0.013008078 = queryNorm
              0.53937393 = fieldWeight in 982, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.9740796 = idf(docFreq=112, maxDocs=44421)
                0.0546875 = fieldNorm(doc=982)
          0.0058216117 = weight(abstract_txt:that in 982) [ClassicSimilarity], result of:
            0.0058216117 = score(doc=982,freq=2.0), product of:
              0.03182885 = queryWeight, product of:
                1.0346384 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.013008078 = queryNorm
              0.18290362 = fieldWeight in 982, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0546875 = fieldNorm(doc=982)
          0.047252912 = weight(abstract_txt:inverse in 982) [ClassicSimilarity], result of:
            0.047252912 = score(doc=982,freq=1.0), product of:
              0.112300254 = queryWeight, product of:
                1.1220387 = boost
                7.694134 = idf(docFreq=54, maxDocs=44421)
                0.013008078 = queryNorm
              0.42077297 = fieldWeight in 982, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.694134 = idf(docFreq=54, maxDocs=44421)
                0.0546875 = fieldNorm(doc=982)
          0.02287005 = weight(abstract_txt:term in 982) [ClassicSimilarity], result of:
            0.02287005 = score(doc=982,freq=1.0), product of:
              0.08722007 = queryWeight, product of:
                1.3984299 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.013008078 = queryNorm
              0.26221088 = fieldWeight in 982, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.0546875 = fieldNorm(doc=982)
          0.035748545 = weight(abstract_txt:improve in 982) [ClassicSimilarity], result of:
            0.035748545 = score(doc=982,freq=2.0), product of:
              0.09323964 = queryWeight, product of:
                1.4458817 = boost
                4.9574084 = idf(docFreq=848, maxDocs=44421)
                0.013008078 = queryNorm
              0.383405 = fieldWeight in 982, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.9574084 = idf(docFreq=848, maxDocs=44421)
                0.0546875 = fieldNorm(doc=982)
          0.028747825 = weight(abstract_txt:using in 982) [ClassicSimilarity], result of:
            0.028747825 = score(doc=982,freq=5.0), product of:
              0.06800624 = queryWeight, product of:
                1.5123506 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.013008078 = queryNorm
              0.42272335 = fieldWeight in 982, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.0546875 = fieldNorm(doc=982)
          0.043680556 = weight(abstract_txt:frequency in 982) [ClassicSimilarity], result of:
            0.043680556 = score(doc=982,freq=1.0), product of:
              0.1342653 = queryWeight, product of:
                1.7350595 = boost
                5.948895 = idf(docFreq=314, maxDocs=44421)
                0.013008078 = queryNorm
              0.3253302 = fieldWeight in 982, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.948895 = idf(docFreq=314, maxDocs=44421)
                0.0546875 = fieldNorm(doc=982)
          0.16797587 = weight(abstract_txt:weight in 982) [ClassicSimilarity], result of:
            0.16797587 = score(doc=982,freq=4.0), product of:
              0.20761065 = queryWeight, product of:
                2.1575322 = boost
                7.3974023 = idf(docFreq=73, maxDocs=44421)
                0.013008078 = queryNorm
              0.80909085 = fieldWeight in 982, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.3974023 = idf(docFreq=73, maxDocs=44421)
                0.0546875 = fieldNorm(doc=982)
          0.19850399 = weight(abstract_txt:baselines in 982) [ClassicSimilarity], result of:
            0.19850399 = score(doc=982,freq=3.0), product of:
              0.25541365 = queryWeight, product of:
                2.3930652 = boost
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.013008078 = queryNorm
              0.7771863 = fieldWeight in 982, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.0546875 = fieldNorm(doc=982)
        0.36 = coord(9/25)