Document (#34532)

Author
Couto, T.
Cristo, M.
Gonçalves, M.A.
Calado, P.
Ziviani, N.
Moura, E.
Ribeiro-Neto, B.
Title
¬A comparative study of citations and links in document classification
Source
International Conference on Digital Libraries: Proceedings of the 6th ACM/IEEE-CS joint conference on Digital libraries, Chapel Hill, NC, USA
Imprint
New York : ACM
Year
2006
Pages
S.75-84
Abstract
It is well known that links are an important source of information when dealing with Web collections. However, the question remains on whether the same techniques that are used on the Web can be applied to collections of documents containing citations between scientific papers. In this work we present a comparative study of digital library citations and Web links, in the context of automatic text classification. We show that there are in fact differences between citations and links in this context. For the comparison, we run a series of experiments using a digital library of computer science papers and a Web directory. In our reference collections, measures based on co-citation tend to perform better for pages in the Web directory, with gains up to 37% over text based classifiers, while measures based on bibliographic coupling perform better in a digital library. We also propose a simple and effective way of combining a traditional text based classifier with a citation-link based classifier. This combination is based on the notion of classifier reliability and presented gains of up to 14% in micro-averaged F1 in the Web collection. However, no significant gain was obtained in the digital library. Finally, a user study was performed to further investigate the causes for these results. We discovered that misclassifications by the citation-link based classifiers are in fact difficult cases, hard to classify even for humans.
Theme
Informetrie

Similar documents (author)

  1. Calado, P.; Cristo, M.; Gonçalves, M.A.; Moura, E.S. de; Ribeiro-Neto, B.; Ziviani, N.: Link-based similarity measures for the classification of Web documents (2006) 5.62
    5.6180835 = sum of:
      5.6180835 = sum of:
        0.75528246 = weight(author_txt:gonçalves in 5921) [ClassicSimilarity], result of:
          0.75528246 = score(doc=5921,freq=1.0), product of:
            0.35268962 = queryWeight, product of:
              8.565973 = idf(docFreq=22, maxDocs=44421)
              0.041173328 = queryNorm
            2.1414933 = fieldWeight in 5921, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.565973 = idf(docFreq=22, maxDocs=44421)
              0.25 = fieldNorm(doc=5921)
        0.79285836 = weight(author_txt:moura in 5921) [ClassicSimilarity], result of:
          0.79285836 = score(doc=5921,freq=1.0), product of:
            0.36429244 = queryWeight, product of:
              1.0163159 = boost
              8.705735 = idf(docFreq=19, maxDocs=44421)
              0.041173328 = queryNorm
            2.1764338 = fieldWeight in 5921, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.705735 = idf(docFreq=19, maxDocs=44421)
              0.25 = fieldNorm(doc=5921)
        0.80695546 = weight(author_txt:ribeiro in 5921) [ClassicSimilarity], result of:
          0.80695546 = score(doc=5921,freq=1.0), product of:
            0.36859784 = queryWeight, product of:
              1.0223039 = boost
              8.757029 = idf(docFreq=18, maxDocs=44421)
              0.041173328 = queryNorm
            2.1892571 = fieldWeight in 5921, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.757029 = idf(docFreq=18, maxDocs=44421)
              0.25 = fieldNorm(doc=5921)
        1.0316488 = weight(author_txt:neto in 5921) [ClassicSimilarity], result of:
          1.0316488 = score(doc=5921,freq=1.0), product of:
            0.4341845 = queryWeight, product of:
              1.1095345 = boost
              9.504243 = idf(docFreq=8, maxDocs=44421)
              0.041173328 = queryNorm
            2.3760607 = fieldWeight in 5921, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.504243 = idf(docFreq=8, maxDocs=44421)
              0.25 = fieldNorm(doc=5921)
        1.1156694 = weight(author_txt:cristo in 5921) [ClassicSimilarity], result of:
          1.1156694 = score(doc=5921,freq=1.0), product of:
            0.4574498 = queryWeight, product of:
              1.1388732 = boost
              9.755557 = idf(docFreq=6, maxDocs=44421)
              0.041173328 = queryNorm
            2.4388893 = fieldWeight in 5921, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.755557 = idf(docFreq=6, maxDocs=44421)
              0.25 = fieldNorm(doc=5921)
        1.1156694 = weight(author_txt:ziviani in 5921) [ClassicSimilarity], result of:
          1.1156694 = score(doc=5921,freq=1.0), product of:
            0.4574498 = queryWeight, product of:
              1.1388732 = boost
              9.755557 = idf(docFreq=6, maxDocs=44421)
              0.041173328 = queryNorm
            2.4388893 = fieldWeight in 5921, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.755557 = idf(docFreq=6, maxDocs=44421)
              0.25 = fieldNorm(doc=5921)
    
  2. Pereira, D.A.; Ribeiro-Neto, B.; Ziviani, N.; Laender, A.H.F.; Gonçalves, M.A.: ¬A generic Web-based entity resolution framework (2011) 2.47
    2.4730375 = sum of:
      2.4730375 = product of:
        3.709556 = sum of:
          0.75528246 = weight(author_txt:gonçalves in 450) [ClassicSimilarity], result of:
            0.75528246 = score(doc=450,freq=1.0), product of:
              0.35268962 = queryWeight, product of:
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.041173328 = queryNorm
              2.1414933 = fieldWeight in 450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.25 = fieldNorm(doc=450)
          0.80695546 = weight(author_txt:ribeiro in 450) [ClassicSimilarity], result of:
            0.80695546 = score(doc=450,freq=1.0), product of:
              0.36859784 = queryWeight, product of:
                1.0223039 = boost
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.041173328 = queryNorm
              2.1892571 = fieldWeight in 450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.25 = fieldNorm(doc=450)
          1.0316488 = weight(author_txt:neto in 450) [ClassicSimilarity], result of:
            1.0316488 = score(doc=450,freq=1.0), product of:
              0.4341845 = queryWeight, product of:
                1.1095345 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.041173328 = queryNorm
              2.3760607 = fieldWeight in 450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.25 = fieldNorm(doc=450)
          1.1156694 = weight(author_txt:ziviani in 450) [ClassicSimilarity], result of:
            1.1156694 = score(doc=450,freq=1.0), product of:
              0.4574498 = queryWeight, product of:
                1.1388732 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.041173328 = queryNorm
              2.4388893 = fieldWeight in 450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.25 = fieldNorm(doc=450)
        0.6666667 = coord(4/6)
    
  3. Moura, E.S. de; Fernandes, D.; Ribeiro-Neto, B.; Silva, A.S. da; Gonçalves, M.A.: Using structural information to improve search in Web collections (2010) 2.26
    2.2578301 = sum of:
      2.2578301 = product of:
        3.386745 = sum of:
          0.75528246 = weight(author_txt:gonçalves in 119) [ClassicSimilarity], result of:
            0.75528246 = score(doc=119,freq=1.0), product of:
              0.35268962 = queryWeight, product of:
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.041173328 = queryNorm
              2.1414933 = fieldWeight in 119, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.25 = fieldNorm(doc=119)
          0.79285836 = weight(author_txt:moura in 119) [ClassicSimilarity], result of:
            0.79285836 = score(doc=119,freq=1.0), product of:
              0.36429244 = queryWeight, product of:
                1.0163159 = boost
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.041173328 = queryNorm
              2.1764338 = fieldWeight in 119, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.25 = fieldNorm(doc=119)
          0.80695546 = weight(author_txt:ribeiro in 119) [ClassicSimilarity], result of:
            0.80695546 = score(doc=119,freq=1.0), product of:
              0.36859784 = queryWeight, product of:
                1.0223039 = boost
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.041173328 = queryNorm
              2.1892571 = fieldWeight in 119, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.25 = fieldNorm(doc=119)
          1.0316488 = weight(author_txt:neto in 119) [ClassicSimilarity], result of:
            1.0316488 = score(doc=119,freq=1.0), product of:
              0.4341845 = queryWeight, product of:
                1.1095345 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.041173328 = queryNorm
              2.3760607 = fieldWeight in 119, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.25 = fieldNorm(doc=119)
        0.6666667 = coord(4/6)
    
  4. Silva, A.J.C.; Gonçalves, M.A.; Laender, A.H.F.; Modesto, M.A.B.; Cristo, M.; Ziviani, N.: Finding what is missing from a digital library : a case study in the computer science field (2009) 1.49
    1.4933107 = sum of:
      1.4933107 = product of:
        2.9866214 = sum of:
          0.75528246 = weight(author_txt:gonçalves in 219) [ClassicSimilarity], result of:
            0.75528246 = score(doc=219,freq=1.0), product of:
              0.35268962 = queryWeight, product of:
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.041173328 = queryNorm
              2.1414933 = fieldWeight in 219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.25 = fieldNorm(doc=219)
          1.1156694 = weight(author_txt:cristo in 219) [ClassicSimilarity], result of:
            1.1156694 = score(doc=219,freq=1.0), product of:
              0.4574498 = queryWeight, product of:
                1.1388732 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.041173328 = queryNorm
              2.4388893 = fieldWeight in 219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.25 = fieldNorm(doc=219)
          1.1156694 = weight(author_txt:ziviani in 219) [ClassicSimilarity], result of:
            1.1156694 = score(doc=219,freq=1.0), product of:
              0.4574498 = queryWeight, product of:
                1.1388732 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.041173328 = queryNorm
              2.4388893 = fieldWeight in 219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.25 = fieldNorm(doc=219)
        0.5 = coord(3/6)
    
  5. Silveira, M.; Ribeiro-Neto, B.: Concept-based ranking : a case study in the juridical domain (2004) 1.07
    1.0725192 = sum of:
      1.0725192 = product of:
        3.2175574 = sum of:
          1.4121721 = weight(author_txt:ribeiro in 3339) [ClassicSimilarity], result of:
            1.4121721 = score(doc=3339,freq=1.0), product of:
              0.36859784 = queryWeight, product of:
                1.0223039 = boost
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.041173328 = queryNorm
              3.8312001 = fieldWeight in 3339, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.4375 = fieldNorm(doc=3339)
          1.8053852 = weight(author_txt:neto in 3339) [ClassicSimilarity], result of:
            1.8053852 = score(doc=3339,freq=1.0), product of:
              0.4341845 = queryWeight, product of:
                1.1095345 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.041173328 = queryNorm
              4.1581063 = fieldWeight in 3339, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.4375 = fieldNorm(doc=3339)
        0.33333334 = coord(2/6)
    

Similar documents (content)

  1. Calado, P.; Cristo, M.; Gonçalves, M.A.; Moura, E.S. de; Ribeiro-Neto, B.; Ziviani, N.: Link-based similarity measures for the classification of Web documents (2006) 0.48
    0.4840267 = sum of:
      0.4840267 = product of:
        1.008389 = sum of:
          0.040714048 = weight(abstract_txt:classification in 5921) [ClassicSimilarity], result of:
            0.040714048 = score(doc=5921,freq=4.0), product of:
              0.08158802 = queryWeight, product of:
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.020437066 = queryNorm
              0.49901992 = fieldWeight in 5921, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.0625 = fieldNorm(doc=5921)
          0.023767726 = weight(abstract_txt:however in 5921) [ClassicSimilarity], result of:
            0.023767726 = score(doc=5921,freq=1.0), product of:
              0.0904639 = queryWeight, product of:
                1.0529904 = boost
                4.203706 = idf(docFreq=1803, maxDocs=44421)
                0.020437066 = queryNorm
              0.2627316 = fieldWeight in 5921, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.203706 = idf(docFreq=1803, maxDocs=44421)
                0.0625 = fieldNorm(doc=5921)
          0.01466017 = weight(abstract_txt:that in 5921) [ClassicSimilarity], result of:
            0.01466017 = score(doc=5921,freq=3.0), product of:
              0.05726367 = queryWeight, product of:
                1.18479 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.020437066 = queryNorm
              0.25601172 = fieldWeight in 5921, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=5921)
          0.07223749 = weight(abstract_txt:measures in 5921) [ClassicSimilarity], result of:
            0.07223749 = score(doc=5921,freq=2.0), product of:
              0.15065446 = queryWeight, product of:
                1.3588697 = boost
                5.424824 = idf(docFreq=531, maxDocs=44421)
                0.020437066 = queryNorm
              0.47949123 = fieldWeight in 5921, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.424824 = idf(docFreq=531, maxDocs=44421)
                0.0625 = fieldNorm(doc=5921)
          0.10303635 = weight(abstract_txt:link in 5921) [ClassicSimilarity], result of:
            0.10303635 = score(doc=5921,freq=3.0), product of:
              0.16676444 = queryWeight, product of:
                1.4296789 = boost
                5.707506 = idf(docFreq=400, maxDocs=44421)
                0.020437066 = queryNorm
              0.61785567 = fieldWeight in 5921, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.707506 = idf(docFreq=400, maxDocs=44421)
                0.0625 = fieldNorm(doc=5921)
          0.054849304 = weight(abstract_txt:text in 5921) [ClassicSimilarity], result of:
            0.054849304 = score(doc=5921,freq=3.0), product of:
              0.12538752 = queryWeight, product of:
                1.518307 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.020437066 = queryNorm
              0.4374383 = fieldWeight in 5921, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=5921)
          0.07353925 = weight(abstract_txt:perform in 5921) [ClassicSimilarity], result of:
            0.07353925 = score(doc=5921,freq=1.0), product of:
              0.19208628 = queryWeight, product of:
                1.5343872 = boost
                6.1255183 = idf(docFreq=263, maxDocs=44421)
                0.020437066 = queryNorm
              0.3828449 = fieldWeight in 5921, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1255183 = idf(docFreq=263, maxDocs=44421)
                0.0625 = fieldNorm(doc=5921)
          0.10177499 = weight(abstract_txt:directory in 5921) [ClassicSimilarity], result of:
            0.10177499 = score(doc=5921,freq=1.0), product of:
              0.238549 = queryWeight, product of:
                1.7099192 = boost
                6.82627 = idf(docFreq=130, maxDocs=44421)
                0.020437066 = queryNorm
              0.42664188 = fieldWeight in 5921, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.82627 = idf(docFreq=130, maxDocs=44421)
                0.0625 = fieldNorm(doc=5921)
          0.13644901 = weight(abstract_txt:classifiers in 5921) [ClassicSimilarity], result of:
            0.13644901 = score(doc=5921,freq=1.0), product of:
              0.29004395 = queryWeight, product of:
                1.885466 = boost
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.020437066 = queryNorm
              0.47044253 = fieldWeight in 5921, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.0625 = fieldNorm(doc=5921)
          0.15361027 = weight(abstract_txt:gains in 5921) [ClassicSimilarity], result of:
            0.15361027 = score(doc=5921,freq=1.0), product of:
              0.31388006 = queryWeight, product of:
                1.9614112 = boost
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.020437066 = queryNorm
              0.48939165 = fieldWeight in 5921, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.0625 = fieldNorm(doc=5921)
          0.18267469 = weight(abstract_txt:classifier in 5921) [ClassicSimilarity], result of:
            0.18267469 = score(doc=5921,freq=1.0), product of:
              0.40330434 = queryWeight, product of:
                2.7230077 = boost
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.020437066 = queryNorm
              0.45294502 = fieldWeight in 5921, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.0625 = fieldNorm(doc=5921)
          0.051075704 = weight(abstract_txt:based in 5921) [ClassicSimilarity], result of:
            0.051075704 = score(doc=5921,freq=2.0), product of:
              0.18154006 = queryWeight, product of:
                2.7906609 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.020437066 = queryNorm
              0.28134674 = fieldWeight in 5921, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0625 = fieldNorm(doc=5921)
        0.48 = coord(12/25)
    
  2. Sun, A.; Lim, E.-P.; Ng, W.-K.: Performance measurement framework for hierarchical text classification (2003) 0.32
    0.31699336 = sum of:
      0.31699336 = product of:
        0.88053703 = sum of:
          0.049864326 = weight(abstract_txt:classification in 2808) [ClassicSimilarity], result of:
            0.049864326 = score(doc=2808,freq=6.0), product of:
              0.08158802 = queryWeight, product of:
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.020437066 = queryNorm
              0.61117214 = fieldWeight in 2808, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.0625 = fieldNorm(doc=2808)
          0.016928108 = weight(abstract_txt:that in 2808) [ClassicSimilarity], result of:
            0.016928108 = score(doc=2808,freq=4.0), product of:
              0.05726367 = queryWeight, product of:
                1.18479 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.020437066 = queryNorm
              0.2956169 = fieldWeight in 2808, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=2808)
          0.034534436 = weight(abstract_txt:better in 2808) [ClassicSimilarity], result of:
            0.034534436 = score(doc=2808,freq=1.0), product of:
              0.11605177 = queryWeight, product of:
                1.1926491 = boost
                4.7612453 = idf(docFreq=1032, maxDocs=44421)
                0.020437066 = queryNorm
              0.29757783 = fieldWeight in 2808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7612453 = idf(docFreq=1032, maxDocs=44421)
                0.0625 = fieldNorm(doc=2808)
          0.13514398 = weight(abstract_txt:measures in 2808) [ClassicSimilarity], result of:
            0.13514398 = score(doc=2808,freq=7.0), product of:
              0.15065446 = queryWeight, product of:
                1.3588697 = boost
                5.424824 = idf(docFreq=531, maxDocs=44421)
                0.020437066 = queryNorm
              0.89704597 = fieldWeight in 2808, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.424824 = idf(docFreq=531, maxDocs=44421)
                0.0625 = fieldNorm(doc=2808)
          0.031667262 = weight(abstract_txt:text in 2808) [ClassicSimilarity], result of:
            0.031667262 = score(doc=2808,freq=1.0), product of:
              0.12538752 = queryWeight, product of:
                1.518307 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.020437066 = queryNorm
              0.25255513 = fieldWeight in 2808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=2808)
          0.07353925 = weight(abstract_txt:perform in 2808) [ClassicSimilarity], result of:
            0.07353925 = score(doc=2808,freq=1.0), product of:
              0.19208628 = queryWeight, product of:
                1.5343872 = boost
                6.1255183 = idf(docFreq=263, maxDocs=44421)
                0.020437066 = queryNorm
              0.3828449 = fieldWeight in 2808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1255183 = idf(docFreq=263, maxDocs=44421)
                0.0625 = fieldNorm(doc=2808)
          0.30510926 = weight(abstract_txt:classifiers in 2808) [ClassicSimilarity], result of:
            0.30510926 = score(doc=2808,freq=5.0), product of:
              0.29004395 = queryWeight, product of:
                1.885466 = boost
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.020437066 = queryNorm
              1.0519415 = fieldWeight in 2808, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.0625 = fieldNorm(doc=2808)
          0.18267469 = weight(abstract_txt:classifier in 2808) [ClassicSimilarity], result of:
            0.18267469 = score(doc=2808,freq=1.0), product of:
              0.40330434 = queryWeight, product of:
                2.7230077 = boost
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.020437066 = queryNorm
              0.45294502 = fieldWeight in 2808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.0625 = fieldNorm(doc=2808)
          0.051075704 = weight(abstract_txt:based in 2808) [ClassicSimilarity], result of:
            0.051075704 = score(doc=2808,freq=2.0), product of:
              0.18154006 = queryWeight, product of:
                2.7906609 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.020437066 = queryNorm
              0.28134674 = fieldWeight in 2808, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0625 = fieldNorm(doc=2808)
        0.36 = coord(9/25)
    
  3. Liu, R.-L.: ¬A passage extractor for classification of disease aspect information (2013) 0.23
    0.22894754 = sum of:
      0.22894754 = product of:
        0.81766975 = sum of:
          0.0352594 = weight(abstract_txt:classification in 2107) [ClassicSimilarity], result of:
            0.0352594 = score(doc=2107,freq=3.0), product of:
              0.08158802 = queryWeight, product of:
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.020437066 = queryNorm
              0.43216392 = fieldWeight in 2107, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.0625 = fieldNorm(doc=2107)
          0.01466017 = weight(abstract_txt:that in 2107) [ClassicSimilarity], result of:
            0.01466017 = score(doc=2107,freq=3.0), product of:
              0.05726367 = queryWeight, product of:
                1.18479 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.020437066 = queryNorm
              0.25601172 = fieldWeight in 2107, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=2107)
          0.034534436 = weight(abstract_txt:better in 2107) [ClassicSimilarity], result of:
            0.034534436 = score(doc=2107,freq=1.0), product of:
              0.11605177 = queryWeight, product of:
                1.1926491 = boost
                4.7612453 = idf(docFreq=1032, maxDocs=44421)
                0.020437066 = queryNorm
              0.29757783 = fieldWeight in 2107, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7612453 = idf(docFreq=1032, maxDocs=44421)
                0.0625 = fieldNorm(doc=2107)
          0.08956854 = weight(abstract_txt:text in 2107) [ClassicSimilarity], result of:
            0.08956854 = score(doc=2107,freq=8.0), product of:
              0.12538752 = queryWeight, product of:
                1.518307 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.020437066 = queryNorm
              0.7143338 = fieldWeight in 2107, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=2107)
          0.33423048 = weight(abstract_txt:classifiers in 2107) [ClassicSimilarity], result of:
            0.33423048 = score(doc=2107,freq=6.0), product of:
              0.29004395 = queryWeight, product of:
                1.885466 = boost
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.020437066 = queryNorm
              1.1523442 = fieldWeight in 2107, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.0625 = fieldNorm(doc=2107)
          0.258341 = weight(abstract_txt:classifier in 2107) [ClassicSimilarity], result of:
            0.258341 = score(doc=2107,freq=2.0), product of:
              0.40330434 = queryWeight, product of:
                2.7230077 = boost
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.020437066 = queryNorm
              0.640561 = fieldWeight in 2107, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.0625 = fieldNorm(doc=2107)
          0.051075704 = weight(abstract_txt:based in 2107) [ClassicSimilarity], result of:
            0.051075704 = score(doc=2107,freq=2.0), product of:
              0.18154006 = queryWeight, product of:
                2.7906609 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.020437066 = queryNorm
              0.28134674 = fieldWeight in 2107, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0625 = fieldNorm(doc=2107)
        0.28 = coord(7/25)
    
  4. Mengle, S.S.R.; Goharian, N.: Ambiguity measure feature-selection algorithm (2009) 0.23
    0.22552286 = sum of:
      0.22552286 = product of:
        0.70475894 = sum of:
          0.020357024 = weight(abstract_txt:classification in 3804) [ClassicSimilarity], result of:
            0.020357024 = score(doc=3804,freq=1.0), product of:
              0.08158802 = queryWeight, product of:
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.020437066 = queryNorm
              0.24950996 = fieldWeight in 3804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.0625 = fieldNorm(doc=3804)
          0.023767726 = weight(abstract_txt:however in 3804) [ClassicSimilarity], result of:
            0.023767726 = score(doc=3804,freq=1.0), product of:
              0.0904639 = queryWeight, product of:
                1.0529904 = boost
                4.203706 = idf(docFreq=1803, maxDocs=44421)
                0.020437066 = queryNorm
              0.2627316 = fieldWeight in 3804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.203706 = idf(docFreq=1803, maxDocs=44421)
                0.0625 = fieldNorm(doc=3804)
          0.01466017 = weight(abstract_txt:that in 3804) [ClassicSimilarity], result of:
            0.01466017 = score(doc=3804,freq=3.0), product of:
              0.05726367 = queryWeight, product of:
                1.18479 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.020437066 = queryNorm
              0.25601172 = fieldWeight in 3804, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=3804)
          0.034534436 = weight(abstract_txt:better in 3804) [ClassicSimilarity], result of:
            0.034534436 = score(doc=3804,freq=1.0), product of:
              0.11605177 = queryWeight, product of:
                1.1926491 = boost
                4.7612453 = idf(docFreq=1032, maxDocs=44421)
                0.020437066 = queryNorm
              0.29757783 = fieldWeight in 3804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7612453 = idf(docFreq=1032, maxDocs=44421)
                0.0625 = fieldNorm(doc=3804)
          0.077568635 = weight(abstract_txt:text in 3804) [ClassicSimilarity], result of:
            0.077568635 = score(doc=3804,freq=6.0), product of:
              0.12538752 = queryWeight, product of:
                1.518307 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.020437066 = queryNorm
              0.61863124 = fieldWeight in 3804, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=3804)
          0.07353925 = weight(abstract_txt:perform in 3804) [ClassicSimilarity], result of:
            0.07353925 = score(doc=3804,freq=1.0), product of:
              0.19208628 = queryWeight, product of:
                1.5343872 = boost
                6.1255183 = idf(docFreq=263, maxDocs=44421)
                0.020437066 = queryNorm
              0.3828449 = fieldWeight in 3804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1255183 = idf(docFreq=263, maxDocs=44421)
                0.0625 = fieldNorm(doc=3804)
          0.05185869 = weight(abstract_txt:digital in 3804) [ClassicSimilarity], result of:
            0.05185869 = score(doc=3804,freq=1.0), product of:
              0.19173788 = queryWeight, product of:
                2.1679823 = boost
                4.3274655 = idf(docFreq=1593, maxDocs=44421)
                0.020437066 = queryNorm
              0.2704666 = fieldWeight in 3804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3274655 = idf(docFreq=1593, maxDocs=44421)
                0.0625 = fieldNorm(doc=3804)
          0.408473 = weight(abstract_txt:classifier in 3804) [ClassicSimilarity], result of:
            0.408473 = score(doc=3804,freq=5.0), product of:
              0.40330434 = queryWeight, product of:
                2.7230077 = boost
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.020437066 = queryNorm
              1.0128158 = fieldWeight in 3804, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.0625 = fieldNorm(doc=3804)
        0.32 = coord(8/25)
    
  5. Safder, I.; Ali, M.; Aljohani, N.R.; Nawaz, R.; Hassan, S.-U.: Neural machine translation for in-text citation classification (2023) 0.21
    0.21445116 = sum of:
      0.21445116 = product of:
        0.59569764 = sum of:
          0.040714048 = weight(abstract_txt:classification in 2055) [ClassicSimilarity], result of:
            0.040714048 = score(doc=2055,freq=4.0), product of:
              0.08158802 = queryWeight, product of:
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.020437066 = queryNorm
              0.49901992 = fieldWeight in 2055, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.0625 = fieldNorm(doc=2055)
          0.03361264 = weight(abstract_txt:however in 2055) [ClassicSimilarity], result of:
            0.03361264 = score(doc=2055,freq=2.0), product of:
              0.0904639 = queryWeight, product of:
                1.0529904 = boost
                4.203706 = idf(docFreq=1803, maxDocs=44421)
                0.020437066 = queryNorm
              0.3715586 = fieldWeight in 2055, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.203706 = idf(docFreq=1803, maxDocs=44421)
                0.0625 = fieldNorm(doc=2055)
          0.073627524 = weight(abstract_txt:context in 2055) [ClassicSimilarity], result of:
            0.073627524 = score(doc=2055,freq=8.0), product of:
              0.096119985 = queryWeight, product of:
                1.0854095 = boost
                4.333128 = idf(docFreq=1584, maxDocs=44421)
                0.020437066 = queryNorm
              0.76599604 = fieldWeight in 2055, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.333128 = idf(docFreq=1584, maxDocs=44421)
                0.0625 = fieldNorm(doc=2055)
          0.008464054 = weight(abstract_txt:that in 2055) [ClassicSimilarity], result of:
            0.008464054 = score(doc=2055,freq=1.0), product of:
              0.05726367 = queryWeight, product of:
                1.18479 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.020437066 = queryNorm
              0.14780845 = fieldWeight in 2055, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=2055)
          0.051079623 = weight(abstract_txt:measures in 2055) [ClassicSimilarity], result of:
            0.051079623 = score(doc=2055,freq=1.0), product of:
              0.15065446 = queryWeight, product of:
                1.3588697 = boost
                5.424824 = idf(docFreq=531, maxDocs=44421)
                0.020437066 = queryNorm
              0.3390515 = fieldWeight in 2055, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.424824 = idf(docFreq=531, maxDocs=44421)
                0.0625 = fieldNorm(doc=2055)
          0.031667262 = weight(abstract_txt:text in 2055) [ClassicSimilarity], result of:
            0.031667262 = score(doc=2055,freq=1.0), product of:
              0.12538752 = queryWeight, product of:
                1.518307 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.020437066 = queryNorm
              0.25255513 = fieldWeight in 2055, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=2055)
          0.1374813 = weight(abstract_txt:citation in 2055) [ClassicSimilarity], result of:
            0.1374813 = score(doc=2055,freq=6.0), product of:
              0.18363662 = queryWeight, product of:
                1.8374354 = boost
                4.890223 = idf(docFreq=907, maxDocs=44421)
                0.020437066 = queryNorm
              0.7486595 = fieldWeight in 2055, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.890223 = idf(docFreq=907, maxDocs=44421)
                0.0625 = fieldNorm(doc=2055)
          0.16797547 = weight(abstract_txt:citations in 2055) [ClassicSimilarity], result of:
            0.16797547 = score(doc=2055,freq=3.0), product of:
              0.29103845 = queryWeight, product of:
                2.671019 = boost
                5.331567 = idf(docFreq=583, maxDocs=44421)
                0.020437066 = queryNorm
              0.57715905 = fieldWeight in 2055, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.331567 = idf(docFreq=583, maxDocs=44421)
                0.0625 = fieldNorm(doc=2055)
          0.051075704 = weight(abstract_txt:based in 2055) [ClassicSimilarity], result of:
            0.051075704 = score(doc=2055,freq=2.0), product of:
              0.18154006 = queryWeight, product of:
                2.7906609 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.020437066 = queryNorm
              0.28134674 = fieldWeight in 2055, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0625 = fieldNorm(doc=2055)
        0.36 = coord(9/25)