Document (#34532)

Author
Couto, T.
Cristo, M.
Gonçalves, M.A.
Calado, P.
Ziviani, N.
Moura, E.
Ribeiro-Neto, B.
Title
¬A comparative study of citations and links in document classification
Source
International Conference on Digital Libraries: Proceedings of the 6th ACM/IEEE-CS joint conference on Digital libraries, Chapel Hill, NC, USA
Imprint
New York : ACM
Year
2006
Pages
S.75-84
Abstract
It is well known that links are an important source of information when dealing with Web collections. However, the question remains on whether the same techniques that are used on the Web can be applied to collections of documents containing citations between scientific papers. In this work we present a comparative study of digital library citations and Web links, in the context of automatic text classification. We show that there are in fact differences between citations and links in this context. For the comparison, we run a series of experiments using a digital library of computer science papers and a Web directory. In our reference collections, measures based on co-citation tend to perform better for pages in the Web directory, with gains up to 37% over text based classifiers, while measures based on bibliographic coupling perform better in a digital library. We also propose a simple and effective way of combining a traditional text based classifier with a citation-link based classifier. This combination is based on the notion of classifier reliability and presented gains of up to 14% in micro-averaged F1 in the Web collection. However, no significant gain was obtained in the digital library. Finally, a user study was performed to further investigate the causes for these results. We discovered that misclassifications by the citation-link based classifiers are in fact difficult cases, hard to classify even for humans.
Theme
Informetrie

Similar documents (author)

  1. Calado, P.; Cristo, M.; Gonçalves, M.A.; Moura, E.S. de; Ribeiro-Neto, B.; Ziviani, N.: Link-based similarity measures for the classification of Web documents (2006) 5.62
    5.6152787 = sum of:
      5.6152787 = sum of:
        0.7548182 = weight(author_txt:gonçalves in 4921) [ClassicSimilarity], result of:
          0.7548182 = score(doc=4921,freq=1.0), product of:
            0.3526614 = queryWeight, product of:
              8.561393 = idf(docFreq=22, maxDocs=44218)
              0.04119206 = queryNorm
            2.1403482 = fieldWeight in 4921, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.561393 = idf(docFreq=22, maxDocs=44218)
              0.25 = fieldNorm(doc=4921)
        0.79239136 = weight(author_txt:moura in 4921) [ClassicSimilarity], result of:
          0.79239136 = score(doc=4921,freq=1.0), product of:
            0.36426952 = queryWeight, product of:
              1.0163246 = boost
              8.701155 = idf(docFreq=19, maxDocs=44218)
              0.04119206 = queryNorm
            2.1752887 = fieldWeight in 4921, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.701155 = idf(docFreq=19, maxDocs=44218)
              0.25 = fieldNorm(doc=4921)
        0.80648756 = weight(author_txt:ribeiro in 4921) [ClassicSimilarity], result of:
          0.80648756 = score(doc=4921,freq=1.0), product of:
            0.3685769 = queryWeight, product of:
              1.0223159 = boost
              8.752448 = idf(docFreq=18, maxDocs=44218)
              0.04119206 = queryNorm
            2.188112 = fieldWeight in 4921, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.752448 = idf(docFreq=18, maxDocs=44218)
              0.25 = fieldNorm(doc=4921)
        1.0311779 = weight(author_txt:neto in 4921) [ClassicSimilarity], result of:
          1.0311779 = score(doc=4921,freq=1.0), product of:
            0.4341956 = queryWeight, product of:
              1.1095932 = boost
              9.499662 = idf(docFreq=8, maxDocs=44218)
              0.04119206 = queryNorm
            2.3749156 = fieldWeight in 4921, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.499662 = idf(docFreq=8, maxDocs=44218)
              0.25 = fieldNorm(doc=4921)
        1.1152021 = weight(author_txt:cristo in 4921) [ClassicSimilarity], result of:
          1.1152021 = score(doc=4921,freq=1.0), product of:
            0.45747292 = queryWeight, product of:
              1.1389476 = boost
              9.7509775 = idf(docFreq=6, maxDocs=44218)
              0.04119206 = queryNorm
            2.4377444 = fieldWeight in 4921, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.7509775 = idf(docFreq=6, maxDocs=44218)
              0.25 = fieldNorm(doc=4921)
        1.1152021 = weight(author_txt:ziviani in 4921) [ClassicSimilarity], result of:
          1.1152021 = score(doc=4921,freq=1.0), product of:
            0.45747292 = queryWeight, product of:
              1.1389476 = boost
              9.7509775 = idf(docFreq=6, maxDocs=44218)
              0.04119206 = queryNorm
            2.4377444 = fieldWeight in 4921, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.7509775 = idf(docFreq=6, maxDocs=44218)
              0.25 = fieldNorm(doc=4921)
    
  2. Pereira, D.A.; Ribeiro-Neto, B.; Ziviani, N.; Laender, A.H.F.; Gonçalves, M.A.: ¬A generic Web-based entity resolution framework (2011) 2.47
    2.4717903 = sum of:
      2.4717903 = product of:
        3.7076855 = sum of:
          0.7548182 = weight(author_txt:gonçalves in 4450) [ClassicSimilarity], result of:
            0.7548182 = score(doc=4450,freq=1.0), product of:
              0.3526614 = queryWeight, product of:
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.04119206 = queryNorm
              2.1403482 = fieldWeight in 4450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.25 = fieldNorm(doc=4450)
          0.80648756 = weight(author_txt:ribeiro in 4450) [ClassicSimilarity], result of:
            0.80648756 = score(doc=4450,freq=1.0), product of:
              0.3685769 = queryWeight, product of:
                1.0223159 = boost
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.04119206 = queryNorm
              2.188112 = fieldWeight in 4450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.25 = fieldNorm(doc=4450)
          1.0311779 = weight(author_txt:neto in 4450) [ClassicSimilarity], result of:
            1.0311779 = score(doc=4450,freq=1.0), product of:
              0.4341956 = queryWeight, product of:
                1.1095932 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.04119206 = queryNorm
              2.3749156 = fieldWeight in 4450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.25 = fieldNorm(doc=4450)
          1.1152021 = weight(author_txt:ziviani in 4450) [ClassicSimilarity], result of:
            1.1152021 = score(doc=4450,freq=1.0), product of:
              0.45747292 = queryWeight, product of:
                1.1389476 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.04119206 = queryNorm
              2.4377444 = fieldWeight in 4450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.25 = fieldNorm(doc=4450)
        0.6666667 = coord(4/6)
    
  3. Moura, E.S. de; Fernandes, D.; Ribeiro-Neto, B.; Silva, A.S. da; Gonçalves, M.A.: Using structural information to improve search in Web collections (2010) 2.26
    2.2565832 = sum of:
      2.2565832 = product of:
        3.3848748 = sum of:
          0.7548182 = weight(author_txt:gonçalves in 4119) [ClassicSimilarity], result of:
            0.7548182 = score(doc=4119,freq=1.0), product of:
              0.3526614 = queryWeight, product of:
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.04119206 = queryNorm
              2.1403482 = fieldWeight in 4119, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.25 = fieldNorm(doc=4119)
          0.79239136 = weight(author_txt:moura in 4119) [ClassicSimilarity], result of:
            0.79239136 = score(doc=4119,freq=1.0), product of:
              0.36426952 = queryWeight, product of:
                1.0163246 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.04119206 = queryNorm
              2.1752887 = fieldWeight in 4119, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.25 = fieldNorm(doc=4119)
          0.80648756 = weight(author_txt:ribeiro in 4119) [ClassicSimilarity], result of:
            0.80648756 = score(doc=4119,freq=1.0), product of:
              0.3685769 = queryWeight, product of:
                1.0223159 = boost
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.04119206 = queryNorm
              2.188112 = fieldWeight in 4119, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.25 = fieldNorm(doc=4119)
          1.0311779 = weight(author_txt:neto in 4119) [ClassicSimilarity], result of:
            1.0311779 = score(doc=4119,freq=1.0), product of:
              0.4341956 = queryWeight, product of:
                1.1095932 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.04119206 = queryNorm
              2.3749156 = fieldWeight in 4119, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.25 = fieldNorm(doc=4119)
        0.6666667 = coord(4/6)
    
  4. Silva, A.J.C.; Gonçalves, M.A.; Laender, A.H.F.; Modesto, M.A.B.; Cristo, M.; Ziviani, N.: Finding what is missing from a digital library : a case study in the computer science field (2009) 1.49
    1.4926112 = sum of:
      1.4926112 = product of:
        2.9852223 = sum of:
          0.7548182 = weight(author_txt:gonçalves in 4219) [ClassicSimilarity], result of:
            0.7548182 = score(doc=4219,freq=1.0), product of:
              0.3526614 = queryWeight, product of:
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.04119206 = queryNorm
              2.1403482 = fieldWeight in 4219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.25 = fieldNorm(doc=4219)
          1.1152021 = weight(author_txt:cristo in 4219) [ClassicSimilarity], result of:
            1.1152021 = score(doc=4219,freq=1.0), product of:
              0.45747292 = queryWeight, product of:
                1.1389476 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.04119206 = queryNorm
              2.4377444 = fieldWeight in 4219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.25 = fieldNorm(doc=4219)
          1.1152021 = weight(author_txt:ziviani in 4219) [ClassicSimilarity], result of:
            1.1152021 = score(doc=4219,freq=1.0), product of:
              0.45747292 = queryWeight, product of:
                1.1389476 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.04119206 = queryNorm
              2.4377444 = fieldWeight in 4219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.25 = fieldNorm(doc=4219)
        0.5 = coord(3/6)
    
  5. Silveira, M.; Ribeiro-Neto, B.: Concept-based ranking : a case study in the juridical domain (2004) 1.07
    1.0719715 = sum of:
      1.0719715 = product of:
        3.2159145 = sum of:
          1.4113532 = weight(author_txt:ribeiro in 2339) [ClassicSimilarity], result of:
            1.4113532 = score(doc=2339,freq=1.0), product of:
              0.3685769 = queryWeight, product of:
                1.0223159 = boost
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.04119206 = queryNorm
              3.829196 = fieldWeight in 2339, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.4375 = fieldNorm(doc=2339)
          1.8045613 = weight(author_txt:neto in 2339) [ClassicSimilarity], result of:
            1.8045613 = score(doc=2339,freq=1.0), product of:
              0.4341956 = queryWeight, product of:
                1.1095932 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.04119206 = queryNorm
              4.156102 = fieldWeight in 2339, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.4375 = fieldNorm(doc=2339)
        0.33333334 = coord(2/6)
    

Similar documents (content)

  1. Calado, P.; Cristo, M.; Gonçalves, M.A.; Moura, E.S. de; Ribeiro-Neto, B.; Ziviani, N.: Link-based similarity measures for the classification of Web documents (2006) 0.49
    0.4850153 = sum of:
      0.4850153 = product of:
        1.0104486 = sum of:
          0.04062373 = weight(abstract_txt:classification in 4921) [ClassicSimilarity], result of:
            0.04062373 = score(doc=4921,freq=4.0), product of:
              0.08140875 = queryWeight, product of:
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.02039259 = queryNorm
              0.4990094 = fieldWeight in 4921, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=4921)
          0.023933006 = weight(abstract_txt:however in 4921) [ClassicSimilarity], result of:
            0.023933006 = score(doc=4921,freq=1.0), product of:
              0.09081747 = queryWeight, product of:
                1.0562073 = boost
                4.216459 = idf(docFreq=1772, maxDocs=44218)
                0.02039259 = queryNorm
              0.26352867 = fieldWeight in 4921, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.216459 = idf(docFreq=1772, maxDocs=44218)
                0.0625 = fieldNorm(doc=4921)
          0.014712984 = weight(abstract_txt:that in 4921) [ClassicSimilarity], result of:
            0.014712984 = score(doc=4921,freq=3.0), product of:
              0.057359844 = queryWeight, product of:
                1.1870894 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.02039259 = queryNorm
              0.2565032 = fieldWeight in 4921, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=4921)
          0.072504036 = weight(abstract_txt:measures in 4921) [ClassicSimilarity], result of:
            0.072504036 = score(doc=4921,freq=2.0), product of:
              0.15091625 = queryWeight, product of:
                1.3615464 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.02039259 = queryNorm
              0.48042563 = fieldWeight in 4921, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.0625 = fieldNorm(doc=4921)
          0.10283696 = weight(abstract_txt:link in 4921) [ClassicSimilarity], result of:
            0.10283696 = score(doc=4921,freq=3.0), product of:
              0.16642949 = queryWeight, product of:
                1.4298142 = boost
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.02039259 = queryNorm
              0.6179011 = fieldWeight in 4921, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.707926 = idf(docFreq=398, maxDocs=44218)
                0.0625 = fieldNorm(doc=4921)
          0.054852452 = weight(abstract_txt:text in 4921) [ClassicSimilarity], result of:
            0.054852452 = score(doc=4921,freq=3.0), product of:
              0.12530217 = queryWeight, product of:
                1.5194603 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.02039259 = queryNorm
              0.4377614 = fieldWeight in 4921, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=4921)
          0.073765524 = weight(abstract_txt:perform in 4921) [ClassicSimilarity], result of:
            0.073765524 = score(doc=4921,freq=1.0), product of:
              0.19234172 = queryWeight, product of:
                1.5370967 = boost
                6.1362057 = idf(docFreq=259, maxDocs=44218)
                0.02039259 = queryNorm
              0.38351285 = fieldWeight in 4921, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1362057 = idf(docFreq=259, maxDocs=44218)
                0.0625 = fieldNorm(doc=4921)
          0.10135137 = weight(abstract_txt:directory in 4921) [ClassicSimilarity], result of:
            0.10135137 = score(doc=4921,freq=1.0), product of:
              0.23771557 = queryWeight, product of:
                1.7088081 = boost
                6.82169 = idf(docFreq=130, maxDocs=44218)
                0.02039259 = queryNorm
              0.42635563 = fieldWeight in 4921, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.82169 = idf(docFreq=130, maxDocs=44218)
                0.0625 = fieldNorm(doc=4921)
          0.13590652 = weight(abstract_txt:classifiers in 4921) [ClassicSimilarity], result of:
            0.13590652 = score(doc=4921,freq=1.0), product of:
              0.2890667 = queryWeight, product of:
                1.8843583 = boost
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.02039259 = queryNorm
              0.47015625 = fieldWeight in 4921, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.0625 = fieldNorm(doc=4921)
          0.15682736 = weight(abstract_txt:gains in 4921) [ClassicSimilarity], result of:
            0.15682736 = score(doc=4921,freq=1.0), product of:
              0.31801853 = queryWeight, product of:
                1.976472 = boost
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.02039259 = queryNorm
              0.49313906 = fieldWeight in 4921, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.0625 = fieldNorm(doc=4921)
          0.18193556 = weight(abstract_txt:classifier in 4921) [ClassicSimilarity], result of:
            0.18193556 = score(doc=4921,freq=1.0), product of:
              0.40192655 = queryWeight, product of:
                2.721344 = boost
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.02039259 = queryNorm
              0.45265874 = fieldWeight in 4921, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.0625 = fieldNorm(doc=4921)
          0.05119908 = weight(abstract_txt:based in 4921) [ClassicSimilarity], result of:
            0.05119908 = score(doc=4921,freq=2.0), product of:
              0.18170157 = queryWeight, product of:
                2.7949743 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.02039259 = queryNorm
              0.28177565 = fieldWeight in 4921, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=4921)
        0.48 = coord(12/25)
    
  2. Sun, A.; Lim, E.-P.; Ng, W.-K.: Performance measurement framework for hierarchical text classification (2003) 0.32
    0.31656167 = sum of:
      0.31656167 = product of:
        0.8793379 = sum of:
          0.049753703 = weight(abstract_txt:classification in 1808) [ClassicSimilarity], result of:
            0.049753703 = score(doc=1808,freq=6.0), product of:
              0.08140875 = queryWeight, product of:
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.02039259 = queryNorm
              0.6111592 = fieldWeight in 1808, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
          0.016989091 = weight(abstract_txt:that in 1808) [ClassicSimilarity], result of:
            0.016989091 = score(doc=1808,freq=4.0), product of:
              0.057359844 = queryWeight, product of:
                1.1870894 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.02039259 = queryNorm
              0.2961844 = fieldWeight in 1808, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
          0.034487035 = weight(abstract_txt:better in 1808) [ClassicSimilarity], result of:
            0.034487035 = score(doc=1808,freq=1.0), product of:
              0.115862206 = queryWeight, product of:
                1.192986 = boost
                4.76249 = idf(docFreq=1026, maxDocs=44218)
                0.02039259 = queryNorm
              0.2976556 = fieldWeight in 1808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.76249 = idf(docFreq=1026, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
          0.13564263 = weight(abstract_txt:measures in 1808) [ClassicSimilarity], result of:
            0.13564263 = score(doc=1808,freq=7.0), product of:
              0.15091625 = queryWeight, product of:
                1.3615464 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.02039259 = queryNorm
              0.89879405 = fieldWeight in 1808, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
          0.031669077 = weight(abstract_txt:text in 1808) [ClassicSimilarity], result of:
            0.031669077 = score(doc=1808,freq=1.0), product of:
              0.12530217 = queryWeight, product of:
                1.5194603 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.02039259 = queryNorm
              0.25274166 = fieldWeight in 1808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
          0.073765524 = weight(abstract_txt:perform in 1808) [ClassicSimilarity], result of:
            0.073765524 = score(doc=1808,freq=1.0), product of:
              0.19234172 = queryWeight, product of:
                1.5370967 = boost
                6.1362057 = idf(docFreq=259, maxDocs=44218)
                0.02039259 = queryNorm
              0.38351285 = fieldWeight in 1808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1362057 = idf(docFreq=259, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
          0.30389622 = weight(abstract_txt:classifiers in 1808) [ClassicSimilarity], result of:
            0.30389622 = score(doc=1808,freq=5.0), product of:
              0.2890667 = queryWeight, product of:
                1.8843583 = boost
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.02039259 = queryNorm
              1.0513014 = fieldWeight in 1808, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
          0.18193556 = weight(abstract_txt:classifier in 1808) [ClassicSimilarity], result of:
            0.18193556 = score(doc=1808,freq=1.0), product of:
              0.40192655 = queryWeight, product of:
                2.721344 = boost
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.02039259 = queryNorm
              0.45265874 = fieldWeight in 1808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
          0.05119908 = weight(abstract_txt:based in 1808) [ClassicSimilarity], result of:
            0.05119908 = score(doc=1808,freq=2.0), product of:
              0.18170157 = queryWeight, product of:
                2.7949743 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.02039259 = queryNorm
              0.28177565 = fieldWeight in 1808, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=1808)
        0.36 = coord(9/25)
    
  3. Liu, R.-L.: ¬A passage extractor for classification of disease aspect information (2013) 0.23
    0.22829838 = sum of:
      0.22829838 = product of:
        0.81535137 = sum of:
          0.03518118 = weight(abstract_txt:classification in 1107) [ClassicSimilarity], result of:
            0.03518118 = score(doc=1107,freq=3.0), product of:
              0.08140875 = queryWeight, product of:
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.02039259 = queryNorm
              0.4321548 = fieldWeight in 1107, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=1107)
          0.014712984 = weight(abstract_txt:that in 1107) [ClassicSimilarity], result of:
            0.014712984 = score(doc=1107,freq=3.0), product of:
              0.057359844 = queryWeight, product of:
                1.1870894 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.02039259 = queryNorm
              0.2565032 = fieldWeight in 1107, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=1107)
          0.034487035 = weight(abstract_txt:better in 1107) [ClassicSimilarity], result of:
            0.034487035 = score(doc=1107,freq=1.0), product of:
              0.115862206 = queryWeight, product of:
                1.192986 = boost
                4.76249 = idf(docFreq=1026, maxDocs=44218)
                0.02039259 = queryNorm
              0.2976556 = fieldWeight in 1107, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.76249 = idf(docFreq=1026, maxDocs=44218)
                0.0625 = fieldNorm(doc=1107)
          0.08957368 = weight(abstract_txt:text in 1107) [ClassicSimilarity], result of:
            0.08957368 = score(doc=1107,freq=8.0), product of:
              0.12530217 = queryWeight, product of:
                1.5194603 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.02039259 = queryNorm
              0.7148614 = fieldWeight in 1107, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=1107)
          0.33290163 = weight(abstract_txt:classifiers in 1107) [ClassicSimilarity], result of:
            0.33290163 = score(doc=1107,freq=6.0), product of:
              0.2890667 = queryWeight, product of:
                1.8843583 = boost
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.02039259 = queryNorm
              1.1516429 = fieldWeight in 1107, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.5225 = idf(docFreq=64, maxDocs=44218)
                0.0625 = fieldNorm(doc=1107)
          0.25729576 = weight(abstract_txt:classifier in 1107) [ClassicSimilarity], result of:
            0.25729576 = score(doc=1107,freq=2.0), product of:
              0.40192655 = queryWeight, product of:
                2.721344 = boost
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.02039259 = queryNorm
              0.64015615 = fieldWeight in 1107, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.0625 = fieldNorm(doc=1107)
          0.05119908 = weight(abstract_txt:based in 1107) [ClassicSimilarity], result of:
            0.05119908 = score(doc=1107,freq=2.0), product of:
              0.18170157 = queryWeight, product of:
                2.7949743 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.02039259 = queryNorm
              0.28177565 = fieldWeight in 1107, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=1107)
        0.28 = coord(7/25)
    
  4. Mengle, S.S.R.; Goharian, N.: Ambiguity measure feature-selection algorithm (2009) 0.23
    0.22513554 = sum of:
      0.22513554 = product of:
        0.70354855 = sum of:
          0.020311864 = weight(abstract_txt:classification in 2804) [ClassicSimilarity], result of:
            0.020311864 = score(doc=2804,freq=1.0), product of:
              0.08140875 = queryWeight, product of:
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.02039259 = queryNorm
              0.2495047 = fieldWeight in 2804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=2804)
          0.023933006 = weight(abstract_txt:however in 2804) [ClassicSimilarity], result of:
            0.023933006 = score(doc=2804,freq=1.0), product of:
              0.09081747 = queryWeight, product of:
                1.0562073 = boost
                4.216459 = idf(docFreq=1772, maxDocs=44218)
                0.02039259 = queryNorm
              0.26352867 = fieldWeight in 2804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.216459 = idf(docFreq=1772, maxDocs=44218)
                0.0625 = fieldNorm(doc=2804)
          0.014712984 = weight(abstract_txt:that in 2804) [ClassicSimilarity], result of:
            0.014712984 = score(doc=2804,freq=3.0), product of:
              0.057359844 = queryWeight, product of:
                1.1870894 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.02039259 = queryNorm
              0.2565032 = fieldWeight in 2804, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=2804)
          0.034487035 = weight(abstract_txt:better in 2804) [ClassicSimilarity], result of:
            0.034487035 = score(doc=2804,freq=1.0), product of:
              0.115862206 = queryWeight, product of:
                1.192986 = boost
                4.76249 = idf(docFreq=1026, maxDocs=44218)
                0.02039259 = queryNorm
              0.2976556 = fieldWeight in 2804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.76249 = idf(docFreq=1026, maxDocs=44218)
                0.0625 = fieldNorm(doc=2804)
          0.07757308 = weight(abstract_txt:text in 2804) [ClassicSimilarity], result of:
            0.07757308 = score(doc=2804,freq=6.0), product of:
              0.12530217 = queryWeight, product of:
                1.5194603 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.02039259 = queryNorm
              0.6190881 = fieldWeight in 2804, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=2804)
          0.073765524 = weight(abstract_txt:perform in 2804) [ClassicSimilarity], result of:
            0.073765524 = score(doc=2804,freq=1.0), product of:
              0.19234172 = queryWeight, product of:
                1.5370967 = boost
                6.1362057 = idf(docFreq=259, maxDocs=44218)
                0.02039259 = queryNorm
              0.38351285 = fieldWeight in 2804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1362057 = idf(docFreq=259, maxDocs=44218)
                0.0625 = fieldNorm(doc=2804)
          0.05194478 = weight(abstract_txt:digital in 2804) [ClassicSimilarity], result of:
            0.05194478 = score(doc=2804,freq=1.0), product of:
              0.19181202 = queryWeight, product of:
                2.1707878 = boost
                4.332974 = idf(docFreq=1577, maxDocs=44218)
                0.02039259 = queryNorm
              0.27081087 = fieldWeight in 2804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.332974 = idf(docFreq=1577, maxDocs=44218)
                0.0625 = fieldNorm(doc=2804)
          0.40682027 = weight(abstract_txt:classifier in 2804) [ClassicSimilarity], result of:
            0.40682027 = score(doc=2804,freq=5.0), product of:
              0.40192655 = queryWeight, product of:
                2.721344 = boost
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.02039259 = queryNorm
              1.0121757 = fieldWeight in 2804, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.24254 = idf(docFreq=85, maxDocs=44218)
                0.0625 = fieldNorm(doc=2804)
        0.32 = coord(8/25)
    
  5. Safder, I.; Ali, M.; Aljohani, N.R.; Nawaz, R.; Hassan, S.-U.: Neural machine translation for in-text citation classification (2023) 0.21
    0.2149093 = sum of:
      0.2149093 = product of:
        0.59697026 = sum of:
          0.04062373 = weight(abstract_txt:classification in 1053) [ClassicSimilarity], result of:
            0.04062373 = score(doc=1053,freq=4.0), product of:
              0.08140875 = queryWeight, product of:
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.02039259 = queryNorm
              0.4990094 = fieldWeight in 1053, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.9920752 = idf(docFreq=2218, maxDocs=44218)
                0.0625 = fieldNorm(doc=1053)
          0.033846382 = weight(abstract_txt:however in 1053) [ClassicSimilarity], result of:
            0.033846382 = score(doc=1053,freq=2.0), product of:
              0.09081747 = queryWeight, product of:
                1.0562073 = boost
                4.216459 = idf(docFreq=1772, maxDocs=44218)
                0.02039259 = queryNorm
              0.37268582 = fieldWeight in 1053, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.216459 = idf(docFreq=1772, maxDocs=44218)
                0.0625 = fieldNorm(doc=1053)
          0.07381737 = weight(abstract_txt:context in 1053) [ClassicSimilarity], result of:
            0.07381737 = score(doc=1053,freq=8.0), product of:
              0.09621592 = queryWeight, product of:
                1.0871462 = boost
                4.339969 = idf(docFreq=1566, maxDocs=44218)
                0.02039259 = queryNorm
              0.7672054 = fieldWeight in 1053, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.339969 = idf(docFreq=1566, maxDocs=44218)
                0.0625 = fieldNorm(doc=1053)
          0.008494546 = weight(abstract_txt:that in 1053) [ClassicSimilarity], result of:
            0.008494546 = score(doc=1053,freq=1.0), product of:
              0.057359844 = queryWeight, product of:
                1.1870894 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.02039259 = queryNorm
              0.1480922 = fieldWeight in 1053, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=1053)
          0.051268097 = weight(abstract_txt:measures in 1053) [ClassicSimilarity], result of:
            0.051268097 = score(doc=1053,freq=1.0), product of:
              0.15091625 = queryWeight, product of:
                1.3615464 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.02039259 = queryNorm
              0.33971223 = fieldWeight in 1053, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.0625 = fieldNorm(doc=1053)
          0.031669077 = weight(abstract_txt:text in 1053) [ClassicSimilarity], result of:
            0.031669077 = score(doc=1053,freq=1.0), product of:
              0.12530217 = queryWeight, product of:
                1.5194603 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.02039259 = queryNorm
              0.25274166 = fieldWeight in 1053, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=1053)
          0.13773224 = weight(abstract_txt:citation in 1053) [ClassicSimilarity], result of:
            0.13773224 = score(doc=1053,freq=6.0), product of:
              0.18372782 = queryWeight, product of:
                1.8399141 = boost
                4.896717 = idf(docFreq=897, maxDocs=44218)
                0.02039259 = queryNorm
              0.7496537 = fieldWeight in 1053, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.896717 = idf(docFreq=897, maxDocs=44218)
                0.0625 = fieldNorm(doc=1053)
          0.16831975 = weight(abstract_txt:citations in 1053) [ClassicSimilarity], result of:
            0.16831975 = score(doc=1053,freq=3.0), product of:
              0.29122645 = queryWeight, product of:
                2.6748219 = boost
                5.339045 = idf(docFreq=576, maxDocs=44218)
                0.02039259 = queryNorm
              0.5779686 = fieldWeight in 1053, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.339045 = idf(docFreq=576, maxDocs=44218)
                0.0625 = fieldNorm(doc=1053)
          0.05119908 = weight(abstract_txt:based in 1053) [ClassicSimilarity], result of:
            0.05119908 = score(doc=1053,freq=2.0), product of:
              0.18170157 = queryWeight, product of:
                2.7949743 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.02039259 = queryNorm
              0.28177565 = fieldWeight in 1053, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=1053)
        0.36 = coord(9/25)