Document (#35987)

Author
Cota, R.G.
Ferreira, A.A.
Nascimento, C.
Gonçalves, M.A.
Laender, A.H.F.
Title
¬An unsupervised heuristic-based hierarchical method for name disambiguation in bibliographic citations
Source
Journal of the American Society for Information Science and Technology. 61(2010) no.9, S.1853-1870
Year
2010
Abstract
Name ambiguity in the context of bibliographic citations is a difficult problem which, despite the many efforts from the research community, still has a lot of room for improvement. In this article, we present a heuristic-based hierarchical clustering method to deal with this problem. The method successively fuses clusters of citations of similar author names based on several heuristics and similarity measures on the components of the citations (e.g., coauthor names, work title, and publication venue title). During the disambiguation task, the information about fused clusters is aggregated providing more information for the next round of fusion. In order to demonstrate the effectiveness of our method, we ran a series of experiments in two different collections extracted from real-world digital libraries and compared it, under two metrics, with four representative methods described in the literature. We present comparisons of results using each considered attribute separately (i.e., coauthor names, work title, and publication venue title) with the author name attribute and using all attributes together. These results show that our unsupervised method, when using all attributes, performs competitively against all other methods, under both metrics, loosing only in one case against a supervised method, whose result was very close to ours. Moreover, such results are achieved without the burden of any training and without using any privileged information such as knowing a priori the correct number of clusters.

Similar documents (author)

  1. Ferreira, A.A.; Veloso, A.; Gonçalves, M.A.; Laender, A.H.F.: Self-training author name disambiguation for information scarce scenarios (2014) 4.02
    4.017217 = sum of:
      4.017217 = product of:
        5.021521 = sum of:
          1.0059247 = weight(author_txt:gonçalves in 2292) [ClassicSimilarity], result of:
            1.0059247 = score(doc=2292,freq=1.0), product of:
              0.3757844 = queryWeight, product of:
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.043869432 = queryNorm
              2.6768665 = fieldWeight in 2292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.3125 = fieldNorm(doc=2292)
          1.1641518 = weight(author_txt:ferreira in 2292) [ClassicSimilarity], result of:
            1.1641518 = score(doc=2292,freq=1.0), product of:
              0.4142236 = queryWeight, product of:
                1.0499003 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.043869432 = queryNorm
              2.810443 = fieldWeight in 2292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.3125 = fieldNorm(doc=2292)
          1.4257224 = weight(author_txt:laender in 2292) [ClassicSimilarity], result of:
            1.4257224 = score(doc=2292,freq=1.0), product of:
              0.4741529 = queryWeight, product of:
                1.1232847 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.043869432 = queryNorm
              3.0068831 = fieldWeight in 2292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.3125 = fieldNorm(doc=2292)
          1.4257224 = weight(author_txt:a.h.f in 2292) [ClassicSimilarity], result of:
            1.4257224 = score(doc=2292,freq=1.0), product of:
              0.4741529 = queryWeight, product of:
                1.1232847 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.043869432 = queryNorm
              3.0068831 = fieldWeight in 2292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.3125 = fieldNorm(doc=2292)
        0.8 = coord(4/5)
    
  2. Santana, A.F.; Gonçalves, M.A.; Laender, A.H.F.; Ferreira, A.A.: Incremental author name disambiguation by exploiting domain-specific heuristics (2017) 4.02
    4.017217 = sum of:
      4.017217 = product of:
        5.021521 = sum of:
          1.0059247 = weight(author_txt:gonçalves in 4587) [ClassicSimilarity], result of:
            1.0059247 = score(doc=4587,freq=1.0), product of:
              0.3757844 = queryWeight, product of:
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.043869432 = queryNorm
              2.6768665 = fieldWeight in 4587, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.3125 = fieldNorm(doc=4587)
          1.1641518 = weight(author_txt:ferreira in 4587) [ClassicSimilarity], result of:
            1.1641518 = score(doc=4587,freq=1.0), product of:
              0.4142236 = queryWeight, product of:
                1.0499003 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.043869432 = queryNorm
              2.810443 = fieldWeight in 4587, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.3125 = fieldNorm(doc=4587)
          1.4257224 = weight(author_txt:laender in 4587) [ClassicSimilarity], result of:
            1.4257224 = score(doc=4587,freq=1.0), product of:
              0.4741529 = queryWeight, product of:
                1.1232847 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.043869432 = queryNorm
              3.0068831 = fieldWeight in 4587, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.3125 = fieldNorm(doc=4587)
          1.4257224 = weight(author_txt:a.h.f in 4587) [ClassicSimilarity], result of:
            1.4257224 = score(doc=4587,freq=1.0), product of:
              0.4741529 = queryWeight, product of:
                1.1232847 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.043869432 = queryNorm
              3.0068831 = fieldWeight in 4587, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.3125 = fieldNorm(doc=4587)
        0.8 = coord(4/5)
    
  3. Silva, A.J.C.; Gonçalves, M.A.; Laender, A.H.F.; Modesto, M.A.B.; Cristo, M.; Ziviani, N.: Finding what is missing from a digital library : a case study in the computer science field (2009) 1.85
    1.8515373 = sum of:
      1.8515373 = product of:
        3.0858955 = sum of:
          0.8047398 = weight(author_txt:gonçalves in 219) [ClassicSimilarity], result of:
            0.8047398 = score(doc=219,freq=1.0), product of:
              0.3757844 = queryWeight, product of:
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.043869432 = queryNorm
              2.1414933 = fieldWeight in 219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.25 = fieldNorm(doc=219)
          1.1405779 = weight(author_txt:laender in 219) [ClassicSimilarity], result of:
            1.1405779 = score(doc=219,freq=1.0), product of:
              0.4741529 = queryWeight, product of:
                1.1232847 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.043869432 = queryNorm
              2.4055066 = fieldWeight in 219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.25 = fieldNorm(doc=219)
          1.1405779 = weight(author_txt:a.h.f in 219) [ClassicSimilarity], result of:
            1.1405779 = score(doc=219,freq=1.0), product of:
              0.4741529 = queryWeight, product of:
                1.1232847 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.043869432 = queryNorm
              2.4055066 = fieldWeight in 219, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.25 = fieldNorm(doc=219)
        0.6 = coord(3/5)
    
  4. Pereira, D.A.; Ribeiro-Neto, B.; Ziviani, N.; Laender, A.H.F.; Gonçalves, M.A.: ¬A generic Web-based entity resolution framework (2011) 1.85
    1.8515373 = sum of:
      1.8515373 = product of:
        3.0858955 = sum of:
          0.8047398 = weight(author_txt:gonçalves in 450) [ClassicSimilarity], result of:
            0.8047398 = score(doc=450,freq=1.0), product of:
              0.3757844 = queryWeight, product of:
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.043869432 = queryNorm
              2.1414933 = fieldWeight in 450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.25 = fieldNorm(doc=450)
          1.1405779 = weight(author_txt:laender in 450) [ClassicSimilarity], result of:
            1.1405779 = score(doc=450,freq=1.0), product of:
              0.4741529 = queryWeight, product of:
                1.1232847 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.043869432 = queryNorm
              2.4055066 = fieldWeight in 450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.25 = fieldNorm(doc=450)
          1.1405779 = weight(author_txt:a.h.f in 450) [ClassicSimilarity], result of:
            1.1405779 = score(doc=450,freq=1.0), product of:
              0.4741529 = queryWeight, product of:
                1.1232847 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.043869432 = queryNorm
              2.4055066 = fieldWeight in 450, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.25 = fieldNorm(doc=450)
        0.6 = coord(3/5)
    
  5. Ribeiro-Neto, B.; Laender, A.H.F.; Lima, L.R.S. de: ¬An experimental study in automatically categorizing medical documents (2001) 1.14
    1.1405779 = sum of:
      1.1405779 = product of:
        2.8514447 = sum of:
          1.4257224 = weight(author_txt:laender in 6702) [ClassicSimilarity], result of:
            1.4257224 = score(doc=6702,freq=1.0), product of:
              0.4741529 = queryWeight, product of:
                1.1232847 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.043869432 = queryNorm
              3.0068831 = fieldWeight in 6702, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.3125 = fieldNorm(doc=6702)
          1.4257224 = weight(author_txt:a.h.f in 6702) [ClassicSimilarity], result of:
            1.4257224 = score(doc=6702,freq=1.0), product of:
              0.4741529 = queryWeight, product of:
                1.1232847 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.043869432 = queryNorm
              3.0068831 = fieldWeight in 6702, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.3125 = fieldNorm(doc=6702)
        0.4 = coord(2/5)
    

Similar documents (content)

  1. Ferreira, A.A.; Veloso, A.; Gonçalves, M.A.; Laender, A.H.F.: Self-training author name disambiguation for information scarce scenarios (2014) 0.52
    0.5209967 = sum of:
      0.5209967 = product of:
        1.1840833 = sum of:
          0.07113226 = weight(abstract_txt:author in 2292) [ClassicSimilarity], result of:
            0.07113226 = score(doc=2292,freq=5.0), product of:
              0.10220416 = queryWeight, product of:
                1.0705655 = boost
                4.980042 = idf(docFreq=829, maxDocs=44421)
                0.01917001 = queryNorm
              0.69598204 = fieldWeight in 2292, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.980042 = idf(docFreq=829, maxDocs=44421)
                0.0625 = fieldNorm(doc=2292)
          0.05930866 = weight(abstract_txt:attributes in 2292) [ClassicSimilarity], result of:
            0.05930866 = score(doc=2292,freq=1.0), product of:
              0.15481971 = queryWeight, product of:
                1.3176258 = boost
                6.1293135 = idf(docFreq=262, maxDocs=44421)
                0.01917001 = queryNorm
              0.3830821 = fieldWeight in 2292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1293135 = idf(docFreq=262, maxDocs=44421)
                0.0625 = fieldNorm(doc=2292)
          0.02127953 = weight(abstract_txt:using in 2292) [ClassicSimilarity], result of:
            0.02127953 = score(doc=2292,freq=1.0), product of:
              0.09849153 = queryWeight, product of:
                1.4862552 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.01917001 = queryNorm
              0.21605442 = fieldWeight in 2292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.0625 = fieldNorm(doc=2292)
          0.17583854 = weight(abstract_txt:disambiguation in 2292) [ClassicSimilarity], result of:
            0.17583854 = score(doc=2292,freq=3.0), product of:
              0.22153881 = queryWeight, product of:
                1.576173 = boost
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.01917001 = queryNorm
              0.7937144 = fieldWeight in 2292, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.0625 = fieldNorm(doc=2292)
          0.114135124 = weight(abstract_txt:unsupervised in 2292) [ClassicSimilarity], result of:
            0.114135124 = score(doc=2292,freq=1.0), product of:
              0.23953027 = queryWeight, product of:
                1.6389252 = boost
                7.62393 = idf(docFreq=58, maxDocs=44421)
                0.01917001 = queryNorm
              0.47649562 = fieldWeight in 2292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.62393 = idf(docFreq=58, maxDocs=44421)
                0.0625 = fieldNorm(doc=2292)
          0.10363392 = weight(abstract_txt:name in 2292) [ClassicSimilarity], result of:
            0.10363392 = score(doc=2292,freq=2.0), product of:
              0.20406532 = queryWeight, product of:
                1.8527174 = boost
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.01917001 = queryNorm
              0.5078468 = fieldWeight in 2292, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.0625 = fieldNorm(doc=2292)
          0.07686517 = weight(abstract_txt:names in 2292) [ClassicSimilarity], result of:
            0.07686517 = score(doc=2292,freq=1.0), product of:
              0.21066755 = queryWeight, product of:
                1.8824497 = boost
                5.8378363 = idf(docFreq=351, maxDocs=44421)
                0.01917001 = queryNorm
              0.36486477 = fieldWeight in 2292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8378363 = idf(docFreq=351, maxDocs=44421)
                0.0625 = fieldNorm(doc=2292)
          0.19169632 = weight(abstract_txt:venue in 2292) [ClassicSimilarity], result of:
            0.19169632 = score(doc=2292,freq=1.0), product of:
              0.3384465 = queryWeight, product of:
                1.9481571 = boost
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.01917001 = queryNorm
              0.56640065 = fieldWeight in 2292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.0625 = fieldNorm(doc=2292)
          0.15141484 = weight(abstract_txt:clusters in 2292) [ClassicSimilarity], result of:
            0.15141484 = score(doc=2292,freq=2.0), product of:
              0.26275253 = queryWeight, product of:
                2.1023161 = boost
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.01917001 = queryNorm
              0.5762641 = fieldWeight in 2292, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.0625 = fieldNorm(doc=2292)
          0.07806874 = weight(abstract_txt:citations in 2292) [ClassicSimilarity], result of:
            0.07806874 = score(doc=2292,freq=1.0), product of:
              0.23428382 = queryWeight, product of:
                2.2922664 = boost
                5.331567 = idf(docFreq=583, maxDocs=44421)
                0.01917001 = queryNorm
              0.33322293 = fieldWeight in 2292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.331567 = idf(docFreq=583, maxDocs=44421)
                0.0625 = fieldNorm(doc=2292)
          0.14071028 = weight(abstract_txt:method in 2292) [ClassicSimilarity], result of:
            0.14071028 = score(doc=2292,freq=4.0), product of:
              0.25021797 = queryWeight, product of:
                2.9013414 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.01917001 = queryNorm
              0.5623508 = fieldWeight in 2292, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0625 = fieldNorm(doc=2292)
        0.44 = coord(11/25)
    
  2. Liu, Y.; Li, W.; Huang, Z.; Fang, Q.: ¬A fast method based on multiple clustering for name disambiguation in bibliographic citations (2015) 0.47
    0.47208482 = sum of:
      0.47208482 = product of:
        1.0729201 = sum of:
          0.0124599 = weight(abstract_txt:based in 2672) [ClassicSimilarity], result of:
            0.0124599 = score(doc=2672,freq=1.0), product of:
              0.06263076 = queryWeight, product of:
                1.0264041 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.01917001 = queryNorm
              0.1989422 = fieldWeight in 2672, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0625 = fieldNorm(doc=2672)
          0.016263139 = weight(abstract_txt:results in 2672) [ClassicSimilarity], result of:
            0.016263139 = score(doc=2672,freq=1.0), product of:
              0.07480217 = queryWeight, product of:
                1.1217128 = boost
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.01917001 = queryNorm
              0.21741535 = fieldWeight in 2672, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.0625 = fieldNorm(doc=2672)
          0.038884636 = weight(abstract_txt:publication in 2672) [ClassicSimilarity], result of:
            0.038884636 = score(doc=2672,freq=1.0), product of:
              0.11684215 = queryWeight, product of:
                1.1446658 = boost
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.01917001 = queryNorm
              0.3327963 = fieldWeight in 2672, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.0625 = fieldNorm(doc=2672)
          0.08387511 = weight(abstract_txt:attributes in 2672) [ClassicSimilarity], result of:
            0.08387511 = score(doc=2672,freq=2.0), product of:
              0.15481971 = queryWeight, product of:
                1.3176258 = boost
                6.1293135 = idf(docFreq=262, maxDocs=44421)
                0.01917001 = queryNorm
              0.5417599 = fieldWeight in 2672, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1293135 = idf(docFreq=262, maxDocs=44421)
                0.0625 = fieldNorm(doc=2672)
          0.101520434 = weight(abstract_txt:disambiguation in 2672) [ClassicSimilarity], result of:
            0.101520434 = score(doc=2672,freq=1.0), product of:
              0.22153881 = queryWeight, product of:
                1.576173 = boost
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.01917001 = queryNorm
              0.45825124 = fieldWeight in 2672, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.0625 = fieldNorm(doc=2672)
          0.1269251 = weight(abstract_txt:name in 2672) [ClassicSimilarity], result of:
            0.1269251 = score(doc=2672,freq=3.0), product of:
              0.20406532 = queryWeight, product of:
                1.8527174 = boost
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.01917001 = queryNorm
              0.6219827 = fieldWeight in 2672, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.0625 = fieldNorm(doc=2672)
          0.18735139 = weight(abstract_txt:coauthor in 2672) [ClassicSimilarity], result of:
            0.18735139 = score(doc=2672,freq=1.0), product of:
              0.3333129 = queryWeight, product of:
                1.9333256 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.01917001 = queryNorm
              0.5620886 = fieldWeight in 2672, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.0625 = fieldNorm(doc=2672)
          0.19169632 = weight(abstract_txt:venue in 2672) [ClassicSimilarity], result of:
            0.19169632 = score(doc=2672,freq=1.0), product of:
              0.3384465 = queryWeight, product of:
                1.9481571 = boost
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.01917001 = queryNorm
              0.56640065 = fieldWeight in 2672, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.0625 = fieldNorm(doc=2672)
          0.10706647 = weight(abstract_txt:clusters in 2672) [ClassicSimilarity], result of:
            0.10706647 = score(doc=2672,freq=1.0), product of:
              0.26275253 = queryWeight, product of:
                2.1023161 = boost
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.01917001 = queryNorm
              0.40748024 = fieldWeight in 2672, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.0625 = fieldNorm(doc=2672)
          0.13652232 = weight(abstract_txt:title in 2672) [ClassicSimilarity], result of:
            0.13652232 = score(doc=2672,freq=2.0), product of:
              0.26990855 = queryWeight, product of:
                2.4603803 = boost
                5.722582 = idf(docFreq=394, maxDocs=44421)
                0.01917001 = queryNorm
              0.50580955 = fieldWeight in 2672, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.722582 = idf(docFreq=394, maxDocs=44421)
                0.0625 = fieldNorm(doc=2672)
          0.07035514 = weight(abstract_txt:method in 2672) [ClassicSimilarity], result of:
            0.07035514 = score(doc=2672,freq=1.0), product of:
              0.25021797 = queryWeight, product of:
                2.9013414 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.01917001 = queryNorm
              0.2811754 = fieldWeight in 2672, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0625 = fieldNorm(doc=2672)
        0.44 = coord(11/25)
    
  3. Pellack, L.J.; Kappmeyer, L.O.: ¬The ripple effect of women's name changes in indexing, citation, and authority control (2011) 0.27
    0.26523522 = sum of:
      0.26523522 = product of:
        0.73676443 = sum of:
          0.07113226 = weight(abstract_txt:author in 347) [ClassicSimilarity], result of:
            0.07113226 = score(doc=347,freq=5.0), product of:
              0.10220416 = queryWeight, product of:
                1.0705655 = boost
                4.980042 = idf(docFreq=829, maxDocs=44421)
                0.01917001 = queryNorm
              0.69598204 = fieldWeight in 347, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.980042 = idf(docFreq=829, maxDocs=44421)
                0.0625 = fieldNorm(doc=347)
          0.03527673 = weight(abstract_txt:under in 347) [ClassicSimilarity], result of:
            0.03527673 = score(doc=347,freq=1.0), product of:
              0.109498054 = queryWeight, product of:
                1.1081082 = boost
                5.154682 = idf(docFreq=696, maxDocs=44421)
                0.01917001 = queryNorm
              0.32216763 = fieldWeight in 347, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.154682 = idf(docFreq=696, maxDocs=44421)
                0.0625 = fieldNorm(doc=347)
          0.016263139 = weight(abstract_txt:results in 347) [ClassicSimilarity], result of:
            0.016263139 = score(doc=347,freq=1.0), product of:
              0.07480217 = queryWeight, product of:
                1.1217128 = boost
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.01917001 = queryNorm
              0.21741535 = fieldWeight in 347, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.0625 = fieldNorm(doc=347)
          0.038884636 = weight(abstract_txt:publication in 347) [ClassicSimilarity], result of:
            0.038884636 = score(doc=347,freq=1.0), product of:
              0.11684215 = queryWeight, product of:
                1.1446658 = boost
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.01917001 = queryNorm
              0.3327963 = fieldWeight in 347, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.0625 = fieldNorm(doc=347)
          0.02127953 = weight(abstract_txt:using in 347) [ClassicSimilarity], result of:
            0.02127953 = score(doc=347,freq=1.0), product of:
              0.09849153 = queryWeight, product of:
                1.4862552 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.01917001 = queryNorm
              0.21605442 = fieldWeight in 347, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.0625 = fieldNorm(doc=347)
          0.10363392 = weight(abstract_txt:name in 347) [ClassicSimilarity], result of:
            0.10363392 = score(doc=347,freq=2.0), product of:
              0.20406532 = queryWeight, product of:
                1.8527174 = boost
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.01917001 = queryNorm
              0.5078468 = fieldWeight in 347, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.0625 = fieldNorm(doc=347)
          0.2033661 = weight(abstract_txt:names in 347) [ClassicSimilarity], result of:
            0.2033661 = score(doc=347,freq=7.0), product of:
              0.21066755 = queryWeight, product of:
                1.8824497 = boost
                5.8378363 = idf(docFreq=351, maxDocs=44421)
                0.01917001 = queryNorm
              0.9653414 = fieldWeight in 347, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.8378363 = idf(docFreq=351, maxDocs=44421)
                0.0625 = fieldNorm(doc=347)
          0.11040586 = weight(abstract_txt:citations in 347) [ClassicSimilarity], result of:
            0.11040586 = score(doc=347,freq=2.0), product of:
              0.23428382 = queryWeight, product of:
                2.2922664 = boost
                5.331567 = idf(docFreq=583, maxDocs=44421)
                0.01917001 = queryNorm
              0.47124836 = fieldWeight in 347, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.331567 = idf(docFreq=583, maxDocs=44421)
                0.0625 = fieldNorm(doc=347)
          0.13652232 = weight(abstract_txt:title in 347) [ClassicSimilarity], result of:
            0.13652232 = score(doc=347,freq=2.0), product of:
              0.26990855 = queryWeight, product of:
                2.4603803 = boost
                5.722582 = idf(docFreq=394, maxDocs=44421)
                0.01917001 = queryNorm
              0.50580955 = fieldWeight in 347, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.722582 = idf(docFreq=394, maxDocs=44421)
                0.0625 = fieldNorm(doc=347)
        0.36 = coord(9/25)
    
  4. Kim, J.; Kim, J.; Owen-Smith, J.: Ethnicity-based name partitioning for author name disambiguation using supervised machine learning (2021) 0.25
    0.25215513 = sum of:
      0.25215513 = product of:
        0.900554 = sum of:
          0.01762096 = weight(abstract_txt:based in 1312) [ClassicSimilarity], result of:
            0.01762096 = score(doc=1312,freq=2.0), product of:
              0.06263076 = queryWeight, product of:
                1.0264041 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.01917001 = queryNorm
              0.28134674 = fieldWeight in 1312, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0625 = fieldNorm(doc=1312)
          0.044987988 = weight(abstract_txt:author in 1312) [ClassicSimilarity], result of:
            0.044987988 = score(doc=1312,freq=2.0), product of:
              0.10220416 = queryWeight, product of:
                1.0705655 = boost
                4.980042 = idf(docFreq=829, maxDocs=44421)
                0.01917001 = queryNorm
              0.44017768 = fieldWeight in 1312, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.980042 = idf(docFreq=829, maxDocs=44421)
                0.0625 = fieldNorm(doc=1312)
          0.016263139 = weight(abstract_txt:results in 1312) [ClassicSimilarity], result of:
            0.016263139 = score(doc=1312,freq=1.0), product of:
              0.07480217 = queryWeight, product of:
                1.1217128 = boost
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.01917001 = queryNorm
              0.21741535 = fieldWeight in 1312, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.0625 = fieldNorm(doc=1312)
          0.2270066 = weight(abstract_txt:disambiguation in 1312) [ClassicSimilarity], result of:
            0.2270066 = score(doc=1312,freq=5.0), product of:
              0.22153881 = queryWeight, product of:
                1.576173 = boost
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.01917001 = queryNorm
              1.024681 = fieldWeight in 1312, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.0625 = fieldNorm(doc=1312)
          0.27418956 = weight(abstract_txt:name in 1312) [ClassicSimilarity], result of:
            0.27418956 = score(doc=1312,freq=14.0), product of:
              0.20406532 = queryWeight, product of:
                1.8527174 = boost
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.01917001 = queryNorm
              1.3436363 = fieldWeight in 1312, product of:
                3.7416575 = tf(freq=14.0), with freq of:
                  14.0 = termFreq=14.0
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.0625 = fieldNorm(doc=1312)
          0.13313438 = weight(abstract_txt:names in 1312) [ClassicSimilarity], result of:
            0.13313438 = score(doc=1312,freq=3.0), product of:
              0.21066755 = queryWeight, product of:
                1.8824497 = boost
                5.8378363 = idf(docFreq=351, maxDocs=44421)
                0.01917001 = queryNorm
              0.6319643 = fieldWeight in 1312, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.8378363 = idf(docFreq=351, maxDocs=44421)
                0.0625 = fieldNorm(doc=1312)
          0.18735139 = weight(abstract_txt:coauthor in 1312) [ClassicSimilarity], result of:
            0.18735139 = score(doc=1312,freq=1.0), product of:
              0.3333129 = queryWeight, product of:
                1.9333256 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.01917001 = queryNorm
              0.5620886 = fieldWeight in 1312, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.0625 = fieldNorm(doc=1312)
        0.28 = coord(7/25)
    
  5. Cortez, E.; Silva, A.S. da; Gonçalves, M.A.; Mesquita, F.; Moura, E.S. de: ¬A flexible approach for extracting metadata from bibliographic citations (2009) 0.24
    0.24158223 = sum of:
      0.24158223 = product of:
        0.60395557 = sum of:
          0.09074233 = weight(abstract_txt:ours in 3848) [ClassicSimilarity], result of:
            0.09074233 = score(doc=3848,freq=1.0), product of:
              0.17834958 = queryWeight, product of:
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.01917001 = queryNorm
              0.5087891 = fieldWeight in 3848, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3848)
          0.010902413 = weight(abstract_txt:based in 3848) [ClassicSimilarity], result of:
            0.010902413 = score(doc=3848,freq=1.0), product of:
              0.06263076 = queryWeight, product of:
                1.0264041 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.01917001 = queryNorm
              0.17407443 = fieldWeight in 3848, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3848)
          0.0278349 = weight(abstract_txt:author in 3848) [ClassicSimilarity], result of:
            0.0278349 = score(doc=3848,freq=1.0), product of:
              0.10220416 = queryWeight, product of:
                1.0705655 = boost
                4.980042 = idf(docFreq=829, maxDocs=44421)
                0.01917001 = queryNorm
              0.27234605 = fieldWeight in 3848, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.980042 = idf(docFreq=829, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3848)
          0.02464751 = weight(abstract_txt:results in 3848) [ClassicSimilarity], result of:
            0.02464751 = score(doc=3848,freq=3.0), product of:
              0.07480217 = queryWeight, product of:
                1.1217128 = boost
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.01917001 = queryNorm
              0.3295026 = fieldWeight in 3848, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3848)
          0.032025274 = weight(abstract_txt:without in 3848) [ClassicSimilarity], result of:
            0.032025274 = score(doc=3848,freq=1.0), product of:
              0.112220116 = queryWeight, product of:
                1.1217971 = boost
                5.2183604 = idf(docFreq=653, maxDocs=44421)
                0.01917001 = queryNorm
              0.28537908 = fieldWeight in 3848, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2183604 = idf(docFreq=653, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3848)
          0.044513814 = weight(abstract_txt:against in 3848) [ClassicSimilarity], result of:
            0.044513814 = score(doc=3848,freq=1.0), product of:
              0.13976723 = queryWeight, product of:
                1.2519345 = boost
                5.823732 = idf(docFreq=356, maxDocs=44421)
                0.01917001 = queryNorm
              0.31848535 = fieldWeight in 3848, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.823732 = idf(docFreq=356, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3848)
          0.01861959 = weight(abstract_txt:using in 3848) [ClassicSimilarity], result of:
            0.01861959 = score(doc=3848,freq=1.0), product of:
              0.09849153 = queryWeight, product of:
                1.4862552 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.01917001 = queryNorm
              0.18904762 = fieldWeight in 3848, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3848)
          0.06725702 = weight(abstract_txt:names in 3848) [ClassicSimilarity], result of:
            0.06725702 = score(doc=3848,freq=1.0), product of:
              0.21066755 = queryWeight, product of:
                1.8824497 = boost
                5.8378363 = idf(docFreq=351, maxDocs=44421)
                0.01917001 = queryNorm
              0.31925666 = fieldWeight in 3848, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8378363 = idf(docFreq=351, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3848)
          0.1366203 = weight(abstract_txt:citations in 3848) [ClassicSimilarity], result of:
            0.1366203 = score(doc=3848,freq=4.0), product of:
              0.23428382 = queryWeight, product of:
                2.2922664 = boost
                5.331567 = idf(docFreq=583, maxDocs=44421)
                0.01917001 = queryNorm
              0.58314013 = fieldWeight in 3848, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.331567 = idf(docFreq=583, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3848)
          0.15079242 = weight(abstract_txt:method in 3848) [ClassicSimilarity], result of:
            0.15079242 = score(doc=3848,freq=6.0), product of:
              0.25021797 = queryWeight, product of:
                2.9013414 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.01917001 = queryNorm
              0.60264426 = fieldWeight in 3848, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3848)
        0.4 = coord(10/25)