Document (#43885)

Author
Zhang, L.
Lu, W.
Yang, J.
Title
LAGOS-AND : a large gold standard dataset for scholarly author name disambiguation
Source
Journal of the Association for Information Science and Technology. 74(2023) no.2, S.168-185
Year
2023
Abstract
In this article, we present a method to automatically build large labeled datasets for the author ambiguity problem in the academic world by leveraging the authoritative academic resources, ORCID and DOI. Using the method, we built LAGOS-AND, two large, gold-standard sub-datasets for author name disambiguation (AND), of which LAGOS-AND-BLOCK is created for clustering-based AND research and LAGOS-AND-PAIRWISE is created for classification-based AND research. Our LAGOS-AND datasets are substantially different from the existing ones. The initial versions of the datasets (v1.0, released in February 2021) include 7.5 M citations authored by 798 K unique authors (LAGOS-AND-BLOCK) and close to 1 M instances (LAGOS-AND-PAIRWISE). And both datasets show close similarities to the whole Microsoft Academic Graph (MAG) across validations of six facets. In building the datasets, we reveal the variation degrees of last names in three literature databases, PubMed, MAG, and Semantic Scholar, by comparing author names hosted to the authors' official last names shown on the ORCID pages. Furthermore, we evaluate several baseline disambiguation methods as well as the MAG's author IDs system on our datasets, and the evaluation helps identify several interesting findings. We hope the datasets and findings will bring new insights for future studies. The code and datasets are publicly available.
Content
Vgl.: https://asistdl.onlinelibrary.wiley.com/doi/10.1002/asi.24720.
Theme
Formalerschließung
Object
ORCID

Similar documents (author)

  1. Zhang, M.; Yang, C.C.: Using content and network analysis to understand the social support exchange patterns and user behaviors of an online smoking cessation intervention program (2015) 4.80
    4.7986236 = sum of:
      4.7986236 = sum of:
        1.996627 = weight(author_txt:zhang in 2668) [ClassicSimilarity], result of:
          1.996627 = score(doc=2668,freq=1.0), product of:
            0.623639 = queryWeight, product of:
              6.40315 = idf(docFreq=199, maxDocs=44421)
              0.09739565 = queryNorm
            3.201575 = fieldWeight in 2668, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.40315 = idf(docFreq=199, maxDocs=44421)
              0.5 = fieldNorm(doc=2668)
        2.8019967 = weight(author_txt:yang in 2668) [ClassicSimilarity], result of:
          2.8019967 = score(doc=2668,freq=1.0), product of:
            0.7817125 = queryWeight, product of:
              1.1195846 = boost
              7.168868 = idf(docFreq=92, maxDocs=44421)
              0.09739565 = queryNorm
            3.584434 = fieldWeight in 2668, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.168868 = idf(docFreq=92, maxDocs=44421)
              0.5 = fieldNorm(doc=2668)
    
  2. Yang, F.; Zhang, X.: Focal fields in literature on the information divide : the USA, China, UK and India (2020) 4.80
    4.7986236 = sum of:
      4.7986236 = sum of:
        1.996627 = weight(author_txt:zhang in 835) [ClassicSimilarity], result of:
          1.996627 = score(doc=835,freq=1.0), product of:
            0.623639 = queryWeight, product of:
              6.40315 = idf(docFreq=199, maxDocs=44421)
              0.09739565 = queryNorm
            3.201575 = fieldWeight in 835, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.40315 = idf(docFreq=199, maxDocs=44421)
              0.5 = fieldNorm(doc=835)
        2.8019967 = weight(author_txt:yang in 835) [ClassicSimilarity], result of:
          2.8019967 = score(doc=835,freq=1.0), product of:
            0.7817125 = queryWeight, product of:
              1.1195846 = boost
              7.168868 = idf(docFreq=92, maxDocs=44421)
              0.09739565 = queryNorm
            3.584434 = fieldWeight in 835, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.168868 = idf(docFreq=92, maxDocs=44421)
              0.5 = fieldNorm(doc=835)
    
  3. Shen, D.; Chen, Z.; Yang, Q.; Zeng, H.J.; Zhang, B.; Lu, Y.; Ma, W.Y.: Web page classification through summarization (2004) 2.40
    2.3993118 = sum of:
      2.3993118 = sum of:
        0.9983135 = weight(author_txt:zhang in 5132) [ClassicSimilarity], result of:
          0.9983135 = score(doc=5132,freq=1.0), product of:
            0.623639 = queryWeight, product of:
              6.40315 = idf(docFreq=199, maxDocs=44421)
              0.09739565 = queryNorm
            1.6007875 = fieldWeight in 5132, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.40315 = idf(docFreq=199, maxDocs=44421)
              0.25 = fieldNorm(doc=5132)
        1.4009984 = weight(author_txt:yang in 5132) [ClassicSimilarity], result of:
          1.4009984 = score(doc=5132,freq=1.0), product of:
            0.7817125 = queryWeight, product of:
              1.1195846 = boost
              7.168868 = idf(docFreq=92, maxDocs=44421)
              0.09739565 = queryNorm
            1.792217 = fieldWeight in 5132, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.168868 = idf(docFreq=92, maxDocs=44421)
              0.25 = fieldNorm(doc=5132)
    
  4. Yang, S.C.: ¬An interpretive and situated approach to an evaluation of Perseus digital libraries (2001) 1.75
    1.751248 = sum of:
      1.751248 = product of:
        3.502496 = sum of:
          3.502496 = weight(author_txt:yang in 933) [ClassicSimilarity], result of:
            3.502496 = score(doc=933,freq=1.0), product of:
              0.7817125 = queryWeight, product of:
                1.1195846 = boost
                7.168868 = idf(docFreq=92, maxDocs=44421)
                0.09739565 = queryNorm
              4.4805427 = fieldWeight in 933, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.168868 = idf(docFreq=92, maxDocs=44421)
                0.625 = fieldNorm(doc=933)
        0.5 = coord(1/2)
    
  5. Yang, K.: Information retrieval on the Web (2004) 1.75
    1.751248 = sum of:
      1.751248 = product of:
        3.502496 = sum of:
          3.502496 = weight(author_txt:yang in 5278) [ClassicSimilarity], result of:
            3.502496 = score(doc=5278,freq=1.0), product of:
              0.7817125 = queryWeight, product of:
                1.1195846 = boost
                7.168868 = idf(docFreq=92, maxDocs=44421)
                0.09739565 = queryNorm
              4.4805427 = fieldWeight in 5278, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.168868 = idf(docFreq=92, maxDocs=44421)
                0.625 = fieldNorm(doc=5278)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Liu, W.; Dog(an, R.I.; Kim, S.; Comeau, D.C.; Kim, W.; Yeganova, L.; Lu, Z.; Wilbur, W.J.: Author name disambiguation for PubMed (2014) 0.36
    0.35935998 = sum of:
      0.35935998 = product of:
        0.89839995 = sum of:
          0.08742718 = weight(abstract_txt:pubmed in 2240) [ClassicSimilarity], result of:
            0.08742718 = score(doc=2240,freq=4.0), product of:
              0.10339102 = queryWeight, product of:
                7.731176 = idf(docFreq=52, maxDocs=44421)
                0.013373259 = queryNorm
              0.8455974 = fieldWeight in 2240, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.731176 = idf(docFreq=52, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2240)
          0.029837493 = weight(abstract_txt:method in 2240) [ClassicSimilarity], result of:
            0.029837493 = score(doc=2240,freq=3.0), product of:
              0.07001907 = queryWeight, product of:
                1.1638091 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.013373259 = queryNorm
              0.42613384 = fieldWeight in 2240, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2240)
          0.020006886 = weight(abstract_txt:standard in 2240) [ClassicSimilarity], result of:
            0.020006886 = score(doc=2240,freq=1.0), product of:
              0.077363275 = queryWeight, product of:
                1.2233226 = boost
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.013373259 = queryNorm
              0.2586096 = fieldWeight in 2240, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2240)
          0.09494495 = weight(abstract_txt:name in 2240) [ClassicSimilarity], result of:
            0.09494495 = score(doc=2240,freq=7.0), product of:
              0.11420815 = queryWeight, product of:
                1.4863535 = boost
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.013373259 = queryNorm
              0.8313325 = fieldWeight in 2240, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2240)
          0.02487939 = weight(abstract_txt:large in 2240) [ClassicSimilarity], result of:
            0.02487939 = score(doc=2240,freq=1.0), product of:
              0.102409154 = queryWeight, product of:
                1.7238069 = boost
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.013373259 = queryNorm
              0.24294108 = fieldWeight in 2240, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2240)
          0.09156675 = weight(abstract_txt:gold in 2240) [ClassicSimilarity], result of:
            0.09156675 = score(doc=2240,freq=1.0), product of:
              0.21325885 = queryWeight, product of:
                2.0310805 = boost
                7.85132 = idf(docFreq=46, maxDocs=44421)
                0.013373259 = queryNorm
              0.42936906 = fieldWeight in 2240, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.85132 = idf(docFreq=46, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2240)
          0.11091016 = weight(abstract_txt:pairwise in 2240) [ClassicSimilarity], result of:
            0.11091016 = score(doc=2240,freq=1.0), product of:
              0.2423238 = queryWeight, product of:
                2.1650684 = boost
                8.369263 = idf(docFreq=27, maxDocs=44421)
                0.013373259 = queryNorm
              0.45769405 = fieldWeight in 2240, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.369263 = idf(docFreq=27, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2240)
          0.07984942 = weight(abstract_txt:names in 2240) [ClassicSimilarity], result of:
            0.07984942 = score(doc=2240,freq=2.0), product of:
              0.17685477 = queryWeight, product of:
                2.2653098 = boost
                5.8378363 = idf(docFreq=351, maxDocs=44421)
                0.013373259 = queryNorm
              0.4514971 = fieldWeight in 2240, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8378363 = idf(docFreq=351, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2240)
          0.19374591 = weight(abstract_txt:disambiguation in 2240) [ClassicSimilarity], result of:
            0.19374591 = score(doc=2240,freq=3.0), product of:
              0.27897176 = queryWeight, product of:
                2.8451118 = boost
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.013373259 = queryNorm
              0.6945001 = fieldWeight in 2240, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2240)
          0.16523185 = weight(abstract_txt:author in 2240) [ClassicSimilarity], result of:
            0.16523185 = score(doc=2240,freq=8.0), product of:
              0.2145002 = queryWeight, product of:
                3.2207532 = boost
                4.980042 = idf(docFreq=829, maxDocs=44421)
                0.013373259 = queryNorm
              0.77031094 = fieldWeight in 2240, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.980042 = idf(docFreq=829, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2240)
        0.4 = coord(10/25)
    
  2. Zhao, D.; Strotmann, A.: Counting first, last, or all authors in citation analysis : a comprehensive comparison in the highly collaborative stem cell research field (2011) 0.21
    0.20748396 = sum of:
      0.20748396 = product of:
        0.5763443 = sum of:
          0.049958386 = weight(abstract_txt:pubmed in 368) [ClassicSimilarity], result of:
            0.049958386 = score(doc=368,freq=1.0), product of:
              0.10339102 = queryWeight, product of:
                7.731176 = idf(docFreq=52, maxDocs=44421)
                0.013373259 = queryNorm
              0.4831985 = fieldWeight in 368, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.731176 = idf(docFreq=52, maxDocs=44421)
                0.0625 = fieldNorm(doc=368)
          0.015329244 = weight(abstract_txt:findings in 368) [ClassicSimilarity], result of:
            0.015329244 = score(doc=368,freq=1.0), product of:
              0.059260827 = queryWeight, product of:
                1.0706744 = boost
                4.1387863 = idf(docFreq=1924, maxDocs=44421)
                0.013373259 = queryNorm
              0.25867414 = fieldWeight in 368, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1387863 = idf(docFreq=1924, maxDocs=44421)
                0.0625 = fieldNorm(doc=368)
          0.01968764 = weight(abstract_txt:method in 368) [ClassicSimilarity], result of:
            0.01968764 = score(doc=368,freq=1.0), product of:
              0.07001907 = queryWeight, product of:
                1.1638091 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.013373259 = queryNorm
              0.2811754 = fieldWeight in 368, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0625 = fieldNorm(doc=368)
          0.020346154 = weight(abstract_txt:several in 368) [ClassicSimilarity], result of:
            0.020346154 = score(doc=368,freq=1.0), product of:
              0.07157183 = queryWeight, product of:
                1.1766428 = boost
                4.548416 = idf(docFreq=1277, maxDocs=44421)
                0.013373259 = queryNorm
              0.284276 = fieldWeight in 368, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.548416 = idf(docFreq=1277, maxDocs=44421)
                0.0625 = fieldNorm(doc=368)
          0.03065179 = weight(abstract_txt:authors in 368) [ClassicSimilarity], result of:
            0.03065179 = score(doc=368,freq=2.0), product of:
              0.07465309 = queryWeight, product of:
                1.2017039 = boost
                4.6452923 = idf(docFreq=1159, maxDocs=44421)
                0.013373259 = queryNorm
              0.4105897 = fieldWeight in 368, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6452923 = idf(docFreq=1159, maxDocs=44421)
                0.0625 = fieldNorm(doc=368)
          0.04101236 = weight(abstract_txt:name in 368) [ClassicSimilarity], result of:
            0.04101236 = score(doc=368,freq=1.0), product of:
              0.11420815 = queryWeight, product of:
                1.4863535 = boost
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.013373259 = queryNorm
              0.3591019 = fieldWeight in 368, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.0625 = fieldNorm(doc=368)
          0.07122833 = weight(abstract_txt:last in 368) [ClassicSimilarity], result of:
            0.07122833 = score(doc=368,freq=3.0), product of:
              0.114414744 = queryWeight, product of:
                1.4876974 = boost
                5.750825 = idf(docFreq=383, maxDocs=44421)
                0.013373259 = queryNorm
              0.62254506 = fieldWeight in 368, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.750825 = idf(docFreq=383, maxDocs=44421)
                0.0625 = fieldNorm(doc=368)
          0.12783915 = weight(abstract_txt:disambiguation in 368) [ClassicSimilarity], result of:
            0.12783915 = score(doc=368,freq=1.0), product of:
              0.27897176 = queryWeight, product of:
                2.8451118 = boost
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.013373259 = queryNorm
              0.45825124 = fieldWeight in 368, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.0625 = fieldNorm(doc=368)
          0.20029126 = weight(abstract_txt:author in 368) [ClassicSimilarity], result of:
            0.20029126 = score(doc=368,freq=9.0), product of:
              0.2145002 = queryWeight, product of:
                3.2207532 = boost
                4.980042 = idf(docFreq=829, maxDocs=44421)
                0.013373259 = queryNorm
              0.9337579 = fieldWeight in 368, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                4.980042 = idf(docFreq=829, maxDocs=44421)
                0.0625 = fieldNorm(doc=368)
        0.36 = coord(9/25)
    
  3. Kim, J.; Diesner, J.: Distortive effects of initial-based name disambiguation on measurements of large-scale coauthorship networks (2016) 0.20
    0.20297827 = sum of:
      0.20297827 = product of:
        0.6343071 = sum of:
          0.015329244 = weight(abstract_txt:findings in 3936) [ClassicSimilarity], result of:
            0.015329244 = score(doc=3936,freq=1.0), product of:
              0.059260827 = queryWeight, product of:
                1.0706744 = boost
                4.1387863 = idf(docFreq=1924, maxDocs=44421)
                0.013373259 = queryNorm
              0.25867414 = fieldWeight in 3936, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1387863 = idf(docFreq=1924, maxDocs=44421)
                0.0625 = fieldNorm(doc=3936)
          0.03065179 = weight(abstract_txt:authors in 3936) [ClassicSimilarity], result of:
            0.03065179 = score(doc=3936,freq=2.0), product of:
              0.07465309 = queryWeight, product of:
                1.2017039 = boost
                4.6452923 = idf(docFreq=1159, maxDocs=44421)
                0.013373259 = queryNorm
              0.4105897 = fieldWeight in 3936, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6452923 = idf(docFreq=1159, maxDocs=44421)
                0.0625 = fieldNorm(doc=3936)
          0.08202472 = weight(abstract_txt:name in 3936) [ClassicSimilarity], result of:
            0.08202472 = score(doc=3936,freq=4.0), product of:
              0.11420815 = queryWeight, product of:
                1.4863535 = boost
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.013373259 = queryNorm
              0.7182038 = fieldWeight in 3936, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.0625 = fieldNorm(doc=3936)
          0.028433591 = weight(abstract_txt:large in 3936) [ClassicSimilarity], result of:
            0.028433591 = score(doc=3936,freq=1.0), product of:
              0.102409154 = queryWeight, product of:
                1.7238069 = boost
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.013373259 = queryNorm
              0.27764696 = fieldWeight in 3936, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.0625 = fieldNorm(doc=3936)
          0.033064388 = weight(abstract_txt:academic in 3936) [ClassicSimilarity], result of:
            0.033064388 = score(doc=3936,freq=1.0), product of:
              0.11324646 = queryWeight, product of:
                1.8127234 = boost
                4.6714945 = idf(docFreq=1129, maxDocs=44421)
                0.013373259 = queryNorm
              0.2919684 = fieldWeight in 3936, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6714945 = idf(docFreq=1129, maxDocs=44421)
                0.0625 = fieldNorm(doc=3936)
          0.06452808 = weight(abstract_txt:names in 3936) [ClassicSimilarity], result of:
            0.06452808 = score(doc=3936,freq=1.0), product of:
              0.17685477 = queryWeight, product of:
                2.2653098 = boost
                5.8378363 = idf(docFreq=351, maxDocs=44421)
                0.013373259 = queryNorm
              0.36486477 = fieldWeight in 3936, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8378363 = idf(docFreq=351, maxDocs=44421)
                0.0625 = fieldNorm(doc=3936)
          0.28585705 = weight(abstract_txt:disambiguation in 3936) [ClassicSimilarity], result of:
            0.28585705 = score(doc=3936,freq=5.0), product of:
              0.27897176 = queryWeight, product of:
                2.8451118 = boost
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.013373259 = queryNorm
              1.024681 = fieldWeight in 3936, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.0625 = fieldNorm(doc=3936)
          0.094418205 = weight(abstract_txt:author in 3936) [ClassicSimilarity], result of:
            0.094418205 = score(doc=3936,freq=2.0), product of:
              0.2145002 = queryWeight, product of:
                3.2207532 = boost
                4.980042 = idf(docFreq=829, maxDocs=44421)
                0.013373259 = queryNorm
              0.44017768 = fieldWeight in 3936, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.980042 = idf(docFreq=829, maxDocs=44421)
                0.0625 = fieldNorm(doc=3936)
        0.32 = coord(8/25)
    
  4. Kim, J.: Scale-free collaboration networks : an author name disambiguation perspective (2019) 0.20
    0.20236415 = sum of:
      0.20236415 = product of:
        0.7227291 = sum of:
          0.020346154 = weight(abstract_txt:several in 297) [ClassicSimilarity], result of:
            0.020346154 = score(doc=297,freq=1.0), product of:
              0.07157183 = queryWeight, product of:
                1.1766428 = boost
                4.548416 = idf(docFreq=1277, maxDocs=44421)
                0.013373259 = queryNorm
              0.284276 = fieldWeight in 297, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.548416 = idf(docFreq=1277, maxDocs=44421)
                0.0625 = fieldNorm(doc=297)
          0.031448014 = weight(abstract_txt:created in 297) [ClassicSimilarity], result of:
            0.031448014 = score(doc=297,freq=1.0), product of:
              0.09567887 = queryWeight, product of:
                1.3604469 = boost
                5.2589273 = idf(docFreq=627, maxDocs=44421)
                0.013373259 = queryNorm
              0.32868296 = fieldWeight in 297, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2589273 = idf(docFreq=627, maxDocs=44421)
                0.0625 = fieldNorm(doc=297)
          0.05800024 = weight(abstract_txt:name in 297) [ClassicSimilarity], result of:
            0.05800024 = score(doc=297,freq=2.0), product of:
              0.11420815 = queryWeight, product of:
                1.4863535 = boost
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.013373259 = queryNorm
              0.5078468 = fieldWeight in 297, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.0625 = fieldNorm(doc=297)
          0.06452808 = weight(abstract_txt:names in 297) [ClassicSimilarity], result of:
            0.06452808 = score(doc=297,freq=1.0), product of:
              0.17685477 = queryWeight, product of:
                2.2653098 = boost
                5.8378363 = idf(docFreq=351, maxDocs=44421)
                0.013373259 = queryNorm
              0.36486477 = fieldWeight in 297, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8378363 = idf(docFreq=351, maxDocs=44421)
                0.0625 = fieldNorm(doc=297)
          0.18079185 = weight(abstract_txt:disambiguation in 297) [ClassicSimilarity], result of:
            0.18079185 = score(doc=297,freq=2.0), product of:
              0.27897176 = queryWeight, product of:
                2.8451118 = boost
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.013373259 = queryNorm
              0.6480651 = fieldWeight in 297, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.0625 = fieldNorm(doc=297)
          0.094418205 = weight(abstract_txt:author in 297) [ClassicSimilarity], result of:
            0.094418205 = score(doc=297,freq=2.0), product of:
              0.2145002 = queryWeight, product of:
                3.2207532 = boost
                4.980042 = idf(docFreq=829, maxDocs=44421)
                0.013373259 = queryNorm
              0.44017768 = fieldWeight in 297, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.980042 = idf(docFreq=829, maxDocs=44421)
                0.0625 = fieldNorm(doc=297)
          0.27319658 = weight(abstract_txt:datasets in 297) [ClassicSimilarity], result of:
            0.27319658 = score(doc=297,freq=1.0), product of:
              0.6675363 = queryWeight, product of:
                7.6228485 = boost
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.013373259 = queryNorm
              0.409261 = fieldWeight in 297, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.0625 = fieldNorm(doc=297)
        0.28 = coord(7/25)
    
  5. Sandberg, J.; Jin, Q.: How should catalogers provide authority control for journal article authors? : Name identifiers in the linked data world (2016) 0.18
    0.18205938 = sum of:
      0.18205938 = product of:
        0.65021205 = sum of:
          0.03051923 = weight(abstract_txt:several in 138) [ClassicSimilarity], result of:
            0.03051923 = score(doc=138,freq=1.0), product of:
              0.07157183 = queryWeight, product of:
                1.1766428 = boost
                4.548416 = idf(docFreq=1277, maxDocs=44421)
                0.013373259 = queryNorm
              0.426414 = fieldWeight in 138, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.548416 = idf(docFreq=1277, maxDocs=44421)
                0.09375 = fieldNorm(doc=138)
          0.045977682 = weight(abstract_txt:authors in 138) [ClassicSimilarity], result of:
            0.045977682 = score(doc=138,freq=2.0), product of:
              0.07465309 = queryWeight, product of:
                1.2017039 = boost
                4.6452923 = idf(docFreq=1159, maxDocs=44421)
                0.013373259 = queryNorm
              0.61588454 = fieldWeight in 138, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6452923 = idf(docFreq=1159, maxDocs=44421)
                0.09375 = fieldNorm(doc=138)
          0.03429752 = weight(abstract_txt:standard in 138) [ClassicSimilarity], result of:
            0.03429752 = score(doc=138,freq=1.0), product of:
              0.077363275 = queryWeight, product of:
                1.2233226 = boost
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.013373259 = queryNorm
              0.44333076 = fieldWeight in 138, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.09375 = fieldNorm(doc=138)
          0.087000355 = weight(abstract_txt:name in 138) [ClassicSimilarity], result of:
            0.087000355 = score(doc=138,freq=2.0), product of:
              0.11420815 = queryWeight, product of:
                1.4863535 = boost
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.013373259 = queryNorm
              0.7617701 = fieldWeight in 138, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.09375 = fieldNorm(doc=138)
          0.21399781 = weight(abstract_txt:orcid in 138) [ClassicSimilarity], result of:
            0.21399781 = score(doc=138,freq=1.0), product of:
              0.26219994 = queryWeight, product of:
                2.2521114 = boost
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.013373259 = queryNorm
              0.8161627 = fieldWeight in 138, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.09375 = fieldNorm(doc=138)
          0.09679211 = weight(abstract_txt:names in 138) [ClassicSimilarity], result of:
            0.09679211 = score(doc=138,freq=1.0), product of:
              0.17685477 = queryWeight, product of:
                2.2653098 = boost
                5.8378363 = idf(docFreq=351, maxDocs=44421)
                0.013373259 = queryNorm
              0.5472971 = fieldWeight in 138, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8378363 = idf(docFreq=351, maxDocs=44421)
                0.09375 = fieldNorm(doc=138)
          0.1416273 = weight(abstract_txt:author in 138) [ClassicSimilarity], result of:
            0.1416273 = score(doc=138,freq=2.0), product of:
              0.2145002 = queryWeight, product of:
                3.2207532 = boost
                4.980042 = idf(docFreq=829, maxDocs=44421)
                0.013373259 = queryNorm
              0.6602665 = fieldWeight in 138, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.980042 = idf(docFreq=829, maxDocs=44421)
                0.09375 = fieldNorm(doc=138)
        0.28 = coord(7/25)