Document (#33618)

Author
Aksnes, D.W.
Title
When different persons have an identical author name : how frequent are homonyms?
Source
Journal of the American Society for Information Science and Technology. 59(2008) no.5, S.838-841
Year
2008
Abstract
The phenomenon that different persons may have the same author name (homonymy) represents a major problem for publication analysis at individual levels and for retriving publications based on author names more generally. In such cases, all publications from the persons sharing the name will be collected in search results. This makes it difficult to provide a true picture of a researcher's publication output. The present study examines how frequent homonyms occur in a population of more than 30,000 individuals. The population represents the entire set of research personell in Norway. It is found that 14% of the persons share their author name with one or more other individuals. For the remaining 86% there is a one-to-one correspondence. Thus, for the large majority of persons, homonyms do not represent a problem. In the final part of the article, potential practical applications of these findings are given particular attention.
Theme
Informetrie

Similar documents (content)

  1. Kang, I.-S.; Na, S.-H.; Lee, S.; Jung, H.; Kim, P.; Sung, W.-K.; Lee, J.-H.: On co-authorship for author disambiguation (2009) 0.18
    0.17847912 = sum of:
      0.17847912 = product of:
        0.55774724 = sum of:
          0.011035985 = weight(abstract_txt:have in 3453) [ClassicSimilarity], result of:
            0.011035985 = score(doc=3453,freq=1.0), product of:
              0.044152383 = queryWeight, product of:
                1.0201932 = boost
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.013527103 = queryNorm
              0.2499522 = fieldWeight in 3453, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.078125 = fieldNorm(doc=3453)
          0.01651804 = weight(abstract_txt:different in 3453) [ClassicSimilarity], result of:
            0.01651804 = score(doc=3453,freq=1.0), product of:
              0.057772223 = queryWeight, product of:
                1.166984 = boost
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.013527103 = queryNorm
              0.28591663 = fieldWeight in 3453, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.078125 = fieldNorm(doc=3453)
          0.029883768 = weight(abstract_txt:problem in 3453) [ClassicSimilarity], result of:
            0.029883768 = score(doc=3453,freq=1.0), product of:
              0.085776895 = queryWeight, product of:
                1.4219702 = boost
                4.4593854 = idf(docFreq=1396, maxDocs=44421)
                0.013527103 = queryNorm
              0.34838948 = fieldWeight in 3453, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4593854 = idf(docFreq=1396, maxDocs=44421)
                0.078125 = fieldNorm(doc=3453)
          0.019801272 = weight(abstract_txt:more in 3453) [ClassicSimilarity], result of:
            0.019801272 = score(doc=3453,freq=1.0), product of:
              0.0746287 = queryWeight, product of:
                1.624441 = boost
                3.3962307 = idf(docFreq=4044, maxDocs=44421)
                0.013527103 = queryNorm
              0.26533052 = fieldWeight in 3453, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3962307 = idf(docFreq=4044, maxDocs=44421)
                0.078125 = fieldNorm(doc=3453)
          0.049056504 = weight(abstract_txt:publications in 3453) [ClassicSimilarity], result of:
            0.049056504 = score(doc=3453,freq=1.0), product of:
              0.11936522 = queryWeight, product of:
                1.6774294 = boost
                5.260521 = idf(docFreq=626, maxDocs=44421)
                0.013527103 = queryNorm
              0.4109782 = fieldWeight in 3453, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.260521 = idf(docFreq=626, maxDocs=44421)
                0.078125 = fieldNorm(doc=3453)
          0.06453075 = weight(abstract_txt:individuals in 3453) [ClassicSimilarity], result of:
            0.06453075 = score(doc=3453,freq=1.0), product of:
              0.14330387 = queryWeight, product of:
                1.8379526 = boost
                5.7639313 = idf(docFreq=378, maxDocs=44421)
                0.013527103 = queryNorm
              0.45030713 = fieldWeight in 3453, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7639313 = idf(docFreq=378, maxDocs=44421)
                0.078125 = fieldNorm(doc=3453)
          0.18613341 = weight(abstract_txt:author in 3453) [ClassicSimilarity], result of:
            0.18613341 = score(doc=3453,freq=5.0), product of:
              0.21395198 = queryWeight, product of:
                3.1759853 = boost
                4.980042 = idf(docFreq=829, maxDocs=44421)
                0.013527103 = queryNorm
              0.86997753 = fieldWeight in 3453, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.980042 = idf(docFreq=829, maxDocs=44421)
                0.078125 = fieldNorm(doc=3453)
          0.18078747 = weight(abstract_txt:name in 3453) [ClassicSimilarity], result of:
            0.18078747 = score(doc=3453,freq=2.0), product of:
              0.28479058 = queryWeight, product of:
                3.6642337 = boost
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.013527103 = queryNorm
              0.6348085 = fieldWeight in 3453, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.078125 = fieldNorm(doc=3453)
        0.32 = coord(8/25)
    
  2. Kim, J.; Kim, J.; Owen-Smith, J.: Ethnicity-based name partitioning for author name disambiguation using supervised machine learning (2021) 0.17
    0.1704338 = sum of:
      0.1704338 = product of:
        0.60869217 = sum of:
          0.038997564 = weight(abstract_txt:entire in 1312) [ClassicSimilarity], result of:
            0.038997564 = score(doc=1312,freq=1.0), product of:
              0.09434128 = queryWeight, product of:
                1.054487 = boost
                6.613871 = idf(docFreq=161, maxDocs=44421)
                0.013527103 = queryNorm
              0.41336694 = fieldWeight in 1312, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.613871 = idf(docFreq=161, maxDocs=44421)
                0.0625 = fieldNorm(doc=1312)
          0.039900284 = weight(abstract_txt:occur in 1312) [ClassicSimilarity], result of:
            0.039900284 = score(doc=1312,freq=1.0), product of:
              0.0957916 = queryWeight, product of:
                1.0625615 = boost
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.013527103 = queryNorm
              0.4165322 = fieldWeight in 1312, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.0625 = fieldNorm(doc=1312)
          0.013214432 = weight(abstract_txt:different in 1312) [ClassicSimilarity], result of:
            0.013214432 = score(doc=1312,freq=1.0), product of:
              0.057772223 = queryWeight, product of:
                1.166984 = boost
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.013527103 = queryNorm
              0.2287333 = fieldWeight in 1312, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.0625 = fieldNorm(doc=1312)
          0.023907015 = weight(abstract_txt:problem in 1312) [ClassicSimilarity], result of:
            0.023907015 = score(doc=1312,freq=1.0), product of:
              0.085776895 = queryWeight, product of:
                1.4219702 = boost
                4.4593854 = idf(docFreq=1396, maxDocs=44421)
                0.013527103 = queryNorm
              0.2787116 = fieldWeight in 1312, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4593854 = idf(docFreq=1396, maxDocs=44421)
                0.0625 = fieldNorm(doc=1312)
          0.015841018 = weight(abstract_txt:more in 1312) [ClassicSimilarity], result of:
            0.015841018 = score(doc=1312,freq=1.0), product of:
              0.0746287 = queryWeight, product of:
                1.624441 = boost
                3.3962307 = idf(docFreq=4044, maxDocs=44421)
                0.013527103 = queryNorm
              0.21226442 = fieldWeight in 1312, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3962307 = idf(docFreq=4044, maxDocs=44421)
                0.0625 = fieldNorm(doc=1312)
          0.09417688 = weight(abstract_txt:author in 1312) [ClassicSimilarity], result of:
            0.09417688 = score(doc=1312,freq=2.0), product of:
              0.21395198 = queryWeight, product of:
                3.1759853 = boost
                4.980042 = idf(docFreq=829, maxDocs=44421)
                0.013527103 = queryNorm
              0.44017768 = fieldWeight in 1312, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.980042 = idf(docFreq=829, maxDocs=44421)
                0.0625 = fieldNorm(doc=1312)
          0.38265494 = weight(abstract_txt:name in 1312) [ClassicSimilarity], result of:
            0.38265494 = score(doc=1312,freq=14.0), product of:
              0.28479058 = queryWeight, product of:
                3.6642337 = boost
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.013527103 = queryNorm
              1.3436363 = fieldWeight in 1312, product of:
                3.7416575 = tf(freq=14.0), with freq of:
                  14.0 = termFreq=14.0
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.0625 = fieldNorm(doc=1312)
        0.28 = coord(7/25)
    
  3. Wang, Y.: ¬A look into Chinese persons' names in bibliography practice (2000) 0.15
    0.14901839 = sum of:
      0.14901839 = product of:
        0.931365 = sum of:
          0.013243181 = weight(abstract_txt:have in 401) [ClassicSimilarity], result of:
            0.013243181 = score(doc=401,freq=1.0), product of:
              0.044152383 = queryWeight, product of:
                1.0201932 = boost
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.013527103 = queryNorm
              0.2999426 = fieldWeight in 401, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.09375 = fieldNorm(doc=401)
          0.028032042 = weight(abstract_txt:different in 401) [ClassicSimilarity], result of:
            0.028032042 = score(doc=401,freq=2.0), product of:
              0.057772223 = queryWeight, product of:
                1.166984 = boost
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.013527103 = queryNorm
              0.48521662 = fieldWeight in 401, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.09375 = fieldNorm(doc=401)
          0.21694495 = weight(abstract_txt:name in 401) [ClassicSimilarity], result of:
            0.21694495 = score(doc=401,freq=2.0), product of:
              0.28479058 = queryWeight, product of:
                3.6642337 = boost
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.013527103 = queryNorm
              0.7617701 = fieldWeight in 401, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.09375 = fieldNorm(doc=401)
          0.6731448 = weight(abstract_txt:persons in 401) [ClassicSimilarity], result of:
            0.6731448 = score(doc=401,freq=4.0), product of:
              0.5179942 = queryWeight, product of:
                5.5250707 = boost
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.013527103 = queryNorm
              1.2995218 = fieldWeight in 401, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.09375 = fieldNorm(doc=401)
        0.16 = coord(4/25)
    
  4. D'Angelo, C.A.; Giuffrida, C.; Abramo, G.: ¬A heuristic approach to author name disambiguation in bibliometrics databases for large-scale research assessments (2011) 0.12
    0.11500909 = sum of:
      0.11500909 = product of:
        0.41074675 = sum of:
          0.015607239 = weight(abstract_txt:have in 190) [ClassicSimilarity], result of:
            0.015607239 = score(doc=190,freq=2.0), product of:
              0.044152383 = queryWeight, product of:
                1.0201932 = boost
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.013527103 = queryNorm
              0.35348576 = fieldWeight in 190, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.078125 = fieldNorm(doc=190)
          0.049022153 = weight(abstract_txt:true in 190) [ClassicSimilarity], result of:
            0.049022153 = score(doc=190,freq=1.0), product of:
              0.09469601 = queryWeight, product of:
                1.0564677 = boost
                6.6262937 = idf(docFreq=159, maxDocs=44421)
                0.013527103 = queryNorm
              0.5176792 = fieldWeight in 190, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6262937 = idf(docFreq=159, maxDocs=44421)
                0.078125 = fieldNorm(doc=190)
          0.029883768 = weight(abstract_txt:problem in 190) [ClassicSimilarity], result of:
            0.029883768 = score(doc=190,freq=1.0), product of:
              0.085776895 = queryWeight, product of:
                1.4219702 = boost
                4.4593854 = idf(docFreq=1396, maxDocs=44421)
                0.013527103 = queryNorm
              0.34838948 = fieldWeight in 190, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4593854 = idf(docFreq=1396, maxDocs=44421)
                0.078125 = fieldNorm(doc=190)
          0.019801272 = weight(abstract_txt:more in 190) [ClassicSimilarity], result of:
            0.019801272 = score(doc=190,freq=1.0), product of:
              0.0746287 = queryWeight, product of:
                1.624441 = boost
                3.3962307 = idf(docFreq=4044, maxDocs=44421)
                0.013527103 = queryNorm
              0.26533052 = fieldWeight in 190, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3962307 = idf(docFreq=4044, maxDocs=44421)
                0.078125 = fieldNorm(doc=190)
          0.050875157 = weight(abstract_txt:publication in 190) [ClassicSimilarity], result of:
            0.050875157 = score(doc=190,freq=1.0), product of:
              0.122297406 = queryWeight, product of:
                1.6979073 = boost
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.013527103 = queryNorm
              0.4159954 = fieldWeight in 190, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.078125 = fieldNorm(doc=190)
          0.1177211 = weight(abstract_txt:author in 190) [ClassicSimilarity], result of:
            0.1177211 = score(doc=190,freq=2.0), product of:
              0.21395198 = queryWeight, product of:
                3.1759853 = boost
                4.980042 = idf(docFreq=829, maxDocs=44421)
                0.013527103 = queryNorm
              0.5502221 = fieldWeight in 190, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.980042 = idf(docFreq=829, maxDocs=44421)
                0.078125 = fieldNorm(doc=190)
          0.12783605 = weight(abstract_txt:name in 190) [ClassicSimilarity], result of:
            0.12783605 = score(doc=190,freq=1.0), product of:
              0.28479058 = queryWeight, product of:
                3.6642337 = boost
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.013527103 = queryNorm
              0.44887736 = fieldWeight in 190, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.078125 = fieldNorm(doc=190)
        0.28 = coord(7/25)
    
  5. Moulaison, H.L.; Dykas, F.; Budd, J.M.: Foucault, the author, and intellectual debt : capturing the author-function through attributes, relationships, and events in Knowledge Organization Systems (2014) 0.11
    0.11273273 = sum of:
      0.11273273 = product of:
        0.7045796 = sum of:
          0.02240258 = weight(abstract_txt:more in 2368) [ClassicSimilarity], result of:
            0.02240258 = score(doc=2368,freq=2.0), product of:
              0.0746287 = queryWeight, product of:
                1.624441 = boost
                3.3962307 = idf(docFreq=4044, maxDocs=44421)
                0.013527103 = queryNorm
              0.3001872 = fieldWeight in 2368, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3962307 = idf(docFreq=4044, maxDocs=44421)
                0.0625 = fieldNorm(doc=2368)
          0.14890674 = weight(abstract_txt:author in 2368) [ClassicSimilarity], result of:
            0.14890674 = score(doc=2368,freq=5.0), product of:
              0.21395198 = queryWeight, product of:
                3.1759853 = boost
                4.980042 = idf(docFreq=829, maxDocs=44421)
                0.013527103 = queryNorm
              0.69598204 = fieldWeight in 2368, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.980042 = idf(docFreq=829, maxDocs=44421)
                0.0625 = fieldNorm(doc=2368)
          0.14462997 = weight(abstract_txt:name in 2368) [ClassicSimilarity], result of:
            0.14462997 = score(doc=2368,freq=2.0), product of:
              0.28479058 = queryWeight, product of:
                3.6642337 = boost
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.013527103 = queryNorm
              0.5078468 = fieldWeight in 2368, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.0625 = fieldNorm(doc=2368)
          0.3886403 = weight(abstract_txt:persons in 2368) [ClassicSimilarity], result of:
            0.3886403 = score(doc=2368,freq=3.0), product of:
              0.5179942 = queryWeight, product of:
                5.5250707 = boost
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.013527103 = queryNorm
              0.75027925 = fieldWeight in 2368, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.0625 = fieldNorm(doc=2368)
        0.16 = coord(4/25)