Document (#33618)

Author
Aksnes, D.W.
Title
When different persons have an identical author name : how frequent are homonyms?
Source
Journal of the American Society for Information Science and Technology. 59(2008) no.5, S.838-841
Year
2008
Abstract
The phenomenon that different persons may have the same author name (homonymy) represents a major problem for publication analysis at individual levels and for retriving publications based on author names more generally. In such cases, all publications from the persons sharing the name will be collected in search results. This makes it difficult to provide a true picture of a researcher's publication output. The present study examines how frequent homonyms occur in a population of more than 30,000 individuals. The population represents the entire set of research personell in Norway. It is found that 14% of the persons share their author name with one or more other individuals. For the remaining 86% there is a one-to-one correspondence. Thus, for the large majority of persons, homonyms do not represent a problem. In the final part of the article, potential practical applications of these findings are given particular attention.
Theme
Informetrie

Similar documents (content)

  1. Kang, I.-S.; Na, S.-H.; Lee, S.; Jung, H.; Kim, P.; Sung, W.-K.; Lee, J.-H.: On co-authorship for author disambiguation (2009) 0.18
    0.17873992 = sum of:
      0.17873992 = product of:
        0.5585623 = sum of:
          0.011090755 = weight(abstract_txt:have in 2453) [ClassicSimilarity], result of:
            0.011090755 = score(doc=2453,freq=1.0), product of:
              0.04429932 = queryWeight, product of:
                1.0182698 = boost
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.013575635 = queryNorm
              0.2503595 = fieldWeight in 2453, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.078125 = fieldNorm(doc=2453)
          0.01659737 = weight(abstract_txt:different in 2453) [ClassicSimilarity], result of:
            0.01659737 = score(doc=2453,freq=1.0), product of:
              0.057958324 = queryWeight, product of:
                1.1647218 = boost
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.013575635 = queryNorm
              0.28636733 = fieldWeight in 2453, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.078125 = fieldNorm(doc=2453)
          0.029909115 = weight(abstract_txt:problem in 2453) [ClassicSimilarity], result of:
            0.029909115 = score(doc=2453,freq=1.0), product of:
              0.085827276 = queryWeight, product of:
                1.4173496 = boost
                4.460548 = idf(docFreq=1388, maxDocs=44218)
                0.013575635 = queryNorm
              0.3484803 = fieldWeight in 2453, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.460548 = idf(docFreq=1388, maxDocs=44218)
                0.078125 = fieldNorm(doc=2453)
          0.019905211 = weight(abstract_txt:more in 2453) [ClassicSimilarity], result of:
            0.019905211 = score(doc=2453,freq=1.0), product of:
              0.07489128 = queryWeight, product of:
                1.6215321 = boost
                3.402088 = idf(docFreq=4002, maxDocs=44218)
                0.013575635 = queryNorm
              0.2657881 = fieldWeight in 2453, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.402088 = idf(docFreq=4002, maxDocs=44218)
                0.078125 = fieldNorm(doc=2453)
          0.04929123 = weight(abstract_txt:publications in 2453) [ClassicSimilarity], result of:
            0.04929123 = score(doc=2453,freq=1.0), product of:
              0.11974831 = queryWeight, product of:
                1.6741679 = boost
                5.268782 = idf(docFreq=618, maxDocs=44218)
                0.013575635 = queryNorm
              0.4116236 = fieldWeight in 2453, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.268782 = idf(docFreq=618, maxDocs=44218)
                0.078125 = fieldNorm(doc=2453)
          0.06500848 = weight(abstract_txt:individuals in 2453) [ClassicSimilarity], result of:
            0.06500848 = score(doc=2453,freq=1.0), product of:
              0.14401342 = queryWeight, product of:
                1.8359709 = boost
                5.777993 = idf(docFreq=371, maxDocs=44218)
                0.013575635 = queryNorm
              0.4514057 = fieldWeight in 2453, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.777993 = idf(docFreq=371, maxDocs=44218)
                0.078125 = fieldNorm(doc=2453)
          0.1859027 = weight(abstract_txt:author in 2453) [ClassicSimilarity], result of:
            0.1859027 = score(doc=2453,freq=5.0), product of:
              0.21377984 = queryWeight, product of:
                3.1634624 = boost
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.013575635 = queryNorm
              0.86959887 = fieldWeight in 2453, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.078125 = fieldNorm(doc=2453)
          0.18085742 = weight(abstract_txt:name in 2453) [ClassicSimilarity], result of:
            0.18085742 = score(doc=2453,freq=2.0), product of:
              0.2848703 = queryWeight, product of:
                3.6517656 = boost
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.013575635 = queryNorm
              0.6348764 = fieldWeight in 2453, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.078125 = fieldNorm(doc=2453)
        0.32 = coord(8/25)
    
  2. Kim, J.; Kim, J.; Owen-Smith, J.: Ethnicity-based name partitioning for author name disambiguation using supervised machine learning (2021) 0.17
    0.17057241 = sum of:
      0.17057241 = product of:
        0.6091872 = sum of:
          0.03913904 = weight(abstract_txt:entire in 311) [ClassicSimilarity], result of:
            0.03913904 = score(doc=311,freq=1.0), product of:
              0.09457138 = queryWeight, product of:
                1.0520326 = boost
                6.6217136 = idf(docFreq=159, maxDocs=44218)
                0.013575635 = queryNorm
              0.4138571 = fieldWeight in 311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6217136 = idf(docFreq=159, maxDocs=44218)
                0.0625 = fieldNorm(doc=311)
          0.040055633 = weight(abstract_txt:occur in 311) [ClassicSimilarity], result of:
            0.040055633 = score(doc=311,freq=1.0), product of:
              0.096042186 = queryWeight, product of:
                1.0601817 = boost
                6.6730065 = idf(docFreq=151, maxDocs=44218)
                0.013575635 = queryNorm
              0.4170629 = fieldWeight in 311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6730065 = idf(docFreq=151, maxDocs=44218)
                0.0625 = fieldNorm(doc=311)
          0.013277897 = weight(abstract_txt:different in 311) [ClassicSimilarity], result of:
            0.013277897 = score(doc=311,freq=1.0), product of:
              0.057958324 = queryWeight, product of:
                1.1647218 = boost
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.013575635 = queryNorm
              0.22909386 = fieldWeight in 311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.0625 = fieldNorm(doc=311)
          0.023927292 = weight(abstract_txt:problem in 311) [ClassicSimilarity], result of:
            0.023927292 = score(doc=311,freq=1.0), product of:
              0.085827276 = queryWeight, product of:
                1.4173496 = boost
                4.460548 = idf(docFreq=1388, maxDocs=44218)
                0.013575635 = queryNorm
              0.27878425 = fieldWeight in 311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.460548 = idf(docFreq=1388, maxDocs=44218)
                0.0625 = fieldNorm(doc=311)
          0.015924169 = weight(abstract_txt:more in 311) [ClassicSimilarity], result of:
            0.015924169 = score(doc=311,freq=1.0), product of:
              0.07489128 = queryWeight, product of:
                1.6215321 = boost
                3.402088 = idf(docFreq=4002, maxDocs=44218)
                0.013575635 = queryNorm
              0.2126305 = fieldWeight in 311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.402088 = idf(docFreq=4002, maxDocs=44218)
                0.0625 = fieldNorm(doc=311)
          0.09406015 = weight(abstract_txt:author in 311) [ClassicSimilarity], result of:
            0.09406015 = score(doc=311,freq=2.0), product of:
              0.21377984 = queryWeight, product of:
                3.1634624 = boost
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.013575635 = queryNorm
              0.43998608 = fieldWeight in 311, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.0625 = fieldNorm(doc=311)
          0.38280302 = weight(abstract_txt:name in 311) [ClassicSimilarity], result of:
            0.38280302 = score(doc=311,freq=14.0), product of:
              0.2848703 = queryWeight, product of:
                3.6517656 = boost
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.013575635 = queryNorm
              1.34378 = fieldWeight in 311, product of:
                3.7416575 = tf(freq=14.0), with freq of:
                  14.0 = termFreq=14.0
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.0625 = fieldNorm(doc=311)
        0.28 = coord(7/25)
    
  3. Wang, Y.: ¬A look into Chinese persons' names in bibliography practice (2000) 0.15
    0.14885756 = sum of:
      0.14885756 = product of:
        0.93035984 = sum of:
          0.013308908 = weight(abstract_txt:have in 5401) [ClassicSimilarity], result of:
            0.013308908 = score(doc=5401,freq=1.0), product of:
              0.04429932 = queryWeight, product of:
                1.0182698 = boost
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.013575635 = queryNorm
              0.30043143 = fieldWeight in 5401, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.09375 = fieldNorm(doc=5401)
          0.02816667 = weight(abstract_txt:different in 5401) [ClassicSimilarity], result of:
            0.02816667 = score(doc=5401,freq=2.0), product of:
              0.057958324 = queryWeight, product of:
                1.1647218 = boost
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.013575635 = queryNorm
              0.48598146 = fieldWeight in 5401, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6655018 = idf(docFreq=3075, maxDocs=44218)
                0.09375 = fieldNorm(doc=5401)
          0.2170289 = weight(abstract_txt:name in 5401) [ClassicSimilarity], result of:
            0.2170289 = score(doc=5401,freq=2.0), product of:
              0.2848703 = queryWeight, product of:
                3.6517656 = boost
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.013575635 = queryNorm
              0.7618516 = fieldWeight in 5401, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.09375 = fieldNorm(doc=5401)
          0.6718554 = weight(abstract_txt:persons in 5401) [ClassicSimilarity], result of:
            0.6718554 = score(doc=5401,freq=4.0), product of:
              0.5173439 = queryWeight, product of:
                5.502043 = boost
                6.926203 = idf(docFreq=117, maxDocs=44218)
                0.013575635 = queryNorm
              1.298663 = fieldWeight in 5401, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.926203 = idf(docFreq=117, maxDocs=44218)
                0.09375 = fieldNorm(doc=5401)
        0.16 = coord(4/25)
    
  4. D'Angelo, C.A.; Giuffrida, C.; Abramo, G.: ¬A heuristic approach to author name disambiguation in bibliometrics databases for large-scale research assessments (2011) 0.12
    0.11515102 = sum of:
      0.11515102 = product of:
        0.41125363 = sum of:
          0.015684696 = weight(abstract_txt:have in 4190) [ClassicSimilarity], result of:
            0.015684696 = score(doc=4190,freq=2.0), product of:
              0.04429932 = queryWeight, product of:
                1.0182698 = boost
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.013575635 = queryNorm
              0.35406178 = fieldWeight in 4190, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.2046018 = idf(docFreq=4876, maxDocs=44218)
                0.078125 = fieldNorm(doc=4190)
          0.049203124 = weight(abstract_txt:true in 4190) [ClassicSimilarity], result of:
            0.049203124 = score(doc=4190,freq=1.0), product of:
              0.09493101 = queryWeight, product of:
                1.0540309 = boost
                6.634292 = idf(docFreq=157, maxDocs=44218)
                0.013575635 = queryNorm
              0.51830405 = fieldWeight in 4190, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.634292 = idf(docFreq=157, maxDocs=44218)
                0.078125 = fieldNorm(doc=4190)
          0.029909115 = weight(abstract_txt:problem in 4190) [ClassicSimilarity], result of:
            0.029909115 = score(doc=4190,freq=1.0), product of:
              0.085827276 = queryWeight, product of:
                1.4173496 = boost
                4.460548 = idf(docFreq=1388, maxDocs=44218)
                0.013575635 = queryNorm
              0.3484803 = fieldWeight in 4190, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.460548 = idf(docFreq=1388, maxDocs=44218)
                0.078125 = fieldNorm(doc=4190)
          0.019905211 = weight(abstract_txt:more in 4190) [ClassicSimilarity], result of:
            0.019905211 = score(doc=4190,freq=1.0), product of:
              0.07489128 = queryWeight, product of:
                1.6215321 = boost
                3.402088 = idf(docFreq=4002, maxDocs=44218)
                0.013575635 = queryNorm
              0.2657881 = fieldWeight in 4190, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.402088 = idf(docFreq=4002, maxDocs=44218)
                0.078125 = fieldNorm(doc=4190)
          0.051090807 = weight(abstract_txt:publication in 4190) [ClassicSimilarity], result of:
            0.051090807 = score(doc=4190,freq=1.0), product of:
              0.12264546 = queryWeight, product of:
                1.694299 = boost
                5.3321366 = idf(docFreq=580, maxDocs=44218)
                0.013575635 = queryNorm
              0.41657317 = fieldWeight in 4190, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3321366 = idf(docFreq=580, maxDocs=44218)
                0.078125 = fieldNorm(doc=4190)
          0.11757519 = weight(abstract_txt:author in 4190) [ClassicSimilarity], result of:
            0.11757519 = score(doc=4190,freq=2.0), product of:
              0.21377984 = queryWeight, product of:
                3.1634624 = boost
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.013575635 = queryNorm
              0.5499826 = fieldWeight in 4190, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.078125 = fieldNorm(doc=4190)
          0.1278855 = weight(abstract_txt:name in 4190) [ClassicSimilarity], result of:
            0.1278855 = score(doc=4190,freq=1.0), product of:
              0.2848703 = queryWeight, product of:
                3.6517656 = boost
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.013575635 = queryNorm
              0.44892538 = fieldWeight in 4190, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.078125 = fieldNorm(doc=4190)
        0.28 = coord(7/25)
    
  5. Moulaison, H.L.; Dykas, F.; Budd, J.M.: Foucault, the author, and intellectual debt : capturing the author-function through attributes, relationships, and events in Knowledge Organization Systems (2014) 0.11
    0.11261186 = sum of:
      0.11261186 = product of:
        0.70382416 = sum of:
          0.022520175 = weight(abstract_txt:more in 1368) [ClassicSimilarity], result of:
            0.022520175 = score(doc=1368,freq=2.0), product of:
              0.07489128 = queryWeight, product of:
                1.6215321 = boost
                3.402088 = idf(docFreq=4002, maxDocs=44218)
                0.013575635 = queryNorm
              0.30070493 = fieldWeight in 1368, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.402088 = idf(docFreq=4002, maxDocs=44218)
                0.0625 = fieldNorm(doc=1368)
          0.14872216 = weight(abstract_txt:author in 1368) [ClassicSimilarity], result of:
            0.14872216 = score(doc=1368,freq=5.0), product of:
              0.21377984 = queryWeight, product of:
                3.1634624 = boost
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.013575635 = queryNorm
              0.69567907 = fieldWeight in 1368, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.0625 = fieldNorm(doc=1368)
          0.14468592 = weight(abstract_txt:name in 1368) [ClassicSimilarity], result of:
            0.14468592 = score(doc=1368,freq=2.0), product of:
              0.2848703 = queryWeight, product of:
                3.6517656 = boost
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.013575635 = queryNorm
              0.5079011 = fieldWeight in 1368, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.746245 = idf(docFreq=383, maxDocs=44218)
                0.0625 = fieldNorm(doc=1368)
          0.38789588 = weight(abstract_txt:persons in 1368) [ClassicSimilarity], result of:
            0.38789588 = score(doc=1368,freq=3.0), product of:
              0.5173439 = queryWeight, product of:
                5.502043 = boost
                6.926203 = idf(docFreq=117, maxDocs=44218)
                0.013575635 = queryNorm
              0.74978346 = fieldWeight in 1368, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.926203 = idf(docFreq=117, maxDocs=44218)
                0.0625 = fieldNorm(doc=1368)
        0.16 = coord(4/25)