Document (#38891)

Author
Thomas, B.
Title
Name disambiguation : learning from more user-friendly models
Source
Cataloging and classification quarterly. 49(2011) no.3, S.223-232
Year
2011
Abstract
Library catalogs do not provide catalog users with the assistance they need to easily and confidently select the person they are interested in. Examples are provided of Web services that do a better job of helping information seekers differentiate the person they are seeking from those with similar names. Some of the reasons for this failure in library catalogs are examined. This article then looks at how much information is necessary to help users disambiguate names, how that information could be captured and shared, and some ways the information could be displayed in library catalogs.
Theme
Formalerschließung

Similar documents (author)

  1. Thomas, D.: Book indexing principles and standards (1989) 4.66
    4.6581078 = sum of:
      4.6581078 = weight(author_txt:thomas in 864) [ClassicSimilarity], result of:
        4.6581078 = fieldWeight in 864, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.4529724 = idf(docFreq=69, maxDocs=44421)
          0.625 = fieldNorm(doc=864)
    
  2. Thomas, A.R.: Options in the arrangement of library materials and the new edition of the Bliss Bibliographic Classification (1992) 4.66
    4.6581078 = sum of:
      4.6581078 = weight(author_txt:thomas in 3933) [ClassicSimilarity], result of:
        4.6581078 = fieldWeight in 3933, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.4529724 = idf(docFreq=69, maxDocs=44421)
          0.625 = fieldNorm(doc=3933)
    
  3. Thomas, A.: Bliss regained : the second edition of the Bliss Bibliographic Classification (1993) 4.66
    4.6581078 = sum of:
      4.6581078 = weight(author_txt:thomas in 5076) [ClassicSimilarity], result of:
        4.6581078 = fieldWeight in 5076, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.4529724 = idf(docFreq=69, maxDocs=44421)
          0.625 = fieldNorm(doc=5076)
    
  4. Thomas, S.E.: CatTutor: a prototypical hypertext tutorial for catalogers (1992) 4.66
    4.6581078 = sum of:
      4.6581078 = weight(author_txt:thomas in 1452) [ClassicSimilarity], result of:
        4.6581078 = fieldWeight in 1452, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.4529724 = idf(docFreq=69, maxDocs=44421)
          0.625 = fieldNorm(doc=1452)
    
  5. Thomas, A.R.: CAPS (Counseling and Personnel Services Clearinghouse) : the work of ERIC Clearinghouse. (1989) 4.66
    4.6581078 = sum of:
      4.6581078 = weight(author_txt:thomas in 1605) [ClassicSimilarity], result of:
        4.6581078 = fieldWeight in 1605, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.4529724 = idf(docFreq=69, maxDocs=44421)
          0.625 = fieldNorm(doc=1605)
    

Similar documents (content)

  1. Vu, Q.M.; Takasu, A.; Adachi, J.: Improving the performance of personal name disambiguation using web directories (2008) 0.24
    0.24224146 = sum of:
      0.24224146 = product of:
        0.86514807 = sum of:
          0.09814086 = weight(abstract_txt:name in 3108) [ClassicSimilarity], result of:
            0.09814086 = score(doc=3108,freq=3.0), product of:
              0.1262297 = queryWeight, product of:
                1.0004505 = boost
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.021959795 = queryNorm
              0.77747834 = fieldWeight in 3108, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.078125 = fieldNorm(doc=3108)
          0.038356386 = weight(abstract_txt:users in 3108) [ClassicSimilarity], result of:
            0.038356386 = score(doc=3108,freq=2.0), product of:
              0.09731815 = queryWeight, product of:
                1.2423007 = boost
                3.5672934 = idf(docFreq=3408, maxDocs=44421)
                0.021959795 = queryNorm
              0.39413396 = fieldWeight in 3108, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5672934 = idf(docFreq=3408, maxDocs=44421)
                0.078125 = fieldNorm(doc=3108)
          0.16651836 = weight(abstract_txt:disambiguation in 3108) [ClassicSimilarity], result of:
            0.16651836 = score(doc=3108,freq=2.0), product of:
              0.20555757 = queryWeight, product of:
                1.2766786 = boost
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.021959795 = queryNorm
              0.81008136 = fieldWeight in 3108, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.078125 = fieldNorm(doc=3108)
          0.274085 = weight(abstract_txt:disambiguate in 3108) [ClassicSimilarity], result of:
            0.274085 = score(doc=3108,freq=2.0), product of:
              0.28656 = queryWeight, product of:
                1.5073795 = boost
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.021959795 = queryNorm
              0.9564663 = fieldWeight in 3108, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.078125 = fieldNorm(doc=3108)
          0.016911797 = weight(abstract_txt:information in 3108) [ClassicSimilarity], result of:
            0.016911797 = score(doc=3108,freq=1.0), product of:
              0.089491524 = queryWeight, product of:
                1.6847512 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.021959795 = queryNorm
              0.18897653 = fieldWeight in 3108, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.078125 = fieldNorm(doc=3108)
          0.11886718 = weight(abstract_txt:names in 3108) [ClassicSimilarity], result of:
            0.11886718 = score(doc=3108,freq=1.0), product of:
              0.26062736 = queryWeight, product of:
                2.0330114 = boost
                5.8378363 = idf(docFreq=351, maxDocs=44421)
                0.021959795 = queryNorm
              0.45608097 = fieldWeight in 3108, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8378363 = idf(docFreq=351, maxDocs=44421)
                0.078125 = fieldNorm(doc=3108)
          0.1522685 = weight(abstract_txt:person in 3108) [ClassicSimilarity], result of:
            0.1522685 = score(doc=3108,freq=1.0), product of:
              0.30741054 = queryWeight, product of:
                2.2079499 = boost
                6.3401756 = idf(docFreq=212, maxDocs=44421)
                0.021959795 = queryNorm
              0.49532622 = fieldWeight in 3108, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3401756 = idf(docFreq=212, maxDocs=44421)
                0.078125 = fieldNorm(doc=3108)
        0.28 = coord(7/25)
    
  2. Delgado, A.D.; Martínez, R.; Montalvo, S.; Fresno, V.: Person name disambiguation in the Web using adaptive threshold clustering (2017) 0.17
    0.16525732 = sum of:
      0.16525732 = product of:
        0.68857217 = sum of:
          0.056661658 = weight(abstract_txt:name in 4694) [ClassicSimilarity], result of:
            0.056661658 = score(doc=4694,freq=1.0), product of:
              0.1262297 = queryWeight, product of:
                1.0004505 = boost
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.021959795 = queryNorm
              0.44887736 = fieldWeight in 4694, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.078125 = fieldNorm(doc=4694)
          0.117746264 = weight(abstract_txt:disambiguation in 4694) [ClassicSimilarity], result of:
            0.117746264 = score(doc=4694,freq=1.0), product of:
              0.20555757 = queryWeight, product of:
                1.2766786 = boost
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.021959795 = queryNorm
              0.57281405 = fieldWeight in 4694, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.078125 = fieldNorm(doc=4694)
          0.06484257 = weight(abstract_txt:could in 4694) [ClassicSimilarity], result of:
            0.06484257 = score(doc=4694,freq=1.0), product of:
              0.17400117 = queryWeight, product of:
                1.6611388 = boost
                4.7699957 = idf(docFreq=1023, maxDocs=44421)
                0.021959795 = queryNorm
              0.37265593 = fieldWeight in 4694, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7699957 = idf(docFreq=1023, maxDocs=44421)
                0.078125 = fieldNorm(doc=4694)
          0.06671767 = weight(abstract_txt:they in 4694) [ClassicSimilarity], result of:
            0.06671767 = score(doc=4694,freq=2.0), product of:
              0.16112381 = queryWeight, product of:
                1.9577414 = boost
                3.7477977 = idf(docFreq=2845, maxDocs=44421)
                0.021959795 = queryNorm
              0.41407704 = fieldWeight in 4694, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.7477977 = idf(docFreq=2845, maxDocs=44421)
                0.078125 = fieldNorm(doc=4694)
          0.11886718 = weight(abstract_txt:names in 4694) [ClassicSimilarity], result of:
            0.11886718 = score(doc=4694,freq=1.0), product of:
              0.26062736 = queryWeight, product of:
                2.0330114 = boost
                5.8378363 = idf(docFreq=351, maxDocs=44421)
                0.021959795 = queryNorm
              0.45608097 = fieldWeight in 4694, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8378363 = idf(docFreq=351, maxDocs=44421)
                0.078125 = fieldNorm(doc=4694)
          0.26373678 = weight(abstract_txt:person in 4694) [ClassicSimilarity], result of:
            0.26373678 = score(doc=4694,freq=3.0), product of:
              0.30741054 = queryWeight, product of:
                2.2079499 = boost
                6.3401756 = idf(docFreq=212, maxDocs=44421)
                0.021959795 = queryNorm
              0.8579302 = fieldWeight in 4694, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.3401756 = idf(docFreq=212, maxDocs=44421)
                0.078125 = fieldNorm(doc=4694)
        0.24 = coord(6/25)
    
  3. Crane, G.; Jones, A.: Text, information, knowledge and the evolving record of humanity (2006) 0.16
    0.16272159 = sum of:
      0.16272159 = product of:
        0.40680397 = sum of:
          0.04434476 = weight(abstract_txt:name in 2182) [ClassicSimilarity], result of:
            0.04434476 = score(doc=2182,freq=5.0), product of:
              0.1262297 = queryWeight, product of:
                1.0004505 = boost
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.021959795 = queryNorm
              0.35130212 = fieldWeight in 2182, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.02734375 = fieldNorm(doc=2182)
          0.013424736 = weight(abstract_txt:users in 2182) [ClassicSimilarity], result of:
            0.013424736 = score(doc=2182,freq=2.0), product of:
              0.09731815 = queryWeight, product of:
                1.2423007 = boost
                3.5672934 = idf(docFreq=3408, maxDocs=44421)
                0.021959795 = queryNorm
              0.13794689 = fieldWeight in 2182, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5672934 = idf(docFreq=3408, maxDocs=44421)
                0.02734375 = fieldNorm(doc=2182)
          0.0147206355 = weight(abstract_txt:some in 2182) [ClassicSimilarity], result of:
            0.0147206355 = score(doc=2182,freq=2.0), product of:
              0.10348429 = queryWeight, product of:
                1.2810528 = boost
                3.6785707 = idf(docFreq=3049, maxDocs=44421)
                0.021959795 = queryNorm
              0.14224996 = fieldWeight in 2182, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6785707 = idf(docFreq=3049, maxDocs=44421)
                0.02734375 = fieldNorm(doc=2182)
          0.041645095 = weight(abstract_txt:captured in 2182) [ClassicSimilarity], result of:
            0.041645095 = score(doc=2182,freq=1.0), product of:
              0.2069979 = queryWeight, product of:
                1.2811435 = boost
                7.357662 = idf(docFreq=76, maxDocs=44421)
                0.021959795 = queryNorm
              0.20118608 = fieldWeight in 2182, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.357662 = idf(docFreq=76, maxDocs=44421)
                0.02734375 = fieldNorm(doc=2182)
          0.032095432 = weight(abstract_txt:could in 2182) [ClassicSimilarity], result of:
            0.032095432 = score(doc=2182,freq=2.0), product of:
              0.17400117 = queryWeight, product of:
                1.6611388 = boost
                4.7699957 = idf(docFreq=1023, maxDocs=44421)
                0.021959795 = queryNorm
              0.18445526 = fieldWeight in 2182, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7699957 = idf(docFreq=1023, maxDocs=44421)
                0.02734375 = fieldNorm(doc=2182)
          0.020342864 = weight(abstract_txt:library in 2182) [ClassicSimilarity], result of:
            0.020342864 = score(doc=2182,freq=4.0), product of:
              0.11665012 = queryWeight, product of:
                1.6657816 = boost
                3.188885 = idf(docFreq=4976, maxDocs=44421)
                0.021959795 = queryNorm
              0.17439215 = fieldWeight in 2182, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.188885 = idf(docFreq=4976, maxDocs=44421)
                0.02734375 = fieldNorm(doc=2182)
          0.014498846 = weight(abstract_txt:information in 2182) [ClassicSimilarity], result of:
            0.014498846 = score(doc=2182,freq=6.0), product of:
              0.089491524 = queryWeight, product of:
                1.6847512 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.021959795 = queryNorm
              0.16201362 = fieldWeight in 2182, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.02734375 = fieldNorm(doc=2182)
          0.023351185 = weight(abstract_txt:they in 2182) [ClassicSimilarity], result of:
            0.023351185 = score(doc=2182,freq=2.0), product of:
              0.16112381 = queryWeight, product of:
                1.9577414 = boost
                3.7477977 = idf(docFreq=2845, maxDocs=44421)
                0.021959795 = queryNorm
              0.14492697 = fieldWeight in 2182, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.7477977 = idf(docFreq=2845, maxDocs=44421)
                0.02734375 = fieldNorm(doc=2182)
          0.11007254 = weight(abstract_txt:names in 2182) [ClassicSimilarity], result of:
            0.11007254 = score(doc=2182,freq=7.0), product of:
              0.26062736 = queryWeight, product of:
                2.0330114 = boost
                5.8378363 = idf(docFreq=351, maxDocs=44421)
                0.021959795 = queryNorm
              0.42233685 = fieldWeight in 2182, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.8378363 = idf(docFreq=351, maxDocs=44421)
                0.02734375 = fieldNorm(doc=2182)
          0.09230787 = weight(abstract_txt:person in 2182) [ClassicSimilarity], result of:
            0.09230787 = score(doc=2182,freq=3.0), product of:
              0.30741054 = queryWeight, product of:
                2.2079499 = boost
                6.3401756 = idf(docFreq=212, maxDocs=44421)
                0.021959795 = queryNorm
              0.30027556 = fieldWeight in 2182, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.3401756 = idf(docFreq=212, maxDocs=44421)
                0.02734375 = fieldNorm(doc=2182)
        0.4 = coord(10/25)
    
  4. Sardo, L.: Multiple names (2004) 0.15
    0.15466098 = sum of:
      0.15466098 = product of:
        0.7733049 = sum of:
          0.09065865 = weight(abstract_txt:name in 6116) [ClassicSimilarity], result of:
            0.09065865 = score(doc=6116,freq=1.0), product of:
              0.1262297 = queryWeight, product of:
                1.0004505 = boost
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.021959795 = queryNorm
              0.7182038 = fieldWeight in 6116, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.125 = fieldNorm(doc=6116)
          0.047584284 = weight(abstract_txt:some in 6116) [ClassicSimilarity], result of:
            0.047584284 = score(doc=6116,freq=1.0), product of:
              0.10348429 = queryWeight, product of:
                1.2810528 = boost
                3.6785707 = idf(docFreq=3049, maxDocs=44421)
                0.021959795 = queryNorm
              0.45982134 = fieldWeight in 6116, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6785707 = idf(docFreq=3049, maxDocs=44421)
                0.125 = fieldNorm(doc=6116)
          0.04649798 = weight(abstract_txt:library in 6116) [ClassicSimilarity], result of:
            0.04649798 = score(doc=6116,freq=1.0), product of:
              0.11665012 = queryWeight, product of:
                1.6657816 = boost
                3.188885 = idf(docFreq=4976, maxDocs=44421)
                0.021959795 = queryNorm
              0.39861062 = fieldWeight in 6116, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.188885 = idf(docFreq=4976, maxDocs=44421)
                0.125 = fieldNorm(doc=6116)
          0.26896572 = weight(abstract_txt:names in 6116) [ClassicSimilarity], result of:
            0.26896572 = score(doc=6116,freq=2.0), product of:
              0.26062736 = queryWeight, product of:
                2.0330114 = boost
                5.8378363 = idf(docFreq=351, maxDocs=44421)
                0.021959795 = queryNorm
              1.0319934 = fieldWeight in 6116, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8378363 = idf(docFreq=351, maxDocs=44421)
                0.125 = fieldNorm(doc=6116)
          0.31959826 = weight(abstract_txt:catalogs in 6116) [ClassicSimilarity], result of:
            0.31959826 = score(doc=6116,freq=1.0), product of:
              0.4216953 = queryWeight, product of:
                3.1671953 = boost
                6.0631127 = idf(docFreq=280, maxDocs=44421)
                0.021959795 = queryNorm
              0.7578891 = fieldWeight in 6116, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0631127 = idf(docFreq=280, maxDocs=44421)
                0.125 = fieldNorm(doc=6116)
        0.2 = coord(5/25)
    
  5. Kim, J.; Kim, J.; Owen-Smith, J.: Ethnicity-based name partitioning for author name disambiguation using supervised machine learning (2021) 0.14
    0.1447566 = sum of:
      0.1447566 = product of:
        0.72378296 = sum of:
          0.1696068 = weight(abstract_txt:name in 1312) [ClassicSimilarity], result of:
            0.1696068 = score(doc=1312,freq=14.0), product of:
              0.1262297 = queryWeight, product of:
                1.0004505 = boost
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.021959795 = queryNorm
              1.3436363 = fieldWeight in 1312, product of:
                3.7416575 = tf(freq=14.0), with freq of:
                  14.0 = termFreq=14.0
                5.7456303 = idf(docFreq=385, maxDocs=44421)
                0.0625 = fieldNorm(doc=1312)
          0.21063092 = weight(abstract_txt:disambiguation in 1312) [ClassicSimilarity], result of:
            0.21063092 = score(doc=1312,freq=5.0), product of:
              0.20555757 = queryWeight, product of:
                1.2766786 = boost
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.021959795 = queryNorm
              1.024681 = fieldWeight in 1312, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.0625 = fieldNorm(doc=1312)
          0.023792142 = weight(abstract_txt:some in 1312) [ClassicSimilarity], result of:
            0.023792142 = score(doc=1312,freq=1.0), product of:
              0.10348429 = queryWeight, product of:
                1.2810528 = boost
                3.6785707 = idf(docFreq=3049, maxDocs=44421)
                0.021959795 = queryNorm
              0.22991067 = fieldWeight in 1312, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6785707 = idf(docFreq=3049, maxDocs=44421)
                0.0625 = fieldNorm(doc=1312)
          0.15504588 = weight(abstract_txt:disambiguate in 1312) [ClassicSimilarity], result of:
            0.15504588 = score(doc=1312,freq=1.0), product of:
              0.28656 = queryWeight, product of:
                1.5073795 = boost
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.021959795 = queryNorm
              0.5410591 = fieldWeight in 1312, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.0625 = fieldNorm(doc=1312)
          0.1647072 = weight(abstract_txt:names in 1312) [ClassicSimilarity], result of:
            0.1647072 = score(doc=1312,freq=3.0), product of:
              0.26062736 = queryWeight, product of:
                2.0330114 = boost
                5.8378363 = idf(docFreq=351, maxDocs=44421)
                0.021959795 = queryNorm
              0.6319643 = fieldWeight in 1312, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.8378363 = idf(docFreq=351, maxDocs=44421)
                0.0625 = fieldNorm(doc=1312)
        0.2 = coord(5/25)