Document (#42143)

Author
Carlson, S.
Seely, A.
Title
Using OpenRefine's reconciliation to validate local authority headings
Source
Cataloging and classification quarterly. 55(2017) no.1, S.1-11
Year
2017
Abstract
In 2015, the Cataloging and Metadata Services department of Rice University's Fondren Library developed a process to reconcile four years of authority headings against an internally developed thesaurus. With a goal of immediate cleanup as well as an ongoing maintenance procedure, staff developed a "hack" of OpenRefine's normal Reconciliation function that ultimately yielded 99.6% authority reconciliation and a stable process for monthly data verification.
Content
Vgl.: https://doi.org/10.1080/01639374.2016.1245693.
Theme
Metadaten

Similar documents (author)

  1. Carlson, P.A.: ¬The rhetoric of hypertext (1990) 6.01
    6.0137663 = sum of:
      6.0137663 = weight(author_txt:carlson in 4913) [ClassicSimilarity], result of:
        6.0137663 = fieldWeight in 4913, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.622026 = idf(docFreq=7, maxDocs=44421)
          0.625 = fieldNorm(doc=4913)
    
  2. Carlson, C.: Perspectives of a hypermedia film sequence database (1993) 6.01
    6.0137663 = sum of:
      6.0137663 = weight(author_txt:carlson in 5320) [ClassicSimilarity], result of:
        6.0137663 = fieldWeight in 5320, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.622026 = idf(docFreq=7, maxDocs=44421)
          0.625 = fieldNorm(doc=5320)
    
  3. Carlson, C.N.; Süllow, K.: AMPHORE, ein standardbasiertes Werkzeug zur Sequenzerschließung (1996) 4.81
    4.811013 = sum of:
      4.811013 = weight(author_txt:carlson in 5319) [ClassicSimilarity], result of:
        4.811013 = fieldWeight in 5319, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.622026 = idf(docFreq=7, maxDocs=44421)
          0.5 = fieldNorm(doc=5319)
    
  4. Carlson, J.R.; Kacmar, C.J.: an examination of end-user preferences : Increasing link marker effectiveness for WWW and other hypermedia interfaces (1999) 4.81
    4.811013 = sum of:
      4.811013 = weight(author_txt:carlson in 5301) [ClassicSimilarity], result of:
        4.811013 = fieldWeight in 5301, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.622026 = idf(docFreq=7, maxDocs=44421)
          0.5 = fieldNorm(doc=5301)
    
  5. Banach, P.; Carlson Jr., M.: Cataloging at the University of Massachusetts Amherst Library (2000) 4.21
    4.2096367 = sum of:
      4.2096367 = weight(author_txt:carlson in 381) [ClassicSimilarity], result of:
        4.2096367 = fieldWeight in 381, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.622026 = idf(docFreq=7, maxDocs=44421)
          0.4375 = fieldNorm(doc=381)
    

Similar documents (content)

  1. Heng, G.; Cole, T.W.; Tian, T.(C.); Han, M.-J.: Rethinking authority reconciliation process (2022) 0.18
    0.17747284 = sum of:
      0.17747284 = product of:
        1.4789404 = sum of:
          0.047461096 = weight(abstract_txt:process in 1728) [ClassicSimilarity], result of:
            0.047461096 = score(doc=1728,freq=2.0), product of:
              0.08841217 = queryWeight, product of:
                1.4532013 = boost
                4.048922 = idf(docFreq=2105, maxDocs=44421)
                0.0150261205 = queryNorm
              0.5368163 = fieldWeight in 1728, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.048922 = idf(docFreq=2105, maxDocs=44421)
                0.09375 = fieldNorm(doc=1728)
          0.1979341 = weight(abstract_txt:authority in 1728) [ClassicSimilarity], result of:
            0.1979341 = score(doc=1728,freq=3.0), product of:
              0.22906952 = queryWeight, product of:
                2.8648312 = boost
                5.321345 = idf(docFreq=589, maxDocs=44421)
                0.0150261205 = queryNorm
              0.86407876 = fieldWeight in 1728, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.321345 = idf(docFreq=589, maxDocs=44421)
                0.09375 = fieldNorm(doc=1728)
          1.2335452 = weight(abstract_txt:reconciliation in 1728) [ClassicSimilarity], result of:
            1.2335452 = score(doc=1728,freq=5.0), product of:
              0.6542956 = queryWeight, product of:
                4.84175 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.0150261205 = queryNorm
              1.8853025 = fieldWeight in 1728, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.09375 = fieldNorm(doc=1728)
        0.12 = coord(3/25)
    
  2. Vellucci, S.L.: Commercial services for providing authority control : outsourcing the process (2004) 0.15
    0.15395884 = sum of:
      0.15395884 = product of:
        0.7697942 = sum of:
          0.07767055 = weight(abstract_txt:ongoing in 681) [ClassicSimilarity], result of:
            0.07767055 = score(doc=681,freq=2.0), product of:
              0.110044576 = queryWeight, product of:
                1.1464076 = boost
                6.388262 = idf(docFreq=202, maxDocs=44421)
                0.0150261205 = queryNorm
              0.7058099 = fieldWeight in 681, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.388262 = idf(docFreq=202, maxDocs=44421)
                0.078125 = fieldNorm(doc=681)
          0.027966717 = weight(abstract_txt:process in 681) [ClassicSimilarity], result of:
            0.027966717 = score(doc=681,freq=1.0), product of:
              0.08841217 = queryWeight, product of:
                1.4532013 = boost
                4.048922 = idf(docFreq=2105, maxDocs=44421)
                0.0150261205 = queryNorm
              0.31632203 = fieldWeight in 681, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.048922 = idf(docFreq=2105, maxDocs=44421)
                0.078125 = fieldNorm(doc=681)
          0.27660784 = weight(abstract_txt:cleanup in 681) [ClassicSimilarity], result of:
            0.27660784 = score(doc=681,freq=2.0), product of:
              0.25662997 = queryWeight, product of:
                1.7506868 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.0150261205 = queryNorm
              1.077847 = fieldWeight in 681, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.078125 = fieldNorm(doc=681)
          0.057658914 = weight(abstract_txt:headings in 681) [ClassicSimilarity], result of:
            0.057658914 = score(doc=681,freq=1.0), product of:
              0.14321727 = queryWeight, product of:
                1.8495559 = boost
                5.1532483 = idf(docFreq=697, maxDocs=44421)
                0.0150261205 = queryNorm
              0.40259752 = fieldWeight in 681, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1532483 = idf(docFreq=697, maxDocs=44421)
                0.078125 = fieldNorm(doc=681)
          0.32989016 = weight(abstract_txt:authority in 681) [ClassicSimilarity], result of:
            0.32989016 = score(doc=681,freq=12.0), product of:
              0.22906952 = queryWeight, product of:
                2.8648312 = boost
                5.321345 = idf(docFreq=589, maxDocs=44421)
                0.0150261205 = queryNorm
              1.4401312 = fieldWeight in 681, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                5.321345 = idf(docFreq=589, maxDocs=44421)
                0.078125 = fieldNorm(doc=681)
        0.2 = coord(5/25)
    
  3. Lougheed, B.; Moran, R.; Callison, C.: Reconciliation through description : using metadata to realize the vision of the National Research Centre for Truth and Reconciliation (2015) 0.13
    0.1296206 = sum of:
      0.1296206 = product of:
        1.0801717 = sum of:
          0.091111645 = weight(abstract_txt:ultimately in 3181) [ClassicSimilarity], result of:
            0.091111645 = score(doc=3181,freq=1.0), product of:
              0.13656399 = queryWeight, product of:
                1.2770939 = boost
                7.1165 = idf(docFreq=97, maxDocs=44421)
                0.0150261205 = queryNorm
              0.66717184 = fieldWeight in 3181, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1165 = idf(docFreq=97, maxDocs=44421)
                0.09375 = fieldNorm(doc=3181)
          0.033560064 = weight(abstract_txt:process in 3181) [ClassicSimilarity], result of:
            0.033560064 = score(doc=3181,freq=1.0), product of:
              0.08841217 = queryWeight, product of:
                1.4532013 = boost
                4.048922 = idf(docFreq=2105, maxDocs=44421)
                0.0150261205 = queryNorm
              0.37958646 = fieldWeight in 3181, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.048922 = idf(docFreq=2105, maxDocs=44421)
                0.09375 = fieldNorm(doc=3181)
          0.95549995 = weight(abstract_txt:reconciliation in 3181) [ClassicSimilarity], result of:
            0.95549995 = score(doc=3181,freq=3.0), product of:
              0.6542956 = queryWeight, product of:
                4.84175 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.0150261205 = queryNorm
              1.460349 = fieldWeight in 3181, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.09375 = fieldNorm(doc=3181)
        0.12 = coord(3/25)
    
  4. Hooland, S. van; Verborgh, R.; Wilde, M. De; Hercher, J.; Mannens, E.; Wa, R.Van de: Evaluating the success of vocabulary reconciliation for cultural heritage collections (2013) 0.13
    0.12704544 = sum of:
      0.12704544 = product of:
        0.794034 = sum of:
          0.03955091 = weight(abstract_txt:process in 1662) [ClassicSimilarity], result of:
            0.03955091 = score(doc=1662,freq=2.0), product of:
              0.08841217 = queryWeight, product of:
                1.4532013 = boost
                4.048922 = idf(docFreq=2105, maxDocs=44421)
                0.0150261205 = queryNorm
              0.4473469 = fieldWeight in 1662, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.048922 = idf(docFreq=2105, maxDocs=44421)
                0.078125 = fieldNorm(doc=1662)
          0.057658914 = weight(abstract_txt:headings in 1662) [ClassicSimilarity], result of:
            0.057658914 = score(doc=1662,freq=1.0), product of:
              0.14321727 = queryWeight, product of:
                1.8495559 = boost
                5.1532483 = idf(docFreq=697, maxDocs=44421)
                0.0150261205 = queryNorm
              0.40259752 = fieldWeight in 1662, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1532483 = idf(docFreq=697, maxDocs=44421)
                0.078125 = fieldNorm(doc=1662)
          0.046688866 = weight(abstract_txt:developed in 1662) [ClassicSimilarity], result of:
            0.046688866 = score(doc=1662,freq=1.0), product of:
              0.14242636 = queryWeight, product of:
                2.2589705 = boost
                4.1959753 = idf(docFreq=1817, maxDocs=44421)
                0.0150261205 = queryNorm
              0.3278106 = fieldWeight in 1662, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1959753 = idf(docFreq=1817, maxDocs=44421)
                0.078125 = fieldNorm(doc=1662)
          0.65013534 = weight(abstract_txt:reconciliation in 1662) [ClassicSimilarity], result of:
            0.65013534 = score(doc=1662,freq=2.0), product of:
              0.6542956 = queryWeight, product of:
                4.84175 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.0150261205 = queryNorm
              0.9936416 = fieldWeight in 1662, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.078125 = fieldNorm(doc=1662)
        0.16 = coord(4/25)
    
  5. Mugridge, R.L.; Furniss, K.A.: Education for authority control : whose responsibility is it? (2002) 0.06
    0.06270676 = sum of:
      0.06270676 = product of:
        0.3919173 = sum of:
          0.075761616 = weight(abstract_txt:maintenance in 459) [ClassicSimilarity], result of:
            0.075761616 = score(doc=459,freq=2.0), product of:
              0.10823404 = queryWeight, product of:
                1.1369377 = boost
                6.3354917 = idf(docFreq=213, maxDocs=44421)
                0.0150261205 = queryNorm
              0.69997954 = fieldWeight in 459, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.3354917 = idf(docFreq=213, maxDocs=44421)
                0.078125 = fieldNorm(doc=459)
          0.054921374 = weight(abstract_txt:ongoing in 459) [ClassicSimilarity], result of:
            0.054921374 = score(doc=459,freq=1.0), product of:
              0.110044576 = queryWeight, product of:
                1.1464076 = boost
                6.388262 = idf(docFreq=202, maxDocs=44421)
                0.0150261205 = queryNorm
              0.49908295 = fieldWeight in 459, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.388262 = idf(docFreq=202, maxDocs=44421)
                0.078125 = fieldNorm(doc=459)
          0.027966717 = weight(abstract_txt:process in 459) [ClassicSimilarity], result of:
            0.027966717 = score(doc=459,freq=1.0), product of:
              0.08841217 = queryWeight, product of:
                1.4532013 = boost
                4.048922 = idf(docFreq=2105, maxDocs=44421)
                0.0150261205 = queryNorm
              0.31632203 = fieldWeight in 459, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.048922 = idf(docFreq=2105, maxDocs=44421)
                0.078125 = fieldNorm(doc=459)
          0.23326756 = weight(abstract_txt:authority in 459) [ClassicSimilarity], result of:
            0.23326756 = score(doc=459,freq=6.0), product of:
              0.22906952 = queryWeight, product of:
                2.8648312 = boost
                5.321345 = idf(docFreq=589, maxDocs=44421)
                0.0150261205 = queryNorm
              1.0183265 = fieldWeight in 459, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.321345 = idf(docFreq=589, maxDocs=44421)
                0.078125 = fieldNorm(doc=459)
        0.16 = coord(4/25)