Document (#44181)

Author
McElfresh, L.K.
Title
Creator name standardization using faceted vocabularies in the BTAA geoportal : Michigan State University libraries digital repository case study
Source
Cataloging and classification quarterly. 61(2023) no.5-6, S.605-625
Year
2023
Abstract
Digital libraries incorporate metadata from varied sources, ranging from traditional catalog data to author-supplied descriptions. The Big Ten Academic Alliance (BTAA) Geoportal unites geospatial resources from the libraries of the BTAA, compounding the variability of metadata. The BTAA Geospatial Information Network's (BTAA GIN) Metadata Committee works to ensure completeness and consistency of metadata in the Geoportal, including a project to standardize the contents of the Creator field. The project comprises an OpenRefine data cleaning phase; evaluation of controlled vocabularies for semiautomated matching via OpenRefine reconciliation; and development and testing of a best practices guide for application of a controlled vocabulary.
Content
Vgl.: https://www.tandfonline.com/doi/full/10.1080/01639374.2023.2200430.
Footnote
Beitrag in Themenheft: Implementation of Faceted Vocabularies.
Field
Geowissenschaften
Location
USA

Similar documents (content)

  1. Hooland, S. van; Verborgh, R.; Wilde, M. De; Hercher, J.; Mannens, E.; Wa, R.Van de: Evaluating the success of vocabulary reconciliation for cultural heritage collections (2013) 0.21
    0.21451667 = sum of:
      0.21451667 = product of:
        0.89381945 = sum of:
          0.012097777 = weight(abstract_txt:from in 1662) [ClassicSimilarity], result of:
            0.012097777 = score(doc=1662,freq=1.0), product of:
              0.05611785 = queryWeight, product of:
                1.2140493 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.016751366 = queryNorm
              0.21557805 = fieldWeight in 1662, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.078125 = fieldNorm(doc=1662)
          0.19743772 = weight(abstract_txt:reconciliation in 1662) [ClassicSimilarity], result of:
            0.19743772 = score(doc=1662,freq=2.0), product of:
              0.19870114 = queryWeight, product of:
                1.318941 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.016751366 = queryNorm
              0.9936416 = fieldWeight in 1662, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.078125 = fieldNorm(doc=1662)
          0.08794924 = weight(abstract_txt:controlled in 1662) [ClassicSimilarity], result of:
            0.08794924 = score(doc=1662,freq=2.0), product of:
              0.14601977 = queryWeight, product of:
                1.598991 = boost
                5.4514923 = idf(docFreq=517, maxDocs=44421)
                0.016751366 = queryNorm
              0.6023105 = fieldWeight in 1662, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4514923 = idf(docFreq=517, maxDocs=44421)
                0.078125 = fieldNorm(doc=1662)
          0.11231957 = weight(abstract_txt:vocabularies in 1662) [ClassicSimilarity], result of:
            0.11231957 = score(doc=1662,freq=2.0), product of:
              0.17188074 = queryWeight, product of:
                1.7348175 = boost
                5.9145703 = idf(docFreq=325, maxDocs=44421)
                0.016751366 = queryNorm
              0.65347385 = fieldWeight in 1662, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.9145703 = idf(docFreq=325, maxDocs=44421)
                0.078125 = fieldNorm(doc=1662)
          0.3295516 = weight(abstract_txt:openrefine in 1662) [ClassicSimilarity], result of:
            0.3295516 = score(doc=1662,freq=1.0), product of:
              0.4438292 = queryWeight, product of:
                2.7877135 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.016751366 = queryNorm
              0.74251896 = fieldWeight in 1662, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.078125 = fieldNorm(doc=1662)
          0.15446356 = weight(abstract_txt:metadata in 1662) [ClassicSimilarity], result of:
            0.15446356 = score(doc=1662,freq=3.0), product of:
              0.23394866 = queryWeight, product of:
                2.862302 = boost
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.016751366 = queryNorm
              0.66024554 = fieldWeight in 1662, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.078125 = fieldNorm(doc=1662)
        0.24 = coord(6/25)
    
  2. Lynch, J.D.; Gibson, J.; Han, M.-J.: Analyzing and normalizing type metadata for a large aggregated digital library (2020) 0.16
    0.15592568 = sum of:
      0.15592568 = product of:
        0.7796284 = sum of:
          0.014517331 = weight(abstract_txt:from in 720) [ClassicSimilarity], result of:
            0.014517331 = score(doc=720,freq=1.0), product of:
              0.05611785 = queryWeight, product of:
                1.2140493 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.016751366 = queryNorm
              0.25869364 = fieldWeight in 720, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.09375 = fieldNorm(doc=720)
          0.05279198 = weight(abstract_txt:digital in 720) [ClassicSimilarity], result of:
            0.05279198 = score(doc=720,freq=2.0), product of:
              0.09201276 = queryWeight, product of:
                1.2692999 = boost
                4.3274655 = idf(docFreq=1593, maxDocs=44421)
                0.016751366 = queryNorm
              0.57374626 = fieldWeight in 720, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3274655 = idf(docFreq=1593, maxDocs=44421)
                0.09375 = fieldNorm(doc=720)
          0.05472373 = weight(abstract_txt:project in 720) [ClassicSimilarity], result of:
            0.05472373 = score(doc=720,freq=2.0), product of:
              0.09424389 = queryWeight, product of:
                1.2845967 = boost
                4.3796177 = idf(docFreq=1512, maxDocs=44421)
                0.016751366 = queryNorm
              0.58066076 = fieldWeight in 720, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3796177 = idf(docFreq=1512, maxDocs=44421)
                0.09375 = fieldNorm(doc=720)
          0.39546195 = weight(abstract_txt:openrefine in 720) [ClassicSimilarity], result of:
            0.39546195 = score(doc=720,freq=1.0), product of:
              0.4438292 = queryWeight, product of:
                2.7877135 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.016751366 = queryNorm
              0.8910228 = fieldWeight in 720, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.09375 = fieldNorm(doc=720)
          0.2621334 = weight(abstract_txt:metadata in 720) [ClassicSimilarity], result of:
            0.2621334 = score(doc=720,freq=6.0), product of:
              0.23394866 = queryWeight, product of:
                2.862302 = boost
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.016751366 = queryNorm
              1.120474 = fieldWeight in 720, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.09375 = fieldNorm(doc=720)
        0.2 = coord(5/25)
    
  3. Integrating multiple overlapping metadata standards (1999) 0.14
    0.13636187 = sum of:
      0.13636187 = product of:
        0.85226166 = sum of:
          0.062215947 = weight(abstract_txt:digital in 5052) [ClassicSimilarity], result of:
            0.062215947 = score(doc=5052,freq=1.0), product of:
              0.09201276 = queryWeight, product of:
                1.2692999 = boost
                4.3274655 = idf(docFreq=1593, maxDocs=44421)
                0.016751366 = queryNorm
              0.6761665 = fieldWeight in 5052, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3274655 = idf(docFreq=1593, maxDocs=44421)
                0.15625 = fieldNorm(doc=5052)
          0.062427036 = weight(abstract_txt:libraries in 5052) [ClassicSimilarity], result of:
            0.062427036 = score(doc=5052,freq=1.0), product of:
              0.10556643 = queryWeight, product of:
                1.6651323 = boost
                3.78466 = idf(docFreq=2742, maxDocs=44421)
                0.016751366 = queryNorm
              0.5913531 = fieldWeight in 5052, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.78466 = idf(docFreq=2742, maxDocs=44421)
                0.15625 = fieldNorm(doc=5052)
          0.47538072 = weight(abstract_txt:geospatial in 5052) [ClassicSimilarity], result of:
            0.47538072 = score(doc=5052,freq=1.0), product of:
              0.35695046 = queryWeight, product of:
                2.500024 = boost
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.016751366 = queryNorm
              1.3317834 = fieldWeight in 5052, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.15625 = fieldNorm(doc=5052)
          0.25223795 = weight(abstract_txt:metadata in 5052) [ClassicSimilarity], result of:
            0.25223795 = score(doc=5052,freq=2.0), product of:
              0.23394866 = queryWeight, product of:
                2.862302 = boost
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.016751366 = queryNorm
              1.0781765 = fieldWeight in 5052, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.15625 = fieldNorm(doc=5052)
        0.16 = coord(4/25)
    
  4. Hooland, S. van; Verborgh, R.: Linked data for Lilibraries, archives and museums : how to clean, link, and publish your metadata (2014) 0.13
    0.13026397 = sum of:
      0.13026397 = product of:
        0.5427666 = sum of:
          0.010265304 = weight(abstract_txt:from in 153) [ClassicSimilarity], result of:
            0.010265304 = score(doc=153,freq=2.0), product of:
              0.05611785 = queryWeight, product of:
                1.2140493 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.016751366 = queryNorm
              0.18292403 = fieldWeight in 153, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.046875 = fieldNorm(doc=153)
          0.018664785 = weight(abstract_txt:digital in 153) [ClassicSimilarity], result of:
            0.018664785 = score(doc=153,freq=1.0), product of:
              0.09201276 = queryWeight, product of:
                1.2692999 = boost
                4.3274655 = idf(docFreq=1593, maxDocs=44421)
                0.016751366 = queryNorm
              0.20284995 = fieldWeight in 153, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3274655 = idf(docFreq=1593, maxDocs=44421)
                0.046875 = fieldNorm(doc=153)
          0.12940371 = weight(abstract_txt:cleaning in 153) [ClassicSimilarity], result of:
            0.12940371 = score(doc=153,freq=3.0), product of:
              0.1841112 = queryWeight, product of:
                1.2695953 = boost
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.016751366 = queryNorm
              0.7028563 = fieldWeight in 153, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.046875 = fieldNorm(doc=153)
          0.1450865 = weight(abstract_txt:reconciliation in 153) [ClassicSimilarity], result of:
            0.1450865 = score(doc=153,freq=3.0), product of:
              0.19870114 = queryWeight, product of:
                1.318941 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.016751366 = queryNorm
              0.7301745 = fieldWeight in 153, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.046875 = fieldNorm(doc=153)
          0.018728111 = weight(abstract_txt:libraries in 153) [ClassicSimilarity], result of:
            0.018728111 = score(doc=153,freq=1.0), product of:
              0.10556643 = queryWeight, product of:
                1.6651323 = boost
                3.78466 = idf(docFreq=2742, maxDocs=44421)
                0.016751366 = queryNorm
              0.17740594 = fieldWeight in 153, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.78466 = idf(docFreq=2742, maxDocs=44421)
                0.046875 = fieldNorm(doc=153)
          0.22061813 = weight(abstract_txt:metadata in 153) [ClassicSimilarity], result of:
            0.22061813 = score(doc=153,freq=17.0), product of:
              0.23394866 = queryWeight, product of:
                2.862302 = boost
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.016751366 = queryNorm
              0.9430194 = fieldWeight in 153, product of:
                4.1231055 = tf(freq=17.0), with freq of:
                  17.0 = termFreq=17.0
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.046875 = fieldNorm(doc=153)
        0.24 = coord(6/25)
    
  5. Gilliland, A.J.: Contemplating co-creator rights in archival description (2012) 0.11
    0.110443056 = sum of:
      0.110443056 = product of:
        0.5522153 = sum of:
          0.009678221 = weight(abstract_txt:from in 1415) [ClassicSimilarity], result of:
            0.009678221 = score(doc=1415,freq=1.0), product of:
              0.05611785 = queryWeight, product of:
                1.2140493 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.016751366 = queryNorm
              0.17246243 = fieldWeight in 1415, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.0625 = fieldNorm(doc=1415)
          0.025797013 = weight(abstract_txt:project in 1415) [ClassicSimilarity], result of:
            0.025797013 = score(doc=1415,freq=1.0), product of:
              0.09424389 = queryWeight, product of:
                1.2845967 = boost
                4.3796177 = idf(docFreq=1512, maxDocs=44421)
                0.016751366 = queryNorm
              0.2737261 = fieldWeight in 1415, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3796177 = idf(docFreq=1512, maxDocs=44421)
                0.0625 = fieldNorm(doc=1415)
          0.11168765 = weight(abstract_txt:reconciliation in 1415) [ClassicSimilarity], result of:
            0.11168765 = score(doc=1415,freq=1.0), product of:
              0.19870114 = queryWeight, product of:
                1.318941 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.016751366 = queryNorm
              0.5620886 = fieldWeight in 1415, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.0625 = fieldNorm(doc=1415)
          0.30415723 = weight(abstract_txt:creator in 1415) [ClassicSimilarity], result of:
            0.30415723 = score(doc=1415,freq=3.0), product of:
              0.33850515 = queryWeight, product of:
                2.4345732 = boost
                8.30027 = idf(docFreq=29, maxDocs=44421)
                0.016751366 = queryNorm
              0.8985306 = fieldWeight in 1415, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.30027 = idf(docFreq=29, maxDocs=44421)
                0.0625 = fieldNorm(doc=1415)
          0.10089518 = weight(abstract_txt:metadata in 1415) [ClassicSimilarity], result of:
            0.10089518 = score(doc=1415,freq=2.0), product of:
              0.23394866 = queryWeight, product of:
                2.862302 = boost
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.016751366 = queryNorm
              0.4312706 = fieldWeight in 1415, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.0625 = fieldNorm(doc=1415)
        0.2 = coord(5/25)