Document (#37044)

Author
Margaritopoulos, M.
Margaritopoulos, T.
Mavridis, I.
Manitsaris, A.
Title
Quantifying and measuring metadata completeness
Source
Journal of the American Society for Information Science and Technology. 63(2012) no.4, S.724-737
Year
2012
Abstract
Completeness of metadata is one of the most essential characteristics of their quality. An incomplete metadata record is a record of degraded quality. Existing approaches to measure metadata completeness limit their scope in counting the existence of values in fields, regardless of the metadata hierarchy as defined in international standards. Such a traditional approach overlooks several issues that need to be taken into account. This paper presents a fine-grained metrics system for measuring metadata completeness, based on field completeness. A metadata field is considered to be a container of multiple pieces of information. In this regard, the proposed system is capable of following the hierarchy of metadata as it is set by the metadata schema and admeasuring the effect of multiple values of multivalued fields. An application of the proposed metrics system, after being configured according to specific user requirements, to measure completeness of a real-world set of metadata is demonstrated. The results prove its ability to assess the sufficiency of metadata to describe a resource and provide targeted measures of completeness throughout the metadata hierarchy.
Theme
Metadaten

Similar documents (content)

  1. Park, J.-r.: Metadata quality in digital repositories : a survey of the current state of the art (2009) 0.17
    0.17158374 = sum of:
      0.17158374 = product of:
        1.0723984 = sum of:
          0.06861527 = weight(abstract_txt:quality in 2982) [ClassicSimilarity], result of:
            0.06861527 = score(doc=2982,freq=7.0), product of:
              0.07134479 = queryWeight, product of:
                1.2909801 = boost
                4.6528544 = idf(docFreq=1145, maxDocs=44218)
                0.011877452 = queryNorm
              0.9617418 = fieldWeight in 2982, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.6528544 = idf(docFreq=1145, maxDocs=44218)
                0.078125 = fieldNorm(doc=2982)
          0.073714994 = weight(abstract_txt:measuring in 2982) [ClassicSimilarity], result of:
            0.073714994 = score(doc=2982,freq=1.0), product of:
              0.1431589 = queryWeight, product of:
                1.8287215 = boost
                6.590942 = idf(docFreq=164, maxDocs=44218)
                0.011877452 = queryNorm
              0.5149173 = fieldWeight in 2982, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.590942 = idf(docFreq=164, maxDocs=44218)
                0.078125 = fieldNorm(doc=2982)
          0.4219064 = weight(abstract_txt:completeness in 2982) [ClassicSimilarity], result of:
            0.4219064 = score(doc=2982,freq=1.0), product of:
              0.6954745 = queryWeight, product of:
                7.540724 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.011877452 = queryNorm
              0.6066454 = fieldWeight in 2982, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.078125 = fieldNorm(doc=2982)
          0.5081618 = weight(abstract_txt:metadata in 2982) [ClassicSimilarity], result of:
            0.5081618 = score(doc=2982,freq=8.0), product of:
              0.47112507 = queryWeight, product of:
                8.1261 = boost
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.011877452 = queryNorm
              1.0786134 = fieldWeight in 2982, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.078125 = fieldNorm(doc=2982)
        0.16 = coord(4/25)
    
  2. Foulonneau, M.: Information redundancy across metadata collections (2007) 0.16
    0.16137092 = sum of:
      0.16137092 = product of:
        1.0085683 = sum of:
          0.020747308 = weight(abstract_txt:quality in 915) [ClassicSimilarity], result of:
            0.020747308 = score(doc=915,freq=1.0), product of:
              0.07134479 = queryWeight, product of:
                1.2909801 = boost
                4.6528544 = idf(docFreq=1145, maxDocs=44218)
                0.011877452 = queryNorm
              0.2908034 = fieldWeight in 915, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6528544 = idf(docFreq=1145, maxDocs=44218)
                0.0625 = fieldNorm(doc=915)
          0.033790283 = weight(abstract_txt:record in 915) [ClassicSimilarity], result of:
            0.033790283 = score(doc=915,freq=1.0), product of:
              0.098760284 = queryWeight, product of:
                1.5189013 = boost
                5.474311 = idf(docFreq=503, maxDocs=44218)
                0.011877452 = queryNorm
              0.34214443 = fieldWeight in 915, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.474311 = idf(docFreq=503, maxDocs=44218)
                0.0625 = fieldNorm(doc=915)
          0.47733262 = weight(abstract_txt:completeness in 915) [ClassicSimilarity], result of:
            0.47733262 = score(doc=915,freq=2.0), product of:
              0.6954745 = queryWeight, product of:
                7.540724 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.011877452 = queryNorm
              0.6863409 = fieldWeight in 915, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.0625 = fieldNorm(doc=915)
          0.47669807 = weight(abstract_txt:metadata in 915) [ClassicSimilarity], result of:
            0.47669807 = score(doc=915,freq=11.0), product of:
              0.47112507 = queryWeight, product of:
                8.1261 = boost
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.011877452 = queryNorm
              1.0118291 = fieldWeight in 915, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.0625 = fieldNorm(doc=915)
        0.16 = coord(4/25)
    
  3. Zavalin, V.: Exploration of subject and genre representation in bibliographic metadata representing works of fiction for children and young adults (2024) 0.13
    0.12669022 = sum of:
      0.12669022 = product of:
        0.7918139 = sum of:
          0.025934136 = weight(abstract_txt:quality in 1152) [ClassicSimilarity], result of:
            0.025934136 = score(doc=1152,freq=1.0), product of:
              0.07134479 = queryWeight, product of:
                1.2909801 = boost
                4.6528544 = idf(docFreq=1145, maxDocs=44218)
                0.011877452 = queryNorm
              0.36350426 = fieldWeight in 1152, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6528544 = idf(docFreq=1145, maxDocs=44218)
                0.078125 = fieldNorm(doc=1152)
          0.032789066 = weight(abstract_txt:fields in 1152) [ClassicSimilarity], result of:
            0.032789066 = score(doc=1152,freq=1.0), product of:
              0.08341941 = queryWeight, product of:
                1.3959568 = boost
                5.0312033 = idf(docFreq=784, maxDocs=44218)
                0.011877452 = queryNorm
              0.39306277 = fieldWeight in 1152, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0312033 = idf(docFreq=784, maxDocs=44218)
                0.078125 = fieldNorm(doc=1152)
          0.4219064 = weight(abstract_txt:completeness in 1152) [ClassicSimilarity], result of:
            0.4219064 = score(doc=1152,freq=1.0), product of:
              0.6954745 = queryWeight, product of:
                7.540724 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.011877452 = queryNorm
              0.6066454 = fieldWeight in 1152, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.078125 = fieldNorm(doc=1152)
          0.3111843 = weight(abstract_txt:metadata in 1152) [ClassicSimilarity], result of:
            0.3111843 = score(doc=1152,freq=3.0), product of:
              0.47112507 = queryWeight, product of:
                8.1261 = boost
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.011877452 = queryNorm
              0.6605131 = fieldWeight in 1152, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.078125 = fieldNorm(doc=1152)
        0.16 = coord(4/25)
    
  4. McElfresh, L.K.: Creator name standardization using faceted vocabularies in the BTAA geoportal : Michigan State University libraries digital repository case study (2023) 0.12
    0.1158577 = sum of:
      0.1158577 = product of:
        0.96548086 = sum of:
          0.028003504 = weight(abstract_txt:field in 1178) [ClassicSimilarity], result of:
            0.028003504 = score(doc=1178,freq=1.0), product of:
              0.06649697 = queryWeight, product of:
                1.246348 = boost
                4.491995 = idf(docFreq=1345, maxDocs=44218)
                0.011877452 = queryNorm
              0.42112452 = fieldWeight in 1178, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.491995 = idf(docFreq=1345, maxDocs=44218)
                0.09375 = fieldNorm(doc=1178)
          0.50628775 = weight(abstract_txt:completeness in 1178) [ClassicSimilarity], result of:
            0.50628775 = score(doc=1178,freq=1.0), product of:
              0.6954745 = queryWeight, product of:
                7.540724 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.011877452 = queryNorm
              0.72797453 = fieldWeight in 1178, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.09375 = fieldNorm(doc=1178)
          0.4311896 = weight(abstract_txt:metadata in 1178) [ClassicSimilarity], result of:
            0.4311896 = score(doc=1178,freq=4.0), product of:
              0.47112507 = queryWeight, product of:
                8.1261 = boost
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.011877452 = queryNorm
              0.91523385 = fieldWeight in 1178, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.09375 = fieldNorm(doc=1178)
        0.12 = coord(3/25)
    
  5. Margaritopoulos, T.; Margaritopoulos, M.; Mavridis, I.; Manitsaris, A.: ¬A conceptual framework for metadata quality assessment (2008) 0.12
    0.115577474 = sum of:
      0.115577474 = product of:
        0.57788736 = sum of:
          0.053903088 = weight(abstract_txt:quality in 2643) [ClassicSimilarity], result of:
            0.053903088 = score(doc=2643,freq=3.0), product of:
              0.07134479 = queryWeight, product of:
                1.2909801 = boost
                4.6528544 = idf(docFreq=1145, maxDocs=44218)
                0.011877452 = queryNorm
              0.7555294 = fieldWeight in 2643, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6528544 = idf(docFreq=1145, maxDocs=44218)
                0.09375 = fieldNorm(doc=2643)
          0.039346877 = weight(abstract_txt:fields in 2643) [ClassicSimilarity], result of:
            0.039346877 = score(doc=2643,freq=1.0), product of:
              0.08341941 = queryWeight, product of:
                1.3959568 = boost
                5.0312033 = idf(docFreq=784, maxDocs=44218)
                0.011877452 = queryNorm
              0.4716753 = fieldWeight in 2643, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0312033 = idf(docFreq=784, maxDocs=44218)
                0.09375 = fieldNorm(doc=2643)
          0.05068542 = weight(abstract_txt:record in 2643) [ClassicSimilarity], result of:
            0.05068542 = score(doc=2643,freq=1.0), product of:
              0.098760284 = queryWeight, product of:
                1.5189013 = boost
                5.474311 = idf(docFreq=503, maxDocs=44218)
                0.011877452 = queryNorm
              0.5132166 = fieldWeight in 2643, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.474311 = idf(docFreq=503, maxDocs=44218)
                0.09375 = fieldNorm(doc=2643)
          0.060530815 = weight(abstract_txt:values in 2643) [ClassicSimilarity], result of:
            0.060530815 = score(doc=2643,freq=1.0), product of:
              0.111167535 = queryWeight, product of:
                1.6114892 = boost
                5.808009 = idf(docFreq=360, maxDocs=44218)
                0.011877452 = queryNorm
              0.5445008 = fieldWeight in 2643, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.808009 = idf(docFreq=360, maxDocs=44218)
                0.09375 = fieldNorm(doc=2643)
          0.37342116 = weight(abstract_txt:metadata in 2643) [ClassicSimilarity], result of:
            0.37342116 = score(doc=2643,freq=3.0), product of:
              0.47112507 = queryWeight, product of:
                8.1261 = boost
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.011877452 = queryNorm
              0.7926158 = fieldWeight in 2643, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.09375 = fieldNorm(doc=2643)
        0.2 = coord(5/25)