Document (#39663)

Author
Tani, A.
Candela, L.
Castelli, D.
Title
Dealing with metadata quality : the legacy of digital library efforts
Source
Information processing and management. 49(2013) no.6, S.1194-1205
Year
2013
Abstract
In this work, we elaborate on the meaning of metadata quality by surveying efforts and experiences matured in the digital library domain. In particular, an overview of the frameworks developed to characterize such a multi-faceted concept is presented. Moreover, the most common quality-related problems affecting metadata both during the creation and the aggregation phase are discussed together with the approaches, technologies and tools developed to mitigate them. This survey on digital library developments is expected to contribute to the ongoing discussion on data and metadata quality occurring in the emerging yet more general framework of data infrastructures.
Content
Vgl.: doi: 10.1016/j.ipm.2013.05.003.
Theme
Metadaten

Similar documents (author)

  1. Candela, L.; Castelli, D.; Manghi, P.; Tani, A.: Data journals : a survey (2015) 4.35
    4.3454504 = sum of:
      4.3454504 = sum of:
        2.1216395 = weight(author_txt:castelli in 3156) [ClassicSimilarity], result of:
          2.1216395 = score(doc=3156,freq=1.0), product of:
            0.6959363 = queryWeight, product of:
              9.755557 = idf(docFreq=6, maxDocs=44421)
              0.071337424 = queryNorm
            3.0486116 = fieldWeight in 3156, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.755557 = idf(docFreq=6, maxDocs=44421)
              0.3125 = fieldNorm(doc=3156)
        2.2238111 = weight(author_txt:candela in 3156) [ClassicSimilarity], result of:
          2.2238111 = score(doc=3156,freq=1.0), product of:
            0.7181035 = queryWeight, product of:
              1.0158013 = boost
              9.909708 = idf(docFreq=5, maxDocs=44421)
              0.071337424 = queryNorm
            3.0967836 = fieldWeight in 3156, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.909708 = idf(docFreq=5, maxDocs=44421)
              0.3125 = fieldNorm(doc=3156)
    
  2. Candela, G.: ¬An automatic data quality approach to assess semantic data from cultural heritage institutions (2023) 2.22
    2.2238111 = sum of:
      2.2238111 = product of:
        4.4476223 = sum of:
          4.4476223 = weight(author_txt:candela in 1999) [ClassicSimilarity], result of:
            4.4476223 = score(doc=1999,freq=1.0), product of:
              0.7181035 = queryWeight, product of:
                1.0158013 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.071337424 = queryNorm
              6.1935673 = fieldWeight in 1999, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.625 = fieldNorm(doc=1999)
        0.5 = coord(1/2)
    
  3. Castelli, V.: Progressive search and retrieval from image databases (2002) 2.12
    2.1216395 = sum of:
      2.1216395 = product of:
        4.243279 = sum of:
          4.243279 = weight(author_txt:castelli in 5253) [ClassicSimilarity], result of:
            4.243279 = score(doc=5253,freq=1.0), product of:
              0.6959363 = queryWeight, product of:
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.071337424 = queryNorm
              6.0972233 = fieldWeight in 5253, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.625 = fieldNorm(doc=5253)
        0.5 = coord(1/2)
    
  4. Castelli, D.: Digital libraries of the future - and the role of libraries (2006) 2.12
    2.1216395 = sum of:
      2.1216395 = product of:
        4.243279 = sum of:
          4.243279 = weight(author_txt:castelli in 3589) [ClassicSimilarity], result of:
            4.243279 = score(doc=3589,freq=1.0), product of:
              0.6959363 = queryWeight, product of:
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.071337424 = queryNorm
              6.0972233 = fieldWeight in 3589, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.625 = fieldNorm(doc=3589)
        0.5 = coord(1/2)
    
  5. Castelli, V.: Still image search and retrieval (2009) 2.12
    2.1216395 = sum of:
      2.1216395 = product of:
        4.243279 = sum of:
          4.243279 = weight(author_txt:castelli in 872) [ClassicSimilarity], result of:
            4.243279 = score(doc=872,freq=1.0), product of:
              0.6959363 = queryWeight, product of:
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.071337424 = queryNorm
              6.0972233 = fieldWeight in 872, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.625 = fieldNorm(doc=872)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Mullin, C.A.: ¬An amicable divorce : programmatic derivation of faceted data from Library of Congress Subject Headings for music (2018) 0.23
    0.23337206 = sum of:
      0.23337206 = product of:
        0.83347166 = sum of:
          0.1252545 = weight(abstract_txt:faceted in 183) [ClassicSimilarity], result of:
            0.1252545 = score(doc=183,freq=3.0), product of:
              0.12896632 = queryWeight, product of:
                1.0064938 = boost
                5.981156 = idf(docFreq=304, maxDocs=44421)
                0.02142299 = queryNorm
              0.9712187 = fieldWeight in 183, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.981156 = idf(docFreq=304, maxDocs=44421)
                0.09375 = fieldNorm(doc=183)
          0.03530557 = weight(abstract_txt:data in 183) [ClassicSimilarity], result of:
            0.03530557 = score(doc=183,freq=2.0), product of:
              0.07996194 = queryWeight, product of:
                1.1208038 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.02142299 = queryNorm
              0.4415297 = fieldWeight in 183, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.09375 = fieldNorm(doc=183)
          0.14502335 = weight(abstract_txt:legacy in 183) [ClassicSimilarity], result of:
            0.14502335 = score(doc=183,freq=1.0), product of:
              0.20509093 = queryWeight, product of:
                1.269247 = boost
                7.5425844 = idf(docFreq=63, maxDocs=44421)
                0.02142299 = queryNorm
              0.7071173 = fieldWeight in 183, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5425844 = idf(docFreq=63, maxDocs=44421)
                0.09375 = fieldNorm(doc=183)
          0.20628281 = weight(abstract_txt:mitigate in 183) [ClassicSimilarity], result of:
            0.20628281 = score(doc=183,freq=1.0), product of:
              0.25939596 = queryWeight, product of:
                1.4274291 = boost
                8.482592 = idf(docFreq=24, maxDocs=44421)
                0.02142299 = queryNorm
              0.79524297 = fieldWeight in 183, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.482592 = idf(docFreq=24, maxDocs=44421)
                0.09375 = fieldNorm(doc=183)
          0.046497557 = weight(abstract_txt:library in 183) [ClassicSimilarity], result of:
            0.046497557 = score(doc=183,freq=2.0), product of:
              0.1099778 = queryWeight, product of:
                1.6098526 = boost
                3.188885 = idf(docFreq=4976, maxDocs=44421)
                0.02142299 = queryNorm
              0.4227904 = fieldWeight in 183, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.188885 = idf(docFreq=4976, maxDocs=44421)
                0.09375 = fieldNorm(doc=183)
          0.118070886 = weight(abstract_txt:efforts in 183) [ClassicSimilarity], result of:
            0.118070886 = score(doc=183,freq=1.0), product of:
              0.22529924 = queryWeight, product of:
                1.8813423 = boost
                5.59 = idf(docFreq=450, maxDocs=44421)
                0.02142299 = queryNorm
              0.5240625 = fieldWeight in 183, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.59 = idf(docFreq=450, maxDocs=44421)
                0.09375 = fieldNorm(doc=183)
          0.15703698 = weight(abstract_txt:metadata in 183) [ClassicSimilarity], result of:
            0.15703698 = score(doc=183,freq=1.0), product of:
              0.34330156 = queryWeight, product of:
                3.284285 = boost
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.02142299 = queryNorm
              0.45743155 = fieldWeight in 183, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.09375 = fieldNorm(doc=183)
        0.28 = coord(7/25)
    
  2. Palavitsinis, N.; Manouselis, N.; Sanchez-Alonso, S.: Metadata quality in digital repositories : empirical results from the cross-domain transfer of a quality assurance process (2014) 0.15
    0.15287517 = sum of:
      0.15287517 = product of:
        0.6369799 = sum of:
          0.06830919 = weight(abstract_txt:frameworks in 2288) [ClassicSimilarity], result of:
            0.06830919 = score(doc=2288,freq=1.0), product of:
              0.16269271 = queryWeight, product of:
                1.1304647 = boost
                6.717861 = idf(docFreq=145, maxDocs=44421)
                0.02142299 = queryNorm
              0.41986632 = fieldWeight in 2288, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.717861 = idf(docFreq=145, maxDocs=44421)
                0.0625 = fieldNorm(doc=2288)
          0.03329008 = weight(abstract_txt:developed in 2288) [ClassicSimilarity], result of:
            0.03329008 = score(doc=2288,freq=1.0), product of:
              0.126941 = queryWeight, product of:
                1.4121763 = boost
                4.1959753 = idf(docFreq=1817, maxDocs=44421)
                0.02142299 = queryNorm
              0.26224846 = fieldWeight in 2288, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1959753 = idf(docFreq=1817, maxDocs=44421)
                0.0625 = fieldNorm(doc=2288)
          0.02191916 = weight(abstract_txt:library in 2288) [ClassicSimilarity], result of:
            0.02191916 = score(doc=2288,freq=1.0), product of:
              0.1099778 = queryWeight, product of:
                1.6098526 = boost
                3.188885 = idf(docFreq=4976, maxDocs=44421)
                0.02142299 = queryNorm
              0.19930531 = fieldWeight in 2288, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.188885 = idf(docFreq=4976, maxDocs=44421)
                0.0625 = fieldNorm(doc=2288)
          0.07746815 = weight(abstract_txt:digital in 2288) [ClassicSimilarity], result of:
            0.07746815 = score(doc=2288,freq=2.0), product of:
              0.20253243 = queryWeight, product of:
                2.184645 = boost
                4.3274655 = idf(docFreq=1593, maxDocs=44421)
                0.02142299 = queryNorm
              0.38249752 = fieldWeight in 2288, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3274655 = idf(docFreq=1593, maxDocs=44421)
                0.0625 = fieldNorm(doc=2288)
          0.20189638 = weight(abstract_txt:quality in 2288) [ClassicSimilarity], result of:
            0.20189638 = score(doc=2288,freq=5.0), product of:
              0.3110506 = queryWeight, product of:
                3.1262124 = boost
                4.6444306 = idf(docFreq=1160, maxDocs=44421)
                0.02142299 = queryNorm
              0.6490789 = fieldWeight in 2288, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.6444306 = idf(docFreq=1160, maxDocs=44421)
                0.0625 = fieldNorm(doc=2288)
          0.2340969 = weight(abstract_txt:metadata in 2288) [ClassicSimilarity], result of:
            0.2340969 = score(doc=2288,freq=5.0), product of:
              0.34330156 = queryWeight, product of:
                3.284285 = boost
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.02142299 = queryNorm
              0.6818987 = fieldWeight in 2288, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.0625 = fieldNorm(doc=2288)
        0.24 = coord(6/25)
    
  3. Park, J.-r.; Maszaros, S.: Metadata Object Description Schema (MODS) in digital repositories : an exploratory study of metadata use and quality (2009) 0.15
    0.1521869 = sum of:
      0.1521869 = product of:
        0.6341121 = sum of:
          0.047283337 = weight(abstract_txt:contribute in 245) [ClassicSimilarity], result of:
            0.047283337 = score(doc=245,freq=1.0), product of:
              0.12730753 = queryWeight, product of:
                5.942566 = idf(docFreq=316, maxDocs=44421)
                0.02142299 = queryNorm
              0.37141037 = fieldWeight in 245, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.942566 = idf(docFreq=316, maxDocs=44421)
                0.0625 = fieldNorm(doc=245)
          0.016643206 = weight(abstract_txt:data in 245) [ClassicSimilarity], result of:
            0.016643206 = score(doc=245,freq=1.0), product of:
              0.07996194 = queryWeight, product of:
                1.1208038 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.02142299 = queryNorm
              0.20813909 = fieldWeight in 245, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=245)
          0.0883527 = weight(abstract_txt:occurring in 245) [ClassicSimilarity], result of:
            0.0883527 = score(doc=245,freq=1.0), product of:
              0.19313541 = queryWeight, product of:
                1.2316971 = boost
                7.319441 = idf(docFreq=79, maxDocs=44421)
                0.02142299 = queryNorm
              0.45746505 = fieldWeight in 245, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.319441 = idf(docFreq=79, maxDocs=44421)
                0.0625 = fieldNorm(doc=245)
          0.07746815 = weight(abstract_txt:digital in 245) [ClassicSimilarity], result of:
            0.07746815 = score(doc=245,freq=2.0), product of:
              0.20253243 = queryWeight, product of:
                2.184645 = boost
                4.3274655 = idf(docFreq=1593, maxDocs=44421)
                0.02142299 = queryNorm
              0.38249752 = fieldWeight in 245, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3274655 = idf(docFreq=1593, maxDocs=44421)
                0.0625 = fieldNorm(doc=245)
          0.09029081 = weight(abstract_txt:quality in 245) [ClassicSimilarity], result of:
            0.09029081 = score(doc=245,freq=1.0), product of:
              0.3110506 = queryWeight, product of:
                3.1262124 = boost
                4.6444306 = idf(docFreq=1160, maxDocs=44421)
                0.02142299 = queryNorm
              0.2902769 = fieldWeight in 245, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6444306 = idf(docFreq=1160, maxDocs=44421)
                0.0625 = fieldNorm(doc=245)
          0.31407395 = weight(abstract_txt:metadata in 245) [ClassicSimilarity], result of:
            0.31407395 = score(doc=245,freq=9.0), product of:
              0.34330156 = queryWeight, product of:
                3.284285 = boost
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.02142299 = queryNorm
              0.9148631 = fieldWeight in 245, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.0625 = fieldNorm(doc=245)
        0.24 = coord(6/25)
    
  4. Kurth, M.; Ruddy, D.; Rupp, N.: Repurposing MARC metadata : using digital project experience to develop a metadata management design (2004) 0.15
    0.14771557 = sum of:
      0.14771557 = product of:
        0.73857784 = sum of:
          0.020804007 = weight(abstract_txt:data in 5748) [ClassicSimilarity], result of:
            0.020804007 = score(doc=5748,freq=1.0), product of:
              0.07996194 = queryWeight, product of:
                1.1208038 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.02142299 = queryNorm
              0.26017386 = fieldWeight in 5748, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.078125 = fieldNorm(doc=5748)
          0.047456373 = weight(abstract_txt:library in 5748) [ClassicSimilarity], result of:
            0.047456373 = score(doc=5748,freq=3.0), product of:
              0.1099778 = queryWeight, product of:
                1.6098526 = boost
                3.188885 = idf(docFreq=4976, maxDocs=44421)
                0.02142299 = queryNorm
              0.43150866 = fieldWeight in 5748, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.188885 = idf(docFreq=4976, maxDocs=44421)
                0.078125 = fieldNorm(doc=5748)
          0.098392405 = weight(abstract_txt:efforts in 5748) [ClassicSimilarity], result of:
            0.098392405 = score(doc=5748,freq=1.0), product of:
              0.22529924 = queryWeight, product of:
                1.8813423 = boost
                5.59 = idf(docFreq=450, maxDocs=44421)
                0.02142299 = queryNorm
              0.43671876 = fieldWeight in 5748, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.59 = idf(docFreq=450, maxDocs=44421)
                0.078125 = fieldNorm(doc=5748)
          0.1185984 = weight(abstract_txt:digital in 5748) [ClassicSimilarity], result of:
            0.1185984 = score(doc=5748,freq=3.0), product of:
              0.20253243 = queryWeight, product of:
                2.184645 = boost
                4.3274655 = idf(docFreq=1593, maxDocs=44421)
                0.02142299 = queryNorm
              0.58557737 = fieldWeight in 5748, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.3274655 = idf(docFreq=1593, maxDocs=44421)
                0.078125 = fieldNorm(doc=5748)
          0.45332664 = weight(abstract_txt:metadata in 5748) [ClassicSimilarity], result of:
            0.45332664 = score(doc=5748,freq=12.0), product of:
              0.34330156 = queryWeight, product of:
                3.284285 = boost
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.02142299 = queryNorm
              1.3204911 = fieldWeight in 5748, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.078125 = fieldNorm(doc=5748)
        0.2 = coord(5/25)
    
  5. Jarke, M.; Lenzerini, M.; Vassiliou, Y.: Fundamentals of data warehousing (1999) 0.14
    0.14256246 = sum of:
      0.14256246 = product of:
        0.7128123 = sum of:
          0.04992962 = weight(abstract_txt:data in 2302) [ClassicSimilarity], result of:
            0.04992962 = score(doc=2302,freq=4.0), product of:
              0.07996194 = queryWeight, product of:
                1.1208038 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.02142299 = queryNorm
              0.6244173 = fieldWeight in 2302, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.09375 = fieldNorm(doc=2302)
          0.13119228 = weight(abstract_txt:aggregation in 2302) [ClassicSimilarity], result of:
            0.13119228 = score(doc=2302,freq=1.0), product of:
              0.1918345 = queryWeight, product of:
                1.2275418 = boost
                7.2947483 = idf(docFreq=81, maxDocs=44421)
                0.02142299 = queryNorm
              0.68388265 = fieldWeight in 2302, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2947483 = idf(docFreq=81, maxDocs=44421)
                0.09375 = fieldNorm(doc=2302)
          0.118070886 = weight(abstract_txt:efforts in 2302) [ClassicSimilarity], result of:
            0.118070886 = score(doc=2302,freq=1.0), product of:
              0.22529924 = queryWeight, product of:
                1.8813423 = boost
                5.59 = idf(docFreq=450, maxDocs=44421)
                0.02142299 = queryNorm
              0.5240625 = fieldWeight in 2302, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.59 = idf(docFreq=450, maxDocs=44421)
                0.09375 = fieldNorm(doc=2302)
          0.19153573 = weight(abstract_txt:quality in 2302) [ClassicSimilarity], result of:
            0.19153573 = score(doc=2302,freq=2.0), product of:
              0.3110506 = queryWeight, product of:
                3.1262124 = boost
                4.6444306 = idf(docFreq=1160, maxDocs=44421)
                0.02142299 = queryNorm
              0.61577034 = fieldWeight in 2302, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6444306 = idf(docFreq=1160, maxDocs=44421)
                0.09375 = fieldNorm(doc=2302)
          0.2220838 = weight(abstract_txt:metadata in 2302) [ClassicSimilarity], result of:
            0.2220838 = score(doc=2302,freq=2.0), product of:
              0.34330156 = queryWeight, product of:
                3.284285 = boost
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.02142299 = queryNorm
              0.6469059 = fieldWeight in 2302, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.09375 = fieldNorm(doc=2302)
        0.2 = coord(5/25)