Document (#38588)

Author
Stiller, J.
Olensky, M.
Petras, V.
Title
¬A framework for the evaluation of automatic metadata enrichments
Source
Metadata and semantics research: 8th Research Conference, MTSR 2014, Karlsruhe, Germany, November 27-29, 2014, Proceedings. Eds.: S. Closs et al
Imprint
Cham : Springer
Year
2014
Pages
S.238-249
Series
Communications in computer and information science; 478
Abstract
Automatic enrichment of collections connects data to vocabularies, which supports the contextualization of content and adds searchable text to metadata. The paper introduces a framework of four dimensions (frequency, coverage, relevance and error rate) that measure both the suitability of the enrichment for the object and the enrichments' contribution to search success. To verify the framework, it is applied to the evaluation of automatic enrichments in the digital library Europeana. The analysis of 100 result sets and their corresponding queries (1,121 documents total) shows the framework is a valuable tool for guiding enrichments and determining the value of enrichment efforts.
Theme
Metadaten
Object
Europeana

Similar documents (author)

  1. Gradmann, S.; Olensky, M.: Semantische Kontextualisierung von Museumsbeständen in Europeana (2013) 1.91
    1.9115031 = sum of:
      1.9115031 = product of:
        3.8230062 = sum of:
          3.8230062 = weight(author_txt:olensky in 939) [ClassicSimilarity], result of:
            3.8230062 = score(doc=939,freq=1.0), product of:
              0.7719247 = queryWeight, product of:
                1.1019365 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.07072261 = queryNorm
              4.952564 = fieldWeight in 939, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.5 = fieldNorm(doc=939)
        0.5 = coord(1/2)
    
  2. Petras, V.: Heterogenitätsbehandlung und Terminology Mapping durch Crosskonkordanzen : eine Fallstudie (2010) 1.79
    1.7857282 = sum of:
      1.7857282 = product of:
        3.5714564 = sum of:
          3.5714564 = weight(author_txt:petras in 3730) [ClassicSimilarity], result of:
            3.5714564 = score(doc=3730,freq=1.0), product of:
              0.63571405 = queryWeight, product of:
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.07072261 = queryNorm
              5.6180234 = fieldWeight in 3730, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.625 = fieldNorm(doc=3730)
        0.5 = coord(1/2)
    
  3. Petras, V.: ¬The identity of information science (2023) 1.79
    1.7857282 = sum of:
      1.7857282 = product of:
        3.5714564 = sum of:
          3.5714564 = weight(author_txt:petras in 1077) [ClassicSimilarity], result of:
            3.5714564 = score(doc=1077,freq=1.0), product of:
              0.63571405 = queryWeight, product of:
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.07072261 = queryNorm
              5.6180234 = fieldWeight in 1077, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.625 = fieldNorm(doc=1077)
        0.5 = coord(1/2)
    
  4. Olensky, M.; Schmidt, M.; Eck, N.J. van: Evaluation of the citation matching algorithms of CWTS and iFQ in comparison to the Web of science (2016) 1.43
    1.4336272 = sum of:
      1.4336272 = product of:
        2.8672545 = sum of:
          2.8672545 = weight(author_txt:olensky in 3130) [ClassicSimilarity], result of:
            2.8672545 = score(doc=3130,freq=1.0), product of:
              0.7719247 = queryWeight, product of:
                1.1019365 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.07072261 = queryNorm
              3.7144227 = fieldWeight in 3130, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.375 = fieldNorm(doc=3130)
        0.5 = coord(1/2)
    
  5. Petras, V.; Bank, M.: Vergleich der Suchmaschinen AltaVista und HotBot bezüglich Treffermengen und Aktualität (1998) 1.43
    1.4285825 = sum of:
      1.4285825 = product of:
        2.857165 = sum of:
          2.857165 = weight(author_txt:petras in 2514) [ClassicSimilarity], result of:
            2.857165 = score(doc=2514,freq=1.0), product of:
              0.63571405 = queryWeight, product of:
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.07072261 = queryNorm
              4.4944186 = fieldWeight in 2514, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.5 = fieldNorm(doc=2514)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Chen, S.-J.: Semantic enrichment of linked archival materials (2019) 0.10
    0.09659542 = sum of:
      0.09659542 = product of:
        0.6037214 = sum of:
          0.041097328 = weight(abstract_txt:vocabularies in 5488) [ClassicSimilarity], result of:
            0.041097328 = score(doc=5488,freq=1.0), product of:
              0.111204185 = queryWeight, product of:
                1.0195404 = boost
                5.913062 = idf(docFreq=324, maxDocs=44218)
                0.018446086 = queryNorm
              0.36956638 = fieldWeight in 5488, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.913062 = idf(docFreq=324, maxDocs=44218)
                0.0625 = fieldNorm(doc=5488)
          0.099341296 = weight(abstract_txt:europeana in 5488) [ClassicSimilarity], result of:
            0.099341296 = score(doc=5488,freq=1.0), product of:
              0.20029277 = queryWeight, product of:
                1.3682848 = boost
                7.935687 = idf(docFreq=42, maxDocs=44218)
                0.018446086 = queryNorm
              0.49598044 = fieldWeight in 5488, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.935687 = idf(docFreq=42, maxDocs=44218)
                0.0625 = fieldNorm(doc=5488)
          0.06539034 = weight(abstract_txt:metadata in 5488) [ClassicSimilarity], result of:
            0.06539034 = score(doc=5488,freq=2.0), product of:
              0.15156111 = queryWeight, product of:
                1.683266 = boost
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.018446086 = queryNorm
              0.43144536 = fieldWeight in 5488, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.881247 = idf(docFreq=911, maxDocs=44218)
                0.0625 = fieldNorm(doc=5488)
          0.3978924 = weight(abstract_txt:enrichment in 5488) [ClassicSimilarity], result of:
            0.3978924 = score(doc=5488,freq=2.0), product of:
              0.57825524 = queryWeight, product of:
                4.0268393 = boost
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.018446086 = queryNorm
              0.6880913 = fieldWeight in 5488, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.0625 = fieldNorm(doc=5488)
        0.16 = coord(4/25)
    
  2. Nicholls, P.; Ridley, J.: ¬A context for evaluating for multimedia (1996) 0.09
    0.090717025 = sum of:
      0.090717025 = product of:
        0.56698143 = sum of:
          0.078796245 = weight(abstract_txt:introduces in 5132) [ClassicSimilarity], result of:
            0.078796245 = score(doc=5132,freq=1.0), product of:
              0.10811744 = queryWeight, product of:
                1.0052909 = boost
                5.830419 = idf(docFreq=352, maxDocs=44218)
                0.018446086 = queryNorm
              0.7288024 = fieldWeight in 5132, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.830419 = idf(docFreq=352, maxDocs=44218)
                0.125 = fieldNorm(doc=5132)
          0.17469598 = weight(abstract_txt:adds in 5132) [ClassicSimilarity], result of:
            0.17469598 = score(doc=5132,freq=1.0), product of:
              0.183829 = queryWeight, product of:
                1.3108436 = boost
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.018446086 = queryNorm
              0.95031786 = fieldWeight in 5132, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.125 = fieldNorm(doc=5132)
          0.10151939 = weight(abstract_txt:evaluation in 5132) [ClassicSimilarity], result of:
            0.10151939 = score(doc=5132,freq=2.0), product of:
              0.12801418 = queryWeight, product of:
                1.5469915 = boost
                4.4860687 = idf(docFreq=1353, maxDocs=44218)
                0.018446086 = queryNorm
              0.7930324 = fieldWeight in 5132, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4860687 = idf(docFreq=1353, maxDocs=44218)
                0.125 = fieldNorm(doc=5132)
          0.21196976 = weight(abstract_txt:framework in 5132) [ClassicSimilarity], result of:
            0.21196976 = score(doc=5132,freq=2.0), product of:
              0.26348224 = queryWeight, product of:
                3.138698 = boost
                4.550903 = idf(docFreq=1268, maxDocs=44218)
                0.018446086 = queryNorm
              0.80449355 = fieldWeight in 5132, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.550903 = idf(docFreq=1268, maxDocs=44218)
                0.125 = fieldNorm(doc=5132)
        0.16 = coord(4/25)
    
  3. Milstead, J.L.: Methodologies for subject analysis in bibliographic databases (1992) 0.09
    0.08648845 = sum of:
      0.08648845 = product of:
        0.7207371 = sum of:
          0.08141038 = weight(abstract_txt:determining in 3092) [ClassicSimilarity], result of:
            0.08141038 = score(doc=3092,freq=1.0), product of:
              0.13385597 = queryWeight, product of:
                1.1185689 = boost
                6.487401 = idf(docFreq=182, maxDocs=44218)
                0.018446086 = queryNorm
              0.6081939 = fieldWeight in 3092, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.487401 = idf(docFreq=182, maxDocs=44218)
                0.09375 = fieldNorm(doc=3092)
          0.21729808 = weight(abstract_txt:automatic in 3092) [ClassicSimilarity], result of:
            0.21729808 = score(doc=3092,freq=3.0), product of:
              0.25756598 = queryWeight, product of:
                2.6875017 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.018446086 = queryNorm
              0.8436599 = fieldWeight in 3092, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.09375 = fieldNorm(doc=3092)
          0.42202863 = weight(abstract_txt:enrichment in 3092) [ClassicSimilarity], result of:
            0.42202863 = score(doc=3092,freq=1.0), product of:
              0.57825524 = queryWeight, product of:
                4.0268393 = boost
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.018446086 = queryNorm
              0.72983104 = fieldWeight in 3092, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7848644 = idf(docFreq=49, maxDocs=44218)
                0.09375 = fieldNorm(doc=3092)
        0.12 = coord(3/25)
    
  4. Rindflesch, T.C.; Fizsman, M.: The interaction of domain knowledge and linguistic structure in natural language processing : interpreting hypernymic propositions in biomedical text (2003) 0.08
    0.0750639 = sum of:
      0.0750639 = product of:
        0.31276625 = sum of:
          0.03877933 = weight(abstract_txt:contribution in 2097) [ClassicSimilarity], result of:
            0.03877933 = score(doc=2097,freq=1.0), product of:
              0.10698238 = queryWeight, product of:
                5.799733 = idf(docFreq=363, maxDocs=44218)
                0.018446086 = queryNorm
              0.36248332 = fieldWeight in 2097, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.799733 = idf(docFreq=363, maxDocs=44218)
                0.0625 = fieldNorm(doc=2097)
          0.041954387 = weight(abstract_txt:valuable in 2097) [ClassicSimilarity], result of:
            0.041954387 = score(doc=2097,freq=1.0), product of:
              0.11274492 = queryWeight, product of:
                1.026579 = boost
                5.953884 = idf(docFreq=311, maxDocs=44218)
                0.018446086 = queryNorm
              0.37211776 = fieldWeight in 2097, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.953884 = idf(docFreq=311, maxDocs=44218)
                0.0625 = fieldNorm(doc=2097)
          0.04853416 = weight(abstract_txt:supports in 2097) [ClassicSimilarity], result of:
            0.04853416 = score(doc=2097,freq=1.0), product of:
              0.124244474 = queryWeight, product of:
                1.0776616 = boost
                6.2501497 = idf(docFreq=231, maxDocs=44218)
                0.018446086 = queryNorm
              0.39063436 = fieldWeight in 2097, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2501497 = idf(docFreq=231, maxDocs=44218)
                0.0625 = fieldNorm(doc=2097)
          0.0639678 = weight(abstract_txt:error in 2097) [ClassicSimilarity], result of:
            0.0639678 = score(doc=2097,freq=1.0), product of:
              0.14935496 = queryWeight, product of:
                1.1815544 = boost
                6.8527 = idf(docFreq=126, maxDocs=44218)
                0.018446086 = queryNorm
              0.42829376 = fieldWeight in 2097, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8527 = idf(docFreq=126, maxDocs=44218)
                0.0625 = fieldNorm(doc=2097)
          0.035892524 = weight(abstract_txt:evaluation in 2097) [ClassicSimilarity], result of:
            0.035892524 = score(doc=2097,freq=1.0), product of:
              0.12801418 = queryWeight, product of:
                1.5469915 = boost
                4.4860687 = idf(docFreq=1353, maxDocs=44218)
                0.018446086 = queryNorm
              0.2803793 = fieldWeight in 2097, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4860687 = idf(docFreq=1353, maxDocs=44218)
                0.0625 = fieldNorm(doc=2097)
          0.08363807 = weight(abstract_txt:automatic in 2097) [ClassicSimilarity], result of:
            0.08363807 = score(doc=2097,freq=1.0), product of:
              0.25756598 = queryWeight, product of:
                2.6875017 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.018446086 = queryNorm
              0.32472485 = fieldWeight in 2097, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.0625 = fieldNorm(doc=2097)
        0.24 = coord(6/25)
    
  5. Hobson, S.P.; Dorr, B.J.; Monz, C.; Schwartz, R.: Task-based evaluation of text summarization using Relevance Prediction (2007) 0.07
    0.0744304 = sum of:
      0.0744304 = product of:
        0.37215197 = sum of:
          0.039398123 = weight(abstract_txt:introduces in 938) [ClassicSimilarity], result of:
            0.039398123 = score(doc=938,freq=1.0), product of:
              0.10811744 = queryWeight, product of:
                1.0052909 = boost
                5.830419 = idf(docFreq=352, maxDocs=44218)
                0.018446086 = queryNorm
              0.3644012 = fieldWeight in 938, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.830419 = idf(docFreq=352, maxDocs=44218)
                0.0625 = fieldNorm(doc=938)
          0.05077817 = weight(abstract_txt:corresponding in 938) [ClassicSimilarity], result of:
            0.05077817 = score(doc=938,freq=1.0), product of:
              0.12804523 = queryWeight, product of:
                1.0940208 = boost
                6.345029 = idf(docFreq=210, maxDocs=44218)
                0.018446086 = queryNorm
              0.3965643 = fieldWeight in 938, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.345029 = idf(docFreq=210, maxDocs=44218)
                0.0625 = fieldNorm(doc=938)
          0.062167674 = weight(abstract_txt:evaluation in 938) [ClassicSimilarity], result of:
            0.062167674 = score(doc=938,freq=3.0), product of:
              0.12801418 = queryWeight, product of:
                1.5469915 = boost
                4.4860687 = idf(docFreq=1353, maxDocs=44218)
                0.018446086 = queryNorm
              0.48563117 = fieldWeight in 938, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.4860687 = idf(docFreq=1353, maxDocs=44218)
                0.0625 = fieldNorm(doc=938)
          0.14486538 = weight(abstract_txt:automatic in 938) [ClassicSimilarity], result of:
            0.14486538 = score(doc=938,freq=3.0), product of:
              0.25756598 = queryWeight, product of:
                2.6875017 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.018446086 = queryNorm
              0.5624399 = fieldWeight in 938, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.0625 = fieldNorm(doc=938)
          0.07494263 = weight(abstract_txt:framework in 938) [ClassicSimilarity], result of:
            0.07494263 = score(doc=938,freq=1.0), product of:
              0.26348224 = queryWeight, product of:
                3.138698 = boost
                4.550903 = idf(docFreq=1268, maxDocs=44218)
                0.018446086 = queryNorm
              0.28443143 = fieldWeight in 938, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.550903 = idf(docFreq=1268, maxDocs=44218)
                0.0625 = fieldNorm(doc=938)
        0.2 = coord(5/25)