Document (#38588)

Author
Stiller, J.
Olensky, M.
Petras, V.
Title
¬A framework for the evaluation of automatic metadata enrichments
Source
Metadata and semantics research: 8th Research Conference, MTSR 2014, Karlsruhe, Germany, November 27-29, 2014, Proceedings. Eds.: S. Closs et al
Imprint
Cham : Springer
Year
2014
Pages
S.238-249
Series
Communications in computer and information science; 478
Abstract
Automatic enrichment of collections connects data to vocabularies, which supports the contextualization of content and adds searchable text to metadata. The paper introduces a framework of four dimensions (frequency, coverage, relevance and error rate) that measure both the suitability of the enrichment for the object and the enrichments' contribution to search success. To verify the framework, it is applied to the evaluation of automatic enrichments in the digital library Europeana. The analysis of 100 result sets and their corresponding queries (1,121 documents total) shows the framework is a valuable tool for guiding enrichments and determining the value of enrichment efforts.
Theme
Metadaten
Object
Europeana

Similar documents (author)

  1. Gradmann, S.; Olensky, M.: Semantische Kontextualisierung von Museumsbeständen in Europeana (2013) 1.91
    1.912314 = sum of:
      1.912314 = product of:
        3.824628 = sum of:
          3.824628 = weight(author_txt:olensky in 1939) [ClassicSimilarity], result of:
            3.824628 = score(doc=1939,freq=1.0), product of:
              0.77189523 = queryWeight, product of:
                1.1018846 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.07069056 = queryNorm
              4.954854 = fieldWeight in 1939, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.5 = fieldNorm(doc=1939)
        0.5 = coord(1/2)
    
  2. Petras, V.: Heterogenitätsbehandlung und Terminology Mapping durch Crosskonkordanzen : eine Fallstudie (2010) 1.79
    1.7867383 = sum of:
      1.7867383 = product of:
        3.5734766 = sum of:
          3.5734766 = weight(author_txt:petras in 717) [ClassicSimilarity], result of:
            3.5734766 = score(doc=717,freq=1.0), product of:
              0.6357497 = queryWeight, product of:
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.07069056 = queryNorm
              5.620886 = fieldWeight in 717, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.625 = fieldNorm(doc=717)
        0.5 = coord(1/2)
    
  3. Petras, V.: ¬The identity of information science (2023) 1.79
    1.7867383 = sum of:
      1.7867383 = product of:
        3.5734766 = sum of:
          3.5734766 = weight(author_txt:petras in 2079) [ClassicSimilarity], result of:
            3.5734766 = score(doc=2079,freq=1.0), product of:
              0.6357497 = queryWeight, product of:
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.07069056 = queryNorm
              5.620886 = fieldWeight in 2079, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.625 = fieldNorm(doc=2079)
        0.5 = coord(1/2)
    
  4. Olensky, M.; Schmidt, M.; Eck, N.J. van: Evaluation of the citation matching algorithms of CWTS and iFQ in comparison to the Web of science (2016) 1.43
    1.4342356 = sum of:
      1.4342356 = product of:
        2.8684711 = sum of:
          2.8684711 = weight(author_txt:olensky in 4130) [ClassicSimilarity], result of:
            2.8684711 = score(doc=4130,freq=1.0), product of:
              0.77189523 = queryWeight, product of:
                1.1018846 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.07069056 = queryNorm
              3.7161405 = fieldWeight in 4130, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.375 = fieldNorm(doc=4130)
        0.5 = coord(1/2)
    
  5. Petras, V.; Bank, M.: Vergleich der Suchmaschinen AltaVista und HotBot bezüglich Treffermengen und Aktualität (1998) 1.43
    1.4293907 = sum of:
      1.4293907 = product of:
        2.8587813 = sum of:
          2.8587813 = weight(author_txt:petras in 3514) [ClassicSimilarity], result of:
            2.8587813 = score(doc=3514,freq=1.0), product of:
              0.6357497 = queryWeight, product of:
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.07069056 = queryNorm
              4.496709 = fieldWeight in 3514, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.5 = fieldNorm(doc=3514)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Chen, S.-J.: Semantic enrichment of linked archival materials (2019) 0.10
    0.09676511 = sum of:
      0.09676511 = product of:
        0.6047819 = sum of:
          0.041144684 = weight(abstract_txt:vocabularies in 488) [ClassicSimilarity], result of:
            0.041144684 = score(doc=488,freq=1.0), product of:
              0.11130393 = queryWeight, product of:
                1.0213964 = boost
                5.9145703 = idf(docFreq=325, maxDocs=44421)
                0.018424384 = queryNorm
              0.36966065 = fieldWeight in 488, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9145703 = idf(docFreq=325, maxDocs=44421)
                0.0625 = fieldNorm(doc=488)
          0.099551894 = weight(abstract_txt:europeana in 488) [ClassicSimilarity], result of:
            0.099551894 = score(doc=488,freq=1.0), product of:
              0.2006016 = queryWeight, product of:
                1.3712173 = boost
                7.9402676 = idf(docFreq=42, maxDocs=44421)
                0.018424384 = queryNorm
              0.49626672 = fieldWeight in 488, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9402676 = idf(docFreq=42, maxDocs=44421)
                0.0625 = fieldNorm(doc=488)
          0.06533617 = weight(abstract_txt:metadata in 488) [ClassicSimilarity], result of:
            0.06533617 = score(doc=488,freq=2.0), product of:
              0.15149692 = queryWeight, product of:
                1.6852175 = boost
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.018424384 = queryNorm
              0.4312706 = fieldWeight in 488, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.0625 = fieldNorm(doc=488)
          0.39874917 = weight(abstract_txt:enrichment in 488) [ClassicSimilarity], result of:
            0.39874917 = score(doc=488,freq=2.0), product of:
              0.5791597 = queryWeight, product of:
                4.035514 = boost
                7.7894444 = idf(docFreq=49, maxDocs=44421)
                0.018424384 = queryNorm
              0.6884961 = fieldWeight in 488, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.7894444 = idf(docFreq=49, maxDocs=44421)
                0.0625 = fieldNorm(doc=488)
        0.16 = coord(4/25)
    
  2. Nicholls, P.; Ridley, J.: ¬A context for evaluating for multimedia (1996) 0.09
    0.09038268 = sum of:
      0.09038268 = product of:
        0.56489176 = sum of:
          0.07866935 = weight(abstract_txt:introduces in 5200) [ClassicSimilarity], result of:
            0.07866935 = score(doc=5200,freq=1.0), product of:
              0.10801525 = queryWeight, product of:
                1.0061938 = boost
                5.8265367 = idf(docFreq=355, maxDocs=44421)
                0.018424384 = queryNorm
              0.7283171 = fieldWeight in 5200, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8265367 = idf(docFreq=355, maxDocs=44421)
                0.125 = fieldNorm(doc=5200)
          0.17394078 = weight(abstract_txt:adds in 5200) [ClassicSimilarity], result of:
            0.17394078 = score(doc=5200,freq=1.0), product of:
              0.18332244 = queryWeight, product of:
                1.3108315 = boost
                7.590594 = idf(docFreq=60, maxDocs=44421)
                0.018424384 = queryNorm
              0.9488242 = fieldWeight in 5200, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.590594 = idf(docFreq=60, maxDocs=44421)
                0.125 = fieldNorm(doc=5200)
          0.101122096 = weight(abstract_txt:evaluation in 5200) [ClassicSimilarity], result of:
            0.101122096 = score(doc=5200,freq=2.0), product of:
              0.12769642 = queryWeight, product of:
                1.5471892 = boost
                4.479632 = idf(docFreq=1368, maxDocs=44421)
                0.018424384 = queryNorm
              0.7918945 = fieldWeight in 5200, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.479632 = idf(docFreq=1368, maxDocs=44421)
                0.125 = fieldNorm(doc=5200)
          0.21115951 = weight(abstract_txt:framework in 5200) [ClassicSimilarity], result of:
            0.21115951 = score(doc=5200,freq=2.0), product of:
              0.26284423 = queryWeight, product of:
                3.139195 = boost
                4.5445113 = idf(docFreq=1282, maxDocs=44421)
                0.018424384 = queryNorm
              0.8033637 = fieldWeight in 5200, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5445113 = idf(docFreq=1282, maxDocs=44421)
                0.125 = fieldNorm(doc=5200)
        0.16 = coord(4/25)
    
  3. Milstead, J.L.: Methodologies for subject analysis in bibliographic databases (1992) 0.09
    0.08663337 = sum of:
      0.08663337 = product of:
        0.72194475 = sum of:
          0.08161448 = weight(abstract_txt:determining in 3091) [ClassicSimilarity], result of:
            0.08161448 = score(doc=3091,freq=1.0), product of:
              0.13409688 = queryWeight, product of:
                1.1211104 = boost
                6.4919815 = idf(docFreq=182, maxDocs=44421)
                0.018424384 = queryNorm
              0.60862327 = fieldWeight in 3091, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4919815 = idf(docFreq=182, maxDocs=44421)
                0.09375 = fieldNorm(doc=3091)
          0.21739289 = weight(abstract_txt:automatic in 3091) [ClassicSimilarity], result of:
            0.21739289 = score(doc=3091,freq=3.0), product of:
              0.2576741 = queryWeight, product of:
                2.6917522 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.018424384 = queryNorm
              0.8436738 = fieldWeight in 3091, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.09375 = fieldNorm(doc=3091)
          0.4229374 = weight(abstract_txt:enrichment in 3091) [ClassicSimilarity], result of:
            0.4229374 = score(doc=3091,freq=1.0), product of:
              0.5791597 = queryWeight, product of:
                4.035514 = boost
                7.7894444 = idf(docFreq=49, maxDocs=44421)
                0.018424384 = queryNorm
              0.73026043 = fieldWeight in 3091, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7894444 = idf(docFreq=49, maxDocs=44421)
                0.09375 = fieldNorm(doc=3091)
        0.12 = coord(3/25)
    
  4. Rindflesch, T.C.; Fizsman, M.: The interaction of domain knowledge and linguistic structure in natural language processing : interpreting hypernymic propositions in biomedical text (2003) 0.08
    0.07500433 = sum of:
      0.07500433 = product of:
        0.31251806 = sum of:
          0.038612753 = weight(abstract_txt:contribution in 3097) [ClassicSimilarity], result of:
            0.038612753 = score(doc=3097,freq=1.0), product of:
              0.10668954 = queryWeight, product of:
                5.790671 = idf(docFreq=368, maxDocs=44421)
                0.018424384 = queryNorm
              0.36191693 = fieldWeight in 3097, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.790671 = idf(docFreq=368, maxDocs=44421)
                0.0625 = fieldNorm(doc=3097)
          0.04179831 = weight(abstract_txt:valuable in 3097) [ClassicSimilarity], result of:
            0.04179831 = score(doc=3097,freq=1.0), product of:
              0.11247962 = queryWeight, product of:
                1.0267767 = boost
                5.9457254 = idf(docFreq=315, maxDocs=44421)
                0.018424384 = queryNorm
              0.37160784 = fieldWeight in 3097, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9457254 = idf(docFreq=315, maxDocs=44421)
                0.0625 = fieldNorm(doc=3097)
          0.04855944 = weight(abstract_txt:supports in 3097) [ClassicSimilarity], result of:
            0.04855944 = score(doc=3097,freq=1.0), product of:
              0.124303624 = queryWeight, product of:
                1.0793964 = boost
                6.250429 = idf(docFreq=232, maxDocs=44421)
                0.018424384 = queryNorm
              0.39065182 = fieldWeight in 3097, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.250429 = idf(docFreq=232, maxDocs=44421)
                0.0625 = fieldNorm(doc=3097)
          0.06412095 = weight(abstract_txt:error in 3097) [ClassicSimilarity], result of:
            0.06412095 = score(doc=3097,freq=1.0), product of:
              0.14961253 = queryWeight, product of:
                1.1841946 = boost
                6.8572807 = idf(docFreq=126, maxDocs=44421)
                0.018424384 = queryNorm
              0.42858005 = fieldWeight in 3097, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8572807 = idf(docFreq=126, maxDocs=44421)
                0.0625 = fieldNorm(doc=3097)
          0.03575206 = weight(abstract_txt:evaluation in 3097) [ClassicSimilarity], result of:
            0.03575206 = score(doc=3097,freq=1.0), product of:
              0.12769642 = queryWeight, product of:
                1.5471892 = boost
                4.479632 = idf(docFreq=1368, maxDocs=44421)
                0.018424384 = queryNorm
              0.279977 = fieldWeight in 3097, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.479632 = idf(docFreq=1368, maxDocs=44421)
                0.0625 = fieldNorm(doc=3097)
          0.083674565 = weight(abstract_txt:automatic in 3097) [ClassicSimilarity], result of:
            0.083674565 = score(doc=3097,freq=1.0), product of:
              0.2576741 = queryWeight, product of:
                2.6917522 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.018424384 = queryNorm
              0.32473022 = fieldWeight in 3097, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.0625 = fieldNorm(doc=3097)
        0.24 = coord(6/25)
    
  5. Hobson, S.P.; Dorr, B.J.; Monz, C.; Schwartz, R.: Task-based evaluation of text summarization using Relevance Prediction (2007) 0.07
    0.07428258 = sum of:
      0.07428258 = product of:
        0.3714129 = sum of:
          0.039334673 = weight(abstract_txt:introduces in 1938) [ClassicSimilarity], result of:
            0.039334673 = score(doc=1938,freq=1.0), product of:
              0.10801525 = queryWeight, product of:
                1.0061938 = boost
                5.8265367 = idf(docFreq=355, maxDocs=44421)
                0.018424384 = queryNorm
              0.36415854 = fieldWeight in 1938, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8265367 = idf(docFreq=355, maxDocs=44421)
                0.0625 = fieldNorm(doc=1938)
          0.050569084 = weight(abstract_txt:corresponding in 1938) [ClassicSimilarity], result of:
            0.050569084 = score(doc=1938,freq=1.0), product of:
              0.12770995 = queryWeight, product of:
                1.0940859 = boost
                6.3354917 = idf(docFreq=213, maxDocs=44421)
                0.018424384 = queryNorm
              0.39596823 = fieldWeight in 1938, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3354917 = idf(docFreq=213, maxDocs=44421)
                0.0625 = fieldNorm(doc=1938)
          0.061924383 = weight(abstract_txt:evaluation in 1938) [ClassicSimilarity], result of:
            0.061924383 = score(doc=1938,freq=3.0), product of:
              0.12769642 = queryWeight, product of:
                1.5471892 = boost
                4.479632 = idf(docFreq=1368, maxDocs=44421)
                0.018424384 = queryNorm
              0.48493436 = fieldWeight in 1938, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.479632 = idf(docFreq=1368, maxDocs=44421)
                0.0625 = fieldNorm(doc=1938)
          0.14492859 = weight(abstract_txt:automatic in 1938) [ClassicSimilarity], result of:
            0.14492859 = score(doc=1938,freq=3.0), product of:
              0.2576741 = queryWeight, product of:
                2.6917522 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.018424384 = queryNorm
              0.5624492 = fieldWeight in 1938, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.0625 = fieldNorm(doc=1938)
          0.074656166 = weight(abstract_txt:framework in 1938) [ClassicSimilarity], result of:
            0.074656166 = score(doc=1938,freq=1.0), product of:
              0.26284423 = queryWeight, product of:
                3.139195 = boost
                4.5445113 = idf(docFreq=1282, maxDocs=44421)
                0.018424384 = queryNorm
              0.28403196 = fieldWeight in 1938, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5445113 = idf(docFreq=1282, maxDocs=44421)
                0.0625 = fieldNorm(doc=1938)
        0.2 = coord(5/25)