Document (#43378)

Author
Wartena, C.
Golub, K.
Title
Evaluierung von Verschlagwortung im Kontext des Information Retrievals
Source
Qualität in der Inhaltserschließung. Hrsg.: M. Franke-Maier, u.a
Imprint
München : DeGruyter-Saur
Year
2021
Pages
S.325-348
Series
Bibliotheks- und Informationspraxis; 70
Abstract
Dieser Beitrag möchte einen Überblick über die in der Literatur diskutierten Möglichkeiten, Herausforderungen und Grenzen geben, Retrieval als eine extrinsische Evaluierungsmethode für die Ergebnisse verbaler Sacherschließung zu nutzen. Die inhaltliche Erschließung im Allgemeinen und die Verschlagwortung im Besonderen können intrinsisch oder extrinsisch evaluiert werden. Die intrinsische Evaluierung bezieht sich auf Eigenschaften der Erschließung, von denen vermutet wird, dass sie geeignete Indikatoren für die Qualität der Erschließung sind, wie formale Einheitlichkeit (im Hinblick auf die Anzahl zugewiesener Deskriptoren pro Dokument, auf die Granularität usw.), Konsistenz oder Übereinstimmung der Ergebnisse verschiedener Erschließer:innen. Bei einer extrinsischen Evaluierung geht es darum, die Qualität der gewählten Deskriptoren daran zu messen, wie gut sie sich tatsächlich bei der Suche bewähren. Obwohl die extrinsische Evaluierung direktere Auskunft darüber gibt, ob die Erschließung ihren Zweck erfüllt, und daher den Vorzug verdienen sollte, ist sie kompliziert und oft problematisch. In einem Retrievalsystem greifen verschiedene Algorithmen und Datenquellen in vielschichtiger Weise ineinander und interagieren bei der Evaluierung darüber hinaus noch mit Nutzer:innen und Rechercheaufgaben. Die Evaluierung einer Komponente im System kann nicht einfach dadurch vorgenommen werden, dass man sie austauscht und mit einer anderen Komponente vergleicht, da die gleiche Ressource oder der gleiche Algorithmus sich in unterschiedlichen Umgebungen unterschiedlich verhalten kann. Wir werden relevante Evaluierungsansätze vorstellen und diskutieren, und zum Abschluss einige Empfehlungen für die Evaluierung von Verschlagwortung im Kontext von Retrieval geben.
Theme
Retrievalstudien

Similar documents (author)

  1. Golub, K.: Automated subject classification of textual web documents (2006) 5.28
    5.277107 = sum of:
      5.277107 = weight(author_txt:golub in 600) [ClassicSimilarity], result of:
        5.277107 = fieldWeight in 600, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.443371 = idf(docFreq=25, maxDocs=44421)
          0.625 = fieldNorm(doc=600)
    
  2. Golub, K.: Automated subject classification of textual Web pages, based on a controlled vocabulary : challenges and recommendations (2006) 5.28
    5.277107 = sum of:
      5.277107 = weight(author_txt:golub in 897) [ClassicSimilarity], result of:
        5.277107 = fieldWeight in 897, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.443371 = idf(docFreq=25, maxDocs=44421)
          0.625 = fieldNorm(doc=897)
    
  3. Golub, K.: Subject access to information : an interdisciplinary approach (2015) 5.28
    5.277107 = sum of:
      5.277107 = weight(author_txt:golub in 1134) [ClassicSimilarity], result of:
        5.277107 = fieldWeight in 1134, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.443371 = idf(docFreq=25, maxDocs=44421)
          0.625 = fieldNorm(doc=1134)
    
  4. Golub, K.: Automated subject classification of textual documents in the context of Web-based hierarchical browsing (2011) 5.28
    5.277107 = sum of:
      5.277107 = weight(author_txt:golub in 558) [ClassicSimilarity], result of:
        5.277107 = fieldWeight in 558, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.443371 = idf(docFreq=25, maxDocs=44421)
          0.625 = fieldNorm(doc=558)
    
  5. Golub, K.: Subject access in Swedish discovery services (2018) 5.28
    5.277107 = sum of:
      5.277107 = weight(author_txt:golub in 379) [ClassicSimilarity], result of:
        5.277107 = fieldWeight in 379, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.443371 = idf(docFreq=25, maxDocs=44421)
          0.625 = fieldNorm(doc=379)
    

Similar documents (content)

  1. Nitzsche, J.: Inhaltserschließung von medizinischen Internetquellen und Multimediaprodukten (2001) 0.18
    0.18080895 = sum of:
      0.18080895 = product of:
        0.64574623 = sum of:
          0.021554044 = weight(abstract_txt:werden in 6674) [ClassicSimilarity], result of:
            0.021554044 = score(doc=6674,freq=3.0), product of:
              0.05676157 = queryWeight, product of:
                1.1612113 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.013935079 = queryNorm
              0.3797295 = fieldWeight in 6674, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.0625 = fieldNorm(doc=6674)
          0.01834662 = weight(abstract_txt:sich in 6674) [ClassicSimilarity], result of:
            0.01834662 = score(doc=6674,freq=2.0), product of:
              0.058358353 = queryWeight, product of:
                1.1774312 = boost
                3.5567884 = idf(docFreq=3444, maxDocs=44421)
                0.013935079 = queryNorm
              0.31437865 = fieldWeight in 6674, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5567884 = idf(docFreq=3444, maxDocs=44421)
                0.0625 = fieldNorm(doc=6674)
          0.029228555 = weight(abstract_txt:einer in 6674) [ClassicSimilarity], result of:
            0.029228555 = score(doc=6674,freq=3.0), product of:
              0.06954087 = queryWeight, product of:
                1.2852988 = boost
                3.882635 = idf(docFreq=2486, maxDocs=44421)
                0.013935079 = queryNorm
              0.42030758 = fieldWeight in 6674, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.882635 = idf(docFreq=2486, maxDocs=44421)
                0.0625 = fieldNorm(doc=6674)
          0.045455586 = weight(abstract_txt:kontext in 6674) [ClassicSimilarity], result of:
            0.045455586 = score(doc=6674,freq=1.0), product of:
              0.117607966 = queryWeight, product of:
                1.3647616 = boost
                6.184015 = idf(docFreq=248, maxDocs=44421)
                0.013935079 = queryNorm
              0.38650092 = fieldWeight in 6674, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.184015 = idf(docFreq=248, maxDocs=44421)
                0.0625 = fieldNorm(doc=6674)
          0.022000961 = weight(abstract_txt:oder in 6674) [ClassicSimilarity], result of:
            0.022000961 = score(doc=6674,freq=1.0), product of:
              0.0829921 = queryWeight, product of:
                1.4041142 = boost
                4.241553 = idf(docFreq=1736, maxDocs=44421)
                0.013935079 = queryNorm
              0.26509705 = fieldWeight in 6674, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.241553 = idf(docFreq=1736, maxDocs=44421)
                0.0625 = fieldNorm(doc=6674)
          0.17812994 = weight(abstract_txt:erschließung in 6674) [ClassicSimilarity], result of:
            0.17812994 = score(doc=6674,freq=5.0), product of:
              0.21538882 = queryWeight, product of:
                2.6119509 = boost
                5.9176426 = idf(docFreq=324, maxDocs=44421)
                0.013935079 = queryNorm
              0.8270157 = fieldWeight in 6674, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.9176426 = idf(docFreq=324, maxDocs=44421)
                0.0625 = fieldNorm(doc=6674)
          0.3310305 = weight(abstract_txt:evaluierung in 6674) [ClassicSimilarity], result of:
            0.3310305 = score(doc=6674,freq=1.0), product of:
              0.67088264 = queryWeight, product of:
                6.0981164 = boost
                7.894805 = idf(docFreq=44, maxDocs=44421)
                0.013935079 = queryNorm
              0.4934253 = fieldWeight in 6674, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.894805 = idf(docFreq=44, maxDocs=44421)
                0.0625 = fieldNorm(doc=6674)
        0.28 = coord(7/25)
    
  2. Wolff, C.: Effektivität von Recherchen im WWW : Vergleichende Evaluierung von such- und Metasuchmaschinen (2000) 0.17
    0.17020187 = sum of:
      0.17020187 = product of:
        0.8510093 = sum of:
          0.026398202 = weight(abstract_txt:werden in 6463) [ClassicSimilarity], result of:
            0.026398202 = score(doc=6463,freq=2.0), product of:
              0.05676157 = queryWeight, product of:
                1.1612113 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.013935079 = queryNorm
              0.46507174 = fieldWeight in 6463, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.09375 = fieldNorm(doc=6463)
          0.019459529 = weight(abstract_txt:sich in 6463) [ClassicSimilarity], result of:
            0.019459529 = score(doc=6463,freq=1.0), product of:
              0.058358353 = queryWeight, product of:
                1.1774312 = boost
                3.5567884 = idf(docFreq=3444, maxDocs=44421)
                0.013935079 = queryNorm
              0.33344892 = fieldWeight in 6463, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5567884 = idf(docFreq=3444, maxDocs=44421)
                0.09375 = fieldNorm(doc=6463)
          0.06713241 = weight(abstract_txt:ergebnisse in 6463) [ClassicSimilarity], result of:
            0.06713241 = score(doc=6463,freq=2.0), product of:
              0.09238382 = queryWeight, product of:
                1.209585 = boost
                5.4808774 = idf(docFreq=502, maxDocs=44421)
                0.013935079 = queryNorm
              0.72666854 = fieldWeight in 6463, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4808774 = idf(docFreq=502, maxDocs=44421)
                0.09375 = fieldNorm(doc=6463)
          0.035797525 = weight(abstract_txt:einer in 6463) [ClassicSimilarity], result of:
            0.035797525 = score(doc=6463,freq=2.0), product of:
              0.06954087 = queryWeight, product of:
                1.2852988 = boost
                3.882635 = idf(docFreq=2486, maxDocs=44421)
                0.013935079 = queryNorm
              0.51476955 = fieldWeight in 6463, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.882635 = idf(docFreq=2486, maxDocs=44421)
                0.09375 = fieldNorm(doc=6463)
          0.70222163 = weight(abstract_txt:evaluierung in 6463) [ClassicSimilarity], result of:
            0.70222163 = score(doc=6463,freq=2.0), product of:
              0.67088264 = queryWeight, product of:
                6.0981164 = boost
                7.894805 = idf(docFreq=44, maxDocs=44421)
                0.013935079 = queryNorm
              1.0467131 = fieldWeight in 6463, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.894805 = idf(docFreq=44, maxDocs=44421)
                0.09375 = fieldNorm(doc=6463)
        0.2 = coord(5/25)
    
  3. Herb, U.: Relevanz von Impact-Maßen für Open Access (2013) 0.15
    0.154659 = sum of:
      0.154659 = product of:
        0.9666188 = sum of:
          0.01866635 = weight(abstract_txt:werden in 1926) [ClassicSimilarity], result of:
            0.01866635 = score(doc=1926,freq=1.0), product of:
              0.05676157 = queryWeight, product of:
                1.1612113 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.013935079 = queryNorm
              0.3288554 = fieldWeight in 1926, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.09375 = fieldNorm(doc=1926)
          0.019459529 = weight(abstract_txt:sich in 1926) [ClassicSimilarity], result of:
            0.019459529 = score(doc=1926,freq=1.0), product of:
              0.058358353 = queryWeight, product of:
                1.1774312 = boost
                3.5567884 = idf(docFreq=3444, maxDocs=44421)
                0.013935079 = queryNorm
              0.33344892 = fieldWeight in 1926, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5567884 = idf(docFreq=3444, maxDocs=44421)
                0.09375 = fieldNorm(doc=1926)
          0.06845047 = weight(abstract_txt:qualität in 1926) [ClassicSimilarity], result of:
            0.06845047 = score(doc=1926,freq=1.0), product of:
              0.1179149 = queryWeight, product of:
                1.3665413 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.013935079 = queryNorm
              0.5805074 = fieldWeight in 1926, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.09375 = fieldNorm(doc=1926)
          0.8600424 = weight(abstract_txt:evaluierung in 1926) [ClassicSimilarity], result of:
            0.8600424 = score(doc=1926,freq=3.0), product of:
              0.67088264 = queryWeight, product of:
                6.0981164 = boost
                7.894805 = idf(docFreq=44, maxDocs=44421)
                0.013935079 = queryNorm
              1.2819566 = fieldWeight in 1926, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.894805 = idf(docFreq=44, maxDocs=44421)
                0.09375 = fieldNorm(doc=1926)
        0.16 = coord(4/25)
    
  4. Hänger, C.; Krätzsch, C.; Niemann, C.: Was vom Tagging übrig blieb : Erkenntnisse und Einsichten aus zwei Jahren Projektarbeit (2011) 0.13
    0.13046713 = sum of:
      0.13046713 = product of:
        0.4077098 = sum of:
          0.015398951 = weight(abstract_txt:werden in 519) [ClassicSimilarity], result of:
            0.015398951 = score(doc=519,freq=2.0), product of:
              0.05676157 = queryWeight, product of:
                1.1612113 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.013935079 = queryNorm
              0.27129185 = fieldWeight in 519, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.0546875 = fieldNorm(doc=519)
          0.016053291 = weight(abstract_txt:sich in 519) [ClassicSimilarity], result of:
            0.016053291 = score(doc=519,freq=2.0), product of:
              0.058358353 = queryWeight, product of:
                1.1774312 = boost
                3.5567884 = idf(docFreq=3444, maxDocs=44421)
                0.013935079 = queryNorm
              0.2750813 = fieldWeight in 519, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5567884 = idf(docFreq=3444, maxDocs=44421)
                0.0546875 = fieldNorm(doc=519)
          0.027690709 = weight(abstract_txt:ergebnisse in 519) [ClassicSimilarity], result of:
            0.027690709 = score(doc=519,freq=1.0), product of:
              0.09238382 = queryWeight, product of:
                1.209585 = boost
                5.4808774 = idf(docFreq=502, maxDocs=44421)
                0.013935079 = queryNorm
              0.2997355 = fieldWeight in 519, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4808774 = idf(docFreq=502, maxDocs=44421)
                0.0546875 = fieldNorm(doc=519)
          0.0147657255 = weight(abstract_txt:einer in 519) [ClassicSimilarity], result of:
            0.0147657255 = score(doc=519,freq=1.0), product of:
              0.06954087 = queryWeight, product of:
                1.2852988 = boost
                3.882635 = idf(docFreq=2486, maxDocs=44421)
                0.013935079 = queryNorm
              0.21233161 = fieldWeight in 519, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.882635 = idf(docFreq=2486, maxDocs=44421)
                0.0546875 = fieldNorm(doc=519)
          0.06915982 = weight(abstract_txt:qualität in 519) [ClassicSimilarity], result of:
            0.06915982 = score(doc=519,freq=3.0), product of:
              0.1179149 = queryWeight, product of:
                1.3665413 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.013935079 = queryNorm
              0.5865232 = fieldWeight in 519, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.0546875 = fieldNorm(doc=519)
          0.033343434 = weight(abstract_txt:oder in 519) [ClassicSimilarity], result of:
            0.033343434 = score(doc=519,freq=3.0), product of:
              0.0829921 = queryWeight, product of:
                1.4041142 = boost
                4.241553 = idf(docFreq=1736, maxDocs=44421)
                0.013935079 = queryNorm
              0.40176636 = fieldWeight in 519, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.241553 = idf(docFreq=1736, maxDocs=44421)
                0.0546875 = fieldNorm(doc=519)
          0.09188913 = weight(abstract_txt:gleiche in 519) [ClassicSimilarity], result of:
            0.09188913 = score(doc=519,freq=1.0), product of:
              0.20553349 = queryWeight, product of:
                1.8041794 = boost
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.013935079 = queryNorm
              0.44707617 = fieldWeight in 519, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.0546875 = fieldNorm(doc=519)
          0.13940874 = weight(abstract_txt:erschließung in 519) [ClassicSimilarity], result of:
            0.13940874 = score(doc=519,freq=4.0), product of:
              0.21538882 = queryWeight, product of:
                2.6119509 = boost
                5.9176426 = idf(docFreq=324, maxDocs=44421)
                0.013935079 = queryNorm
              0.6472422 = fieldWeight in 519, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.9176426 = idf(docFreq=324, maxDocs=44421)
                0.0546875 = fieldNorm(doc=519)
        0.32 = coord(8/25)
    
  5. Behnert, C.; Plassmeier, K.; Borst, T.; Lewandowski, D.: Evaluierung von Rankingverfahren für bibliothekarische Informationssysteme (2019) 0.13
    0.12866484 = sum of:
      0.12866484 = product of:
        0.8041553 = sum of:
          0.01866635 = weight(abstract_txt:werden in 23) [ClassicSimilarity], result of:
            0.01866635 = score(doc=23,freq=1.0), product of:
              0.05676157 = queryWeight, product of:
                1.1612113 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.013935079 = queryNorm
              0.3288554 = fieldWeight in 23, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.09375 = fieldNorm(doc=23)
          0.047469787 = weight(abstract_txt:ergebnisse in 23) [ClassicSimilarity], result of:
            0.047469787 = score(doc=23,freq=1.0), product of:
              0.09238382 = queryWeight, product of:
                1.209585 = boost
                5.4808774 = idf(docFreq=502, maxDocs=44421)
                0.013935079 = queryNorm
              0.5138323 = fieldWeight in 23, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4808774 = idf(docFreq=502, maxDocs=44421)
                0.09375 = fieldNorm(doc=23)
          0.035797525 = weight(abstract_txt:einer in 23) [ClassicSimilarity], result of:
            0.035797525 = score(doc=23,freq=2.0), product of:
              0.06954087 = queryWeight, product of:
                1.2852988 = boost
                3.882635 = idf(docFreq=2486, maxDocs=44421)
                0.013935079 = queryNorm
              0.51476955 = fieldWeight in 23, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.882635 = idf(docFreq=2486, maxDocs=44421)
                0.09375 = fieldNorm(doc=23)
          0.70222163 = weight(abstract_txt:evaluierung in 23) [ClassicSimilarity], result of:
            0.70222163 = score(doc=23,freq=2.0), product of:
              0.67088264 = queryWeight, product of:
                6.0981164 = boost
                7.894805 = idf(docFreq=44, maxDocs=44421)
                0.013935079 = queryNorm
              1.0467131 = fieldWeight in 23, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.894805 = idf(docFreq=44, maxDocs=44421)
                0.09375 = fieldNorm(doc=23)
        0.16 = coord(4/25)