Document (#43372)

Author
Voß, J.
Title
Datenqualität als Grundlage qualitativer Inhaltserschließung
Source
Qualität in der Inhaltserschließung. Hrsg.: M. Franke-Maier, u.a
Imprint
München : DeGruyter-Saur
Year
2021
Pages
S.167-176
Series
Bibliotheks- und Informationspraxis; 70
Abstract
Spätestens mit Beginn des 21. Jahrhunderts findet die inhaltliche Erschließung von Dokumenten praktisch ausschließlich in digitaler Form statt. Dies gilt sowohl für die fachliche Inhaltserschließung durch Bibliotheken und andere Dokumentationseinrichtungen als auch für die verschiedensten Formen inhaltlicher Beschreibung in Datenbanken - von Produktbeschreibungen im Internethandel bis zum Social Tagging. Selbst dort, wo analoge Ursprünge vorhanden sind, beispielsweise handschriftliche Notizen oder retrokonvertierte Findmittel, liegt die Sacherschließung am Ende in Form von Daten vor. Für die konkrete Ausprägung dieser Daten gibt es allerdings viele verschiedene Möglichkeiten. Der vorliegende Beitrag soll einen Überblick darüber geben, wie unterschiedliche Praktiken der Datenverarbeitung die Qualität von Inhaltserschließung beeinflussen und wie die Qualität von Erschließungsdaten beurteilt werden kann. Der Fokus liegt also nicht auf den Inhalten von Erschließungsdaten, sondern auf ihrer Form. Die Form von Daten ist keine rein technische Nebensächlichkeit, sondern durchaus relevant: So ist eine inhaltlich hervorragende Erschließung unbrauchbar, wenn die Erschließungsdaten aufgrund inkompatibler Datenformate nicht verwendet werden können. Zur qualitativen Einschätzung von Inhaltserschließung ist es daher notwendig, sich auch darüber im Klaren zu sein, wie und in welcher Form die Erschließungsdaten verarbeitet werden.

Similar documents (content)

  1. Mödden, E.: Maschinelle Beschlagwortung mit Algorithmen : Ein Blick in die Werkstatt des KI-Projektes der Deutschen Nationalbibliothek (2024) 0.21
    0.20665364 = sum of:
      0.20665364 = product of:
        1.0332682 = sum of:
          0.068832 = weight(abstract_txt:erschließung in 1051) [ClassicSimilarity], result of:
            0.068832 = score(doc=1051,freq=1.0), product of:
              0.12407103 = queryWeight, product of:
                1.4528741 = boost
                5.9176426 = idf(docFreq=324, maxDocs=44421)
                0.014430909 = queryNorm
              0.554779 = fieldWeight in 1051, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9176426 = idf(docFreq=324, maxDocs=44421)
                0.09375 = fieldNorm(doc=1051)
          0.070494086 = weight(abstract_txt:liegt in 1051) [ClassicSimilarity], result of:
            0.070494086 = score(doc=1051,freq=1.0), product of:
              0.12606037 = queryWeight, product of:
                1.4644753 = boost
                5.9648952 = idf(docFreq=309, maxDocs=44421)
                0.014430909 = queryNorm
              0.5592089 = fieldWeight in 1051, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9648952 = idf(docFreq=309, maxDocs=44421)
                0.09375 = fieldNorm(doc=1051)
          0.07885943 = weight(abstract_txt:qualität in 1051) [ClassicSimilarity], result of:
            0.07885943 = score(doc=1051,freq=1.0), product of:
              0.1358457 = queryWeight, product of:
                1.5202525 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.014430909 = queryNorm
              0.5805074 = fieldWeight in 1051, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.09375 = fieldNorm(doc=1051)
          0.24475184 = weight(abstract_txt:inhaltserschließung in 1051) [ClassicSimilarity], result of:
            0.24475184 = score(doc=1051,freq=1.0), product of:
              0.36416996 = queryWeight, product of:
                3.520139 = boost
                7.168868 = idf(docFreq=92, maxDocs=44421)
                0.014430909 = queryNorm
              0.67208135 = fieldWeight in 1051, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.168868 = idf(docFreq=92, maxDocs=44421)
                0.09375 = fieldNorm(doc=1051)
          0.5703308 = weight(abstract_txt:erschließungsdaten in 1051) [ClassicSimilarity], result of:
            0.5703308 = score(doc=1051,freq=1.0), product of:
              0.6400855 = queryWeight, product of:
                4.6668816 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.014430909 = queryNorm
              0.8910228 = fieldWeight in 1051, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.09375 = fieldNorm(doc=1051)
        0.2 = coord(5/25)
    
  2. Scheven, E.: Effiziente Sacherschließung in schwierigen Zeiten : Gedanken zur Zukunft der SWD (2005) 0.14
    0.13701391 = sum of:
      0.13701391 = product of:
        0.4281685 = sum of:
          0.014492648 = weight(abstract_txt:sondern in 4555) [ClassicSimilarity], result of:
            0.014492648 = score(doc=4555,freq=1.0), product of:
              0.09133897 = queryWeight, product of:
                1.2465819 = boost
                5.0774026 = idf(docFreq=752, maxDocs=44421)
                0.014430909 = queryNorm
              0.15866883 = fieldWeight in 4555, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0774026 = idf(docFreq=752, maxDocs=44421)
                0.03125 = fieldNorm(doc=4555)
          0.010137487 = weight(abstract_txt:werden in 4555) [ClassicSimilarity], result of:
            0.010137487 = score(doc=4555,freq=2.0), product of:
              0.06539305 = queryWeight, product of:
                1.2918265 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.014430909 = queryNorm
              0.15502392 = fieldWeight in 4555, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.03125 = fieldNorm(doc=4555)
          0.032447718 = weight(abstract_txt:erschließung in 4555) [ClassicSimilarity], result of:
            0.032447718 = score(doc=4555,freq=2.0), product of:
              0.12407103 = queryWeight, product of:
                1.4528741 = boost
                5.9176426 = idf(docFreq=324, maxDocs=44421)
                0.014430909 = queryNorm
              0.26152533 = fieldWeight in 4555, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.9176426 = idf(docFreq=324, maxDocs=44421)
                0.03125 = fieldNorm(doc=4555)
          0.03323123 = weight(abstract_txt:liegt in 4555) [ClassicSimilarity], result of:
            0.03323123 = score(doc=4555,freq=2.0), product of:
              0.12606037 = queryWeight, product of:
                1.4644753 = boost
                5.9648952 = idf(docFreq=309, maxDocs=44421)
                0.014430909 = queryNorm
              0.2636136 = fieldWeight in 4555, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.9648952 = idf(docFreq=309, maxDocs=44421)
                0.03125 = fieldNorm(doc=4555)
          0.037174694 = weight(abstract_txt:qualität in 4555) [ClassicSimilarity], result of:
            0.037174694 = score(doc=4555,freq=2.0), product of:
              0.1358457 = queryWeight, product of:
                1.5202525 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.014430909 = queryNorm
              0.2736538 = fieldWeight in 4555, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.03125 = fieldNorm(doc=4555)
          0.034005918 = weight(abstract_txt:daten in 4555) [ClassicSimilarity], result of:
            0.034005918 = score(doc=4555,freq=2.0), product of:
              0.14653714 = queryWeight, product of:
                1.9338031 = boost
                5.250997 = idf(docFreq=632, maxDocs=44421)
                0.014430909 = queryNorm
              0.23206347 = fieldWeight in 4555, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.250997 = idf(docFreq=632, maxDocs=44421)
                0.03125 = fieldNorm(doc=4555)
          0.021926975 = weight(abstract_txt:form in 4555) [ClassicSimilarity], result of:
            0.021926975 = score(doc=4555,freq=1.0), product of:
              0.1633767 = queryWeight, product of:
                2.6360755 = boost
                4.294757 = idf(docFreq=1646, maxDocs=44421)
                0.014430909 = queryNorm
              0.13421115 = fieldWeight in 4555, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.294757 = idf(docFreq=1646, maxDocs=44421)
                0.03125 = fieldNorm(doc=4555)
          0.24475184 = weight(abstract_txt:inhaltserschließung in 4555) [ClassicSimilarity], result of:
            0.24475184 = score(doc=4555,freq=9.0), product of:
              0.36416996 = queryWeight, product of:
                3.520139 = boost
                7.168868 = idf(docFreq=92, maxDocs=44421)
                0.014430909 = queryNorm
              0.67208135 = fieldWeight in 4555, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                7.168868 = idf(docFreq=92, maxDocs=44421)
                0.03125 = fieldNorm(doc=4555)
        0.32 = coord(8/25)
    
  3. Franke-Maier, M.: Anforderungen an die Qualität der Inhaltserschließung im Spannungsfeld von intellektuell und automatisch erzeugten Metadaten (2018) 0.10
    0.09860236 = sum of:
      0.09860236 = product of:
        0.61626476 = sum of:
          0.08977729 = weight(abstract_txt:spätestens in 344) [ClassicSimilarity], result of:
            0.08977729 = score(doc=344,freq=1.0), product of:
              0.11755591 = queryWeight, product of:
                8.146119 = idf(docFreq=34, maxDocs=44421)
                0.014430909 = queryNorm
              0.7636987 = fieldWeight in 344, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.146119 = idf(docFreq=34, maxDocs=44421)
                0.09375 = fieldNorm(doc=344)
          0.068832 = weight(abstract_txt:erschließung in 344) [ClassicSimilarity], result of:
            0.068832 = score(doc=344,freq=1.0), product of:
              0.12407103 = queryWeight, product of:
                1.4528741 = boost
                5.9176426 = idf(docFreq=324, maxDocs=44421)
                0.014430909 = queryNorm
              0.554779 = fieldWeight in 344, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9176426 = idf(docFreq=324, maxDocs=44421)
                0.09375 = fieldNorm(doc=344)
          0.11152408 = weight(abstract_txt:qualität in 344) [ClassicSimilarity], result of:
            0.11152408 = score(doc=344,freq=2.0), product of:
              0.1358457 = queryWeight, product of:
                1.5202525 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.014430909 = queryNorm
              0.8209614 = fieldWeight in 344, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.09375 = fieldNorm(doc=344)
          0.34613138 = weight(abstract_txt:inhaltserschließung in 344) [ClassicSimilarity], result of:
            0.34613138 = score(doc=344,freq=2.0), product of:
              0.36416996 = queryWeight, product of:
                3.520139 = boost
                7.168868 = idf(docFreq=92, maxDocs=44421)
                0.014430909 = queryNorm
              0.95046663 = fieldWeight in 344, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.168868 = idf(docFreq=92, maxDocs=44421)
                0.09375 = fieldNorm(doc=344)
        0.16 = coord(4/25)
    
  4. Henze, V.; Junger, U.; Mödden, E.: Grundzüge und erste Schritte der künftigen inhaltlichen Erschliessung von Publikationen in der Deutschen Nationalbibliothek (2017) 0.09
    0.09479242 = sum of:
      0.09479242 = product of:
        0.47396207 = sum of:
          0.030743549 = weight(abstract_txt:sondern in 4772) [ClassicSimilarity], result of:
            0.030743549 = score(doc=4772,freq=2.0), product of:
              0.09133897 = queryWeight, product of:
                1.2465819 = boost
                5.0774026 = idf(docFreq=752, maxDocs=44421)
                0.014430909 = queryNorm
              0.33658743 = fieldWeight in 4772, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.0774026 = idf(docFreq=752, maxDocs=44421)
                0.046875 = fieldNorm(doc=4772)
          0.018623754 = weight(abstract_txt:werden in 4772) [ClassicSimilarity], result of:
            0.018623754 = score(doc=4772,freq=3.0), product of:
              0.06539305 = queryWeight, product of:
                1.2918265 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.014430909 = queryNorm
              0.28479713 = fieldWeight in 4772, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.046875 = fieldNorm(doc=4772)
          0.07695652 = weight(abstract_txt:erschließung in 4772) [ClassicSimilarity], result of:
            0.07695652 = score(doc=4772,freq=5.0), product of:
              0.12407103 = queryWeight, product of:
                1.4528741 = boost
                5.9176426 = idf(docFreq=324, maxDocs=44421)
                0.014430909 = queryNorm
              0.6202618 = fieldWeight in 4772, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.9176426 = idf(docFreq=324, maxDocs=44421)
                0.046875 = fieldNorm(doc=4772)
          0.06247286 = weight(abstract_txt:daten in 4772) [ClassicSimilarity], result of:
            0.06247286 = score(doc=4772,freq=3.0), product of:
              0.14653714 = queryWeight, product of:
                1.9338031 = boost
                5.250997 = idf(docFreq=632, maxDocs=44421)
                0.014430909 = queryNorm
              0.42632782 = fieldWeight in 4772, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.250997 = idf(docFreq=632, maxDocs=44421)
                0.046875 = fieldNorm(doc=4772)
          0.2851654 = weight(abstract_txt:erschließungsdaten in 4772) [ClassicSimilarity], result of:
            0.2851654 = score(doc=4772,freq=1.0), product of:
              0.6400855 = queryWeight, product of:
                4.6668816 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.014430909 = queryNorm
              0.4455114 = fieldWeight in 4772, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.046875 = fieldNorm(doc=4772)
        0.2 = coord(5/25)
    
  5. Wiesenmüller, H.: Verbale Erschließung in Katalogen und Discovery-Systemen : Überlegungen zur Qualität (2021) 0.09
    0.09439476 = sum of:
      0.09439476 = product of:
        0.47197378 = sum of:
          0.07305321 = weight(abstract_txt:inhaltlicher in 1375) [ClassicSimilarity], result of:
            0.07305321 = score(doc=1375,freq=1.0), product of:
              0.13426222 = queryWeight, product of:
                1.0686972 = boost
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.014430909 = queryNorm
              0.54410845 = fieldWeight in 1375, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.0625 = fieldNorm(doc=1375)
          0.014336573 = weight(abstract_txt:werden in 1375) [ClassicSimilarity], result of:
            0.014336573 = score(doc=1375,freq=1.0), product of:
              0.06539305 = queryWeight, product of:
                1.2918265 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.014430909 = queryNorm
              0.21923694 = fieldWeight in 1375, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.0625 = fieldNorm(doc=1375)
          0.07948035 = weight(abstract_txt:erschließung in 1375) [ClassicSimilarity], result of:
            0.07948035 = score(doc=1375,freq=3.0), product of:
              0.12407103 = queryWeight, product of:
                1.4528741 = boost
                5.9176426 = idf(docFreq=324, maxDocs=44421)
                0.014430909 = queryNorm
              0.6406036 = fieldWeight in 1375, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.9176426 = idf(docFreq=324, maxDocs=44421)
                0.0625 = fieldNorm(doc=1375)
          0.07434939 = weight(abstract_txt:qualität in 1375) [ClassicSimilarity], result of:
            0.07434939 = score(doc=1375,freq=2.0), product of:
              0.1358457 = queryWeight, product of:
                1.5202525 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.014430909 = queryNorm
              0.5473076 = fieldWeight in 1375, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.0625 = fieldNorm(doc=1375)
          0.23075426 = weight(abstract_txt:inhaltserschließung in 1375) [ClassicSimilarity], result of:
            0.23075426 = score(doc=1375,freq=2.0), product of:
              0.36416996 = queryWeight, product of:
                3.520139 = boost
                7.168868 = idf(docFreq=92, maxDocs=44421)
                0.014430909 = queryNorm
              0.6336444 = fieldWeight in 1375, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.168868 = idf(docFreq=92, maxDocs=44421)
                0.0625 = fieldNorm(doc=1375)
        0.2 = coord(5/25)