Document (#41311)

Author
Munkelt, J.
Title
Erstellung einer DNB-Retrieval-Testkollektion
Imprint
Köln : Technische Hochschule, Fakultät für Informations- und Kommunikationswissenschaften
Year
2018
Pages
II, 79 S
Abstract
Seit Herbst 2017 findet in der Deutschen Nationalbibliothek die Inhaltserschließung bestimmter Medienwerke rein maschinell statt. Die Qualität dieses Verfahrens, das die Prozessorganisation von Bibliotheken maßgeblich prägen kann, wird unter Fachleuten kontrovers diskutiert. Ihre Standpunkte werden zunächst hinreichend erläutert, ehe die Notwendigkeit einer Qualitätsprüfung des Verfahrens und dessen Grundlagen dargelegt werden. Zentraler Bestandteil einer künftigen Prüfung ist eine Testkollektion. Ihre Erstellung und deren Dokumentation steht im Fokus dieser Arbeit. In diesem Zusammenhang werden auch die Entstehungsgeschichte und Anforderungen an gelungene Testkollektionen behandelt. Abschließend wird ein Retrievaltest durchgeführt, der die Einsatzfähigkeit der erarbeiteten Testkollektion belegt. Seine Ergebnisse dienen ausschließlich der Funktionsüberprüfung. Eine Qualitätsbeurteilung maschineller Inhaltserschließung im Speziellen sowie im Allgemeinen findet nicht statt und ist nicht Ziel der Ausarbeitung.
Content
Bachelorarbeit, Bibliothekswissenschaften, Fakultät für Informations- und Kommunikationswissenschaften, Technische Hochschule Köln
Footnote
Munkelt_Bachelorarbeit_DNB_Retrievaltest.pdf.
Theme
Retrievalstudien
Automatisches Indexieren

Similar documents (content)

  1. Körber, S.: Suchmuster erfahrener und unerfahrener Suchmaschinennutzer im deutschsprachigen World Wide Web (2000) 0.09
    0.086299405 = sum of:
      0.086299405 = product of:
        0.35958084 = sum of:
          0.09276858 = weight(abstract_txt:prüfung in 5938) [ClassicSimilarity], result of:
            0.09276858 = score(doc=5938,freq=1.0), product of:
              0.19381317 = queryWeight, product of:
                1.0712221 = boost
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.020671608 = queryNorm
              0.4786495 = fieldWeight in 5938, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5938)
          0.09834098 = weight(abstract_txt:ausarbeitung in 5938) [ClassicSimilarity], result of:
            0.09834098 = score(doc=5938,freq=1.0), product of:
              0.20149876 = queryWeight, product of:
                1.092255 = boost
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.020671608 = queryNorm
              0.48804757 = fieldWeight in 5938, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5938)
          0.054109786 = weight(abstract_txt:ihre in 5938) [ClassicSimilarity], result of:
            0.054109786 = score(doc=5938,freq=3.0), product of:
              0.11819601 = queryWeight, product of:
                1.1830531 = boost
                4.8330836 = idf(docFreq=956, maxDocs=44218)
                0.020671608 = queryNorm
              0.45779705 = fieldWeight in 5938, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.8330836 = idf(docFreq=956, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5938)
          0.01789222 = weight(abstract_txt:werden in 5938) [ClassicSimilarity], result of:
            0.01789222 = score(doc=5938,freq=1.0), product of:
              0.093310945 = queryWeight, product of:
                1.2874037 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.020671608 = queryNorm
              0.19174835 = fieldWeight in 5938, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5938)
          0.024314791 = weight(abstract_txt:einer in 5938) [ClassicSimilarity], result of:
            0.024314791 = score(doc=5938,freq=1.0), product of:
              0.11448189 = queryWeight, product of:
                1.4259913 = boost
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.020671608 = queryNorm
              0.21238984 = fieldWeight in 5938, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5938)
          0.07215449 = weight(abstract_txt:findet in 5938) [ClassicSimilarity], result of:
            0.07215449 = score(doc=5938,freq=1.0), product of:
              0.20652293 = queryWeight, product of:
                1.5638207 = boost
                6.3886194 = idf(docFreq=201, maxDocs=44218)
                0.020671608 = queryNorm
              0.34937763 = fieldWeight in 5938, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3886194 = idf(docFreq=201, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5938)
        0.24 = coord(6/25)
    
  2. Bertram, J.: Einführung in die inhaltliche Erschließung : Grundlagen - Methoden - Instrumente (2005) 0.09
    0.08552227 = sum of:
      0.08552227 = product of:
        0.3563428 = sum of:
          0.043727316 = weight(abstract_txt:ihre in 210) [ClassicSimilarity], result of:
            0.043727316 = score(doc=210,freq=6.0), product of:
              0.11819601 = queryWeight, product of:
                1.1830531 = boost
                4.8330836 = idf(docFreq=956, maxDocs=44218)
                0.020671608 = queryNorm
              0.36995593 = fieldWeight in 210, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.8330836 = idf(docFreq=956, maxDocs=44218)
                0.03125 = fieldNorm(doc=210)
          0.02286184 = weight(abstract_txt:werden in 210) [ClassicSimilarity], result of:
            0.02286184 = score(doc=210,freq=5.0), product of:
              0.093310945 = queryWeight, product of:
                1.2874037 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.020671608 = queryNorm
              0.24500707 = fieldWeight in 210, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.03125 = fieldNorm(doc=210)
          0.024065401 = weight(abstract_txt:einer in 210) [ClassicSimilarity], result of:
            0.024065401 = score(doc=210,freq=3.0), product of:
              0.11448189 = queryWeight, product of:
                1.4259913 = boost
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.020671608 = queryNorm
              0.21021143 = fieldWeight in 210, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.03125 = fieldNorm(doc=210)
          0.041231137 = weight(abstract_txt:findet in 210) [ClassicSimilarity], result of:
            0.041231137 = score(doc=210,freq=1.0), product of:
              0.20652293 = queryWeight, product of:
                1.5638207 = boost
                6.3886194 = idf(docFreq=201, maxDocs=44218)
                0.020671608 = queryNorm
              0.19964436 = fieldWeight in 210, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3886194 = idf(docFreq=201, maxDocs=44218)
                0.03125 = fieldNorm(doc=210)
          0.081382215 = weight(abstract_txt:erstellung in 210) [ClassicSimilarity], result of:
            0.081382215 = score(doc=210,freq=3.0), product of:
              0.22531874 = queryWeight, product of:
                1.6334337 = boost
                6.6730065 = idf(docFreq=151, maxDocs=44218)
                0.020671608 = queryNorm
              0.36118707 = fieldWeight in 210, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.6730065 = idf(docFreq=151, maxDocs=44218)
                0.03125 = fieldNorm(doc=210)
          0.1430749 = weight(abstract_txt:inhaltserschließung in 210) [ClassicSimilarity], result of:
            0.1430749 = score(doc=210,freq=6.0), product of:
              0.26050133 = queryWeight, product of:
                1.7563368 = boost
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.020671608 = queryNorm
              0.5492291 = fieldWeight in 210, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.03125 = fieldNorm(doc=210)
        0.24 = coord(6/25)
    
  3. Fichtner, K.: Boyer-Moore Suchalgorithmus (2005) 0.08
    0.08295156 = sum of:
      0.08295156 = product of:
        0.4147578 = sum of:
          0.1404871 = weight(abstract_txt:ausarbeitung in 864) [ClassicSimilarity], result of:
            0.1404871 = score(doc=864,freq=1.0), product of:
              0.20149876 = queryWeight, product of:
                1.092255 = boost
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.020671608 = queryNorm
              0.6972108 = fieldWeight in 864, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.078125 = fieldNorm(doc=864)
          0.036147743 = weight(abstract_txt:werden in 864) [ClassicSimilarity], result of:
            0.036147743 = score(doc=864,freq=2.0), product of:
              0.093310945 = queryWeight, product of:
                1.2874037 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.020671608 = queryNorm
              0.3873902 = fieldWeight in 864, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.078125 = fieldNorm(doc=864)
          0.03473541 = weight(abstract_txt:einer in 864) [ClassicSimilarity], result of:
            0.03473541 = score(doc=864,freq=1.0), product of:
              0.11448189 = queryWeight, product of:
                1.4259913 = boost
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.020671608 = queryNorm
              0.30341405 = fieldWeight in 864, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8837 = idf(docFreq=2472, maxDocs=44218)
                0.078125 = fieldNorm(doc=864)
          0.10030969 = weight(abstract_txt:statt in 864) [ClassicSimilarity], result of:
            0.10030969 = score(doc=864,freq=1.0), product of:
              0.20280872 = queryWeight, product of:
                1.5496948 = boost
                6.330911 = idf(docFreq=213, maxDocs=44218)
                0.020671608 = queryNorm
              0.49460244 = fieldWeight in 864, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.330911 = idf(docFreq=213, maxDocs=44218)
                0.078125 = fieldNorm(doc=864)
          0.10307784 = weight(abstract_txt:findet in 864) [ClassicSimilarity], result of:
            0.10307784 = score(doc=864,freq=1.0), product of:
              0.20652293 = queryWeight, product of:
                1.5638207 = boost
                6.3886194 = idf(docFreq=201, maxDocs=44218)
                0.020671608 = queryNorm
              0.49911088 = fieldWeight in 864, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3886194 = idf(docFreq=201, maxDocs=44218)
                0.078125 = fieldNorm(doc=864)
        0.2 = coord(5/25)
    
  4. Pollmeier, M.: Verlagsschlagwörter als Grundlage für den Einsatz eines maschinellen Verfahrens zur verbalen Erschließung der Kinder- und Jugendliteratur durch die Deutsche Nationalbibliothek : eine Datenanalyse (2019) 0.08
    0.07813488 = sum of:
      0.07813488 = product of:
        0.48834303 = sum of:
          0.08818323 = weight(abstract_txt:maschinell in 1081) [ClassicSimilarity], result of:
            0.08818323 = score(doc=1081,freq=1.0), product of:
              0.17141365 = queryWeight, product of:
                1.00742 = boost
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.020671608 = queryNorm
              0.514447 = fieldWeight in 1081, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.0625 = fieldNorm(doc=1081)
          0.028918196 = weight(abstract_txt:werden in 1081) [ClassicSimilarity], result of:
            0.028918196 = score(doc=1081,freq=2.0), product of:
              0.093310945 = queryWeight, product of:
                1.2874037 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.020671608 = queryNorm
              0.30991215 = fieldWeight in 1081, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.0625 = fieldNorm(doc=1081)
          0.20233847 = weight(abstract_txt:inhaltserschließung in 1081) [ClassicSimilarity], result of:
            0.20233847 = score(doc=1081,freq=3.0), product of:
              0.26050133 = queryWeight, product of:
                1.7563368 = boost
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.020671608 = queryNorm
              0.7767272 = fieldWeight in 1081, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.1750984 = idf(docFreq=91, maxDocs=44218)
                0.0625 = fieldNorm(doc=1081)
          0.16890314 = weight(abstract_txt:verfahrens in 1081) [ClassicSimilarity], result of:
            0.16890314 = score(doc=1081,freq=1.0), product of:
              0.33308613 = queryWeight, product of:
                1.9860086 = boost
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.020671608 = queryNorm
              0.5070855 = fieldWeight in 1081, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.0625 = fieldNorm(doc=1081)
        0.16 = coord(4/25)
    
  5. Wilhelmy, A.: Phonetische Ähnlichkeitssuche in Datenbanken (1991) 0.08
    0.07671118 = sum of:
      0.07671118 = product of:
        0.47944486 = sum of:
          0.11022904 = weight(abstract_txt:maschinell in 5684) [ClassicSimilarity], result of:
            0.11022904 = score(doc=5684,freq=1.0), product of:
              0.17141365 = queryWeight, product of:
                1.00742 = boost
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.020671608 = queryNorm
              0.6430587 = fieldWeight in 5684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.231152 = idf(docFreq=31, maxDocs=44218)
                0.078125 = fieldNorm(doc=5684)
          0.13252655 = weight(abstract_txt:maschineller in 5684) [ClassicSimilarity], result of:
            0.13252655 = score(doc=5684,freq=1.0), product of:
              0.19381317 = queryWeight, product of:
                1.0712221 = boost
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.020671608 = queryNorm
              0.683785 = fieldWeight in 5684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.078125 = fieldNorm(doc=5684)
          0.025560316 = weight(abstract_txt:werden in 5684) [ClassicSimilarity], result of:
            0.025560316 = score(doc=5684,freq=1.0), product of:
              0.093310945 = queryWeight, product of:
                1.2874037 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.020671608 = queryNorm
              0.27392623 = fieldWeight in 5684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.078125 = fieldNorm(doc=5684)
          0.21112894 = weight(abstract_txt:verfahrens in 5684) [ClassicSimilarity], result of:
            0.21112894 = score(doc=5684,freq=1.0), product of:
              0.33308613 = queryWeight, product of:
                1.9860086 = boost
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.020671608 = queryNorm
              0.6338569 = fieldWeight in 5684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.113368 = idf(docFreq=35, maxDocs=44218)
                0.078125 = fieldNorm(doc=5684)
        0.16 = coord(4/25)