Document (#44253)

Author
Blischke, K.
Title
Konvertierung bibliografischer Referenzdaten in ein neutrales Austauschformat : Probleme und Lösungsmöglichkeiten am Beispiel der Datenbank "Literatur zur Informationserschließung"
Imprint
Köln : Technische Hochschule / Fakultät für Informations- und Kommunikationswissenschaften
Year
2024
Pages
IV, 43 S
Abstract
Die Konvertierung von bibliographischen Daten in andere Formate stellt eine häufige Herausforderung in der bibliothekarischen Arbeit dar, wie die Systemumstellung vieler Bibliotheken auf das Bibliotheksmanagementsystem Alma zeigt. Dabei ist die verlustfreie Durchführung dieses Prozesses eine besondere Schwierigkeit, die aus der Verschiedenheit der Formate resultiert. Ein konkretes Beispiel für eine solche zu konvertierende Datenmenge ist die Literaturdatenbank "Literatur zur Informationserschließung", welche 44.218 bibliographische Einträge enthält und von einer modifizierten Form des Allegro-Neutralformats in das RIS-Format konvertiert werden soll. Dabei wird auf der Grundlage von erarbeiteten Konkordanzen zwischen beiden Formaten und Untersuchungen der Datenbank mit regulären Ausdrücken, sowie einem Pythonskript ein Programm geschrieben, das die Datenbank in das Zielformat konvertieren soll. Das Ergebnis wird anhand einer proportionalen Schichtenstichprobe evaluiert. Abschließend werden der Entwicklungsprozess und das Ergebnis hinsichtlich des stattgefundenen Informationsverlustes bei dem Konvertierungsprozess reflektiert.
Footnote
Bachelorarbeit zur Erlangung des akademischen Grades Bachelor of Arts im Studiengang Bibliothek und digitale Kommunikation an der Fakultät für Informations- und Kommunikationswissenschaften der Technischen Hochschule Köln.
Theme
Datenformate
Bibliographische Software
Object
RIS
Allegro

Similar documents (content)

  1. Enderle, W.: Neue Wege der bibliothekarischen Informationserschließung : von der Erschließung unselbständiger Literatur über Volltextindizierung bis zu Hypertext- und Expertensystemen (1994) 0.09
    0.09431142 = sum of:
      0.09431142 = product of:
        0.5894464 = sum of:
          0.122741506 = weight(abstract_txt:lösungsmöglichkeiten in 2103) [ClassicSimilarity], result of:
            0.122741506 = score(doc=2103,freq=1.0), product of:
              0.17336352 = queryWeight, product of:
                1.0524929 = boost
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.018175855 = queryNorm
              0.7080008 = fieldWeight in 2103, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.078125 = fieldNorm(doc=2103)
          0.06477378 = weight(abstract_txt:beispiel in 2103) [ClassicSimilarity], result of:
            0.06477378 = score(doc=2103,freq=1.0), product of:
              0.14263943 = queryWeight, product of:
                1.3501284 = boost
                5.8125896 = idf(docFreq=360, maxDocs=44421)
                0.018175855 = queryNorm
              0.45410857 = fieldWeight in 2103, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8125896 = idf(docFreq=360, maxDocs=44421)
                0.078125 = fieldNorm(doc=2103)
          0.06988679 = weight(abstract_txt:literatur in 2103) [ClassicSimilarity], result of:
            0.06988679 = score(doc=2103,freq=1.0), product of:
              0.15005028 = queryWeight, product of:
                1.3847574 = boost
                5.9616747 = idf(docFreq=310, maxDocs=44421)
                0.018175855 = queryNorm
              0.46575582 = fieldWeight in 2103, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9616747 = idf(docFreq=310, maxDocs=44421)
                0.078125 = fieldNorm(doc=2103)
          0.33204433 = weight(abstract_txt:informationserschließung in 2103) [ClassicSimilarity], result of:
            0.33204433 = score(doc=2103,freq=2.0), product of:
              0.3365845 = queryWeight, product of:
                2.0739694 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.018175855 = queryNorm
              0.98651105 = fieldWeight in 2103, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.078125 = fieldNorm(doc=2103)
        0.16 = coord(4/25)
    
  2. Kaiser, A.: Computer-unterstütztes Indexieren in Intelligenten Information Retrieval Systemen : Ein Relevanz-Feedback orientierter Ansatz zur Informationserschließung in unformatierten Datenbanken (1993) 0.08
    0.07655631 = sum of:
      0.07655631 = product of:
        0.31898463 = sum of:
          0.023509841 = weight(abstract_txt:dabei in 284) [ClassicSimilarity], result of:
            0.023509841 = score(doc=284,freq=3.0), product of:
              0.092695676 = queryWeight, product of:
                1.0883912 = boost
                4.6857553 = idf(docFreq=1113, maxDocs=44421)
                0.018175855 = queryNorm
              0.25362393 = fieldWeight in 284, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6857553 = idf(docFreq=1113, maxDocs=44421)
                0.03125 = fieldNorm(doc=284)
          0.02377124 = weight(abstract_txt:eine in 284) [ClassicSimilarity], result of:
            0.02377124 = score(doc=284,freq=8.0), product of:
              0.077084735 = queryWeight, product of:
                1.2155843 = boost
                3.4888992 = idf(docFreq=3686, maxDocs=44421)
                0.018175855 = queryNorm
              0.30837804 = fieldWeight in 284, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                3.4888992 = idf(docFreq=3686, maxDocs=44421)
                0.03125 = fieldNorm(doc=284)
          0.019067455 = weight(abstract_txt:soll in 284) [ClassicSimilarity], result of:
            0.019067455 = score(doc=284,freq=1.0), product of:
              0.116268456 = queryWeight, product of:
                1.218951 = boost
                5.247843 = idf(docFreq=634, maxDocs=44421)
                0.018175855 = queryNorm
              0.16399509 = fieldWeight in 284, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.247843 = idf(docFreq=634, maxDocs=44421)
                0.03125 = fieldNorm(doc=284)
          0.027954718 = weight(abstract_txt:literatur in 284) [ClassicSimilarity], result of:
            0.027954718 = score(doc=284,freq=1.0), product of:
              0.15005028 = queryWeight, product of:
                1.3847574 = boost
                5.9616747 = idf(docFreq=310, maxDocs=44421)
                0.018175855 = queryNorm
              0.18630233 = fieldWeight in 284, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9616747 = idf(docFreq=310, maxDocs=44421)
                0.03125 = fieldNorm(doc=284)
          0.03684871 = weight(abstract_txt:ergebnis in 284) [ClassicSimilarity], result of:
            0.03684871 = score(doc=284,freq=1.0), product of:
              0.18039103 = queryWeight, product of:
                1.518318 = boost
                6.5366817 = idf(docFreq=174, maxDocs=44421)
                0.018175855 = queryNorm
              0.2042713 = fieldWeight in 284, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5366817 = idf(docFreq=174, maxDocs=44421)
                0.03125 = fieldNorm(doc=284)
          0.18783264 = weight(abstract_txt:informationserschließung in 284) [ClassicSimilarity], result of:
            0.18783264 = score(doc=284,freq=4.0), product of:
              0.3365845 = queryWeight, product of:
                2.0739694 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.018175855 = queryNorm
              0.5580549 = fieldWeight in 284, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.03125 = fieldNorm(doc=284)
        0.24 = coord(6/25)
    
  3. Garbe, G.: Informationeller Mehrwert durch den Einsatz von Inhouse-Retrieval-Systemen : am Beispiel von Literaturbestellungen als Realisierung im Datenbanksystem STAR (1992) 0.07
    0.074905224 = sum of:
      0.074905224 = product of:
        0.46815768 = sum of:
          0.21908315 = weight(abstract_txt:konkretes in 558) [ClassicSimilarity], result of:
            0.21908315 = score(doc=558,freq=1.0), product of:
              0.1864759 = queryWeight, product of:
                1.0915701 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.018175855 = queryNorm
              1.1748604 = fieldWeight in 558, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.125 = fieldNorm(doc=558)
          0.03361761 = weight(abstract_txt:eine in 558) [ClassicSimilarity], result of:
            0.03361761 = score(doc=558,freq=1.0), product of:
              0.077084735 = queryWeight, product of:
                1.2155843 = boost
                3.4888992 = idf(docFreq=3686, maxDocs=44421)
                0.018175855 = queryNorm
              0.4361124 = fieldWeight in 558, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4888992 = idf(docFreq=3686, maxDocs=44421)
                0.125 = fieldNorm(doc=558)
          0.10363806 = weight(abstract_txt:beispiel in 558) [ClassicSimilarity], result of:
            0.10363806 = score(doc=558,freq=1.0), product of:
              0.14263943 = queryWeight, product of:
                1.3501284 = boost
                5.8125896 = idf(docFreq=360, maxDocs=44421)
                0.018175855 = queryNorm
              0.7265737 = fieldWeight in 558, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8125896 = idf(docFreq=360, maxDocs=44421)
                0.125 = fieldNorm(doc=558)
          0.11181887 = weight(abstract_txt:literatur in 558) [ClassicSimilarity], result of:
            0.11181887 = score(doc=558,freq=1.0), product of:
              0.15005028 = queryWeight, product of:
                1.3847574 = boost
                5.9616747 = idf(docFreq=310, maxDocs=44421)
                0.018175855 = queryNorm
              0.74520934 = fieldWeight in 558, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9616747 = idf(docFreq=310, maxDocs=44421)
                0.125 = fieldNorm(doc=558)
        0.16 = coord(4/25)
    
  4. Curio, H.-J. (Mitarb.); Körner, H.P. (Mitarb.); Marohn, H.-D. (Mitarb.): Fachthesaurus Luftverkehr und Randgebiete (1990) 0.06
    0.059841443 = sum of:
      0.059841443 = product of:
        0.24933936 = sum of:
          0.028793557 = weight(abstract_txt:dabei in 5218) [ClassicSimilarity], result of:
            0.028793557 = score(doc=5218,freq=2.0), product of:
              0.092695676 = queryWeight, product of:
                1.0883912 = boost
                4.6857553 = idf(docFreq=1113, maxDocs=44421)
                0.018175855 = queryNorm
              0.3106246 = fieldWeight in 5218, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6857553 = idf(docFreq=1113, maxDocs=44421)
                0.046875 = fieldNorm(doc=5218)
          0.025213206 = weight(abstract_txt:eine in 5218) [ClassicSimilarity], result of:
            0.025213206 = score(doc=5218,freq=4.0), product of:
              0.077084735 = queryWeight, product of:
                1.2155843 = boost
                3.4888992 = idf(docFreq=3686, maxDocs=44421)
                0.018175855 = queryNorm
              0.3270843 = fieldWeight in 5218, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4888992 = idf(docFreq=3686, maxDocs=44421)
                0.046875 = fieldNorm(doc=5218)
          0.028601183 = weight(abstract_txt:soll in 5218) [ClassicSimilarity], result of:
            0.028601183 = score(doc=5218,freq=1.0), product of:
              0.116268456 = queryWeight, product of:
                1.218951 = boost
                5.247843 = idf(docFreq=634, maxDocs=44421)
                0.018175855 = queryNorm
              0.24599263 = fieldWeight in 5218, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.247843 = idf(docFreq=634, maxDocs=44421)
                0.046875 = fieldNorm(doc=5218)
          0.04193208 = weight(abstract_txt:literatur in 5218) [ClassicSimilarity], result of:
            0.04193208 = score(doc=5218,freq=1.0), product of:
              0.15005028 = queryWeight, product of:
                1.3847574 = boost
                5.9616747 = idf(docFreq=310, maxDocs=44421)
                0.018175855 = queryNorm
              0.27945352 = fieldWeight in 5218, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9616747 = idf(docFreq=310, maxDocs=44421)
                0.046875 = fieldNorm(doc=5218)
          0.055273063 = weight(abstract_txt:ergebnis in 5218) [ClassicSimilarity], result of:
            0.055273063 = score(doc=5218,freq=1.0), product of:
              0.18039103 = queryWeight, product of:
                1.518318 = boost
                6.5366817 = idf(docFreq=174, maxDocs=44421)
                0.018175855 = queryNorm
              0.30640694 = fieldWeight in 5218, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5366817 = idf(docFreq=174, maxDocs=44421)
                0.046875 = fieldNorm(doc=5218)
          0.06952627 = weight(abstract_txt:datenbank in 5218) [ClassicSimilarity], result of:
            0.06952627 = score(doc=5218,freq=1.0), product of:
              0.24062215 = queryWeight, product of:
                2.1476758 = boost
                6.1641335 = idf(docFreq=253, maxDocs=44421)
                0.018175855 = queryNorm
              0.28894377 = fieldWeight in 5218, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1641335 = idf(docFreq=253, maxDocs=44421)
                0.046875 = fieldNorm(doc=5218)
        0.24 = coord(6/25)
    
  5. Niedermair, K.; Habersam, M.: ¬Die Bibliothek im Zeitalter ihrer Automatisierbarkeit : Die Aufgaben der Bibliothek und die Darlegung ihrer Qualität (2015) 0.05
    0.054288473 = sum of:
      0.054288473 = product of:
        0.33930296 = sum of:
          0.08422171 = weight(abstract_txt:prozesses in 2707) [ClassicSimilarity], result of:
            0.08422171 = score(doc=2707,freq=1.0), product of:
              0.15650184 = queryWeight, product of:
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.018175855 = queryNorm
              0.53815156 = fieldWeight in 2707, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.0625 = fieldNorm(doc=2707)
          0.029113702 = weight(abstract_txt:eine in 2707) [ClassicSimilarity], result of:
            0.029113702 = score(doc=2707,freq=3.0), product of:
              0.077084735 = queryWeight, product of:
                1.2155843 = boost
                3.4888992 = idf(docFreq=3686, maxDocs=44421)
                0.018175855 = queryNorm
              0.3776844 = fieldWeight in 2707, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4888992 = idf(docFreq=3686, maxDocs=44421)
                0.0625 = fieldNorm(doc=2707)
          0.03813491 = weight(abstract_txt:soll in 2707) [ClassicSimilarity], result of:
            0.03813491 = score(doc=2707,freq=1.0), product of:
              0.116268456 = queryWeight, product of:
                1.218951 = boost
                5.247843 = idf(docFreq=634, maxDocs=44421)
                0.018175855 = queryNorm
              0.32799017 = fieldWeight in 2707, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.247843 = idf(docFreq=634, maxDocs=44421)
                0.0625 = fieldNorm(doc=2707)
          0.18783264 = weight(abstract_txt:informationserschließung in 2707) [ClassicSimilarity], result of:
            0.18783264 = score(doc=2707,freq=1.0), product of:
              0.3365845 = queryWeight, product of:
                2.0739694 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.018175855 = queryNorm
              0.5580549 = fieldWeight in 2707, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.0625 = fieldNorm(doc=2707)
        0.16 = coord(4/25)