Document (#43374)

Author
Sack, H.
Title
Hybride Künstliche Intelligenz in der automatisierten Inhaltserschließung
Source
Qualität in der Inhaltserschließung. Hrsg.: M. Franke-Maier, u.a
Imprint
München : DeGruyter-Saur
Year
2021
Pages
S.387-405
Series
Bibliotheks- und Informationspraxis; 70
Abstract
Effizienter (Online-)Zugang zu Bibliotheks- und Archivmaterialien erfordert eine qualitativ hinreichende inhaltliche Erschließung dieser Dokumente. Die passgenaue Verschlagwortung und Kategorisierung dieser unstrukturierten Dokumente ermöglichen einen strukturell gegliederten Zugang sowohl in der analogen als auch in der digitalen Welt. Darüber hinaus erweitert eine vollständige Transkription der Dokumente den Zugang über die Möglichkeiten der Volltextsuche. Angesichts der in jüngster Zeit erzielten spektakulären Erfolge der Künstlichen Intelligenz liegt die Schlussfolgerung nahe, dass auch das Problem der automatisierten Inhaltserschließung für Bibliotheken und Archive als mehr oder weniger gelöst anzusehen wäre. Allerdings lassen sich die oftmals nur in thematisch engen Teilbereichen erzielten Erfolge nicht immer problemlos verallgemeinern oder in einen neuen Kontext übertragen. Das Ziel der vorliegenden Darstellung liegt in der Diskussion des aktuellen Stands der Technik der automatisierten inhaltlichen Erschließung anhand ausgewählter Beispiele sowie möglicher Fortschritte und Prognosen basierend auf aktuellen Entwicklungen des maschinellen Lernens und der Künstlichen Intelligenz einschließlich deren Kritik.
Theme
Automatisches Indexieren

Similar documents (content)

  1. Kasprzik, A.: Automatisierte und semiautomatisierte Klassifizierung : eine Analyse aktueller Projekte (2014) 0.23
    0.23044917 = sum of:
      0.23044917 = product of:
        0.82303274 = sum of:
          0.025362976 = weight(abstract_txt:dieser in 3470) [ClassicSimilarity], result of:
            0.025362976 = score(doc=3470,freq=1.0), product of:
              0.07416013 = queryWeight, product of:
                1.0113584 = boost
                4.377637 = idf(docFreq=1515, maxDocs=44421)
                0.01675042 = queryNorm
              0.34200287 = fieldWeight in 3470, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.377637 = idf(docFreq=1515, maxDocs=44421)
                0.078125 = fieldNorm(doc=3470)
          0.07067508 = weight(abstract_txt:aktuellen in 3470) [ClassicSimilarity], result of:
            0.07067508 = score(doc=3470,freq=1.0), product of:
              0.14685245 = queryWeight, product of:
                1.4231819 = boost
                6.160204 = idf(docFreq=254, maxDocs=44421)
                0.01675042 = queryNorm
              0.48126593 = fieldWeight in 3470, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.160204 = idf(docFreq=254, maxDocs=44421)
                0.078125 = fieldNorm(doc=3470)
          0.11138657 = weight(abstract_txt:inhaltserschließung in 3470) [ClassicSimilarity], result of:
            0.11138657 = score(doc=3470,freq=1.0), product of:
              0.1988805 = queryWeight, product of:
                1.656212 = boost
                7.168868 = idf(docFreq=92, maxDocs=44421)
                0.01675042 = queryNorm
              0.56006783 = fieldWeight in 3470, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.168868 = idf(docFreq=92, maxDocs=44421)
                0.078125 = fieldNorm(doc=3470)
          0.1139824 = weight(abstract_txt:künstlichen in 3470) [ClassicSimilarity], result of:
            0.1139824 = score(doc=3470,freq=1.0), product of:
              0.20195854 = queryWeight, product of:
                1.6689792 = boost
                7.2241306 = idf(docFreq=87, maxDocs=44421)
                0.01675042 = queryNorm
              0.5643852 = fieldWeight in 3470, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2241306 = idf(docFreq=87, maxDocs=44421)
                0.078125 = fieldNorm(doc=3470)
          0.11096808 = weight(abstract_txt:dokumente in 3470) [ClassicSimilarity], result of:
            0.11096808 = score(doc=3470,freq=1.0), product of:
              0.22709076 = queryWeight, product of:
                2.1675303 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.01675042 = queryNorm
              0.4886508 = fieldWeight in 3470, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.078125 = fieldNorm(doc=3470)
          0.11768451 = weight(abstract_txt:intelligenz in 3470) [ClassicSimilarity], result of:
            0.11768451 = score(doc=3470,freq=1.0), product of:
              0.23616396 = queryWeight, product of:
                2.210407 = boost
                6.3784575 = idf(docFreq=204, maxDocs=44421)
                0.01675042 = queryNorm
              0.498317 = fieldWeight in 3470, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3784575 = idf(docFreq=204, maxDocs=44421)
                0.078125 = fieldNorm(doc=3470)
          0.27297312 = weight(abstract_txt:automatisierten in 3470) [ClassicSimilarity], result of:
            0.27297312 = score(doc=3470,freq=1.0), product of:
              0.4138224 = queryWeight, product of:
                2.9259875 = boost
                8.443371 = idf(docFreq=25, maxDocs=44421)
                0.01675042 = queryNorm
              0.65963835 = fieldWeight in 3470, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.443371 = idf(docFreq=25, maxDocs=44421)
                0.078125 = fieldNorm(doc=3470)
        0.28 = coord(7/25)
    
  2. Groß, T.; Faden, M.: Automatische Indexierung elektronischer Dokumente an der Deutschen Zentralbibliothek für Wirtschaftswissenschaften : Bericht über die Jahrestagung der Internationalen Buchwissenschaftlichen Gesellschaft (2010) 0.12
    0.12032917 = sum of:
      0.12032917 = product of:
        0.42974705 = sum of:
          0.015217787 = weight(abstract_txt:dieser in 51) [ClassicSimilarity], result of:
            0.015217787 = score(doc=51,freq=1.0), product of:
              0.07416013 = queryWeight, product of:
                1.0113584 = boost
                4.377637 = idf(docFreq=1515, maxDocs=44421)
                0.01675042 = queryNorm
              0.20520173 = fieldWeight in 51, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.377637 = idf(docFreq=1515, maxDocs=44421)
                0.046875 = fieldNorm(doc=51)
          0.037590533 = weight(abstract_txt:erschließung in 51) [ClassicSimilarity], result of:
            0.037590533 = score(doc=51,freq=1.0), product of:
              0.13551535 = queryWeight, product of:
                1.3671434 = boost
                5.9176426 = idf(docFreq=324, maxDocs=44421)
                0.01675042 = queryNorm
              0.2773895 = fieldWeight in 51, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9176426 = idf(docFreq=324, maxDocs=44421)
                0.046875 = fieldNorm(doc=51)
          0.042405047 = weight(abstract_txt:aktuellen in 51) [ClassicSimilarity], result of:
            0.042405047 = score(doc=51,freq=1.0), product of:
              0.14685245 = queryWeight, product of:
                1.4231819 = boost
                6.160204 = idf(docFreq=254, maxDocs=44421)
                0.01675042 = queryNorm
              0.28875956 = fieldWeight in 51, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.160204 = idf(docFreq=254, maxDocs=44421)
                0.046875 = fieldNorm(doc=51)
          0.06683194 = weight(abstract_txt:inhaltserschließung in 51) [ClassicSimilarity], result of:
            0.06683194 = score(doc=51,freq=1.0), product of:
              0.1988805 = queryWeight, product of:
                1.656212 = boost
                7.168868 = idf(docFreq=92, maxDocs=44421)
                0.01675042 = queryNorm
              0.33604068 = fieldWeight in 51, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.168868 = idf(docFreq=92, maxDocs=44421)
                0.046875 = fieldNorm(doc=51)
          0.13834852 = weight(abstract_txt:erzielten in 51) [ClassicSimilarity], result of:
            0.13834852 = score(doc=51,freq=1.0), product of:
              0.32303718 = queryWeight, product of:
                2.110795 = boost
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.01675042 = queryNorm
              0.4282743 = fieldWeight in 51, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.046875 = fieldNorm(doc=51)
          0.06277238 = weight(abstract_txt:zugang in 51) [ClassicSimilarity], result of:
            0.06277238 = score(doc=51,freq=1.0), product of:
              0.2183462 = queryWeight, product of:
                2.1253881 = boost
                6.133123 = idf(docFreq=261, maxDocs=44421)
                0.01675042 = queryNorm
              0.28749013 = fieldWeight in 51, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.133123 = idf(docFreq=261, maxDocs=44421)
                0.046875 = fieldNorm(doc=51)
          0.06658085 = weight(abstract_txt:dokumente in 51) [ClassicSimilarity], result of:
            0.06658085 = score(doc=51,freq=1.0), product of:
              0.22709076 = queryWeight, product of:
                2.1675303 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.01675042 = queryNorm
              0.29319048 = fieldWeight in 51, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.046875 = fieldNorm(doc=51)
        0.28 = coord(7/25)
    
  3. Boltzendahl, S.: Ontologien in digitalen Bibliotheken unter dem Schwerpunkt Inhaltserschliessung und Recherche (2004) 0.11
    0.1074314 = sum of:
      0.1074314 = product of:
        0.44763082 = sum of:
          0.034028005 = weight(abstract_txt:dieser in 2414) [ClassicSimilarity], result of:
            0.034028005 = score(doc=2414,freq=5.0), product of:
              0.07416013 = queryWeight, product of:
                1.0113584 = boost
                4.377637 = idf(docFreq=1515, maxDocs=44421)
                0.01675042 = queryNorm
              0.45884502 = fieldWeight in 2414, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.377637 = idf(docFreq=1515, maxDocs=44421)
                0.046875 = fieldNorm(doc=2414)
          0.03849823 = weight(abstract_txt:liegt in 2414) [ClassicSimilarity], result of:
            0.03849823 = score(doc=2414,freq=1.0), product of:
              0.13768817 = queryWeight, product of:
                1.3780601 = boost
                5.9648952 = idf(docFreq=309, maxDocs=44421)
                0.01675042 = queryNorm
              0.27960446 = fieldWeight in 2414, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9648952 = idf(docFreq=309, maxDocs=44421)
                0.046875 = fieldNorm(doc=2414)
          0.11575631 = weight(abstract_txt:inhaltserschließung in 2414) [ClassicSimilarity], result of:
            0.11575631 = score(doc=2414,freq=3.0), product of:
              0.1988805 = queryWeight, product of:
                1.656212 = boost
                7.168868 = idf(docFreq=92, maxDocs=44421)
                0.01675042 = queryNorm
              0.58203954 = fieldWeight in 2414, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.168868 = idf(docFreq=92, maxDocs=44421)
                0.046875 = fieldNorm(doc=2414)
          0.096717276 = weight(abstract_txt:künstlichen in 2414) [ClassicSimilarity], result of:
            0.096717276 = score(doc=2414,freq=2.0), product of:
              0.20195854 = queryWeight, product of:
                1.6689792 = boost
                7.2241306 = idf(docFreq=87, maxDocs=44421)
                0.01675042 = queryNorm
              0.4788967 = fieldWeight in 2414, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.2241306 = idf(docFreq=87, maxDocs=44421)
                0.046875 = fieldNorm(doc=2414)
          0.06277238 = weight(abstract_txt:zugang in 2414) [ClassicSimilarity], result of:
            0.06277238 = score(doc=2414,freq=1.0), product of:
              0.2183462 = queryWeight, product of:
                2.1253881 = boost
                6.133123 = idf(docFreq=261, maxDocs=44421)
                0.01675042 = queryNorm
              0.28749013 = fieldWeight in 2414, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.133123 = idf(docFreq=261, maxDocs=44421)
                0.046875 = fieldNorm(doc=2414)
          0.09985863 = weight(abstract_txt:intelligenz in 2414) [ClassicSimilarity], result of:
            0.09985863 = score(doc=2414,freq=2.0), product of:
              0.23616396 = queryWeight, product of:
                2.210407 = boost
                6.3784575 = idf(docFreq=204, maxDocs=44421)
                0.01675042 = queryNorm
              0.422836 = fieldWeight in 2414, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.3784575 = idf(docFreq=204, maxDocs=44421)
                0.046875 = fieldNorm(doc=2414)
        0.24 = coord(6/25)
    
  4. Gabler, S.: Vergabe von DDC-Sachgruppen mittels eines Schlagwort-Thesaurus (2021) 0.08
    0.080994196 = sum of:
      0.080994196 = product of:
        0.5062137 = sum of:
          0.020290382 = weight(abstract_txt:dieser in 2002) [ClassicSimilarity], result of:
            0.020290382 = score(doc=2002,freq=1.0), product of:
              0.07416013 = queryWeight, product of:
                1.0113584 = boost
                4.377637 = idf(docFreq=1515, maxDocs=44421)
                0.01675042 = queryNorm
              0.2736023 = fieldWeight in 2002, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.377637 = idf(docFreq=1515, maxDocs=44421)
                0.0625 = fieldNorm(doc=2002)
          0.14199881 = weight(abstract_txt:kategorisierung in 2002) [ClassicSimilarity], result of:
            0.14199881 = score(doc=2002,freq=2.0), product of:
              0.1709281 = queryWeight, product of:
                1.0857043 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.01675042 = queryNorm
              0.8307517 = fieldWeight in 2002, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.0625 = fieldNorm(doc=2002)
          0.12554605 = weight(abstract_txt:dokumente in 2002) [ClassicSimilarity], result of:
            0.12554605 = score(doc=2002,freq=2.0), product of:
              0.22709076 = queryWeight, product of:
                2.1675303 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.01675042 = queryNorm
              0.55284524 = fieldWeight in 2002, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.0625 = fieldNorm(doc=2002)
          0.2183785 = weight(abstract_txt:automatisierten in 2002) [ClassicSimilarity], result of:
            0.2183785 = score(doc=2002,freq=1.0), product of:
              0.4138224 = queryWeight, product of:
                2.9259875 = boost
                8.443371 = idf(docFreq=25, maxDocs=44421)
                0.01675042 = queryNorm
              0.5277107 = fieldWeight in 2002, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.443371 = idf(docFreq=25, maxDocs=44421)
                0.0625 = fieldNorm(doc=2002)
        0.16 = coord(4/25)
    
  5. Mödden, E.: Maschinelle Beschlagwortung mit Algorithmen : Ein Blick in die Werkstatt des KI-Projektes der Deutschen Nationalbibliothek (2024) 0.08
    0.07768939 = sum of:
      0.07768939 = product of:
        0.48555866 = sum of:
          0.07518107 = weight(abstract_txt:erschließung in 1051) [ClassicSimilarity], result of:
            0.07518107 = score(doc=1051,freq=1.0), product of:
              0.13551535 = queryWeight, product of:
                1.3671434 = boost
                5.9176426 = idf(docFreq=324, maxDocs=44421)
                0.01675042 = queryNorm
              0.554779 = fieldWeight in 1051, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9176426 = idf(docFreq=324, maxDocs=44421)
                0.09375 = fieldNorm(doc=1051)
          0.07699646 = weight(abstract_txt:liegt in 1051) [ClassicSimilarity], result of:
            0.07699646 = score(doc=1051,freq=1.0), product of:
              0.13768817 = queryWeight, product of:
                1.3780601 = boost
                5.9648952 = idf(docFreq=309, maxDocs=44421)
                0.01675042 = queryNorm
              0.5592089 = fieldWeight in 1051, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9648952 = idf(docFreq=309, maxDocs=44421)
                0.09375 = fieldNorm(doc=1051)
          0.13366388 = weight(abstract_txt:inhaltserschließung in 1051) [ClassicSimilarity], result of:
            0.13366388 = score(doc=1051,freq=1.0), product of:
              0.1988805 = queryWeight, product of:
                1.656212 = boost
                7.168868 = idf(docFreq=92, maxDocs=44421)
                0.01675042 = queryNorm
              0.67208135 = fieldWeight in 1051, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.168868 = idf(docFreq=92, maxDocs=44421)
                0.09375 = fieldNorm(doc=1051)
          0.19971725 = weight(abstract_txt:intelligenz in 1051) [ClassicSimilarity], result of:
            0.19971725 = score(doc=1051,freq=2.0), product of:
              0.23616396 = queryWeight, product of:
                2.210407 = boost
                6.3784575 = idf(docFreq=204, maxDocs=44421)
                0.01675042 = queryNorm
              0.845672 = fieldWeight in 1051, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.3784575 = idf(docFreq=204, maxDocs=44421)
                0.09375 = fieldNorm(doc=1051)
        0.16 = coord(4/25)