Document (#43727)

Author
Hahn, J.
Title
Semi-automated methods for BIBFRAME work entity description
Source
Cataloging and classification quarterly. 59(2021) no.8, p.853-867
Year
2021
Abstract
This paper reports an investigation of machine learning methods for the semi-automated creation of a BIBFRAME Work entity description within the RDF linked data editor Sinopia (https://sinopia.io). The automated subject indexing software Annif was configured with the Library of Congress Subject Headings (LCSH) vocabulary from the Linked Data Service at https://id.loc.gov/. The training corpus was comprised of 9.3 million titles and LCSH linked data references from the IvyPlus POD project (https://pod.stanford.edu/) and from Share-VDE (https://wiki.share-vde.org). Semi-automated processes were explored to support and extend, not replace, professional expertise.
Content
Vgl.: https://doi.org/10.1080/01639374.2021.2014011.
Footnote
Teil eines Themenheftes: Artificial intelligence (AI) and automated processes for subject sccess
Theme
Formalerschließung
Object
BIBFRAME

Similar documents (author)

  1. Hahn, G.: ¬Die Bibliothek des Wissenschaftlichen Dienstes des US-Kongresses : Eine Bibliothek in der Library of Congress (1985) 4.88
    4.881029 = sum of:
      4.881029 = weight(author_txt:hahn in 1310) [ClassicSimilarity], result of:
        4.881029 = score(doc=1310,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.809647 = idf(docFreq=48, maxDocs=44421)
            0.12804675 = queryNorm
          4.8810296 = fieldWeight in 1310, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.809647 = idf(docFreq=48, maxDocs=44421)
            0.625 = fieldNorm(doc=1310)
    
  2. Hahn, G.: ¬Die Entwicklung der Wirtschaftswissenschaften im Spiegel von Klassifikationssystemen : ein Beitrag zur Wissenschafts- und Klassifikationskunde der Nationalökonomie (1978) 4.88
    4.881029 = sum of:
      4.881029 = weight(author_txt:hahn in 1697) [ClassicSimilarity], result of:
        4.881029 = score(doc=1697,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.809647 = idf(docFreq=48, maxDocs=44421)
            0.12804675 = queryNorm
          4.8810296 = fieldWeight in 1697, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.809647 = idf(docFreq=48, maxDocs=44421)
            0.625 = fieldNorm(doc=1697)
    
  3. Hahn, G.: Sacherschließung durch Schlagwortkataloge : theoretische und praktische Fragen, dargestellt am Beispiel der Bibliotheken der Industrie- und Handelskammern (1983) 4.88
    4.881029 = sum of:
      4.881029 = weight(author_txt:hahn in 1698) [ClassicSimilarity], result of:
        4.881029 = score(doc=1698,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.809647 = idf(docFreq=48, maxDocs=44421)
            0.12804675 = queryNorm
          4.8810296 = fieldWeight in 1698, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.809647 = idf(docFreq=48, maxDocs=44421)
            0.625 = fieldNorm(doc=1698)
    
  4. Hahn, G.: ¬Die Bibliothek des Deutschen Bundestages : Informationsbasis für die parlamentarische Arbeit (1983) 4.88
    4.881029 = sum of:
      4.881029 = weight(author_txt:hahn in 1699) [ClassicSimilarity], result of:
        4.881029 = score(doc=1699,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.809647 = idf(docFreq=48, maxDocs=44421)
            0.12804675 = queryNorm
          4.8810296 = fieldWeight in 1699, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.809647 = idf(docFreq=48, maxDocs=44421)
            0.625 = fieldNorm(doc=1699)
    
  5. Hahn, G.: Information und Dokumentation in der Bibliothek des Deutschen Bundestages : ein Beispiel der Praxis für die Einheit bibliothekarischer und dokumentarischer Prinzipien (1978-79) 4.88
    4.881029 = sum of:
      4.881029 = weight(author_txt:hahn in 1700) [ClassicSimilarity], result of:
        4.881029 = score(doc=1700,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.809647 = idf(docFreq=48, maxDocs=44421)
            0.12804675 = queryNorm
          4.8810296 = fieldWeight in 1700, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.809647 = idf(docFreq=48, maxDocs=44421)
            0.625 = fieldNorm(doc=1700)
    

Similar documents (content)

  1. Ahmed, M.; Mukhopadhyay, M.; Mukhopadhyay, P.: Automated knowledge organization : AI ML based subject indexing system for libraries (2023) 0.21
    0.21412523 = sum of:
      0.21412523 = product of:
        0.76473296 = sum of:
          0.025554016 = weight(abstract_txt:subject in 1979) [ClassicSimilarity], result of:
            0.025554016 = score(doc=1979,freq=2.0), product of:
              0.0739424 = queryWeight, product of:
                1.3214535 = boost
                3.9099448 = idf(docFreq=2419, maxDocs=44421)
                0.014311035 = queryNorm
              0.34559354 = fieldWeight in 1979, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9099448 = idf(docFreq=2419, maxDocs=44421)
                0.0625 = fieldNorm(doc=1979)
          0.2547684 = weight(abstract_txt:annif in 1979) [ClassicSimilarity], result of:
            0.2547684 = score(doc=1979,freq=3.0), product of:
              0.2374893 = queryWeight, product of:
                1.674604 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.014311035 = queryNorm
              1.0727574 = fieldWeight in 1979, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.0625 = fieldNorm(doc=1979)
          0.023684245 = weight(abstract_txt:data in 1979) [ClassicSimilarity], result of:
            0.023684245 = score(doc=1979,freq=2.0), product of:
              0.08046201 = queryWeight, product of:
                1.6882867 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.014311035 = queryNorm
              0.29435313 = fieldWeight in 1979, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=1979)
          0.10660775 = weight(abstract_txt:lcsh in 1979) [ClassicSimilarity], result of:
            0.10660775 = score(doc=1979,freq=2.0), product of:
              0.19162254 = queryWeight, product of:
                2.1272984 = boost
                6.294296 = idf(docFreq=222, maxDocs=44421)
                0.014311035 = queryNorm
              0.5563424 = fieldWeight in 1979, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.294296 = idf(docFreq=222, maxDocs=44421)
                0.0625 = fieldNorm(doc=1979)
          0.07755423 = weight(abstract_txt:linked in 1979) [ClassicSimilarity], result of:
            0.07755423 = score(doc=1979,freq=1.0), product of:
              0.22354494 = queryWeight, product of:
                2.8140588 = boost
                5.5508647 = idf(docFreq=468, maxDocs=44421)
                0.014311035 = queryNorm
              0.34692904 = fieldWeight in 1979, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5508647 = idf(docFreq=468, maxDocs=44421)
                0.0625 = fieldNorm(doc=1979)
          0.1263163 = weight(abstract_txt:semi in 1979) [ClassicSimilarity], result of:
            0.1263163 = score(doc=1979,freq=1.0), product of:
              0.30945733 = queryWeight, product of:
                3.3109386 = boost
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.014311035 = queryNorm
              0.40818647 = fieldWeight in 1979, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.0625 = fieldNorm(doc=1979)
          0.15024798 = weight(abstract_txt:automated in 1979) [ClassicSimilarity], result of:
            0.15024798 = score(doc=1979,freq=2.0), product of:
              0.3034845 = queryWeight, product of:
                3.7860675 = boost
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.014311035 = queryNorm
              0.49507627 = fieldWeight in 1979, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.0625 = fieldNorm(doc=1979)
        0.28 = coord(7/25)
    
  2. Samples, J.; Bigelow, I.: MARC to BIBFRAME : converting the PCC to Linked Data (2020) 0.18
    0.17892863 = sum of:
      0.17892863 = product of:
        0.8946431 = sum of:
          0.070598796 = weight(abstract_txt:share in 1120) [ClassicSimilarity], result of:
            0.070598796 = score(doc=1120,freq=2.0), product of:
              0.0881823 = queryWeight, product of:
                1.0204244 = boost
                6.038507 = idf(docFreq=287, maxDocs=44421)
                0.014311035 = queryNorm
              0.8006005 = fieldWeight in 1120, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.038507 = idf(docFreq=287, maxDocs=44421)
                0.09375 = fieldNorm(doc=1120)
          0.03489423 = weight(abstract_txt:work in 1120) [ClassicSimilarity], result of:
            0.03489423 = score(doc=1120,freq=2.0), product of:
              0.069453746 = queryWeight, product of:
                1.2807163 = boost
                3.7894108 = idf(docFreq=2729, maxDocs=44421)
                0.014311035 = queryNorm
              0.50240964 = fieldWeight in 1120, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.7894108 = idf(docFreq=2729, maxDocs=44421)
                0.09375 = fieldNorm(doc=1120)
          0.035526365 = weight(abstract_txt:data in 1120) [ClassicSimilarity], result of:
            0.035526365 = score(doc=1120,freq=2.0), product of:
              0.08046201 = queryWeight, product of:
                1.6882867 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.014311035 = queryNorm
              0.4415297 = fieldWeight in 1120, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.09375 = fieldNorm(doc=1120)
          0.11633135 = weight(abstract_txt:linked in 1120) [ClassicSimilarity], result of:
            0.11633135 = score(doc=1120,freq=1.0), product of:
              0.22354494 = queryWeight, product of:
                2.8140588 = boost
                5.5508647 = idf(docFreq=468, maxDocs=44421)
                0.014311035 = queryNorm
              0.52039355 = fieldWeight in 1120, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5508647 = idf(docFreq=468, maxDocs=44421)
                0.09375 = fieldNorm(doc=1120)
          0.6372924 = weight(abstract_txt:bibframe in 1120) [ClassicSimilarity], result of:
            0.6372924 = score(doc=1120,freq=5.0), product of:
              0.3548998 = queryWeight, product of:
                2.8950627 = boost
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.014311035 = queryNorm
              1.7956967 = fieldWeight in 1120, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.09375 = fieldNorm(doc=1120)
        0.2 = coord(5/25)
    
  3. Zhu, L.; Xu, A.; Deng, S.; Heng, G.; Li, X.: Entity management using Wikidata for cultural heritage information (2024) 0.17
    0.17202187 = sum of:
      0.17202187 = product of:
        0.7167578 = sum of:
          0.014290856 = weight(abstract_txt:from in 1977) [ClassicSimilarity], result of:
            0.014290856 = score(doc=1977,freq=1.0), product of:
              0.055242397 = queryWeight, product of:
                1.398901 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.014311035 = queryNorm
              0.25869364 = fieldWeight in 1977, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.09375 = fieldNorm(doc=1977)
          0.043510735 = weight(abstract_txt:data in 1977) [ClassicSimilarity], result of:
            0.043510735 = score(doc=1977,freq=3.0), product of:
              0.08046201 = queryWeight, product of:
                1.6882867 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.014311035 = queryNorm
              0.54076123 = fieldWeight in 1977, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.09375 = fieldNorm(doc=1977)
          0.19378833 = weight(abstract_txt:entity in 1977) [ClassicSimilarity], result of:
            0.19378833 = score(doc=1977,freq=3.0), product of:
              0.19027479 = queryWeight, product of:
                2.1198041 = boost
                6.272122 = idf(docFreq=227, maxDocs=44421)
                0.014311035 = queryNorm
              1.0184656 = fieldWeight in 1977, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.272122 = idf(docFreq=227, maxDocs=44421)
                0.09375 = fieldNorm(doc=1977)
          0.11633135 = weight(abstract_txt:linked in 1977) [ClassicSimilarity], result of:
            0.11633135 = score(doc=1977,freq=1.0), product of:
              0.22354494 = queryWeight, product of:
                2.8140588 = boost
                5.5508647 = idf(docFreq=468, maxDocs=44421)
                0.014311035 = queryNorm
              0.52039355 = fieldWeight in 1977, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5508647 = idf(docFreq=468, maxDocs=44421)
                0.09375 = fieldNorm(doc=1977)
          0.18947445 = weight(abstract_txt:semi in 1977) [ClassicSimilarity], result of:
            0.18947445 = score(doc=1977,freq=1.0), product of:
              0.30945733 = queryWeight, product of:
                3.3109386 = boost
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.014311035 = queryNorm
              0.6122797 = fieldWeight in 1977, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.09375 = fieldNorm(doc=1977)
          0.15936205 = weight(abstract_txt:automated in 1977) [ClassicSimilarity], result of:
            0.15936205 = score(doc=1977,freq=1.0), product of:
              0.3034845 = queryWeight, product of:
                3.7860675 = boost
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.014311035 = queryNorm
              0.5251077 = fieldWeight in 1977, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.09375 = fieldNorm(doc=1977)
        0.24 = coord(6/25)
    
  4. Heng, G.; Cole, T.W.; Tian, T.(C.); Han, M.-J.: Rethinking authority reconciliation process (2022) 0.13
    0.12970327 = sum of:
      0.12970327 = product of:
        0.6485163 = sum of:
          0.025120934 = weight(abstract_txt:data in 1728) [ClassicSimilarity], result of:
            0.025120934 = score(doc=1728,freq=1.0), product of:
              0.08046201 = queryWeight, product of:
                1.6882867 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.014311035 = queryNorm
              0.31220865 = fieldWeight in 1728, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.09375 = fieldNorm(doc=1728)
          0.15822752 = weight(abstract_txt:entity in 1728) [ClassicSimilarity], result of:
            0.15822752 = score(doc=1728,freq=2.0), product of:
              0.19027479 = queryWeight, product of:
                2.1198041 = boost
                6.272122 = idf(docFreq=227, maxDocs=44421)
                0.014311035 = queryNorm
              0.8315737 = fieldWeight in 1728, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.272122 = idf(docFreq=227, maxDocs=44421)
                0.09375 = fieldNorm(doc=1728)
          0.11633135 = weight(abstract_txt:linked in 1728) [ClassicSimilarity], result of:
            0.11633135 = score(doc=1728,freq=1.0), product of:
              0.22354494 = queryWeight, product of:
                2.8140588 = boost
                5.5508647 = idf(docFreq=468, maxDocs=44421)
                0.014311035 = queryNorm
              0.52039355 = fieldWeight in 1728, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5508647 = idf(docFreq=468, maxDocs=44421)
                0.09375 = fieldNorm(doc=1728)
          0.18947445 = weight(abstract_txt:semi in 1728) [ClassicSimilarity], result of:
            0.18947445 = score(doc=1728,freq=1.0), product of:
              0.30945733 = queryWeight, product of:
                3.3109386 = boost
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.014311035 = queryNorm
              0.6122797 = fieldWeight in 1728, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.09375 = fieldNorm(doc=1728)
          0.15936205 = weight(abstract_txt:automated in 1728) [ClassicSimilarity], result of:
            0.15936205 = score(doc=1728,freq=1.0), product of:
              0.3034845 = queryWeight, product of:
                3.7860675 = boost
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.014311035 = queryNorm
              0.5251077 = fieldWeight in 1728, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.09375 = fieldNorm(doc=1728)
        0.2 = coord(5/25)
    
  5. Willer, M.; Dunsire, G.: ISBD, the UNIMARC bibliographic format, and RDA : interoperability issues in namespaces and the linked data environment (2014) 0.12
    0.118383855 = sum of:
      0.118383855 = product of:
        0.49326608 = sum of:
          0.024673948 = weight(abstract_txt:work in 2999) [ClassicSimilarity], result of:
            0.024673948 = score(doc=2999,freq=1.0), product of:
              0.069453746 = queryWeight, product of:
                1.2807163 = boost
                3.7894108 = idf(docFreq=2729, maxDocs=44421)
                0.014311035 = queryNorm
              0.35525727 = fieldWeight in 2999, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7894108 = idf(docFreq=2729, maxDocs=44421)
                0.09375 = fieldNorm(doc=2999)
          0.020210324 = weight(abstract_txt:from in 2999) [ClassicSimilarity], result of:
            0.020210324 = score(doc=2999,freq=2.0), product of:
              0.055242397 = queryWeight, product of:
                1.398901 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.014311035 = queryNorm
              0.36584806 = fieldWeight in 2999, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.09375 = fieldNorm(doc=2999)
          0.08897602 = weight(abstract_txt:description in 2999) [ClassicSimilarity], result of:
            0.08897602 = score(doc=2999,freq=3.0), product of:
              0.11324303 = queryWeight, product of:
                1.6353505 = boost
                4.83871 = idf(docFreq=955, maxDocs=44421)
                0.014311035 = queryNorm
              0.78570855 = fieldWeight in 2999, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.83871 = idf(docFreq=955, maxDocs=44421)
                0.09375 = fieldNorm(doc=2999)
          0.035526365 = weight(abstract_txt:data in 2999) [ClassicSimilarity], result of:
            0.035526365 = score(doc=2999,freq=2.0), product of:
              0.08046201 = queryWeight, product of:
                1.6882867 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.014311035 = queryNorm
              0.4415297 = fieldWeight in 2999, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.09375 = fieldNorm(doc=2999)
          0.16451736 = weight(abstract_txt:linked in 2999) [ClassicSimilarity], result of:
            0.16451736 = score(doc=2999,freq=2.0), product of:
              0.22354494 = queryWeight, product of:
                2.8140588 = boost
                5.5508647 = idf(docFreq=468, maxDocs=44421)
                0.014311035 = queryNorm
              0.7359476 = fieldWeight in 2999, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5508647 = idf(docFreq=468, maxDocs=44421)
                0.09375 = fieldNorm(doc=2999)
          0.15936205 = weight(abstract_txt:automated in 2999) [ClassicSimilarity], result of:
            0.15936205 = score(doc=2999,freq=1.0), product of:
              0.3034845 = queryWeight, product of:
                3.7860675 = boost
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.014311035 = queryNorm
              0.5251077 = fieldWeight in 2999, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.09375 = fieldNorm(doc=2999)
        0.24 = coord(6/25)