Document (#34877)

Author
Strobel, S.
Title
Englischsprachige Erweiterung des TIB / AV-Portals : Ein GND/DBpedia-Mapping zur Gewinnung eines englischen Begriffssystems
Source
o-bib: Das offene Bibliotheksjournal. 1(2014) Nr.1, S.197-204
Year
2014
Abstract
Die Videos des TIB / AV-Portals werden mit insgesamt 63.356 GND-Sachbegriffen aus Naturwissenschaft und Technik automatisch verschlagwortet. Neben den deutschsprachigen Videos verfügt das TIB / AV-Portal auch über zahlreiche englischsprachige Videos. Die GND enthält zu den in der TIB / AV-Portal-Wissensbasis verwendeten Sachbegriffen nur sehr wenige englische Bezeichner. Es fehlt demnach ein englisches Indexierungsvokabular, mit dem die englischsprachigen Videos automatisch verschlagwortet werden können. Die Lösung dieses Problems sieht wie folgt aus: Die englischen Bezeichner sollen über ein Mapping der GND-Sachbegriffe auf andere Datensätze gewonnen werden, die eine englische Übersetzung der Begriffe enthalten. Die verwendeten Mappingstrategien nutzen die DBpedia, LCSH, MACS-Ergebnisse sowie den WTI-Thesaurus. Am Ende haben 35.025 GND-Sachbegriffe (mindestens) einen englischen Bezeichner ermittelt bekommen. Diese englischen Bezeichner können für die automatische Verschlagwortung der englischsprachigen Videos unmittelbar herangezogen werden. 11.694 GND-Sachbegriffe konnten zwar nicht ins Englische "übersetzt", aber immerhin mit einem Oberbegriff assoziiert werden, der eine englische Übersetzung hat. Diese Assoziation dient der Erweiterung der Suchergebnisse.
Content
Beitrag als ausgearbeitete Form eines Vortrages während des 103. Deutschen Bibliothekartages in Bremen. Vgl.: https://www.o-bib.de/article/view/2014H1S197-204.
Theme
Metadaten
Automatisches Indexieren
Multilinguale Probleme
Form
AV-Materialien
Object
GND
DBpedia
Location
D
Hannover

Similar documents (author)

  1. Strobel, S.: ¬The complete Linux kit : fully configured LINUX system kernel (1997) 6.10
    6.0972233 = sum of:
      6.0972233 = weight(author_txt:strobel in 573) [ClassicSimilarity], result of:
        6.0972233 = fieldWeight in 573, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.625 = fieldNorm(doc=573)
    
  2. Strobel, G.: Konzeption und Realisierung eines WWW-Servers für den Studiengang Dokumentation der HBI Stuttgart (1995) 6.10
    6.0972233 = sum of:
      6.0972233 = weight(author_txt:strobel in 6032) [ClassicSimilarity], result of:
        6.0972233 = fieldWeight in 6032, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.625 = fieldNorm(doc=6032)
    
  3. Strobel, S.: Firewalls : Einführung - Praxis - Produkte (1999) 6.10
    6.0972233 = sum of:
      6.0972233 = weight(author_txt:strobel in 2531) [ClassicSimilarity], result of:
        6.0972233 = fieldWeight in 2531, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.625 = fieldNorm(doc=2531)
    
  4. Strobel, S.; Uhl, T.: LINUX - vom PC zur Workstation : Grundlagen, Installation und praktischer Einsatz (1994) 4.88
    4.8777785 = sum of:
      4.8777785 = weight(author_txt:strobel in 2561) [ClassicSimilarity], result of:
        4.8777785 = fieldWeight in 2561, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.5 = fieldNorm(doc=2561)
    
  5. Strobel, S.; Marín-Arraiza, P.: Metadata for scientific audiovisual media : current practices and perspectives of the TIB / AV-portal (2015) 4.27
    4.2680564 = sum of:
      4.2680564 = weight(author_txt:strobel in 4667) [ClassicSimilarity], result of:
        4.2680564 = fieldWeight in 4667, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.4375 = fieldNorm(doc=4667)
    

Similar documents (content)

  1. Carevic, Z.: Semi-automatische Verschlagwortung zur Integration externer semantischer Inhalte innerhalb einer medizinischen Kooperationsplattform (2012) 0.13
    0.1304429 = sum of:
      0.1304429 = product of:
        0.46586752 = sum of:
          0.064295724 = weight(abstract_txt:wissensbasis in 1897) [ClassicSimilarity], result of:
            0.064295724 = score(doc=1897,freq=2.0), product of:
              0.11433976 = queryWeight, product of:
                1.0219657 = boost
                8.482592 = idf(docFreq=24, maxDocs=44421)
                0.013189624 = queryNorm
              0.56232166 = fieldWeight in 1897, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.482592 = idf(docFreq=24, maxDocs=44421)
                0.046875 = fieldNorm(doc=1897)
          0.12203128 = weight(abstract_txt:verschlagwortung in 1897) [ClassicSimilarity], result of:
            0.12203128 = score(doc=1897,freq=7.0), product of:
              0.115442924 = queryWeight, product of:
                1.0268838 = boost
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.013189624 = queryNorm
              1.0570703 = fieldWeight in 1897, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.046875 = fieldNorm(doc=1897)
          0.012130973 = weight(abstract_txt:diese in 1897) [ClassicSimilarity], result of:
            0.012130973 = score(doc=1897,freq=1.0), product of:
              0.059707146 = queryWeight, product of:
                1.0443975 = boost
                4.3343906 = idf(docFreq=1582, maxDocs=44421)
                0.013189624 = queryNorm
              0.20317456 = fieldWeight in 1897, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3343906 = idf(docFreq=1582, maxDocs=44421)
                0.046875 = fieldNorm(doc=1897)
          0.018549291 = weight(abstract_txt:können in 1897) [ClassicSimilarity], result of:
            0.018549291 = score(doc=1897,freq=2.0), product of:
              0.06289809 = queryWeight, product of:
                1.0719423 = boost
                4.448705 = idf(docFreq=1411, maxDocs=44421)
                0.013189624 = queryNorm
              0.29491025 = fieldWeight in 1897, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.448705 = idf(docFreq=1411, maxDocs=44421)
                0.046875 = fieldNorm(doc=1897)
          0.14497532 = weight(abstract_txt:begriffssystems in 1897) [ClassicSimilarity], result of:
            0.14497532 = score(doc=1897,freq=4.0), product of:
              0.15604934 = queryWeight, product of:
                1.1939019 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.013189624 = queryNorm
              0.9290351 = fieldWeight in 1897, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.046875 = fieldNorm(doc=1897)
          0.058417797 = weight(abstract_txt:verwendeten in 1897) [ClassicSimilarity], result of:
            0.058417797 = score(doc=1897,freq=1.0), product of:
              0.17026524 = queryWeight, product of:
                1.7636633 = boost
                7.319441 = idf(docFreq=79, maxDocs=44421)
                0.013189624 = queryNorm
              0.3430988 = fieldWeight in 1897, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.319441 = idf(docFreq=79, maxDocs=44421)
                0.046875 = fieldNorm(doc=1897)
          0.04546713 = weight(abstract_txt:werden in 1897) [ClassicSimilarity], result of:
            0.04546713 = score(doc=1897,freq=8.0), product of:
              0.097763695 = queryWeight, product of:
                2.1130583 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.013189624 = queryNorm
              0.46507174 = fieldWeight in 1897, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.046875 = fieldNorm(doc=1897)
        0.28 = coord(7/25)
    
  2. Beall, J.: Approaches to expansions : case studies from the German and Vietnamese translations (2003) 0.12
    0.11713906 = sum of:
      0.11713906 = product of:
        0.58569527 = sum of:
          0.017488442 = weight(abstract_txt:können in 2748) [ClassicSimilarity], result of:
            0.017488442 = score(doc=2748,freq=1.0), product of:
              0.06289809 = queryWeight, product of:
                1.0719423 = boost
                4.448705 = idf(docFreq=1411, maxDocs=44421)
                0.013189624 = queryNorm
              0.27804407 = fieldWeight in 2748, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.448705 = idf(docFreq=1411, maxDocs=44421)
                0.0625 = fieldNorm(doc=2748)
          0.10904258 = weight(abstract_txt:übersetzung in 2748) [ClassicSimilarity], result of:
            0.10904258 = score(doc=2748,freq=2.0), product of:
              0.16911837 = queryWeight, product of:
                1.7577134 = boost
                7.2947483 = idf(docFreq=81, maxDocs=44421)
                0.013189624 = queryNorm
              0.64477074 = fieldWeight in 2748, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.2947483 = idf(docFreq=81, maxDocs=44421)
                0.0625 = fieldNorm(doc=2748)
          0.030311422 = weight(abstract_txt:werden in 2748) [ClassicSimilarity], result of:
            0.030311422 = score(doc=2748,freq=2.0), product of:
              0.097763695 = queryWeight, product of:
                2.1130583 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.013189624 = queryNorm
              0.31004784 = fieldWeight in 2748, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.0625 = fieldNorm(doc=2748)
          0.20436133 = weight(abstract_txt:englischen in 2748) [ClassicSimilarity], result of:
            0.20436133 = score(doc=2748,freq=1.0), product of:
              0.40808052 = queryWeight, product of:
                3.8613627 = boost
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.013189624 = queryNorm
              0.5007868 = fieldWeight in 2748, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.0625 = fieldNorm(doc=2748)
          0.22449145 = weight(abstract_txt:englische in 2748) [ClassicSimilarity], result of:
            0.22449145 = score(doc=2748,freq=1.0), product of:
              0.4344568 = queryWeight, product of:
                3.9841986 = boost
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.013189624 = queryNorm
              0.51671755 = fieldWeight in 2748, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.0625 = fieldNorm(doc=2748)
        0.2 = coord(5/25)
    
  3. Online-Enzyklopädie Wikipedia (2003) 0.10
    0.10470726 = sum of:
      0.10470726 = product of:
        0.43628028 = sum of:
          0.017509552 = weight(abstract_txt:diese in 2410) [ClassicSimilarity], result of:
            0.017509552 = score(doc=2410,freq=3.0), product of:
              0.059707146 = queryWeight, product of:
                1.0443975 = boost
                4.3343906 = idf(docFreq=1582, maxDocs=44421)
                0.013189624 = queryNorm
              0.2932572 = fieldWeight in 2410, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.3343906 = idf(docFreq=1582, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2410)
          0.015457745 = weight(abstract_txt:können in 2410) [ClassicSimilarity], result of:
            0.015457745 = score(doc=2410,freq=2.0), product of:
              0.06289809 = queryWeight, product of:
                1.0719423 = boost
                4.448705 = idf(docFreq=1411, maxDocs=44421)
                0.013189624 = queryNorm
              0.24575856 = fieldWeight in 2410, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.448705 = idf(docFreq=1411, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2410)
          0.112077646 = weight(abstract_txt:englischsprachigen in 2410) [ClassicSimilarity], result of:
            0.112077646 = score(doc=2410,freq=2.0), product of:
              0.23562393 = queryWeight, product of:
                2.0747337 = boost
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.013189624 = queryNorm
              0.47566327 = fieldWeight in 2410, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2410)
          0.02320235 = weight(abstract_txt:werden in 2410) [ClassicSimilarity], result of:
            0.02320235 = score(doc=2410,freq=3.0), product of:
              0.097763695 = queryWeight, product of:
                2.1130583 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.013189624 = queryNorm
              0.23733094 = fieldWeight in 2410, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2410)
          0.12772582 = weight(abstract_txt:englischen in 2410) [ClassicSimilarity], result of:
            0.12772582 = score(doc=2410,freq=1.0), product of:
              0.40808052 = queryWeight, product of:
                3.8613627 = boost
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.013189624 = queryNorm
              0.31299174 = fieldWeight in 2410, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2410)
          0.14030716 = weight(abstract_txt:englische in 2410) [ClassicSimilarity], result of:
            0.14030716 = score(doc=2410,freq=1.0), product of:
              0.4344568 = queryWeight, product of:
                3.9841986 = boost
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.013189624 = queryNorm
              0.32294846 = fieldWeight in 2410, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2410)
        0.24 = coord(6/25)
    
  4. Weisweiler, H.: Zusätzliche verbale Sacherschließung in englischer Sprache : Zeitschrifteninhaltsdienst Theologie (2001) 0.10
    0.09697031 = sum of:
      0.09697031 = product of:
        0.48485157 = sum of:
          0.0141528025 = weight(abstract_txt:diese in 6957) [ClassicSimilarity], result of:
            0.0141528025 = score(doc=6957,freq=1.0), product of:
              0.059707146 = queryWeight, product of:
                1.0443975 = boost
                4.3343906 = idf(docFreq=1582, maxDocs=44421)
                0.013189624 = queryNorm
              0.23703699 = fieldWeight in 6957, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3343906 = idf(docFreq=1582, maxDocs=44421)
                0.0546875 = fieldNorm(doc=6957)
          0.110951215 = weight(abstract_txt:englischsprachigen in 6957) [ClassicSimilarity], result of:
            0.110951215 = score(doc=6957,freq=1.0), product of:
              0.23562393 = queryWeight, product of:
                2.0747337 = boost
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.013189624 = queryNorm
              0.47088262 = fieldWeight in 6957, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.0546875 = fieldNorm(doc=6957)
          0.16217713 = weight(abstract_txt:englischsprachige in 6957) [ClassicSimilarity], result of:
            0.16217713 = score(doc=6957,freq=2.0), product of:
              0.2408691 = queryWeight, product of:
                2.0976992 = boost
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.013189624 = queryNorm
              0.67329985 = fieldWeight in 6957, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.0546875 = fieldNorm(doc=6957)
          0.018754236 = weight(abstract_txt:werden in 6957) [ClassicSimilarity], result of:
            0.018754236 = score(doc=6957,freq=1.0), product of:
              0.097763695 = queryWeight, product of:
                2.1130583 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.013189624 = queryNorm
              0.19183232 = fieldWeight in 6957, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.0546875 = fieldNorm(doc=6957)
          0.17881617 = weight(abstract_txt:englischen in 6957) [ClassicSimilarity], result of:
            0.17881617 = score(doc=6957,freq=1.0), product of:
              0.40808052 = queryWeight, product of:
                3.8613627 = boost
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.013189624 = queryNorm
              0.43818843 = fieldWeight in 6957, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.0546875 = fieldNorm(doc=6957)
        0.2 = coord(5/25)
    
  5. Anglo-Amerikanische Katalogisierungsregeln : Deutsche Übersetzung der Anglo-American Cataloguing Rules, Second edition, 1998 Revision, einschließlich der Änderungen und Ergänzungen bis März 2001 (2002) 0.09
    0.09469013 = sum of:
      0.09469013 = product of:
        0.39454222 = sum of:
          0.042594954 = weight(abstract_txt:übersetzt in 601) [ClassicSimilarity], result of:
            0.042594954 = score(doc=601,freq=1.0), product of:
              0.109477445 = queryWeight, product of:
                8.30027 = idf(docFreq=29, maxDocs=44421)
                0.013189624 = queryNorm
              0.38907516 = fieldWeight in 601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.30027 = idf(docFreq=29, maxDocs=44421)
                0.046875 = fieldNorm(doc=601)
          0.012130973 = weight(abstract_txt:diese in 601) [ClassicSimilarity], result of:
            0.012130973 = score(doc=601,freq=1.0), product of:
              0.059707146 = queryWeight, product of:
                1.0443975 = boost
                4.3343906 = idf(docFreq=1582, maxDocs=44421)
                0.013189624 = queryNorm
              0.20317456 = fieldWeight in 601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3343906 = idf(docFreq=1582, maxDocs=44421)
                0.046875 = fieldNorm(doc=601)
          0.16356386 = weight(abstract_txt:übersetzung in 601) [ClassicSimilarity], result of:
            0.16356386 = score(doc=601,freq=8.0), product of:
              0.16911837 = queryWeight, product of:
                1.7577134 = boost
                7.2947483 = idf(docFreq=81, maxDocs=44421)
                0.013189624 = queryNorm
              0.9671561 = fieldWeight in 601, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                7.2947483 = idf(docFreq=81, maxDocs=44421)
                0.046875 = fieldNorm(doc=601)
          0.058417797 = weight(abstract_txt:verwendeten in 601) [ClassicSimilarity], result of:
            0.058417797 = score(doc=601,freq=1.0), product of:
              0.17026524 = queryWeight, product of:
                1.7636633 = boost
                7.319441 = idf(docFreq=79, maxDocs=44421)
                0.013189624 = queryNorm
              0.3430988 = fieldWeight in 601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.319441 = idf(docFreq=79, maxDocs=44421)
                0.046875 = fieldNorm(doc=601)
          0.09510104 = weight(abstract_txt:englischsprachigen in 601) [ClassicSimilarity], result of:
            0.09510104 = score(doc=601,freq=1.0), product of:
              0.23562393 = queryWeight, product of:
                2.0747337 = boost
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.013189624 = queryNorm
              0.4036137 = fieldWeight in 601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.046875 = fieldNorm(doc=601)
          0.022733565 = weight(abstract_txt:werden in 601) [ClassicSimilarity], result of:
            0.022733565 = score(doc=601,freq=2.0), product of:
              0.097763695 = queryWeight, product of:
                2.1130583 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.013189624 = queryNorm
              0.23253587 = fieldWeight in 601, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.046875 = fieldNorm(doc=601)
        0.24 = coord(6/25)