Document (#42629)

Author
Busch, D.
Title
Domänenspezifische hybride automatische Indexierung von bibliographischen Metadaten
Source
B.I.T.online. 22(2019) H.6, S.465-469
Year
2019
Abstract
Im Fraunhofer-Informationszentrum Raum und Bau (IRB) wird Fachliteratur im Bereich Planen und Bauen bibliographisch erschlossen. Die daraus resultierenden Dokumente (Metadaten-Einträge) werden u.a. bei der Produktion der bibliographischen Datenbanken des IRB verwendet. In Abb. 1 ist ein Dokument dargestellt, das einen Zeitschriftenartikel beschreibt. Die Dokumente werden mit Deskriptoren von einer Nomenklatur (Schlagwortliste IRB) indexiert. Ein Deskriptor ist "eine Benennung., die für sich allein verwendbar, eindeutig zur Inhaltskennzeichnung geeignet und im betreffenden Dokumentationssystem zugelassen ist". Momentan wird die Indexierung intellektuell von menschlichen Experten durchgeführt. Die intellektuelle Indexierung ist zeitaufwendig und teuer. Eine Lösung des Problems besteht in der automatischen Indexierung, bei der die Zuordnung von Deskriptoren durch ein Computerprogramm erfolgt. Solche Computerprogramme werden im Folgenden auch als Klassifikatoren bezeichnet. In diesem Beitrag geht es um ein System zur automatischen Indexierung von deutschsprachigen Dokumenten im Bereich Bauwesen mit Deskriptoren aus der Schlagwortliste IRB.
Content
Vgl.: https://www.b-i-t-online.de/heft/2019-06-index.php.
Theme
Automatisches Indexieren
Object
IRB
Location
D

Similar documents (author)

  1. Busch, R.: Neue Wege der Buchaufstellung in den USA (1956) 5.54
    5.5426593 = sum of:
      5.5426593 = weight(author_txt:busch in 556) [ClassicSimilarity], result of:
        5.5426593 = fieldWeight in 556, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.868255 = idf(docFreq=16, maxDocs=44421)
          0.625 = fieldNorm(doc=556)
    
  2. Busch, J.: Bibliographie zum Bibliotheks- und Büchereiwesen : aus dem Nachlaß bearbeitet von U. von Dietze (1966) 5.54
    5.5426593 = sum of:
      5.5426593 = weight(author_txt:busch in 1461) [ClassicSimilarity], result of:
        5.5426593 = fieldWeight in 1461, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.868255 = idf(docFreq=16, maxDocs=44421)
          0.625 = fieldNorm(doc=1461)
    
  3. Busch, C.: Bitte ein Bit? : Zur (Be-) Deutung der Informationstheorie (1992) 5.54
    5.5426593 = sum of:
      5.5426593 = weight(author_txt:busch in 2443) [ClassicSimilarity], result of:
        5.5426593 = fieldWeight in 2443, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.868255 = idf(docFreq=16, maxDocs=44421)
          0.625 = fieldNorm(doc=2443)
    
  4. Busch, J.: ¬A method for evaluating the multiple relations between subject descriptors : related terms in the Thesaurus for Engineering and Scientific Terms, a pilot study (1978) 5.54
    5.5426593 = sum of:
      5.5426593 = weight(author_txt:busch in 2947) [ClassicSimilarity], result of:
        5.5426593 = fieldWeight in 2947, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.868255 = idf(docFreq=16, maxDocs=44421)
          0.625 = fieldNorm(doc=2947)
    
  5. Busch, J.A.: Thinking ambiguously : organizing source materials for historical research (1994) 5.54
    5.5426593 = sum of:
      5.5426593 = weight(author_txt:busch in 3046) [ClassicSimilarity], result of:
        5.5426593 = fieldWeight in 3046, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.868255 = idf(docFreq=16, maxDocs=44421)
          0.625 = fieldNorm(doc=3046)
    

Similar documents (content)

  1. Lepsky, K.: Automatisches Indexieren (2023) 0.25
    0.25053364 = sum of:
      0.25053364 = product of:
        1.2526681 = sum of:
          0.043104634 = weight(abstract_txt:werden in 1782) [ClassicSimilarity], result of:
            0.043104634 = score(doc=1782,freq=3.0), product of:
              0.07567603 = queryWeight, product of:
                1.2626776 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.017085675 = queryNorm
              0.56959426 = fieldWeight in 1782, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.09375 = fieldNorm(doc=1782)
          0.09405821 = weight(abstract_txt:dokumente in 1782) [ClassicSimilarity], result of:
            0.09405821 = score(doc=1782,freq=1.0), product of:
              0.16040461 = queryWeight, product of:
                1.5009842 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.017085675 = queryNorm
              0.58638096 = fieldWeight in 1782, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.09375 = fieldNorm(doc=1782)
          0.17712335 = weight(abstract_txt:automatischen in 1782) [ClassicSimilarity], result of:
            0.17712335 = score(doc=1782,freq=2.0), product of:
              0.19414502 = queryWeight, product of:
                1.6513184 = boost
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.017085675 = queryNorm
              0.91232497 = fieldWeight in 1782, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.09375 = fieldNorm(doc=1782)
          0.27463534 = weight(abstract_txt:deskriptoren in 1782) [ClassicSimilarity], result of:
            0.27463534 = score(doc=1782,freq=1.0), product of:
              0.37510577 = queryWeight, product of:
                2.81119 = boost
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.017085675 = queryNorm
              0.7321544 = fieldWeight in 1782, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.09375 = fieldNorm(doc=1782)
          0.66374665 = weight(abstract_txt:indexierung in 1782) [ClassicSimilarity], result of:
            0.66374665 = score(doc=1782,freq=5.0), product of:
              0.4683932 = queryWeight, product of:
                4.0554867 = boost
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.017085675 = queryNorm
              1.4170715 = fieldWeight in 1782, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.09375 = fieldNorm(doc=1782)
        0.2 = coord(5/25)
    
  2. Kempf, A.O.: Automatische Indexierung in der sozialwissenschaftlichen Fachinformation : eine Evaluationsstudie zur maschinellen Erschließung für die Datenbank SOLIS (2012) 0.21
    0.21348467 = sum of:
      0.21348467 = product of:
        0.8895195 = sum of:
          0.029328987 = weight(abstract_txt:werden in 1903) [ClassicSimilarity], result of:
            0.029328987 = score(doc=1903,freq=2.0), product of:
              0.07567603 = queryWeight, product of:
                1.2626776 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.017085675 = queryNorm
              0.3875598 = fieldWeight in 1903, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.078125 = fieldNorm(doc=1903)
          0.084095106 = weight(abstract_txt:metadaten in 1903) [ClassicSimilarity], result of:
            0.084095106 = score(doc=1903,freq=1.0), product of:
              0.16810746 = queryWeight, product of:
                1.5366013 = boost
                6.40315 = idf(docFreq=199, maxDocs=44421)
                0.017085675 = queryNorm
              0.5002461 = fieldWeight in 1903, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.40315 = idf(docFreq=199, maxDocs=44421)
                0.078125 = fieldNorm(doc=1903)
          0.18077578 = weight(abstract_txt:automatischen in 1903) [ClassicSimilarity], result of:
            0.18077578 = score(doc=1903,freq=3.0), product of:
              0.19414502 = queryWeight, product of:
                1.6513184 = boost
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.017085675 = queryNorm
              0.9311378 = fieldWeight in 1903, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.078125 = fieldNorm(doc=1903)
          0.11909308 = weight(abstract_txt:bibliographischen in 1903) [ClassicSimilarity], result of:
            0.11909308 = score(doc=1903,freq=1.0), product of:
              0.21199757 = queryWeight, product of:
                1.7255722 = boost
                7.190608 = idf(docFreq=90, maxDocs=44421)
                0.017085675 = queryNorm
              0.56176627 = fieldWeight in 1903, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.190608 = idf(docFreq=90, maxDocs=44421)
                0.078125 = fieldNorm(doc=1903)
          0.22886279 = weight(abstract_txt:deskriptoren in 1903) [ClassicSimilarity], result of:
            0.22886279 = score(doc=1903,freq=1.0), product of:
              0.37510577 = queryWeight, product of:
                2.81119 = boost
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.017085675 = queryNorm
              0.6101287 = fieldWeight in 1903, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.078125 = fieldNorm(doc=1903)
          0.24736376 = weight(abstract_txt:indexierung in 1903) [ClassicSimilarity], result of:
            0.24736376 = score(doc=1903,freq=1.0), product of:
              0.4683932 = queryWeight, product of:
                4.0554867 = boost
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.017085675 = queryNorm
              0.52811134 = fieldWeight in 1903, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.078125 = fieldNorm(doc=1903)
        0.24 = coord(6/25)
    
  3. Bunk, T.: Deskriptoren Stoppwortlisten und kryptische Zeichen (2008) 0.16
    0.16393414 = sum of:
      0.16393414 = product of:
        1.3661178 = sum of:
          0.20874187 = weight(abstract_txt:automatischen in 3471) [ClassicSimilarity], result of:
            0.20874187 = score(doc=3471,freq=1.0), product of:
              0.19414502 = queryWeight, product of:
                1.6513184 = boost
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.017085675 = queryNorm
              1.0751853 = fieldWeight in 3471, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.15625 = fieldNorm(doc=3471)
          0.45772558 = weight(abstract_txt:deskriptoren in 3471) [ClassicSimilarity], result of:
            0.45772558 = score(doc=3471,freq=1.0), product of:
              0.37510577 = queryWeight, product of:
                2.81119 = boost
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.017085675 = queryNorm
              1.2202574 = fieldWeight in 3471, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.15625 = fieldNorm(doc=3471)
          0.6996504 = weight(abstract_txt:indexierung in 3471) [ClassicSimilarity], result of:
            0.6996504 = score(doc=3471,freq=2.0), product of:
              0.4683932 = queryWeight, product of:
                4.0554867 = boost
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.017085675 = queryNorm
              1.4937245 = fieldWeight in 3471, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.15625 = fieldNorm(doc=3471)
        0.12 = coord(3/25)
    
  4. Schirmer, K.; Haller, J.: Zugang zu mehrsprachigen Nachrichten im Internet (2000) 0.13
    0.12697963 = sum of:
      0.12697963 = product of:
        0.6348982 = sum of:
          0.092714384 = weight(abstract_txt:indexiert in 6562) [ClassicSimilarity], result of:
            0.092714384 = score(doc=6562,freq=1.0), product of:
              0.14239496 = queryWeight, product of:
                8.334172 = idf(docFreq=28, maxDocs=44421)
                0.017085675 = queryNorm
              0.6511072 = fieldWeight in 6562, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.334172 = idf(docFreq=28, maxDocs=44421)
                0.078125 = fieldNorm(doc=6562)
          0.054869514 = weight(abstract_txt:werden in 6562) [ClassicSimilarity], result of:
            0.054869514 = score(doc=6562,freq=7.0), product of:
              0.07567603 = queryWeight, product of:
                1.2626776 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.017085675 = queryNorm
              0.725058 = fieldWeight in 6562, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.078125 = fieldNorm(doc=6562)
          0.11084866 = weight(abstract_txt:dokumente in 6562) [ClassicSimilarity], result of:
            0.11084866 = score(doc=6562,freq=2.0), product of:
              0.16040461 = queryWeight, product of:
                1.5009842 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.017085675 = queryNorm
              0.69105655 = fieldWeight in 6562, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.078125 = fieldNorm(doc=6562)
          0.1476028 = weight(abstract_txt:automatischen in 6562) [ClassicSimilarity], result of:
            0.1476028 = score(doc=6562,freq=2.0), product of:
              0.19414502 = queryWeight, product of:
                1.6513184 = boost
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.017085675 = queryNorm
              0.76027083 = fieldWeight in 6562, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.078125 = fieldNorm(doc=6562)
          0.22886279 = weight(abstract_txt:deskriptoren in 6562) [ClassicSimilarity], result of:
            0.22886279 = score(doc=6562,freq=1.0), product of:
              0.37510577 = queryWeight, product of:
                2.81119 = boost
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.017085675 = queryNorm
              0.6101287 = fieldWeight in 6562, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.078125 = fieldNorm(doc=6562)
        0.2 = coord(5/25)
    
  5. Scherer, B.: Automatische Indexierung und ihre Anwendung im DFG-Projekt "Gemeinsames Portal für Bibliotheken, Archive und Museen (BAM)" (2003) 0.12
    0.116933346 = sum of:
      0.116933346 = product of:
        0.7308334 = sum of:
          0.037098564 = weight(abstract_txt:werden in 283) [ClassicSimilarity], result of:
            0.037098564 = score(doc=283,freq=5.0), product of:
              0.07567603 = queryWeight, product of:
                1.2626776 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.017085675 = queryNorm
              0.4902287 = fieldWeight in 283, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.0625 = fieldNorm(doc=283)
          0.042009328 = weight(abstract_txt:bereich in 283) [ClassicSimilarity], result of:
            0.042009328 = score(doc=283,freq=1.0), product of:
              0.122812815 = queryWeight, product of:
                1.3133774 = boost
                5.4729567 = idf(docFreq=506, maxDocs=44421)
                0.017085675 = queryNorm
              0.3420598 = fieldWeight in 283, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4729567 = idf(docFreq=506, maxDocs=44421)
                0.0625 = fieldNorm(doc=283)
          0.1669935 = weight(abstract_txt:automatischen in 283) [ClassicSimilarity], result of:
            0.1669935 = score(doc=283,freq=4.0), product of:
              0.19414502 = queryWeight, product of:
                1.6513184 = boost
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.017085675 = queryNorm
              0.86014825 = fieldWeight in 283, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.0625 = fieldNorm(doc=283)
          0.484732 = weight(abstract_txt:indexierung in 283) [ClassicSimilarity], result of:
            0.484732 = score(doc=283,freq=6.0), product of:
              0.4683932 = queryWeight, product of:
                4.0554867 = boost
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.017085675 = queryNorm
              1.0348827 = fieldWeight in 283, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.0625 = fieldNorm(doc=283)
        0.16 = coord(4/25)