Document (#42629)

Author
Busch, D.
Title
Domänenspezifische hybride automatische Indexierung von bibliographischen Metadaten
Source
B.I.T.online. 22(2019) H.6, S.465-469
Year
2019
Abstract
Im Fraunhofer-Informationszentrum Raum und Bau (IRB) wird Fachliteratur im Bereich Planen und Bauen bibliographisch erschlossen. Die daraus resultierenden Dokumente (Metadaten-Einträge) werden u.a. bei der Produktion der bibliographischen Datenbanken des IRB verwendet. In Abb. 1 ist ein Dokument dargestellt, das einen Zeitschriftenartikel beschreibt. Die Dokumente werden mit Deskriptoren von einer Nomenklatur (Schlagwortliste IRB) indexiert. Ein Deskriptor ist "eine Benennung., die für sich allein verwendbar, eindeutig zur Inhaltskennzeichnung geeignet und im betreffenden Dokumentationssystem zugelassen ist". Momentan wird die Indexierung intellektuell von menschlichen Experten durchgeführt. Die intellektuelle Indexierung ist zeitaufwendig und teuer. Eine Lösung des Problems besteht in der automatischen Indexierung, bei der die Zuordnung von Deskriptoren durch ein Computerprogramm erfolgt. Solche Computerprogramme werden im Folgenden auch als Klassifikatoren bezeichnet. In diesem Beitrag geht es um ein System zur automatischen Indexierung von deutschsprachigen Dokumenten im Bereich Bauwesen mit Deskriptoren aus der Schlagwortliste IRB.
Content
Vgl.: https://www.b-i-t-online.de/heft/2019-06-index.php.
Theme
Automatisches Indexieren
Object
IRB
Location
D

Similar documents (author)

  1. Busch, R.: Neue Wege der Buchaufstellung in den USA (1956) 5.54
    5.5397964 = sum of:
      5.5397964 = weight(author_txt:busch in 557) [ClassicSimilarity], result of:
        5.5397964 = fieldWeight in 557, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.863674 = idf(docFreq=16, maxDocs=44218)
          0.625 = fieldNorm(doc=557)
    
  2. Busch, J.: Bibliographie zum Bibliotheks- und Büchereiwesen : aus dem Nachlaß bearbeitet von U. von Dietze (1966) 5.54
    5.5397964 = sum of:
      5.5397964 = weight(author_txt:busch in 1462) [ClassicSimilarity], result of:
        5.5397964 = fieldWeight in 1462, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.863674 = idf(docFreq=16, maxDocs=44218)
          0.625 = fieldNorm(doc=1462)
    
  3. Busch, C.: Bitte ein Bit? : Zur (Be-) Deutung der Informationstheorie (1992) 5.54
    5.5397964 = sum of:
      5.5397964 = weight(author_txt:busch in 2444) [ClassicSimilarity], result of:
        5.5397964 = fieldWeight in 2444, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.863674 = idf(docFreq=16, maxDocs=44218)
          0.625 = fieldNorm(doc=2444)
    
  4. Busch, J.: ¬A method for evaluating the multiple relations between subject descriptors : related terms in the Thesaurus for Engineering and Scientific Terms, a pilot study (1978) 5.54
    5.5397964 = sum of:
      5.5397964 = weight(author_txt:busch in 2948) [ClassicSimilarity], result of:
        5.5397964 = fieldWeight in 2948, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.863674 = idf(docFreq=16, maxDocs=44218)
          0.625 = fieldNorm(doc=2948)
    
  5. Busch, J.A.: Thinking ambiguously : organizing source materials for historical research (1994) 5.54
    5.5397964 = sum of:
      5.5397964 = weight(author_txt:busch in 2978) [ClassicSimilarity], result of:
        5.5397964 = fieldWeight in 2978, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.863674 = idf(docFreq=16, maxDocs=44218)
          0.625 = fieldNorm(doc=2978)
    

Similar documents (content)

  1. Lepsky, K.: Automatisches Indexieren (2023) 0.25
    0.2502609 = sum of:
      0.2502609 = product of:
        1.2513044 = sum of:
          0.04307867 = weight(abstract_txt:werden in 781) [ClassicSimilarity], result of:
            0.04307867 = score(doc=781,freq=3.0), product of:
              0.07566357 = queryWeight, product of:
                1.2628189 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.01708843 = queryNorm
              0.56934494 = fieldWeight in 781, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.09375 = fieldNorm(doc=781)
          0.09411333 = weight(abstract_txt:dokumente in 781) [ClassicSimilarity], result of:
            0.09411333 = score(doc=781,freq=1.0), product of:
              0.1605053 = queryWeight, product of:
                1.5017469 = boost
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.01708843 = queryNorm
              0.5863565 = fieldWeight in 781, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.09375 = fieldNorm(doc=781)
          0.1768956 = weight(abstract_txt:automatischen in 781) [ClassicSimilarity], result of:
            0.1768956 = score(doc=781,freq=2.0), product of:
              0.19402453 = queryWeight, product of:
                1.6511266 = boost
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.01708843 = queryNorm
              0.9117177 = fieldWeight in 781, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.09375 = fieldNorm(doc=781)
          0.2743474 = weight(abstract_txt:deskriptoren in 781) [ClassicSimilarity], result of:
            0.2743474 = score(doc=781,freq=1.0), product of:
              0.37493238 = queryWeight, product of:
                2.8110862 = boost
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.01708843 = queryNorm
              0.73172504 = fieldWeight in 781, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.09375 = fieldNorm(doc=781)
          0.66286945 = weight(abstract_txt:indexierung in 781) [ClassicSimilarity], result of:
            0.66286945 = score(doc=781,freq=5.0), product of:
              0.46809137 = queryWeight, product of:
                4.0549674 = boost
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.01708843 = queryNorm
              1.4161112 = fieldWeight in 781, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.09375 = fieldNorm(doc=781)
        0.2 = coord(5/25)
    
  2. Kempf, A.O.: Automatische Indexierung in der sozialwissenschaftlichen Fachinformation : eine Evaluationsstudie zur maschinellen Erschließung für die Datenbank SOLIS (2012) 0.21
    0.21340464 = sum of:
      0.21340464 = product of:
        0.889186 = sum of:
          0.029311324 = weight(abstract_txt:werden in 903) [ClassicSimilarity], result of:
            0.029311324 = score(doc=903,freq=2.0), product of:
              0.07566357 = queryWeight, product of:
                1.2628189 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.01708843 = queryNorm
              0.3873902 = fieldWeight in 903, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.078125 = fieldNorm(doc=903)
          0.08417198 = weight(abstract_txt:metadaten in 903) [ClassicSimilarity], result of:
            0.08417198 = score(doc=903,freq=1.0), product of:
              0.16824977 = queryWeight, product of:
                1.5375502 = boost
                6.4035826 = idf(docFreq=198, maxDocs=44218)
                0.01708843 = queryNorm
              0.5002799 = fieldWeight in 903, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4035826 = idf(docFreq=198, maxDocs=44218)
                0.078125 = fieldNorm(doc=903)
          0.18054332 = weight(abstract_txt:automatischen in 903) [ClassicSimilarity], result of:
            0.18054332 = score(doc=903,freq=3.0), product of:
              0.19402453 = queryWeight, product of:
                1.6511266 = boost
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.01708843 = queryNorm
              0.930518 = fieldWeight in 903, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.078125 = fieldNorm(doc=903)
          0.119499765 = weight(abstract_txt:bibliographischen in 903) [ClassicSimilarity], result of:
            0.119499765 = score(doc=903,freq=1.0), product of:
              0.21253029 = queryWeight, product of:
                1.7280746 = boost
                7.1970778 = idf(docFreq=89, maxDocs=44218)
                0.01708843 = queryNorm
              0.5622717 = fieldWeight in 903, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1970778 = idf(docFreq=89, maxDocs=44218)
                0.078125 = fieldNorm(doc=903)
          0.22862285 = weight(abstract_txt:deskriptoren in 903) [ClassicSimilarity], result of:
            0.22862285 = score(doc=903,freq=1.0), product of:
              0.37493238 = queryWeight, product of:
                2.8110862 = boost
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.01708843 = queryNorm
              0.6097709 = fieldWeight in 903, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.078125 = fieldNorm(doc=903)
          0.24703684 = weight(abstract_txt:indexierung in 903) [ClassicSimilarity], result of:
            0.24703684 = score(doc=903,freq=1.0), product of:
              0.46809137 = queryWeight, product of:
                4.0549674 = boost
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.01708843 = queryNorm
              0.5277535 = fieldWeight in 903, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.078125 = fieldNorm(doc=903)
        0.24 = coord(6/25)
    
  3. Bunk, T.: Deskriptoren Stoppwortlisten und kryptische Zeichen (2008) 0.16
    0.16373338 = sum of:
      0.16373338 = product of:
        1.3644449 = sum of:
          0.20847346 = weight(abstract_txt:automatischen in 2471) [ClassicSimilarity], result of:
            0.20847346 = score(doc=2471,freq=1.0), product of:
              0.19402453 = queryWeight, product of:
                1.6511266 = boost
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.01708843 = queryNorm
              1.0744696 = fieldWeight in 2471, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.15625 = fieldNorm(doc=2471)
          0.4572457 = weight(abstract_txt:deskriptoren in 2471) [ClassicSimilarity], result of:
            0.4572457 = score(doc=2471,freq=1.0), product of:
              0.37493238 = queryWeight, product of:
                2.8110862 = boost
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.01708843 = queryNorm
              1.2195418 = fieldWeight in 2471, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.15625 = fieldNorm(doc=2471)
          0.6987257 = weight(abstract_txt:indexierung in 2471) [ClassicSimilarity], result of:
            0.6987257 = score(doc=2471,freq=2.0), product of:
              0.46809137 = queryWeight, product of:
                4.0549674 = boost
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.01708843 = queryNorm
              1.4927123 = fieldWeight in 2471, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.15625 = fieldNorm(doc=2471)
        0.12 = coord(3/25)
    
  4. Schirmer, K.; Haller, J.: Zugang zu mehrsprachigen Nachrichten im Internet (2000) 0.13
    0.12688267 = sum of:
      0.12688267 = product of:
        0.63441336 = sum of:
          0.09262743 = weight(abstract_txt:indexiert in 5562) [ClassicSimilarity], result of:
            0.09262743 = score(doc=5562,freq=1.0), product of:
              0.14233965 = queryWeight, product of:
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.01708843 = queryNorm
              0.6507493 = fieldWeight in 5562, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.078125 = fieldNorm(doc=5562)
          0.05483646 = weight(abstract_txt:werden in 5562) [ClassicSimilarity], result of:
            0.05483646 = score(doc=5562,freq=7.0), product of:
              0.07566357 = queryWeight, product of:
                1.2628189 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.01708843 = queryNorm
              0.7247406 = fieldWeight in 5562, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.078125 = fieldNorm(doc=5562)
          0.11091361 = weight(abstract_txt:dokumente in 5562) [ClassicSimilarity], result of:
            0.11091361 = score(doc=5562,freq=2.0), product of:
              0.1605053 = queryWeight, product of:
                1.5017469 = boost
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.01708843 = queryNorm
              0.69102776 = fieldWeight in 5562, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2544694 = idf(docFreq=230, maxDocs=44218)
                0.078125 = fieldNorm(doc=5562)
          0.14741302 = weight(abstract_txt:automatischen in 5562) [ClassicSimilarity], result of:
            0.14741302 = score(doc=5562,freq=2.0), product of:
              0.19402453 = queryWeight, product of:
                1.6511266 = boost
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.01708843 = queryNorm
              0.7597648 = fieldWeight in 5562, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.078125 = fieldNorm(doc=5562)
          0.22862285 = weight(abstract_txt:deskriptoren in 5562) [ClassicSimilarity], result of:
            0.22862285 = score(doc=5562,freq=1.0), product of:
              0.37493238 = queryWeight, product of:
                2.8110862 = boost
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.01708843 = queryNorm
              0.6097709 = fieldWeight in 5562, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.078125 = fieldNorm(doc=5562)
        0.2 = coord(5/25)
    
  5. Scherer, B.: Automatische Indexierung und ihre Anwendung im DFG-Projekt "Gemeinsames Portal für Bibliotheken, Archive und Museen (BAM)" (2003) 0.12
    0.1167881 = sum of:
      0.1167881 = product of:
        0.72992563 = sum of:
          0.037076216 = weight(abstract_txt:werden in 4283) [ClassicSimilarity], result of:
            0.037076216 = score(doc=4283,freq=5.0), product of:
              0.07566357 = queryWeight, product of:
                1.2628189 = boost
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.01708843 = queryNorm
              0.49001414 = fieldWeight in 4283, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.5062556 = idf(docFreq=3606, maxDocs=44218)
                0.0625 = fieldNorm(doc=4283)
          0.041979183 = weight(abstract_txt:bereich in 4283) [ClassicSimilarity], result of:
            0.041979183 = score(doc=4283,freq=1.0), product of:
              0.122783154 = queryWeight, product of:
                1.3134739 = boost
                5.4703507 = idf(docFreq=505, maxDocs=44218)
                0.01708843 = queryNorm
              0.34189692 = fieldWeight in 4283, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4703507 = idf(docFreq=505, maxDocs=44218)
                0.0625 = fieldNorm(doc=4283)
          0.16677877 = weight(abstract_txt:automatischen in 4283) [ClassicSimilarity], result of:
            0.16677877 = score(doc=4283,freq=4.0), product of:
              0.19402453 = queryWeight, product of:
                1.6511266 = boost
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.01708843 = queryNorm
              0.8595757 = fieldWeight in 4283, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.0625 = fieldNorm(doc=4283)
          0.48409143 = weight(abstract_txt:indexierung in 4283) [ClassicSimilarity], result of:
            0.48409143 = score(doc=4283,freq=6.0), product of:
              0.46809137 = queryWeight, product of:
                4.0549674 = boost
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.01708843 = queryNorm
              1.0341815 = fieldWeight in 4283, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.0625 = fieldNorm(doc=4283)
        0.16 = coord(4/25)