Document (#37599)

Author
Weiner, U.
Title
Vor uns die Dokumentenflut oder Automatische Indexierung als notwendige und sinnvolle Ergänzung zur intellektuellen Sacherschließung
Imprint
Wien : Universität
Year
2012
Pages
xxx Bl
Abstract
Vor dem Hintergrund veränderter Ansprüche der Bibliotheksbenutzer an Recherchemöglichkeiten - weg vom klassischen Online-Katalog hin zum "One-Stop-Shop" mit Funktionalitäten wie thematisches Browsing, Relevanzranking und dergleichen mehr - einerseits und der notwendigen Bearbeitung von Massendaten (Stichwort Dokumentenflut) andererseits rücken Systeme zur automatischen Indexierung wieder verstärkt in den Mittelpunkt des Interesses. Da in Österreich die Beschäftigung mit diesem Thema im Bibliotheksbereich bislang nur sehr selektiv, bezogen auf wenige konkrete Projekte, erfolgte, wird zuerst ein allgemeiner theoretischer Überblick über die unterschiedlichen Verfahrensansätze der automatischen Indexierung geboten. Im nächsten Schritt werden mit der IDX-basierten Indexierungssoftware MILOS (mit den Teilprojekten MILOS I, MILOS II und KASCADE) und dem modularen System intelligentCAPTURE (mit der integrierten Indexierungssoftware AUTINDEX) die bis vor wenigen Jahren im deutschsprachigen Raum einzigen im Praxiseinsatz befindlichen automatischen Indexierungssysteme vorgestellt. Mit zunehmender Notwendigkeit, neue Wege der inhaltlichen Erschließung zu beschreiten, wurden in den vergangenen 5 - 6 Jahren zahlreiche Softwareentwicklungen auf ihre Einsatzmöglichkeit im Bibliotheksbereich hin getestet. Stellvertretend für diese in Entwicklung befindlichen Systeme zur automatischen inhaltlichen Erschließung wird das Projekt PETRUS, welches in den Jahren 2009 - 2011 an der DNB durchgeführt wurde und die Komponenten PICA Match&Merge sowie die Extraction Platform der Firma Averbis beinhaltet, vorgestellt.
Footnote
Wien, Univ., Lehrgang Library and Information Studies, Master-Thesis, 2012
Theme
Automatisches Indexieren
Object
MILOS
KASCADE
intelligentCAPTURE
AUTINDEX
PETRUS

Similar documents (author)

  1. Weiner, S.T.: Electronic journals, four part series : an introduction (1997) 5.94
    5.9401517 = sum of:
      5.9401517 = weight(author_txt:weiner in 834) [ClassicSimilarity], result of:
        5.9401517 = fieldWeight in 834, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.625 = fieldNorm(doc=834)
    
  2. Weiner, R.G.: Information access illiterate? (1997) 5.94
    5.9401517 = sum of:
      5.9401517 = weight(author_txt:weiner in 2413) [ClassicSimilarity], result of:
        5.9401517 = fieldWeight in 2413, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.625 = fieldNorm(doc=2413)
    
  3. Weiner, M.: ¬Die Agenten kommen (2002) 5.94
    5.9401517 = sum of:
      5.9401517 = weight(author_txt:weiner in 734) [ClassicSimilarity], result of:
        5.9401517 = fieldWeight in 734, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.625 = fieldNorm(doc=734)
    
  4. Weiner, M.L.; Rusch, P.F.: New searching technologies and interfaces (1996) 4.75
    4.7521214 = sum of:
      4.7521214 = weight(author_txt:weiner in 109) [ClassicSimilarity], result of:
        4.7521214 = fieldWeight in 109, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.5 = fieldNorm(doc=109)
    
  5. Weiner, M.L.; Rusch, P.F.: New searching technologies and interfaces (1997) 4.75
    4.7521214 = sum of:
      4.7521214 = weight(author_txt:weiner in 321) [ClassicSimilarity], result of:
        4.7521214 = fieldWeight in 321, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.5 = fieldNorm(doc=321)
    

Similar documents (content)

  1. Lepsky, K.; Zimmermann, H.H.: Katalogerweiterung durch Scanning und automatische Dokumenterschließung : Ergebnisse des DFG-Projekts KASCADE (2000) 0.21
    0.2131784 = sum of:
      0.2131784 = product of:
        1.332365 = sum of:
          0.39971572 = weight(title_txt:kascade in 5966) [ClassicSimilarity], result of:
            0.39971572 = score(doc=5966,freq=1.0), product of:
              0.16822623 = queryWeight, product of:
                1.135613 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.0155864 = queryNorm
              2.3760607 = fieldWeight in 5966, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.25 = fieldNorm(doc=5966)
          0.18875757 = weight(abstract_txt:indexierung in 5966) [ClassicSimilarity], result of:
            0.18875757 = score(doc=5966,freq=1.0), product of:
              0.25530002 = queryWeight, product of:
                2.42309 = boost
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.0155864 = queryNorm
              0.73935586 = fieldWeight in 5966, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.109375 = fieldNorm(doc=5966)
          0.2654768 = weight(abstract_txt:automatischen in 5966) [ClassicSimilarity], result of:
            0.2654768 = score(doc=5966,freq=1.0), product of:
              0.35273227 = queryWeight, product of:
                3.2887895 = boost
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.0155864 = queryNorm
              0.7526297 = fieldWeight in 5966, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.109375 = fieldNorm(doc=5966)
          0.47841492 = weight(abstract_txt:milos in 5966) [ClassicSimilarity], result of:
            0.47841492 = score(doc=5966,freq=1.0), product of:
              0.47458908 = queryWeight, product of:
                3.303718 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.0155864 = queryNorm
              1.0080614 = fieldWeight in 5966, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.109375 = fieldNorm(doc=5966)
        0.16 = coord(4/25)
    
  2. Schneider, A.: Moderne Retrievalverfahren in klassischen bibliotheksbezogenen Anwendungen : Projekte und Perspektiven (2008) 0.15
    0.15347621 = sum of:
      0.15347621 = product of:
        0.6394842 = sum of:
          0.04069754 = weight(abstract_txt:vorgestellt in 31) [ClassicSimilarity], result of:
            0.04069754 = score(doc=31,freq=2.0), product of:
              0.11197063 = queryWeight, product of:
                1.3102391 = boost
                5.4828677 = idf(docFreq=501, maxDocs=44421)
                0.0155864 = queryNorm
              0.3634662 = fieldWeight in 31, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4828677 = idf(docFreq=501, maxDocs=44421)
                0.046875 = fieldNorm(doc=31)
          0.036180627 = weight(abstract_txt:erschließung in 31) [ClassicSimilarity], result of:
            0.036180627 = score(doc=31,freq=1.0), product of:
              0.13043258 = queryWeight, product of:
                1.4141371 = boost
                5.9176426 = idf(docFreq=324, maxDocs=44421)
                0.0155864 = queryNorm
              0.2773895 = fieldWeight in 31, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9176426 = idf(docFreq=324, maxDocs=44421)
                0.046875 = fieldNorm(doc=31)
          0.035426952 = weight(abstract_txt:jahren in 31) [ClassicSimilarity], result of:
            0.035426952 = score(doc=31,freq=1.0), product of:
              0.14722729 = queryWeight, product of:
                1.8400866 = boost
                5.1333895 = idf(docFreq=711, maxDocs=44421)
                0.0155864 = queryNorm
              0.24062763 = fieldWeight in 31, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1333895 = idf(docFreq=711, maxDocs=44421)
                0.046875 = fieldNorm(doc=31)
          0.18202797 = weight(abstract_txt:bibliotheksbereich in 31) [ClassicSimilarity], result of:
            0.18202797 = score(doc=31,freq=3.0), product of:
              0.26553413 = queryWeight, product of:
                2.0177095 = boost
                8.443371 = idf(docFreq=25, maxDocs=44421)
                0.0155864 = queryNorm
              0.68551624 = fieldWeight in 31, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.443371 = idf(docFreq=25, maxDocs=44421)
                0.046875 = fieldNorm(doc=31)
          0.14011617 = weight(abstract_txt:indexierung in 31) [ClassicSimilarity], result of:
            0.14011617 = score(doc=31,freq=3.0), product of:
              0.25530002 = queryWeight, product of:
                2.42309 = boost
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.0155864 = queryNorm
              0.54882944 = fieldWeight in 31, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.046875 = fieldNorm(doc=31)
          0.20503497 = weight(abstract_txt:milos in 31) [ClassicSimilarity], result of:
            0.20503497 = score(doc=31,freq=1.0), product of:
              0.47458908 = queryWeight, product of:
                3.303718 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.0155864 = queryNorm
              0.43202633 = fieldWeight in 31, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.046875 = fieldNorm(doc=31)
        0.24 = coord(6/25)
    
  3. Scherer, B.: Automatische Indexierung und ihre Anwendung im DFG-Projekt "Gemeinsames Portal für Bibliotheken, Archive und Museen (BAM)" (2003) 0.15
    0.14600018 = sum of:
      0.14600018 = product of:
        0.73000085 = sum of:
          0.04693449 = weight(abstract_txt:systeme in 283) [ClassicSimilarity], result of:
            0.04693449 = score(doc=283,freq=1.0), product of:
              0.1280671 = queryWeight, product of:
                1.4012554 = boost
                5.863737 = idf(docFreq=342, maxDocs=44421)
                0.0155864 = queryNorm
              0.36648357 = fieldWeight in 283, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.863737 = idf(docFreq=342, maxDocs=44421)
                0.0625 = fieldNorm(doc=283)
          0.06822284 = weight(abstract_txt:erschließung in 283) [ClassicSimilarity], result of:
            0.06822284 = score(doc=283,freq=2.0), product of:
              0.13043258 = queryWeight, product of:
                1.4141371 = boost
                5.9176426 = idf(docFreq=324, maxDocs=44421)
                0.0155864 = queryNorm
              0.52305067 = fieldWeight in 283, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.9176426 = idf(docFreq=324, maxDocs=44421)
                0.0625 = fieldNorm(doc=283)
          0.04723594 = weight(abstract_txt:jahren in 283) [ClassicSimilarity], result of:
            0.04723594 = score(doc=283,freq=1.0), product of:
              0.14722729 = queryWeight, product of:
                1.8400866 = boost
                5.1333895 = idf(docFreq=711, maxDocs=44421)
                0.0155864 = queryNorm
              0.32083684 = fieldWeight in 283, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1333895 = idf(docFreq=711, maxDocs=44421)
                0.0625 = fieldNorm(doc=283)
          0.26420555 = weight(abstract_txt:indexierung in 283) [ClassicSimilarity], result of:
            0.26420555 = score(doc=283,freq=6.0), product of:
              0.25530002 = queryWeight, product of:
                2.42309 = boost
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.0155864 = queryNorm
              1.0348827 = fieldWeight in 283, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.0625 = fieldNorm(doc=283)
          0.30340204 = weight(abstract_txt:automatischen in 283) [ClassicSimilarity], result of:
            0.30340204 = score(doc=283,freq=4.0), product of:
              0.35273227 = queryWeight, product of:
                3.2887895 = boost
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.0155864 = queryNorm
              0.86014825 = fieldWeight in 283, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.0625 = fieldNorm(doc=283)
        0.2 = coord(5/25)
    
  4. Oberhauser, O.; Labner, J.: OPAC-Erweiterung durch automatische Indexierung : Empirische Untersuchung mit Daten aus dem Österreichischen Verbundkatalog (2002) 0.14
    0.13868321 = sum of:
      0.13868321 = product of:
        0.8667701 = sum of:
          0.05904492 = weight(abstract_txt:jahren in 1883) [ClassicSimilarity], result of:
            0.05904492 = score(doc=1883,freq=1.0), product of:
              0.14722729 = queryWeight, product of:
                1.8400866 = boost
                5.1333895 = idf(docFreq=711, maxDocs=44421)
                0.0155864 = queryNorm
              0.40104604 = fieldWeight in 1883, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1333895 = idf(docFreq=711, maxDocs=44421)
                0.078125 = fieldNorm(doc=1883)
          0.13482684 = weight(abstract_txt:indexierung in 1883) [ClassicSimilarity], result of:
            0.13482684 = score(doc=1883,freq=1.0), product of:
              0.25530002 = queryWeight, product of:
                2.42309 = boost
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.0155864 = queryNorm
              0.52811134 = fieldWeight in 1883, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.078125 = fieldNorm(doc=1883)
          0.18962628 = weight(abstract_txt:automatischen in 1883) [ClassicSimilarity], result of:
            0.18962628 = score(doc=1883,freq=1.0), product of:
              0.35273227 = queryWeight, product of:
                3.2887895 = boost
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.0155864 = queryNorm
              0.53759265 = fieldWeight in 1883, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.078125 = fieldNorm(doc=1883)
          0.48327205 = weight(abstract_txt:milos in 1883) [ClassicSimilarity], result of:
            0.48327205 = score(doc=1883,freq=2.0), product of:
              0.47458908 = queryWeight, product of:
                3.303718 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.0155864 = queryNorm
              1.0182958 = fieldWeight in 1883, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.078125 = fieldNorm(doc=1883)
        0.16 = coord(4/25)
    
  5. Lepsky, K.: Automatische Indexierung und bibliothekarische Inhaltserschließung : Ergebnisse des DFG-Projekts MILOS I (1996) 0.14
    0.13711499 = sum of:
      0.13711499 = product of:
        0.8569687 = sum of:
          0.057555016 = weight(abstract_txt:vorgestellt in 3061) [ClassicSimilarity], result of:
            0.057555016 = score(doc=3061,freq=1.0), product of:
              0.11197063 = queryWeight, product of:
                1.3102391 = boost
                5.4828677 = idf(docFreq=501, maxDocs=44421)
                0.0155864 = queryNorm
              0.51401883 = fieldWeight in 3061, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4828677 = idf(docFreq=501, maxDocs=44421)
                0.09375 = fieldNorm(doc=3061)
          0.1617922 = weight(abstract_txt:indexierung in 3061) [ClassicSimilarity], result of:
            0.1617922 = score(doc=3061,freq=1.0), product of:
              0.25530002 = queryWeight, product of:
                2.42309 = boost
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.0155864 = queryNorm
              0.63373363 = fieldWeight in 3061, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.09375 = fieldNorm(doc=3061)
          0.22755153 = weight(abstract_txt:automatischen in 3061) [ClassicSimilarity], result of:
            0.22755153 = score(doc=3061,freq=1.0), product of:
              0.35273227 = queryWeight, product of:
                3.2887895 = boost
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.0155864 = queryNorm
              0.6451112 = fieldWeight in 3061, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.09375 = fieldNorm(doc=3061)
          0.41006994 = weight(abstract_txt:milos in 3061) [ClassicSimilarity], result of:
            0.41006994 = score(doc=3061,freq=1.0), product of:
              0.47458908 = queryWeight, product of:
                3.303718 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.0155864 = queryNorm
              0.86405265 = fieldWeight in 3061, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.09375 = fieldNorm(doc=3061)
        0.16 = coord(4/25)