Document (#23795)

Author
Ladewig, C.
Henkes, M.
Title
Verfahren zur automatischen inhaltlichen Erschließung von elektronischen Texten : ASPECTIX
Source
nfd Information - Wissenschaft und Praxis. 52(2001) H.3, S.159-164
Year
2001
Abstract
Das Verfahren zur automatischen syntaktischen inhaltlichen Erschließung von elektronischen Texten, AspectiX, basiert auf einem Index, dessen Elemente mit einer universellen Aspekt-Klassifikation verknüpft sind, die es erlauben, ein syntaktisches Retrieval durchzuführen. Mit diesen, auf den jeweiligen Suchgegenstand inhaltlich bezogenen Klassifikationselementen, werden die Informationen in elektronischen Texten mit bekannten Suchalgorithmen abgefragt und die Ergebnisse entsprechend der Aspektverknüpfung ausgewertet. Mit diesen Aspekten ist es möglich, unbekannte Textdokumente automatisch fachgebiets- und sprachunabhängig nach Inhalten zu klassifizieren und beim Suchen in einem Textcorpus nicht nur auf die Verwendung von Zeichenfolgen angewiesen zu sein wie bei Suchmaschinen im WWW. Der Index kann bei diesen Vorgängen intellektuell und automatisch weiter ausgebaut werden und liefert Ergebnisse im Retrieval von nahezu 100 Prozent Precision, bei gleichzeitig nahezu 100 Prozent Recall. Damit ist das Verfahren AspectiX allen anderen Recherchetools um bis zu 40 Prozent an Precision bzw. Recall überlegen, wie an zahlreichen Recherchen in drei Datenbanken, die unterschiedlich groß und thematisch unähnlich sind, nachgewiesen wird
Theme
Automatisches Indexieren
Object
AspectiX

Similar documents (author)

  1. Ladewig, C.: ¬Die Ausbildung am Institut für Information und Dokumentation der Fachhochschule Potsdam (IID) (1994) 6.19
    6.1935673 = sum of:
      6.1935673 = weight(author_txt:ladewig in 8383) [ClassicSimilarity], result of:
        6.1935673 = fieldWeight in 8383, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.909708 = idf(docFreq=5, maxDocs=44421)
          0.625 = fieldNorm(doc=8383)
    
  2. Ladewig, C.: 'Information Retrieval ohne Linguistik?' : Erwiderung zu dem Artikel von Gerda Ruge und Sebastian Goeser, Nfd 49(1998) H.6, S.361-369 (1998) 6.19
    6.1935673 = sum of:
      6.1935673 = weight(author_txt:ladewig in 3513) [ClassicSimilarity], result of:
        6.1935673 = fieldWeight in 3513, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.909708 = idf(docFreq=5, maxDocs=44421)
          0.625 = fieldNorm(doc=3513)
    
  3. Ladewig, C.: Grundlagen der inhaltlichen Erschließung (1997) 6.19
    6.1935673 = sum of:
      6.1935673 = weight(author_txt:ladewig in 1695) [ClassicSimilarity], result of:
        6.1935673 = fieldWeight in 1695, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.909708 = idf(docFreq=5, maxDocs=44421)
          0.625 = fieldNorm(doc=1695)
    
  4. Ladewig, C.; Rieger, M.: Ähnlichkeitsmessung mit und ohne aspektische Indexierung (1998) 4.95
    4.954854 = sum of:
      4.954854 = weight(author_txt:ladewig in 3526) [ClassicSimilarity], result of:
        4.954854 = fieldWeight in 3526, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.909708 = idf(docFreq=5, maxDocs=44421)
          0.5 = fieldNorm(doc=3526)
    

Similar documents (content)

  1. Scherer, B.: Automatische Indexierung und ihre Anwendung im DFG-Projekt "Gemeinsames Portal für Bibliotheken, Archive und Museen (BAM)" (2003) 0.15
    0.14899577 = sum of:
      0.14899577 = product of:
        0.62081575 = sum of:
          0.0379659 = weight(abstract_txt:einem in 283) [ClassicSimilarity], result of:
            0.0379659 = score(doc=283,freq=3.0), product of:
              0.08089066 = queryWeight, product of:
                1.007071 = boost
                4.3356547 = idf(docFreq=1580, maxDocs=44421)
                0.018526085 = queryNorm
              0.46934837 = fieldWeight in 283, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.3356547 = idf(docFreq=1580, maxDocs=44421)
                0.0625 = fieldNorm(doc=283)
          0.044281192 = weight(abstract_txt:ergebnisse in 283) [ClassicSimilarity], result of:
            0.044281192 = score(doc=283,freq=1.0), product of:
              0.12926745 = queryWeight, product of:
                1.2730794 = boost
                5.4808774 = idf(docFreq=502, maxDocs=44421)
                0.018526085 = queryNorm
              0.34255484 = fieldWeight in 283, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4808774 = idf(docFreq=502, maxDocs=44421)
                0.0625 = fieldNorm(doc=283)
          0.07881889 = weight(abstract_txt:erschließung in 283) [ClassicSimilarity], result of:
            0.07881889 = score(doc=283,freq=2.0), product of:
              0.15069073 = queryWeight, product of:
                1.3745297 = boost
                5.9176426 = idf(docFreq=324, maxDocs=44421)
                0.018526085 = queryNorm
              0.52305067 = fieldWeight in 283, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.9176426 = idf(docFreq=324, maxDocs=44421)
                0.0625 = fieldNorm(doc=283)
          0.17526247 = weight(abstract_txt:automatischen in 283) [ClassicSimilarity], result of:
            0.17526247 = score(doc=283,freq=4.0), product of:
              0.20375845 = queryWeight, product of:
                1.5983382 = boost
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.018526085 = queryNorm
              0.86014825 = fieldWeight in 283, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.0625 = fieldNorm(doc=283)
          0.13380653 = weight(abstract_txt:verfahren in 283) [ClassicSimilarity], result of:
            0.13380653 = score(doc=283,freq=3.0), product of:
              0.21444596 = queryWeight, product of:
                2.0082393 = boost
                5.7639313 = idf(docFreq=378, maxDocs=44421)
                0.018526085 = queryNorm
              0.62396383 = fieldWeight in 283, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.7639313 = idf(docFreq=378, maxDocs=44421)
                0.0625 = fieldNorm(doc=283)
          0.15068075 = weight(abstract_txt:texten in 283) [ClassicSimilarity], result of:
            0.15068075 = score(doc=283,freq=1.0), product of:
              0.33476904 = queryWeight, product of:
                2.5091646 = boost
                7.201658 = idf(docFreq=89, maxDocs=44421)
                0.018526085 = queryNorm
              0.4501036 = fieldWeight in 283, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.201658 = idf(docFreq=89, maxDocs=44421)
                0.0625 = fieldNorm(doc=283)
        0.24 = coord(6/25)
    
  2. Heyer, G.; Quasthoff, U.; Wittig, T.: Text Mining : Wissensrohstoff Text. Konzepte, Algorithmen, Ergebnisse (2006) 0.13
    0.12819202 = sum of:
      0.12819202 = product of:
        0.45782864 = sum of:
          0.019374395 = weight(abstract_txt:einem in 218) [ClassicSimilarity], result of:
            0.019374395 = score(doc=218,freq=2.0), product of:
              0.08089066 = queryWeight, product of:
                1.007071 = boost
                4.3356547 = idf(docFreq=1580, maxDocs=44421)
                0.018526085 = queryNorm
              0.23951335 = fieldWeight in 218, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3356547 = idf(docFreq=1580, maxDocs=44421)
                0.0390625 = fieldNorm(doc=218)
          0.061135486 = weight(abstract_txt:unbekannte in 218) [ClassicSimilarity], result of:
            0.061135486 = score(doc=218,freq=1.0), product of:
              0.17402378 = queryWeight, product of:
                1.0444801 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.018526085 = queryNorm
              0.35130537 = fieldWeight in 218, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.0390625 = fieldNorm(doc=218)
          0.027675744 = weight(abstract_txt:ergebnisse in 218) [ClassicSimilarity], result of:
            0.027675744 = score(doc=218,freq=1.0), product of:
              0.12926745 = queryWeight, product of:
                1.2730794 = boost
                5.4808774 = idf(docFreq=502, maxDocs=44421)
                0.018526085 = queryNorm
              0.21409677 = fieldWeight in 218, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4808774 = idf(docFreq=502, maxDocs=44421)
                0.0390625 = fieldNorm(doc=218)
          0.05768022 = weight(abstract_txt:automatisch in 218) [ClassicSimilarity], result of:
            0.05768022 = score(doc=218,freq=1.0), product of:
              0.21091506 = queryWeight, product of:
                1.6261653 = boost
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.018526085 = queryNorm
              0.27347606 = fieldWeight in 218, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.0390625 = fieldNorm(doc=218)
          0.06221204 = weight(abstract_txt:diesen in 218) [ClassicSimilarity], result of:
            0.06221204 = score(doc=218,freq=2.0), product of:
              0.20153928 = queryWeight, product of:
                1.9468673 = boost
                5.5877852 = idf(docFreq=451, maxDocs=44421)
                0.018526085 = queryNorm
              0.30868444 = fieldWeight in 218, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5877852 = idf(docFreq=451, maxDocs=44421)
                0.0390625 = fieldNorm(doc=218)
          0.09656654 = weight(abstract_txt:verfahren in 218) [ClassicSimilarity], result of:
            0.09656654 = score(doc=218,freq=4.0), product of:
              0.21444596 = queryWeight, product of:
                2.0082393 = boost
                5.7639313 = idf(docFreq=378, maxDocs=44421)
                0.018526085 = queryNorm
              0.45030713 = fieldWeight in 218, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.7639313 = idf(docFreq=378, maxDocs=44421)
                0.0390625 = fieldNorm(doc=218)
          0.13318422 = weight(abstract_txt:texten in 218) [ClassicSimilarity], result of:
            0.13318422 = score(doc=218,freq=2.0), product of:
              0.33476904 = queryWeight, product of:
                2.5091646 = boost
                7.201658 = idf(docFreq=89, maxDocs=44421)
                0.018526085 = queryNorm
              0.39783913 = fieldWeight in 218, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.201658 = idf(docFreq=89, maxDocs=44421)
                0.0390625 = fieldNorm(doc=218)
        0.28 = coord(7/25)
    
  3. Lepsky, K.; Zimmermann, H.H.: Katalogerweiterung durch Scanning und automatische Dokumenterschließung : Ergebnisse des DFG-Projekts KASCADE (2000) 0.13
    0.12755834 = sum of:
      0.12755834 = product of:
        0.63779163 = sum of:
          0.054248303 = weight(abstract_txt:einem in 5966) [ClassicSimilarity], result of:
            0.054248303 = score(doc=5966,freq=2.0), product of:
              0.08089066 = queryWeight, product of:
                1.007071 = boost
                4.3356547 = idf(docFreq=1580, maxDocs=44421)
                0.018526085 = queryNorm
              0.67063737 = fieldWeight in 5966, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3356547 = idf(docFreq=1580, maxDocs=44421)
                0.109375 = fieldNorm(doc=5966)
          0.07749209 = weight(abstract_txt:ergebnisse in 5966) [ClassicSimilarity], result of:
            0.07749209 = score(doc=5966,freq=1.0), product of:
              0.12926745 = queryWeight, product of:
                1.2730794 = boost
                5.4808774 = idf(docFreq=502, maxDocs=44421)
                0.018526085 = queryNorm
              0.599471 = fieldWeight in 5966, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4808774 = idf(docFreq=502, maxDocs=44421)
                0.109375 = fieldNorm(doc=5966)
          0.15335466 = weight(abstract_txt:automatischen in 5966) [ClassicSimilarity], result of:
            0.15335466 = score(doc=5966,freq=1.0), product of:
              0.20375845 = queryWeight, product of:
                1.5983382 = boost
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.018526085 = queryNorm
              0.7526297 = fieldWeight in 5966, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.109375 = fieldNorm(doc=5966)
          0.16150461 = weight(abstract_txt:automatisch in 5966) [ClassicSimilarity], result of:
            0.16150461 = score(doc=5966,freq=1.0), product of:
              0.21091506 = queryWeight, product of:
                1.6261653 = boost
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.018526085 = queryNorm
              0.76573294 = fieldWeight in 5966, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.109375 = fieldNorm(doc=5966)
          0.191192 = weight(abstract_txt:verfahren in 5966) [ClassicSimilarity], result of:
            0.191192 = score(doc=5966,freq=2.0), product of:
              0.21444596 = queryWeight, product of:
                2.0082393 = boost
                5.7639313 = idf(docFreq=378, maxDocs=44421)
                0.018526085 = queryNorm
              0.8915626 = fieldWeight in 5966, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7639313 = idf(docFreq=378, maxDocs=44421)
                0.109375 = fieldNorm(doc=5966)
        0.2 = coord(5/25)
    
  4. Hänger, C.; Krätzsch, C.; Niemann, C.: Was vom Tagging übrig blieb : Erkenntnisse und Einsichten aus zwei Jahren Projektarbeit (2011) 0.12
    0.12396264 = sum of:
      0.12396264 = product of:
        0.516511 = sum of:
          0.038746044 = weight(abstract_txt:ergebnisse in 519) [ClassicSimilarity], result of:
            0.038746044 = score(doc=519,freq=1.0), product of:
              0.12926745 = queryWeight, product of:
                1.2730794 = boost
                5.4808774 = idf(docFreq=502, maxDocs=44421)
                0.018526085 = queryNorm
              0.2997355 = fieldWeight in 519, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4808774 = idf(docFreq=502, maxDocs=44421)
                0.0546875 = fieldNorm(doc=519)
          0.0975334 = weight(abstract_txt:erschließung in 519) [ClassicSimilarity], result of:
            0.0975334 = score(doc=519,freq=4.0), product of:
              0.15069073 = queryWeight, product of:
                1.3745297 = boost
                5.9176426 = idf(docFreq=324, maxDocs=44421)
                0.018526085 = queryNorm
              0.6472422 = fieldWeight in 519, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.9176426 = idf(docFreq=324, maxDocs=44421)
                0.0546875 = fieldNorm(doc=519)
          0.10843812 = weight(abstract_txt:automatischen in 519) [ClassicSimilarity], result of:
            0.10843812 = score(doc=519,freq=2.0), product of:
              0.20375845 = queryWeight, product of:
                1.5983382 = boost
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.018526085 = queryNorm
              0.53218955 = fieldWeight in 519, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.0546875 = fieldNorm(doc=519)
          0.080752306 = weight(abstract_txt:automatisch in 519) [ClassicSimilarity], result of:
            0.080752306 = score(doc=519,freq=1.0), product of:
              0.21091506 = queryWeight, product of:
                1.6261653 = boost
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.018526085 = queryNorm
              0.38286647 = fieldWeight in 519, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.0546875 = fieldNorm(doc=519)
          0.11708071 = weight(abstract_txt:verfahren in 519) [ClassicSimilarity], result of:
            0.11708071 = score(doc=519,freq=3.0), product of:
              0.21444596 = queryWeight, product of:
                2.0082393 = boost
                5.7639313 = idf(docFreq=378, maxDocs=44421)
                0.018526085 = queryNorm
              0.54596835 = fieldWeight in 519, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.7639313 = idf(docFreq=378, maxDocs=44421)
                0.0546875 = fieldNorm(doc=519)
          0.073960476 = weight(abstract_txt:elektronischen in 519) [ClassicSimilarity], result of:
            0.073960476 = score(doc=519,freq=1.0), product of:
              0.22770251 = queryWeight, product of:
                2.0693808 = boost
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.018526085 = queryNorm
              0.32481185 = fieldWeight in 519, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.0546875 = fieldNorm(doc=519)
        0.24 = coord(6/25)
    
  5. Glaesener, L.: Automatisches Indexieren einer informationswissenschaftlichen Datenbank mit Mehrwortgruppen (2012) 0.12
    0.11882238 = sum of:
      0.11882238 = product of:
        0.7426399 = sum of:
          0.12524612 = weight(abstract_txt:ergebnisse in 1401) [ClassicSimilarity], result of:
            0.12524612 = score(doc=1401,freq=2.0), product of:
              0.12926745 = queryWeight, product of:
                1.2730794 = boost
                5.4808774 = idf(docFreq=502, maxDocs=44421)
                0.018526085 = queryNorm
              0.9688914 = fieldWeight in 1401, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4808774 = idf(docFreq=502, maxDocs=44421)
                0.125 = fieldNorm(doc=1401)
          0.17526247 = weight(abstract_txt:automatischen in 1401) [ClassicSimilarity], result of:
            0.17526247 = score(doc=1401,freq=1.0), product of:
              0.20375845 = queryWeight, product of:
                1.5983382 = boost
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.018526085 = queryNorm
              0.86014825 = fieldWeight in 1401, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.125 = fieldNorm(doc=1401)
          0.14076978 = weight(abstract_txt:diesen in 1401) [ClassicSimilarity], result of:
            0.14076978 = score(doc=1401,freq=1.0), product of:
              0.20153928 = queryWeight, product of:
                1.9468673 = boost
                5.5877852 = idf(docFreq=451, maxDocs=44421)
                0.018526085 = queryNorm
              0.69847316 = fieldWeight in 1401, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5877852 = idf(docFreq=451, maxDocs=44421)
                0.125 = fieldNorm(doc=1401)
          0.3013615 = weight(abstract_txt:texten in 1401) [ClassicSimilarity], result of:
            0.3013615 = score(doc=1401,freq=1.0), product of:
              0.33476904 = queryWeight, product of:
                2.5091646 = boost
                7.201658 = idf(docFreq=89, maxDocs=44421)
                0.018526085 = queryNorm
              0.9002072 = fieldWeight in 1401, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.201658 = idf(docFreq=89, maxDocs=44421)
                0.125 = fieldNorm(doc=1401)
        0.16 = coord(4/25)