Document (#26563)

Author
Walther, R.
Title
Möglichkeiten und Grenzen automatischer Klassifikationen von Web-Dokumenten
Imprint
Bern : Rechts- und Wirtschaftswissenschaftlichen Fakultät
Year
2001
Pages
97 S
Abstract
Automatische Klassifikationen von Web- und andern Textdokumenten ermöglichen es, betriebsinterne und externe Informationen geordnet zugänglich zu machen. Die Forschung zur automatischen Klassifikation hat sich in den letzten Jahren intensiviert. Das Resultat sind verschiedenen Methoden, die heute in der Praxis einzeln oder kombiniert für die Klassifikation im Einsatz sind. In der vorliegenden Lizenziatsarbeit werden neben allgemeinen Grundsätzen einige Methoden zur automatischen Klassifikation genauer betrachtet und ihre Möglichkeiten und Grenzen erörtert. Daneben erfolgt die Präsentation der Resultate aus einer Umfrage bei Anbieterrfirmen von Softwarelösungen zur automatische Klassifikation von Text-Dokumenten. Die Ausführungen dienen der myax internet AG als Basis, ein eigenes Klassifikations-Produkt zu entwickeln
Content
Auch unter: http://www.ie.iwi.unibe.ch/roundtable/april/hostettler/lizwalther.pdf
Footnote
Lizenziatsarbeit an der Rechts- und Wirtschaftswissenschaftlichen Fakultät der Universität Bern, Institut für Wirtschaftsinformatik (Prof. G. Knolmayer)
Theme
Automatisches Klassifizieren
Internet

Similar documents (author)

  1. Walther, J.: ¬La construction d'un langage documentaire pluridisciplinaire (1992) 5.62
    5.6180234 = sum of:
      5.6180234 = weight(author_txt:walther in 2309) [ClassicSimilarity], result of:
        5.6180234 = fieldWeight in 2309, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.625 = fieldNorm(doc=2309)
    
  2. Walther, C.: Wie Deutschland zur Dezimalklassifikation kam (1957) 5.62
    5.6180234 = sum of:
      5.6180234 = weight(author_txt:walther in 5009) [ClassicSimilarity], result of:
        5.6180234 = fieldWeight in 5009, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.625 = fieldNorm(doc=5009)
    
  3. Walther, R.: In vierundzwanzig Bänden um die Welt : Die Neuauflage des 'Großen Brockhaus': wie die Enzyklopädie das Wissen der Gegenwart inventarisiert (1996) 5.62
    5.6180234 = sum of:
      5.6180234 = weight(author_txt:walther in 6280) [ClassicSimilarity], result of:
        5.6180234 = fieldWeight in 6280, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.625 = fieldNorm(doc=6280)
    
  4. Walther, R.: Wille und Kraft aller einzelnen Glieder : Mit Abschluß seiner 20. Auflage wird der 'Brockhaus' eingestellt (1999) 5.62
    5.6180234 = sum of:
      5.6180234 = weight(author_txt:walther in 3388) [ClassicSimilarity], result of:
        5.6180234 = fieldWeight in 3388, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.625 = fieldNorm(doc=3388)
    
  5. Walther, R.: Wanderung aus gestorbenen Systemen : Bibliotheken bemühen sich, digital archivierte Texte trotz des Wandels der Technik zugänglich zu halten (2003) 5.62
    5.6180234 = sum of:
      5.6180234 = weight(author_txt:walther in 1483) [ClassicSimilarity], result of:
        5.6180234 = fieldWeight in 1483, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.625 = fieldNorm(doc=1483)
    

Similar documents (content)

  1. Hoffmann, R.: Entwicklung einer benutzerunterstützten automatisierten Klassifikation von Web - Dokumenten : Untersuchung gegenwärtiger Methoden zur automatisierten Dokumentklassifikation und Implementierung eines Prototyps zum verbesserten Information Retrieval für das xFIND System (2002) 0.28
    0.27595574 = sum of:
      0.27595574 = product of:
        0.98555624 = sum of:
          0.06631344 = weight(abstract_txt:automatischer in 4197) [ClassicSimilarity], result of:
            0.06631344 = score(doc=4197,freq=1.0), product of:
              0.16983865 = queryWeight, product of:
                1.1307925 = boost
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.018031418 = queryNorm
              0.3904496 = fieldWeight in 4197, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.329592 = idf(docFreq=28, maxDocs=44218)
                0.046875 = fieldNorm(doc=4197)
          0.038973577 = weight(abstract_txt:möglichkeiten in 4197) [ClassicSimilarity], result of:
            0.038973577 = score(doc=4197,freq=1.0), product of:
              0.1501386 = queryWeight, product of:
                1.5035776 = boost
                5.5377917 = idf(docFreq=472, maxDocs=44218)
                0.018031418 = queryNorm
              0.25958398 = fieldWeight in 4197, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5377917 = idf(docFreq=472, maxDocs=44218)
                0.046875 = fieldNorm(doc=4197)
          0.088784374 = weight(abstract_txt:methoden in 4197) [ClassicSimilarity], result of:
            0.088784374 = score(doc=4197,freq=4.0), product of:
              0.16375072 = queryWeight, product of:
                1.570259 = boost
                5.7833843 = idf(docFreq=369, maxDocs=44218)
                0.018031418 = queryNorm
              0.5421923 = fieldWeight in 4197, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.7833843 = idf(docFreq=369, maxDocs=44218)
                0.046875 = fieldNorm(doc=4197)
          0.086869255 = weight(abstract_txt:dokumenten in 4197) [ClassicSimilarity], result of:
            0.086869255 = score(doc=4197,freq=2.0), product of:
              0.20333536 = queryWeight, product of:
                1.7497908 = boost
                6.444614 = idf(docFreq=190, maxDocs=44218)
                0.018031418 = queryNorm
              0.4272216 = fieldWeight in 4197, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.444614 = idf(docFreq=190, maxDocs=44218)
                0.046875 = fieldNorm(doc=4197)
          0.16686596 = weight(abstract_txt:automatischen in 4197) [ClassicSimilarity], result of:
            0.16686596 = score(doc=4197,freq=5.0), product of:
              0.23150872 = queryWeight, product of:
                1.8670818 = boost
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.018031418 = queryNorm
              0.72077614 = fieldWeight in 4197, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.046875 = fieldNorm(doc=4197)
          0.075972706 = weight(abstract_txt:automatische in 4197) [ClassicSimilarity], result of:
            0.075972706 = score(doc=4197,freq=1.0), product of:
              0.23428828 = queryWeight, product of:
                1.8782567 = boost
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.018031418 = queryNorm
              0.3242702 = fieldWeight in 4197, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.046875 = fieldNorm(doc=4197)
          0.461777 = weight(title_txt:klassifikation in 4197) [ClassicSimilarity], result of:
            0.461777 = score(doc=4197,freq=1.0), product of:
              0.3901549 = queryWeight, product of:
                3.427782 = boost
                6.312392 = idf(docFreq=217, maxDocs=44218)
                0.018031418 = queryNorm
              1.1835735 = fieldWeight in 4197, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.312392 = idf(docFreq=217, maxDocs=44218)
                0.1875 = fieldNorm(doc=4197)
        0.28 = coord(7/25)
    
  2. Oberhauser, O.: Klassifikation in Online-Informationssystemen (1986) 0.24
    0.2383683 = sum of:
      0.2383683 = product of:
        1.4898019 = sum of:
          0.045987476 = weight(abstract_txt:sind in 588) [ClassicSimilarity], result of:
            0.045987476 = score(doc=588,freq=4.0), product of:
              0.07513104 = queryWeight, product of:
                1.063627 = boost
                3.9174201 = idf(docFreq=2390, maxDocs=44218)
                0.018031418 = queryNorm
              0.6120969 = fieldWeight in 588, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.9174201 = idf(docFreq=2390, maxDocs=44218)
                0.078125 = fieldNorm(doc=588)
          0.064955965 = weight(abstract_txt:möglichkeiten in 588) [ClassicSimilarity], result of:
            0.064955965 = score(doc=588,freq=1.0), product of:
              0.1501386 = queryWeight, product of:
                1.5035776 = boost
                5.5377917 = idf(docFreq=472, maxDocs=44218)
                0.018031418 = queryNorm
              0.43264 = fieldWeight in 588, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5377917 = idf(docFreq=472, maxDocs=44218)
                0.078125 = fieldNorm(doc=588)
          0.14745301 = weight(abstract_txt:klassifikationen in 588) [ClassicSimilarity], result of:
            0.14745301 = score(doc=588,freq=1.0), product of:
              0.25932762 = queryWeight, product of:
                1.9760778 = boost
                7.2780466 = idf(docFreq=82, maxDocs=44218)
                0.018031418 = queryNorm
              0.5685974 = fieldWeight in 588, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2780466 = idf(docFreq=82, maxDocs=44218)
                0.078125 = fieldNorm(doc=588)
          1.2314054 = weight(title_txt:klassifikation in 588) [ClassicSimilarity], result of:
            1.2314054 = score(doc=588,freq=1.0), product of:
              0.3901549 = queryWeight, product of:
                3.427782 = boost
                6.312392 = idf(docFreq=217, maxDocs=44218)
                0.018031418 = queryNorm
              3.156196 = fieldWeight in 588, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.312392 = idf(docFreq=217, maxDocs=44218)
                0.5 = fieldNorm(doc=588)
        0.16 = coord(4/25)
    
  3. Degens, P.O.: Hierarchische Klassifikation (1980) 0.23
    0.23065646 = sum of:
      0.23065646 = product of:
        1.9221373 = sum of:
          0.090938345 = weight(abstract_txt:möglichkeiten in 89) [ClassicSimilarity], result of:
            0.090938345 = score(doc=89,freq=1.0), product of:
              0.1501386 = queryWeight, product of:
                1.5035776 = boost
                5.5377917 = idf(docFreq=472, maxDocs=44218)
                0.018031418 = queryNorm
              0.60569596 = fieldWeight in 89, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5377917 = idf(docFreq=472, maxDocs=44218)
                0.109375 = fieldNorm(doc=89)
          0.29194206 = weight(abstract_txt:klassifikationen in 89) [ClassicSimilarity], result of:
            0.29194206 = score(doc=89,freq=2.0), product of:
              0.25932762 = queryWeight, product of:
                1.9760778 = boost
                7.2780466 = idf(docFreq=82, maxDocs=44218)
                0.018031418 = queryNorm
              1.1257654 = fieldWeight in 89, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.2780466 = idf(docFreq=82, maxDocs=44218)
                0.109375 = fieldNorm(doc=89)
          1.5392568 = weight(title_txt:klassifikation in 89) [ClassicSimilarity], result of:
            1.5392568 = score(doc=89,freq=1.0), product of:
              0.3901549 = queryWeight, product of:
                3.427782 = boost
                6.312392 = idf(docFreq=217, maxDocs=44218)
                0.018031418 = queryNorm
              3.9452453 = fieldWeight in 89, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.312392 = idf(docFreq=217, maxDocs=44218)
                0.625 = fieldNorm(doc=89)
        0.12 = coord(3/25)
    
  4. Manecke, H.-J.: Klassifikation, Klassieren (2004) 0.21
    0.21131778 = sum of:
      0.21131778 = product of:
        1.7609816 = sum of:
          0.02389579 = weight(abstract_txt:sind in 2902) [ClassicSimilarity], result of:
            0.02389579 = score(doc=2902,freq=3.0), product of:
              0.07513104 = queryWeight, product of:
                1.063627 = boost
                3.9174201 = idf(docFreq=2390, maxDocs=44218)
                0.018031418 = queryNorm
              0.31805485 = fieldWeight in 2902, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9174201 = idf(docFreq=2390, maxDocs=44218)
                0.046875 = fieldNorm(doc=2902)
          0.19782898 = weight(abstract_txt:klassifikationen in 2902) [ClassicSimilarity], result of:
            0.19782898 = score(doc=2902,freq=5.0), product of:
              0.25932762 = queryWeight, product of:
                1.9760778 = boost
                7.2780466 = idf(docFreq=82, maxDocs=44218)
                0.018031418 = queryNorm
              0.7628535 = fieldWeight in 2902, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.2780466 = idf(docFreq=82, maxDocs=44218)
                0.046875 = fieldNorm(doc=2902)
          1.5392568 = weight(title_txt:klassifikation in 2902) [ClassicSimilarity], result of:
            1.5392568 = score(doc=2902,freq=1.0), product of:
              0.3901549 = queryWeight, product of:
                3.427782 = boost
                6.312392 = idf(docFreq=217, maxDocs=44218)
                0.018031418 = queryNorm
              3.9452453 = fieldWeight in 2902, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.312392 = idf(docFreq=217, maxDocs=44218)
                0.625 = fieldNorm(doc=2902)
        0.12 = coord(3/25)
    
  5. Krauth, J.: Evaluation von Verfahren der automatischen Klassifikation (1983) 0.18
    0.18374245 = sum of:
      0.18374245 = product of:
        1.531187 = sum of:
          0.19899927 = weight(abstract_txt:automatischen in 111) [ClassicSimilarity], result of:
            0.19899927 = score(doc=111,freq=1.0), product of:
              0.23150872 = queryWeight, product of:
                1.8670818 = boost
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.018031418 = queryNorm
              0.8595757 = fieldWeight in 111, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8766055 = idf(docFreq=123, maxDocs=44218)
                0.125 = fieldNorm(doc=111)
          0.40863377 = weight(abstract_txt:klassifikationen in 111) [ClassicSimilarity], result of:
            0.40863377 = score(doc=111,freq=3.0), product of:
              0.25932762 = queryWeight, product of:
                1.9760778 = boost
                7.2780466 = idf(docFreq=82, maxDocs=44218)
                0.018031418 = queryNorm
              1.5757433 = fieldWeight in 111, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.2780466 = idf(docFreq=82, maxDocs=44218)
                0.125 = fieldNorm(doc=111)
          0.923554 = weight(title_txt:klassifikation in 111) [ClassicSimilarity], result of:
            0.923554 = score(doc=111,freq=1.0), product of:
              0.3901549 = queryWeight, product of:
                3.427782 = boost
                6.312392 = idf(docFreq=217, maxDocs=44218)
                0.018031418 = queryNorm
              2.367147 = fieldWeight in 111, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.312392 = idf(docFreq=217, maxDocs=44218)
                0.375 = fieldNorm(doc=111)
        0.12 = coord(3/25)