Document (#28391)

Author
Sebastiani, F.
Title
¬A tutorial an automated text categorisation
Source
http://net.pku.edu.cn/~webg/papers/sebastiani99tutorial.pdf
Year
1999
Abstract
The automated categorisation (or classification) of texts into topical categories has a long history, dating back at least to 1960. Until the late '80s, the dominant approach to the problem involved knowledge-engineering automatic categorisers, i.e. manually building a set of rules encoding expert knowledge an how to classify documents. In the '90s, with the booming production and availability of on-line documents, automated text categorisation has witnessed an increased and renewed interest. A newer paradigm based an machine learning has superseded the previous approach. Within this paradigm, a general inductive process automatically builds a classifier by "learning", from a set of previously classified documents, the characteristics of one or more categories; the advantages are a very good effectiveness, a considerable savings in terms of expert manpower, and domain independence. In this tutorial we look at the main approaches that have been taken towards automatic text categorisation within the general machine learning paradigm. Issues of document indexing, classifier construction, and classifier evaluation, will be touched upon.
Content
Aus: Proceedings of THAI-99, European Symposium on Telematics, Hypermedia and Artificial Intelligence
Theme
Automatisches Klassifizieren
Computerlinguistik

Similar documents (author)

  1. Sebastiani, F.: On the role of logic in information retrieval (1998) 5.94
    5.9401517 = sum of:
      5.9401517 = weight(author_txt:sebastiani in 2140) [ClassicSimilarity], result of:
        5.9401517 = fieldWeight in 2140, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.625 = fieldNorm(doc=2140)
    
  2. Sebastiani, F.: Machine learning in automated text categorization (2002) 5.94
    5.9401517 = sum of:
      5.9401517 = weight(author_txt:sebastiani in 4389) [ClassicSimilarity], result of:
        5.9401517 = fieldWeight in 4389, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.625 = fieldNorm(doc=4389)
    
  3. Sebastiani, F.: Classification of text, automatic (2006) 5.94
    5.9401517 = sum of:
      5.9401517 = weight(author_txt:sebastiani in 3) [ClassicSimilarity], result of:
        5.9401517 = fieldWeight in 3, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.625 = fieldNorm(doc=3)
    
  4. Debole, F.; Sebastiani, F.: ¬An analysis of the relative hardness of Reuters-21578 subsets (2005) 4.75
    4.7521214 = sum of:
      4.7521214 = weight(author_txt:sebastiani in 4456) [ClassicSimilarity], result of:
        4.7521214 = fieldWeight in 4456, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.5 = fieldNorm(doc=4456)
    
  5. Giorgetti, D.; Sebastiani, F.: Automating survey coding by multiclass text categorization techniques (2003) 4.75
    4.7521214 = sum of:
      4.7521214 = weight(author_txt:sebastiani in 172) [ClassicSimilarity], result of:
        4.7521214 = fieldWeight in 172, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.5 = fieldNorm(doc=172)
    

Similar documents (content)

  1. Sebastiani, F.: Machine learning in automated text categorization (2002) 0.93
    0.9326973 = sum of:
      0.9326973 = product of:
        1.4573395 = sum of:
          0.029977422 = weight(abstract_txt:approach in 4389) [ClassicSimilarity], result of:
            0.029977422 = score(doc=4389,freq=3.0), product of:
              0.05921602 = queryWeight, product of:
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.015828319 = queryNorm
              0.5062384 = fieldWeight in 4389, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.078125 = fieldNorm(doc=4389)
          0.074750654 = weight(abstract_txt:inductive in 4389) [ClassicSimilarity], result of:
            0.074750654 = score(doc=4389,freq=1.0), product of:
              0.12464746 = queryWeight, product of:
                1.0259049 = boost
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.015828319 = queryNorm
              0.5996966 = fieldWeight in 4389, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.078125 = fieldNorm(doc=4389)
          0.024322867 = weight(abstract_txt:within in 4389) [ClassicSimilarity], result of:
            0.024322867 = score(doc=4389,freq=1.0), product of:
              0.07429506 = queryWeight, product of:
                1.1201092 = boost
                4.19049 = idf(docFreq=1827, maxDocs=44421)
                0.015828319 = queryNorm
              0.32738203 = fieldWeight in 4389, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.19049 = idf(docFreq=1827, maxDocs=44421)
                0.078125 = fieldNorm(doc=4389)
          0.105502844 = weight(abstract_txt:savings in 4389) [ClassicSimilarity], result of:
            0.105502844 = score(doc=4389,freq=1.0), product of:
              0.15683737 = queryWeight, product of:
                1.1507744 = boost
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.015828319 = queryNorm
              0.67268944 = fieldWeight in 4389, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.078125 = fieldNorm(doc=4389)
          0.026868641 = weight(abstract_txt:general in 4389) [ClassicSimilarity], result of:
            0.026868641 = score(doc=4389,freq=1.0), product of:
              0.0793927 = queryWeight, product of:
                1.157899 = boost
                4.3318667 = idf(docFreq=1586, maxDocs=44421)
                0.015828319 = queryNorm
              0.3384271 = fieldWeight in 4389, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3318667 = idf(docFreq=1586, maxDocs=44421)
                0.078125 = fieldNorm(doc=4389)
          0.11305252 = weight(abstract_txt:witnessed in 4389) [ClassicSimilarity], result of:
            0.11305252 = score(doc=4389,freq=1.0), product of:
              0.16423294 = queryWeight, product of:
                1.1775938 = boost
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.015828319 = queryNorm
              0.6883669 = fieldWeight in 4389, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.078125 = fieldNorm(doc=4389)
          0.13308825 = weight(abstract_txt:booming in 4389) [ClassicSimilarity], result of:
            0.13308825 = score(doc=4389,freq=1.0), product of:
              0.18310487 = queryWeight, product of:
                1.2434129 = boost
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.015828319 = queryNorm
              0.7268416 = fieldWeight in 4389, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.078125 = fieldNorm(doc=4389)
          0.06489296 = weight(abstract_txt:categories in 4389) [ClassicSimilarity], result of:
            0.06489296 = score(doc=4389,freq=2.0), product of:
              0.113432765 = queryWeight, product of:
                1.3840432 = boost
                5.177905 = idf(docFreq=680, maxDocs=44421)
                0.015828319 = queryNorm
              0.57208306 = fieldWeight in 4389, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.177905 = idf(docFreq=680, maxDocs=44421)
                0.078125 = fieldNorm(doc=4389)
          0.068611614 = weight(abstract_txt:machine in 4389) [ClassicSimilarity], result of:
            0.068611614 = score(doc=4389,freq=2.0), product of:
              0.11772586 = queryWeight, product of:
                1.4099909 = boost
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.015828319 = queryNorm
              0.5828084 = fieldWeight in 4389, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.078125 = fieldNorm(doc=4389)
          0.06169822 = weight(abstract_txt:expert in 4389) [ClassicSimilarity], result of:
            0.06169822 = score(doc=4389,freq=1.0), product of:
              0.13818637 = queryWeight, product of:
                1.5276117 = boost
                5.7150154 = idf(docFreq=397, maxDocs=44421)
                0.015828319 = queryNorm
              0.44648558 = fieldWeight in 4389, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7150154 = idf(docFreq=397, maxDocs=44421)
                0.078125 = fieldNorm(doc=4389)
          0.032714494 = weight(abstract_txt:text in 4389) [ClassicSimilarity], result of:
            0.032714494 = score(doc=4389,freq=1.0), product of:
              0.10362726 = queryWeight, product of:
                1.620179 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.015828319 = queryNorm
              0.3156939 = fieldWeight in 4389, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.078125 = fieldNorm(doc=4389)
          0.04915508 = weight(abstract_txt:documents in 4389) [ClassicSimilarity], result of:
            0.04915508 = score(doc=4389,freq=2.0), product of:
              0.10789868 = queryWeight, product of:
                1.6532332 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.015828319 = queryNorm
              0.455567 = fieldWeight in 4389, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.078125 = fieldNorm(doc=4389)
          0.091574796 = weight(abstract_txt:learning in 4389) [ClassicSimilarity], result of:
            0.091574796 = score(doc=4389,freq=3.0), product of:
              0.14271098 = queryWeight, product of:
                1.9013178 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.015828319 = queryNorm
              0.64168006 = fieldWeight in 4389, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.078125 = fieldNorm(doc=4389)
          0.08712503 = weight(abstract_txt:automated in 4389) [ClassicSimilarity], result of:
            0.08712503 = score(doc=4389,freq=1.0), product of:
              0.19910209 = queryWeight, product of:
                2.245763 = boost
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.015828319 = queryNorm
              0.43758973 = fieldWeight in 4389, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.078125 = fieldNorm(doc=4389)
          0.11657288 = weight(abstract_txt:paradigm in 4389) [ClassicSimilarity], result of:
            0.11657288 = score(doc=4389,freq=1.0), product of:
              0.24175689 = queryWeight, product of:
                2.4746594 = boost
                6.1720386 = idf(docFreq=251, maxDocs=44421)
                0.015828319 = queryNorm
              0.48219052 = fieldWeight in 4389, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1720386 = idf(docFreq=251, maxDocs=44421)
                0.078125 = fieldNorm(doc=4389)
          0.37743145 = weight(abstract_txt:classifier in 4389) [ClassicSimilarity], result of:
            0.37743145 = score(doc=4389,freq=4.0), product of:
              0.33331326 = queryWeight, product of:
                2.9057105 = boost
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.015828319 = queryNorm
              1.1323626 = fieldWeight in 4389, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.078125 = fieldNorm(doc=4389)
        0.64 = coord(16/25)
    
  2. Sebastiani, F.: Classification of text, automatic (2006) 0.23
    0.22584647 = sum of:
      0.22584647 = product of:
        0.70577025 = sum of:
          0.020768967 = weight(abstract_txt:approach in 3) [ClassicSimilarity], result of:
            0.020768967 = score(doc=3,freq=1.0), product of:
              0.05921602 = queryWeight, product of:
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.015828319 = queryNorm
              0.35073224 = fieldWeight in 3, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.09375 = fieldNorm(doc=3)
          0.07787156 = weight(abstract_txt:categories in 3) [ClassicSimilarity], result of:
            0.07787156 = score(doc=3,freq=2.0), product of:
              0.113432765 = queryWeight, product of:
                1.3840432 = boost
                5.177905 = idf(docFreq=680, maxDocs=44421)
                0.015828319 = queryNorm
              0.6864997 = fieldWeight in 3, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.177905 = idf(docFreq=680, maxDocs=44421)
                0.09375 = fieldNorm(doc=3)
          0.055632643 = weight(abstract_txt:automatic in 3) [ClassicSimilarity], result of:
            0.055632643 = score(doc=3,freq=1.0), product of:
              0.11421305 = queryWeight, product of:
                1.3887954 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.015828319 = queryNorm
              0.48709533 = fieldWeight in 3, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.09375 = fieldNorm(doc=3)
          0.058218885 = weight(abstract_txt:machine in 3) [ClassicSimilarity], result of:
            0.058218885 = score(doc=3,freq=1.0), product of:
              0.11772586 = queryWeight, product of:
                1.4099909 = boost
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.015828319 = queryNorm
              0.4945293 = fieldWeight in 3, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.09375 = fieldNorm(doc=3)
          0.055518337 = weight(abstract_txt:text in 3) [ClassicSimilarity], result of:
            0.055518337 = score(doc=3,freq=2.0), product of:
              0.10362726 = queryWeight, product of:
                1.620179 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.015828319 = queryNorm
              0.5357503 = fieldWeight in 3, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.09375 = fieldNorm(doc=3)
          0.063444875 = weight(abstract_txt:learning in 3) [ClassicSimilarity], result of:
            0.063444875 = score(doc=3,freq=1.0), product of:
              0.14271098 = queryWeight, product of:
                1.9013178 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.015828319 = queryNorm
              0.444569 = fieldWeight in 3, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.09375 = fieldNorm(doc=3)
          0.14785607 = weight(abstract_txt:automated in 3) [ClassicSimilarity], result of:
            0.14785607 = score(doc=3,freq=2.0), product of:
              0.19910209 = queryWeight, product of:
                2.245763 = boost
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.015828319 = queryNorm
              0.7426144 = fieldWeight in 3, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.09375 = fieldNorm(doc=3)
          0.22645888 = weight(abstract_txt:classifier in 3) [ClassicSimilarity], result of:
            0.22645888 = score(doc=3,freq=1.0), product of:
              0.33331326 = queryWeight, product of:
                2.9057105 = boost
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.015828319 = queryNorm
              0.67941755 = fieldWeight in 3, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.09375 = fieldNorm(doc=3)
        0.32 = coord(8/25)
    
  3. Ko, Y.; Seo, J.: Text classification from unlabeled documents with bootstrapping and feature projection techniques (2009) 0.22
    0.21998061 = sum of:
      0.21998061 = product of:
        0.68743944 = sum of:
          0.059800524 = weight(abstract_txt:inductive in 3452) [ClassicSimilarity], result of:
            0.059800524 = score(doc=3452,freq=1.0), product of:
              0.12464746 = queryWeight, product of:
                1.0259049 = boost
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.015828319 = queryNorm
              0.47975725 = fieldWeight in 3452, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.0625 = fieldNorm(doc=3452)
          0.021494912 = weight(abstract_txt:general in 3452) [ClassicSimilarity], result of:
            0.021494912 = score(doc=3452,freq=1.0), product of:
              0.0793927 = queryWeight, product of:
                1.157899 = boost
                4.3318667 = idf(docFreq=1586, maxDocs=44421)
                0.015828319 = queryNorm
              0.27074167 = fieldWeight in 3452, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3318667 = idf(docFreq=1586, maxDocs=44421)
                0.0625 = fieldNorm(doc=3452)
          0.05488929 = weight(abstract_txt:machine in 3452) [ClassicSimilarity], result of:
            0.05488929 = score(doc=3452,freq=2.0), product of:
              0.11772586 = queryWeight, product of:
                1.4099909 = boost
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.015828319 = queryNorm
              0.4662467 = fieldWeight in 3452, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.0625 = fieldNorm(doc=3452)
          0.06924353 = weight(abstract_txt:text in 3452) [ClassicSimilarity], result of:
            0.06924353 = score(doc=3452,freq=7.0), product of:
              0.10362726 = queryWeight, product of:
                1.620179 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.015828319 = queryNorm
              0.66819805 = fieldWeight in 3452, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=3452)
          0.055612627 = weight(abstract_txt:documents in 3452) [ClassicSimilarity], result of:
            0.055612627 = score(doc=3452,freq=4.0), product of:
              0.10789868 = queryWeight, product of:
                1.6532332 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.015828319 = queryNorm
              0.51541525 = fieldWeight in 3452, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0625 = fieldNorm(doc=3452)
          0.1196328 = weight(abstract_txt:learning in 3452) [ClassicSimilarity], result of:
            0.1196328 = score(doc=3452,freq=8.0), product of:
              0.14271098 = queryWeight, product of:
                1.9013178 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.015828319 = queryNorm
              0.8382873 = fieldWeight in 3452, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.0625 = fieldNorm(doc=3452)
          0.0932583 = weight(abstract_txt:paradigm in 3452) [ClassicSimilarity], result of:
            0.0932583 = score(doc=3452,freq=1.0), product of:
              0.24175689 = queryWeight, product of:
                2.4746594 = boost
                6.1720386 = idf(docFreq=251, maxDocs=44421)
                0.015828319 = queryNorm
              0.3857524 = fieldWeight in 3452, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1720386 = idf(docFreq=251, maxDocs=44421)
                0.0625 = fieldNorm(doc=3452)
          0.21350747 = weight(abstract_txt:classifier in 3452) [ClassicSimilarity], result of:
            0.21350747 = score(doc=3452,freq=2.0), product of:
              0.33331326 = queryWeight, product of:
                2.9057105 = boost
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.015828319 = queryNorm
              0.640561 = fieldWeight in 3452, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.0625 = fieldNorm(doc=3452)
        0.32 = coord(8/25)
    
  4. Li, T.; Zhu, S.; Ogihara, M.: Hierarchical document classification using automatically generated hierarchy (2007) 0.17
    0.17371197 = sum of:
      0.17371197 = product of:
        0.5428499 = sum of:
          0.017307471 = weight(abstract_txt:approach in 797) [ClassicSimilarity], result of:
            0.017307471 = score(doc=797,freq=1.0), product of:
              0.05921602 = queryWeight, product of:
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.015828319 = queryNorm
              0.29227686 = fieldWeight in 797, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.078125 = fieldNorm(doc=797)
          0.11305252 = weight(abstract_txt:witnessed in 797) [ClassicSimilarity], result of:
            0.11305252 = score(doc=797,freq=1.0), product of:
              0.16423294 = queryWeight, product of:
                1.1775938 = boost
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.015828319 = queryNorm
              0.6883669 = fieldWeight in 797, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.078125 = fieldNorm(doc=797)
          0.13308825 = weight(abstract_txt:booming in 797) [ClassicSimilarity], result of:
            0.13308825 = score(doc=797,freq=1.0), product of:
              0.18310487 = queryWeight, product of:
                1.2434129 = boost
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.015828319 = queryNorm
              0.7268416 = fieldWeight in 797, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.078125 = fieldNorm(doc=797)
          0.06489296 = weight(abstract_txt:categories in 797) [ClassicSimilarity], result of:
            0.06489296 = score(doc=797,freq=2.0), product of:
              0.113432765 = queryWeight, product of:
                1.3840432 = boost
                5.177905 = idf(docFreq=680, maxDocs=44421)
                0.015828319 = queryNorm
              0.57208306 = fieldWeight in 797, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.177905 = idf(docFreq=680, maxDocs=44421)
                0.078125 = fieldNorm(doc=797)
          0.046360534 = weight(abstract_txt:automatic in 797) [ClassicSimilarity], result of:
            0.046360534 = score(doc=797,freq=1.0), product of:
              0.11421305 = queryWeight, product of:
                1.3887954 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.015828319 = queryNorm
              0.40591276 = fieldWeight in 797, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.078125 = fieldNorm(doc=797)
          0.04626528 = weight(abstract_txt:text in 797) [ClassicSimilarity], result of:
            0.04626528 = score(doc=797,freq=2.0), product of:
              0.10362726 = queryWeight, product of:
                1.620179 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.015828319 = queryNorm
              0.4464586 = fieldWeight in 797, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.078125 = fieldNorm(doc=797)
          0.03475789 = weight(abstract_txt:documents in 797) [ClassicSimilarity], result of:
            0.03475789 = score(doc=797,freq=1.0), product of:
              0.10789868 = queryWeight, product of:
                1.6532332 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.015828319 = queryNorm
              0.32213452 = fieldWeight in 797, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.078125 = fieldNorm(doc=797)
          0.08712503 = weight(abstract_txt:automated in 797) [ClassicSimilarity], result of:
            0.08712503 = score(doc=797,freq=1.0), product of:
              0.19910209 = queryWeight, product of:
                2.245763 = boost
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.015828319 = queryNorm
              0.43758973 = fieldWeight in 797, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.078125 = fieldNorm(doc=797)
        0.32 = coord(8/25)
    
  5. Ruiz, M.E.; Srinivasan, P.: Combining machine learning and hierarchical indexing structures for text categorization (2001) 0.14
    0.13553566 = sum of:
      0.13553566 = product of:
        0.5647319 = sum of:
          0.05506351 = weight(abstract_txt:categories in 2595) [ClassicSimilarity], result of:
            0.05506351 = score(doc=2595,freq=1.0), product of:
              0.113432765 = queryWeight, product of:
                1.3840432 = boost
                5.177905 = idf(docFreq=680, maxDocs=44421)
                0.015828319 = queryNorm
              0.4854286 = fieldWeight in 2595, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.177905 = idf(docFreq=680, maxDocs=44421)
                0.09375 = fieldNorm(doc=2595)
          0.055632643 = weight(abstract_txt:automatic in 2595) [ClassicSimilarity], result of:
            0.055632643 = score(doc=2595,freq=1.0), product of:
              0.11421305 = queryWeight, product of:
                1.3887954 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.015828319 = queryNorm
              0.48709533 = fieldWeight in 2595, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.09375 = fieldNorm(doc=2595)
          0.08233394 = weight(abstract_txt:machine in 2595) [ClassicSimilarity], result of:
            0.08233394 = score(doc=2595,freq=2.0), product of:
              0.11772586 = queryWeight, product of:
                1.4099909 = boost
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.015828319 = queryNorm
              0.69937 = fieldWeight in 2595, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.09375 = fieldNorm(doc=2595)
          0.055518337 = weight(abstract_txt:text in 2595) [ClassicSimilarity], result of:
            0.055518337 = score(doc=2595,freq=2.0), product of:
              0.10362726 = queryWeight, product of:
                1.620179 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.015828319 = queryNorm
              0.5357503 = fieldWeight in 2595, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.09375 = fieldNorm(doc=2595)
          0.0897246 = weight(abstract_txt:learning in 2595) [ClassicSimilarity], result of:
            0.0897246 = score(doc=2595,freq=2.0), product of:
              0.14271098 = queryWeight, product of:
                1.9013178 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.015828319 = queryNorm
              0.62871546 = fieldWeight in 2595, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.09375 = fieldNorm(doc=2595)
          0.22645888 = weight(abstract_txt:classifier in 2595) [ClassicSimilarity], result of:
            0.22645888 = score(doc=2595,freq=1.0), product of:
              0.33331326 = queryWeight, product of:
                2.9057105 = boost
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.015828319 = queryNorm
              0.67941755 = fieldWeight in 2595, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.09375 = fieldNorm(doc=2595)
        0.24 = coord(6/25)