Document (#34453)

Author
Ko, Y.
Seo, J.
Title
Text classification from unlabeled documents with bootstrapping and feature projection techniques
Source
Information processing and management. 45(2009) no.1, S.70-83
Year
2009
Abstract
Many machine learning algorithms have been applied to text classification tasks. In the machine learning paradigm, a general inductive process automatically builds a text classifier by learning, generally known as supervised learning. However, the supervised learning approaches have some problems. The most notable problem is that they require a large number of labeled training documents for accurate learning. While unlabeled documents are easily collected and plentiful, labeled documents are difficultly generated because a labeling task must be done by human developers. In this paper, we propose a new text classification method based on unsupervised or semi-supervised learning. The proposed method launches text classification tasks with only unlabeled documents and the title word of each category for learning, and then it automatically learns text classifier by using bootstrapping and feature projection techniques. The results of experiments showed that the proposed method achieved reasonably useful performance compared to a supervised method. If the proposed method is used in a text classification task, building text classification systems will become significantly faster and less expensive.
Theme
Automatisches Klassifizieren

Similar documents (content)

  1. Li, M.; Li, H.; Zhou, Z.-H.: Semi-supervised document retrieval (2009) 0.60
    0.59721386 = sum of:
      0.59721386 = product of:
        1.4930346 = sum of:
          0.047840986 = weight(abstract_txt:unsupervised in 218) [ClassicSimilarity], result of:
            0.047840986 = score(doc=218,freq=1.0), product of:
              0.10040173 = queryWeight, product of:
                7.62393 = idf(docFreq=58, maxDocs=44421)
                0.013169288 = queryNorm
              0.47649562 = fieldWeight in 218, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.62393 = idf(docFreq=58, maxDocs=44421)
                0.0625 = fieldNorm(doc=218)
          0.07330068 = weight(abstract_txt:labeling in 218) [ClassicSimilarity], result of:
            0.07330068 = score(doc=218,freq=2.0), product of:
              0.105909884 = queryWeight, product of:
                1.0270643 = boost
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.013169288 = queryNorm
              0.6921043 = fieldWeight in 218, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.0625 = fieldNorm(doc=218)
          0.05489291 = weight(abstract_txt:machine in 218) [ClassicSimilarity], result of:
            0.05489291 = score(doc=218,freq=3.0), product of:
              0.0961291 = queryWeight, product of:
                1.3837953 = boost
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.013169288 = queryNorm
              0.57103324 = fieldWeight in 218, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.0625 = fieldNorm(doc=218)
          0.031691294 = weight(abstract_txt:proposed in 218) [ClassicSimilarity], result of:
            0.031691294 = score(doc=218,freq=1.0), product of:
              0.11003771 = queryWeight, product of:
                1.813263 = boost
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.013169288 = queryNorm
              0.28800395 = fieldWeight in 218, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.0625 = fieldNorm(doc=218)
          0.18646675 = weight(abstract_txt:labeled in 218) [ClassicSimilarity], result of:
            0.18646675 = score(doc=218,freq=4.0), product of:
              0.19736284 = queryWeight, product of:
                1.9827918 = boost
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.013169288 = queryNorm
              0.9447916 = fieldWeight in 218, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.0625 = fieldNorm(doc=218)
          0.0535168 = weight(abstract_txt:documents in 218) [ClassicSimilarity], result of:
            0.0535168 = score(doc=218,freq=2.0), product of:
              0.14684118 = queryWeight, product of:
                2.7041972 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.013169288 = queryNorm
              0.3644536 = fieldWeight in 218, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0625 = fieldNorm(doc=218)
          0.1099032 = weight(abstract_txt:method in 218) [ClassicSimilarity], result of:
            0.1099032 = score(doc=218,freq=5.0), product of:
              0.17480265 = queryWeight, product of:
                2.950451 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.013169288 = queryNorm
              0.6287273 = fieldWeight in 218, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0625 = fieldNorm(doc=218)
          0.38030177 = weight(abstract_txt:unlabeled in 218) [ClassicSimilarity], result of:
            0.38030177 = score(doc=218,freq=2.0), product of:
              0.4577803 = queryWeight, product of:
                3.69844 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.013169288 = queryNorm
              0.8307517 = fieldWeight in 218, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.0625 = fieldNorm(doc=218)
          0.31144792 = weight(abstract_txt:supervised in 218) [ClassicSimilarity], result of:
            0.31144792 = score(doc=218,freq=3.0), product of:
              0.38528106 = queryWeight, product of:
                3.9178538 = boost
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.013169288 = queryNorm
              0.8083655 = fieldWeight in 218, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.0625 = fieldNorm(doc=218)
          0.24367224 = weight(abstract_txt:learning in 218) [ClassicSimilarity], result of:
            0.24367224 = score(doc=218,freq=7.0), product of:
              0.31074858 = queryWeight, product of:
                4.975984 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.013169288 = queryNorm
              0.78414595 = fieldWeight in 218, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.0625 = fieldNorm(doc=218)
        0.4 = coord(10/25)
    
  2. Suakkaphong, N.; Zhang, Z.; Chen, H.: Disease named entity recognition using semisupervised learning and conditional random fields (2011) 0.52
    0.51913893 = sum of:
      0.51913893 = product of:
        1.2978473 = sum of:
          0.051831413 = weight(abstract_txt:labeling in 367) [ClassicSimilarity], result of:
            0.051831413 = score(doc=367,freq=1.0), product of:
              0.105909884 = queryWeight, product of:
                1.0270643 = boost
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.013169288 = queryNorm
              0.48939165 = fieldWeight in 367, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.0625 = fieldNorm(doc=367)
          0.028425746 = weight(abstract_txt:techniques in 367) [ClassicSimilarity], result of:
            0.028425746 = score(doc=367,freq=2.0), product of:
              0.07096034 = queryWeight, product of:
                1.188919 = boost
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.013169288 = queryNorm
              0.40058637 = fieldWeight in 367, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.0625 = fieldNorm(doc=367)
          0.025492517 = weight(abstract_txt:task in 367) [ClassicSimilarity], result of:
            0.025492517 = score(doc=367,freq=1.0), product of:
              0.08314311 = queryWeight, product of:
                1.2869377 = boost
                4.9057617 = idf(docFreq=893, maxDocs=44421)
                0.013169288 = queryNorm
              0.3066101 = fieldWeight in 367, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9057617 = idf(docFreq=893, maxDocs=44421)
                0.0625 = fieldNorm(doc=367)
          0.044399057 = weight(abstract_txt:feature in 367) [ClassicSimilarity], result of:
            0.044399057 = score(doc=367,freq=1.0), product of:
              0.12035578 = queryWeight, product of:
                1.5483812 = boost
                5.9023747 = idf(docFreq=329, maxDocs=44421)
                0.013169288 = queryNorm
              0.36889842 = fieldWeight in 367, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9023747 = idf(docFreq=329, maxDocs=44421)
                0.0625 = fieldNorm(doc=367)
          0.09323338 = weight(abstract_txt:labeled in 367) [ClassicSimilarity], result of:
            0.09323338 = score(doc=367,freq=1.0), product of:
              0.19736284 = queryWeight, product of:
                1.9827918 = boost
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.013169288 = queryNorm
              0.4723958 = fieldWeight in 367, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.0625 = fieldNorm(doc=367)
          0.28350756 = weight(abstract_txt:bootstrapping in 367) [ClassicSimilarity], result of:
            0.28350756 = score(doc=367,freq=2.0), product of:
              0.3287892 = queryWeight, product of:
                2.5591938 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.013169288 = queryNorm
              0.86227757 = fieldWeight in 367, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.0625 = fieldNorm(doc=367)
          0.38030177 = weight(abstract_txt:unlabeled in 367) [ClassicSimilarity], result of:
            0.38030177 = score(doc=367,freq=2.0), product of:
              0.4577803 = queryWeight, product of:
                3.69844 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.013169288 = queryNorm
              0.8307517 = fieldWeight in 367, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.0625 = fieldNorm(doc=367)
          0.17981455 = weight(abstract_txt:supervised in 367) [ClassicSimilarity], result of:
            0.17981455 = score(doc=367,freq=1.0), product of:
              0.38528106 = queryWeight, product of:
                3.9178538 = boost
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.013169288 = queryNorm
              0.46671006 = fieldWeight in 367, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.0625 = fieldNorm(doc=367)
          0.08059293 = weight(abstract_txt:text in 367) [ClassicSimilarity], result of:
            0.08059293 = score(doc=367,freq=2.0), product of:
              0.22564502 = queryWeight, product of:
                4.240209 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.013169288 = queryNorm
              0.3571669 = fieldWeight in 367, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=367)
          0.1302483 = weight(abstract_txt:learning in 367) [ClassicSimilarity], result of:
            0.1302483 = score(doc=367,freq=2.0), product of:
              0.31074858 = queryWeight, product of:
                4.975984 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.013169288 = queryNorm
              0.41914365 = fieldWeight in 367, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.0625 = fieldNorm(doc=367)
        0.4 = coord(10/25)
    
  3. Billal, B.; Fonseca, A.; Sadat, F.; Lounis, H.: Semi-supervised learning and social media text analysis towards multi-labeling categorization (2017) 0.35
    0.34563455 = sum of:
      0.34563455 = product of:
        0.96009594 = sum of:
          0.022305952 = weight(abstract_txt:task in 95) [ClassicSimilarity], result of:
            0.022305952 = score(doc=95,freq=1.0), product of:
              0.08314311 = queryWeight, product of:
                1.2869377 = boost
                4.9057617 = idf(docFreq=893, maxDocs=44421)
                0.013169288 = queryNorm
              0.26828384 = fieldWeight in 95, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9057617 = idf(docFreq=893, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
          0.027730882 = weight(abstract_txt:machine in 95) [ClassicSimilarity], result of:
            0.027730882 = score(doc=95,freq=1.0), product of:
              0.0961291 = queryWeight, product of:
                1.3837953 = boost
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.013169288 = queryNorm
              0.28847542 = fieldWeight in 95, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
          0.027729884 = weight(abstract_txt:proposed in 95) [ClassicSimilarity], result of:
            0.027729884 = score(doc=95,freq=1.0), product of:
              0.11003771 = queryWeight, product of:
                1.813263 = boost
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.013169288 = queryNorm
              0.25200346 = fieldWeight in 95, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
          0.0815792 = weight(abstract_txt:labeled in 95) [ClassicSimilarity], result of:
            0.0815792 = score(doc=95,freq=1.0), product of:
              0.19736284 = queryWeight, product of:
                1.9827918 = boost
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.013169288 = queryNorm
              0.41334632 = fieldWeight in 95, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
          0.060820274 = weight(abstract_txt:method in 95) [ClassicSimilarity], result of:
            0.060820274 = score(doc=95,freq=2.0), product of:
              0.17480265 = queryWeight, product of:
                2.950451 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.013169288 = queryNorm
              0.3479368 = fieldWeight in 95, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
          0.10818514 = weight(abstract_txt:classification in 95) [ClassicSimilarity], result of:
            0.10818514 = score(doc=95,freq=9.0), product of:
              0.16517732 = queryWeight, product of:
                3.1418123 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.013169288 = queryNorm
              0.6549637 = fieldWeight in 95, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
          0.35181788 = weight(abstract_txt:supervised in 95) [ClassicSimilarity], result of:
            0.35181788 = score(doc=95,freq=5.0), product of:
              0.38528106 = queryWeight, product of:
                3.9178538 = boost
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.013169288 = queryNorm
              0.913146 = fieldWeight in 95, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
          0.099728666 = weight(abstract_txt:text in 95) [ClassicSimilarity], result of:
            0.099728666 = score(doc=95,freq=4.0), product of:
              0.22564502 = queryWeight, product of:
                4.240209 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.013169288 = queryNorm
              0.44197148 = fieldWeight in 95, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
          0.18019806 = weight(abstract_txt:learning in 95) [ClassicSimilarity], result of:
            0.18019806 = score(doc=95,freq=5.0), product of:
              0.31074858 = queryWeight, product of:
                4.975984 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.013169288 = queryNorm
              0.57988375 = fieldWeight in 95, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
        0.36 = coord(9/25)
    
  4. Hung, C.-M.; Chien, L.-F.: Web-based text classification in the absence of manually labeled training documents (2007) 0.30
    0.29857385 = sum of:
      0.29857385 = product of:
        0.8293718 = sum of:
          0.025125047 = weight(abstract_txt:techniques in 1087) [ClassicSimilarity], result of:
            0.025125047 = score(doc=1087,freq=1.0), product of:
              0.07096034 = queryWeight, product of:
                1.188919 = boost
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.013169288 = queryNorm
              0.35407168 = fieldWeight in 1087, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.078125 = fieldNorm(doc=1087)
          0.06425073 = weight(abstract_txt:automatically in 1087) [ClassicSimilarity], result of:
            0.06425073 = score(doc=1087,freq=2.0), product of:
              0.105322175 = queryWeight, product of:
                1.4484527 = boost
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.013169288 = queryNorm
              0.6100399 = fieldWeight in 1087, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.078125 = fieldNorm(doc=1087)
          0.03961412 = weight(abstract_txt:proposed in 1087) [ClassicSimilarity], result of:
            0.03961412 = score(doc=1087,freq=1.0), product of:
              0.11003771 = queryWeight, product of:
                1.813263 = boost
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.013169288 = queryNorm
              0.36000493 = fieldWeight in 1087, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.078125 = fieldNorm(doc=1087)
          0.10273061 = weight(abstract_txt:classifier in 1087) [ClassicSimilarity], result of:
            0.10273061 = score(doc=1087,freq=1.0), product of:
              0.18144472 = queryWeight, product of:
                1.9011508 = boost
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.013169288 = queryNorm
              0.5661813 = fieldWeight in 1087, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.078125 = fieldNorm(doc=1087)
          0.2018562 = weight(abstract_txt:labeled in 1087) [ClassicSimilarity], result of:
            0.2018562 = score(doc=1087,freq=3.0), product of:
              0.19736284 = queryWeight, product of:
                1.9827918 = boost
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.013169288 = queryNorm
              1.022767 = fieldWeight in 1087, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.078125 = fieldNorm(doc=1087)
          0.10577187 = weight(abstract_txt:documents in 1087) [ClassicSimilarity], result of:
            0.10577187 = score(doc=1087,freq=5.0), product of:
              0.14684118 = queryWeight, product of:
                2.7041972 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.013169288 = queryNorm
              0.72031474 = fieldWeight in 1087, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.078125 = fieldNorm(doc=1087)
          0.05151673 = weight(abstract_txt:classification in 1087) [ClassicSimilarity], result of:
            0.05151673 = score(doc=1087,freq=1.0), product of:
              0.16517732 = queryWeight, product of:
                3.1418123 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.013169288 = queryNorm
              0.31188744 = fieldWeight in 1087, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.078125 = fieldNorm(doc=1087)
          0.12338222 = weight(abstract_txt:text in 1087) [ClassicSimilarity], result of:
            0.12338222 = score(doc=1087,freq=3.0), product of:
              0.22564502 = queryWeight, product of:
                4.240209 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.013169288 = queryNorm
              0.5467979 = fieldWeight in 1087, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.078125 = fieldNorm(doc=1087)
          0.115124315 = weight(abstract_txt:learning in 1087) [ClassicSimilarity], result of:
            0.115124315 = score(doc=1087,freq=1.0), product of:
              0.31074858 = queryWeight, product of:
                4.975984 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.013169288 = queryNorm
              0.37047416 = fieldWeight in 1087, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.078125 = fieldNorm(doc=1087)
        0.36 = coord(9/25)
    
  5. Sebastiani, F.: Machine learning in automated text categorization (2002) 0.28
    0.28156665 = sum of:
      0.28156665 = product of:
        0.7821295 = sum of:
          0.06103768 = weight(abstract_txt:inductive in 4389) [ClassicSimilarity], result of:
            0.06103768 = score(doc=4389,freq=1.0), product of:
              0.101780936 = queryWeight, product of:
                1.006845 = boost
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.013169288 = queryNorm
              0.5996966 = fieldWeight in 4389, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.078125 = fieldNorm(doc=4389)
          0.025125047 = weight(abstract_txt:techniques in 4389) [ClassicSimilarity], result of:
            0.025125047 = score(doc=4389,freq=1.0), product of:
              0.07096034 = queryWeight, product of:
                1.188919 = boost
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.013169288 = queryNorm
              0.35407168 = fieldWeight in 4389, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.078125 = fieldNorm(doc=4389)
          0.056024842 = weight(abstract_txt:machine in 4389) [ClassicSimilarity], result of:
            0.056024842 = score(doc=4389,freq=2.0), product of:
              0.0961291 = queryWeight, product of:
                1.3837953 = boost
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.013169288 = queryNorm
              0.5828084 = fieldWeight in 4389, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.274979 = idf(docFreq=617, maxDocs=44421)
                0.078125 = fieldNorm(doc=4389)
          0.045432124 = weight(abstract_txt:automatically in 4389) [ClassicSimilarity], result of:
            0.045432124 = score(doc=4389,freq=1.0), product of:
              0.105322175 = queryWeight, product of:
                1.4484527 = boost
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.013169288 = queryNorm
              0.43136334 = fieldWeight in 4389, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.078125 = fieldNorm(doc=4389)
          0.20546122 = weight(abstract_txt:classifier in 4389) [ClassicSimilarity], result of:
            0.20546122 = score(doc=4389,freq=4.0), product of:
              0.18144472 = queryWeight, product of:
                1.9011508 = boost
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.013169288 = queryNorm
              1.1323626 = fieldWeight in 4389, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.078125 = fieldNorm(doc=4389)
          0.066896 = weight(abstract_txt:documents in 4389) [ClassicSimilarity], result of:
            0.066896 = score(doc=4389,freq=2.0), product of:
              0.14684118 = queryWeight, product of:
                2.7041972 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.013169288 = queryNorm
              0.455567 = fieldWeight in 4389, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.078125 = fieldNorm(doc=4389)
          0.05151673 = weight(abstract_txt:classification in 4389) [ClassicSimilarity], result of:
            0.05151673 = score(doc=4389,freq=1.0), product of:
              0.16517732 = queryWeight, product of:
                3.1418123 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.013169288 = queryNorm
              0.31188744 = fieldWeight in 4389, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.078125 = fieldNorm(doc=4389)
          0.07123476 = weight(abstract_txt:text in 4389) [ClassicSimilarity], result of:
            0.07123476 = score(doc=4389,freq=1.0), product of:
              0.22564502 = queryWeight, product of:
                4.240209 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.013169288 = queryNorm
              0.3156939 = fieldWeight in 4389, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.078125 = fieldNorm(doc=4389)
          0.19940117 = weight(abstract_txt:learning in 4389) [ClassicSimilarity], result of:
            0.19940117 = score(doc=4389,freq=3.0), product of:
              0.31074858 = queryWeight, product of:
                4.975984 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.013169288 = queryNorm
              0.64168006 = fieldWeight in 4389, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.078125 = fieldNorm(doc=4389)
        0.36 = coord(9/25)