Document (#34698)

Author
Gauch, S.
Chandramouli, A.
Ranganathan, S.
Title
Training a hierarchical classifier using inter document relationships
Source
Journal of the American Society for Information Science and Technology. 60(2009) no.1, S.47-58
Year
2009
Abstract
Text classifiers automatically classify documents into appropriate concepts for different applications. Most classification approaches use flat classifiers that treat each concept as independent, even when the concept space is hierarchically structured. In contrast, hierarchical text classification exploits the structural relationships between the concepts. In this article, we explore the effectiveness of hierarchical classification for a large concept hierarchy. Since the quality of the classification is dependent on the quality and quantity of the training data, we evaluate the use of documents selected from subconcepts to address the sparseness of training data for the top-level classifiers and the use of document relationships to identify the most representative training documents. By selecting training documents using structural and similarity relationships, we achieve a statistically significant improvement of 39.8% (from 54.5-76.2%) in the accuracy of the hierarchical classifier over that of the flat classifier for a large, three-level concept hierarchy.
Theme
Automatisches Klassifizieren

Similar documents (author)

  1. Gauch, S.: Intelligent information retrieval : an introduction (1992) 2.49
    2.494897 = sum of:
      2.494897 = product of:
        4.989794 = sum of:
          4.989794 = weight(author_txt:gauch in 502) [ClassicSimilarity], result of:
            4.989794 = score(doc=502,freq=1.0), product of:
              0.81837153 = queryWeight, product of:
                1.1933247 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.07029749 = queryNorm
              6.0972233 = fieldWeight in 502, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.625 = fieldNorm(doc=502)
        0.5 = coord(1/2)
    
  2. Gauch, S.; Smith, J.B.: ¬An expert system for automatic query reformation (1993) 2.00
    1.9959176 = sum of:
      1.9959176 = product of:
        3.991835 = sum of:
          3.991835 = weight(author_txt:gauch in 3692) [ClassicSimilarity], result of:
            3.991835 = score(doc=3692,freq=1.0), product of:
              0.81837153 = queryWeight, product of:
                1.1933247 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.07029749 = queryNorm
              4.8777785 = fieldWeight in 3692, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.5 = fieldNorm(doc=3692)
        0.5 = coord(1/2)
    
  3. Gauch, S.; Chong, M.K.: Automatic word similarity detection for TREC 4 query expansion (1996) 2.00
    1.9959176 = sum of:
      1.9959176 = product of:
        3.991835 = sum of:
          3.991835 = weight(author_txt:gauch in 3059) [ClassicSimilarity], result of:
            3.991835 = score(doc=3059,freq=1.0), product of:
              0.81837153 = queryWeight, product of:
                1.1933247 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.07029749 = queryNorm
              4.8777785 = fieldWeight in 3059, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.5 = fieldNorm(doc=3059)
        0.5 = coord(1/2)
    
  4. Gauch, S.; Wang, J.: Corpus analysis for TREC 5 query expansion (1997) 2.00
    1.9959176 = sum of:
      1.9959176 = product of:
        3.991835 = sum of:
          3.991835 = weight(author_txt:gauch in 5868) [ClassicSimilarity], result of:
            3.991835 = score(doc=5868,freq=1.0), product of:
              0.81837153 = queryWeight, product of:
                1.1933247 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.07029749 = queryNorm
              4.8777785 = fieldWeight in 5868, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.5 = fieldNorm(doc=5868)
        0.5 = coord(1/2)
    
  5. Haverkamp, D.S.; Gauch, S.: Intelligent information agents : review and challenges for distributed information sources (1998) 2.00
    1.9959176 = sum of:
      1.9959176 = product of:
        3.991835 = sum of:
          3.991835 = weight(author_txt:gauch in 3882) [ClassicSimilarity], result of:
            3.991835 = score(doc=3882,freq=1.0), product of:
              0.81837153 = queryWeight, product of:
                1.1933247 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.07029749 = queryNorm
              4.8777785 = fieldWeight in 3882, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.5 = fieldNorm(doc=3882)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Ruiz, M.E.; Srinivasan, P.: Combining machine learning and hierarchical indexing structures for text categorization (2001) 0.45
    0.44532797 = sum of:
      0.44532797 = product of:
        1.39165 = sum of:
          0.01845654 = weight(abstract_txt:using in 2595) [ClassicSimilarity], result of:
            0.01845654 = score(doc=2595,freq=1.0), product of:
              0.05695028 = queryWeight, product of:
                1.0139436 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.016247964 = queryNorm
              0.32408163 = fieldWeight in 2595, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.09375 = fieldNorm(doc=2595)
          0.111834615 = weight(abstract_txt:exploits in 2595) [ClassicSimilarity], result of:
            0.111834615 = score(doc=2595,freq=1.0), product of:
              0.15023455 = queryWeight, product of:
                1.16449 = boost
                7.9402676 = idf(docFreq=42, maxDocs=44421)
                0.016247964 = queryNorm
              0.7444001 = fieldWeight in 2595, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9402676 = idf(docFreq=42, maxDocs=44421)
                0.09375 = fieldNorm(doc=2595)
          0.0416912 = weight(abstract_txt:text in 2595) [ClassicSimilarity], result of:
            0.0416912 = score(doc=2595,freq=2.0), product of:
              0.077818334 = queryWeight, product of:
                1.1852413 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.016247964 = queryNorm
              0.5357503 = fieldWeight in 2595, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.09375 = fieldNorm(doc=2595)
          0.23902301 = weight(abstract_txt:flat in 2595) [ClassicSimilarity], result of:
            0.23902301 = score(doc=2595,freq=1.0), product of:
              0.3140669 = queryWeight, product of:
                2.381096 = boost
                8.117949 = idf(docFreq=35, maxDocs=44421)
                0.016247964 = queryNorm
              0.7610577 = fieldWeight in 2595, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.117949 = idf(docFreq=35, maxDocs=44421)
                0.09375 = fieldNorm(doc=2595)
          0.2550871 = weight(abstract_txt:classifier in 2595) [ClassicSimilarity], result of:
            0.2550871 = score(doc=2595,freq=1.0), product of:
              0.3754497 = queryWeight, product of:
                3.1885068 = boost
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.016247964 = queryNorm
              0.67941755 = fieldWeight in 2595, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.09375 = fieldNorm(doc=2595)
          0.2858063 = weight(abstract_txt:classifiers in 2595) [ClassicSimilarity], result of:
            0.2858063 = score(doc=2595,freq=1.0), product of:
              0.40501764 = queryWeight, product of:
                3.3116806 = boost
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.016247964 = queryNorm
              0.7056638 = fieldWeight in 2595, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.09375 = fieldNorm(doc=2595)
          0.29120716 = weight(abstract_txt:hierarchical in 2595) [ClassicSimilarity], result of:
            0.29120716 = score(doc=2595,freq=3.0), product of:
              0.31296802 = queryWeight, product of:
                3.3614821 = boost
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.016247964 = queryNorm
              0.9304694 = fieldWeight in 2595, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.09375 = fieldNorm(doc=2595)
          0.14854395 = weight(abstract_txt:training in 2595) [ClassicSimilarity], result of:
            0.14854395 = score(doc=2595,freq=1.0), product of:
              0.31041712 = queryWeight, product of:
                3.742904 = boost
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.016247964 = queryNorm
              0.47853017 = fieldWeight in 2595, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.09375 = fieldNorm(doc=2595)
        0.32 = coord(8/25)
    
  2. Hung, C.-M.; Chien, L.-F.: Web-based text classification in the absence of manually labeled training documents (2007) 0.33
    0.33471954 = sum of:
      0.33471954 = product of:
        0.92977643 = sum of:
          0.02175124 = weight(abstract_txt:using in 1087) [ClassicSimilarity], result of:
            0.02175124 = score(doc=1087,freq=2.0), product of:
              0.05695028 = queryWeight, product of:
                1.0139436 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.016247964 = queryNorm
              0.38193387 = fieldWeight in 1087, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.078125 = fieldNorm(doc=1087)
          0.022811929 = weight(abstract_txt:most in 1087) [ClassicSimilarity], result of:
            0.022811929 = score(doc=1087,freq=1.0), product of:
              0.07406695 = queryWeight, product of:
                1.1563201 = boost
                3.94228 = idf(docFreq=2342, maxDocs=44421)
                0.016247964 = queryNorm
              0.30799064 = fieldWeight in 1087, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.94228 = idf(docFreq=2342, maxDocs=44421)
                0.078125 = fieldNorm(doc=1087)
          0.0425509 = weight(abstract_txt:text in 1087) [ClassicSimilarity], result of:
            0.0425509 = score(doc=1087,freq=3.0), product of:
              0.077818334 = queryWeight, product of:
                1.1852413 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.016247964 = queryNorm
              0.5467979 = fieldWeight in 1087, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.078125 = fieldNorm(doc=1087)
          0.052751157 = weight(abstract_txt:quality in 1087) [ClassicSimilarity], result of:
            0.052751157 = score(doc=1087,freq=2.0), product of:
              0.102800325 = queryWeight, product of:
                1.3622696 = boost
                4.6444306 = idf(docFreq=1160, maxDocs=44421)
                0.016247964 = queryNorm
              0.51314193 = fieldWeight in 1087, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6444306 = idf(docFreq=1160, maxDocs=44421)
                0.078125 = fieldNorm(doc=1087)
          0.047377612 = weight(abstract_txt:classification in 1087) [ClassicSimilarity], result of:
            0.047377612 = score(doc=1087,freq=1.0), product of:
              0.15190612 = queryWeight, product of:
                2.3419006 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.016247964 = queryNorm
              0.31188744 = fieldWeight in 1087, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.078125 = fieldNorm(doc=1087)
          0.11672835 = weight(abstract_txt:documents in 1087) [ClassicSimilarity], result of:
            0.11672835 = score(doc=1087,freq=5.0), product of:
              0.16205187 = queryWeight, product of:
                2.418844 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.016247964 = queryNorm
              0.72031474 = fieldWeight in 1087, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.078125 = fieldNorm(doc=1087)
          0.21257259 = weight(abstract_txt:classifier in 1087) [ClassicSimilarity], result of:
            0.21257259 = score(doc=1087,freq=1.0), product of:
              0.3754497 = queryWeight, product of:
                3.1885068 = boost
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.016247964 = queryNorm
              0.5661813 = fieldWeight in 1087, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.078125 = fieldNorm(doc=1087)
          0.2381719 = weight(abstract_txt:classifiers in 1087) [ClassicSimilarity], result of:
            0.2381719 = score(doc=1087,freq=1.0), product of:
              0.40501764 = queryWeight, product of:
                3.3116806 = boost
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.016247964 = queryNorm
              0.58805317 = fieldWeight in 1087, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.078125 = fieldNorm(doc=1087)
          0.17506073 = weight(abstract_txt:training in 1087) [ClassicSimilarity], result of:
            0.17506073 = score(doc=1087,freq=2.0), product of:
              0.31041712 = queryWeight, product of:
                3.742904 = boost
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.016247964 = queryNorm
              0.5639532 = fieldWeight in 1087, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.078125 = fieldNorm(doc=1087)
        0.36 = coord(9/25)
    
  3. Li, T.; Zhu, S.; Ogihara, M.: Hierarchical document classification using automatically generated hierarchy (2007) 0.31
    0.3139316 = sum of:
      0.3139316 = product of:
        0.784829 = sum of:
          0.02175124 = weight(abstract_txt:using in 797) [ClassicSimilarity], result of:
            0.02175124 = score(doc=797,freq=2.0), product of:
              0.05695028 = queryWeight, product of:
                1.0139436 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.016247964 = queryNorm
              0.38193387 = fieldWeight in 797, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.078125 = fieldNorm(doc=797)
          0.022811929 = weight(abstract_txt:most in 797) [ClassicSimilarity], result of:
            0.022811929 = score(doc=797,freq=1.0), product of:
              0.07406695 = queryWeight, product of:
                1.1563201 = boost
                3.94228 = idf(docFreq=2342, maxDocs=44421)
                0.016247964 = queryNorm
              0.30799064 = fieldWeight in 797, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.94228 = idf(docFreq=2342, maxDocs=44421)
                0.078125 = fieldNorm(doc=797)
          0.034742665 = weight(abstract_txt:text in 797) [ClassicSimilarity], result of:
            0.034742665 = score(doc=797,freq=2.0), product of:
              0.077818334 = queryWeight, product of:
                1.1852413 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.016247964 = queryNorm
              0.4464586 = fieldWeight in 797, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.078125 = fieldNorm(doc=797)
          0.029481607 = weight(abstract_txt:document in 797) [ClassicSimilarity], result of:
            0.029481607 = score(doc=797,freq=1.0), product of:
              0.08787876 = queryWeight, product of:
                1.2595279 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.016247964 = queryNorm
              0.33548045 = fieldWeight in 797, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.078125 = fieldNorm(doc=797)
          0.032640614 = weight(abstract_txt:large in 797) [ClassicSimilarity], result of:
            0.032640614 = score(doc=797,freq=1.0), product of:
              0.09404926 = queryWeight, product of:
                1.3029974 = boost
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.016247964 = queryNorm
              0.3470587 = fieldWeight in 797, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.078125 = fieldNorm(doc=797)
          0.06700206 = weight(abstract_txt:classification in 797) [ClassicSimilarity], result of:
            0.06700206 = score(doc=797,freq=2.0), product of:
              0.15190612 = queryWeight, product of:
                2.3419006 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.016247964 = queryNorm
              0.44107544 = fieldWeight in 797, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.078125 = fieldNorm(doc=797)
          0.19918585 = weight(abstract_txt:flat in 797) [ClassicSimilarity], result of:
            0.19918585 = score(doc=797,freq=1.0), product of:
              0.3140669 = queryWeight, product of:
                2.381096 = boost
                8.117949 = idf(docFreq=35, maxDocs=44421)
                0.016247964 = queryNorm
              0.63421476 = fieldWeight in 797, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.117949 = idf(docFreq=35, maxDocs=44421)
                0.078125 = fieldNorm(doc=797)
          0.052202504 = weight(abstract_txt:documents in 797) [ClassicSimilarity], result of:
            0.052202504 = score(doc=797,freq=1.0), product of:
              0.16205187 = queryWeight, product of:
                2.418844 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.016247964 = queryNorm
              0.32213452 = fieldWeight in 797, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.078125 = fieldNorm(doc=797)
          0.08233796 = weight(abstract_txt:relationships in 797) [ClassicSimilarity], result of:
            0.08233796 = score(doc=797,freq=1.0), product of:
              0.21958023 = queryWeight, product of:
                2.815642 = boost
                4.7997303 = idf(docFreq=993, maxDocs=44421)
                0.016247964 = queryNorm
              0.37497893 = fieldWeight in 797, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7997303 = idf(docFreq=993, maxDocs=44421)
                0.078125 = fieldNorm(doc=797)
          0.24267264 = weight(abstract_txt:hierarchical in 797) [ClassicSimilarity], result of:
            0.24267264 = score(doc=797,freq=3.0), product of:
              0.31296802 = queryWeight, product of:
                3.3614821 = boost
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.016247964 = queryNorm
              0.77539116 = fieldWeight in 797, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.078125 = fieldNorm(doc=797)
        0.4 = coord(10/25)
    
  4. Sun, A.; Lim, E.-P.; Ng, W.-K.: Performance measurement framework for hierarchical text classification (2003) 0.30
    0.29842559 = sum of:
      0.29842559 = product of:
        1.0658057 = sum of:
          0.019653419 = weight(abstract_txt:text in 2808) [ClassicSimilarity], result of:
            0.019653419 = score(doc=2808,freq=1.0), product of:
              0.077818334 = queryWeight, product of:
                1.1852413 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.016247964 = queryNorm
              0.25255513 = fieldWeight in 2808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=2808)
          0.023585286 = weight(abstract_txt:document in 2808) [ClassicSimilarity], result of:
            0.023585286 = score(doc=2808,freq=1.0), product of:
              0.08787876 = queryWeight, product of:
                1.2595279 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.016247964 = queryNorm
              0.26838437 = fieldWeight in 2808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.0625 = fieldNorm(doc=2808)
          0.09284079 = weight(abstract_txt:classification in 2808) [ClassicSimilarity], result of:
            0.09284079 = score(doc=2808,freq=6.0), product of:
              0.15190612 = queryWeight, product of:
                2.3419006 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.016247964 = queryNorm
              0.61117214 = fieldWeight in 2808, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.0625 = fieldNorm(doc=2808)
          0.05906039 = weight(abstract_txt:documents in 2808) [ClassicSimilarity], result of:
            0.05906039 = score(doc=2808,freq=2.0), product of:
              0.16205187 = queryWeight, product of:
                2.418844 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.016247964 = queryNorm
              0.3644536 = fieldWeight in 2808, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0625 = fieldNorm(doc=2808)
          0.17005807 = weight(abstract_txt:classifier in 2808) [ClassicSimilarity], result of:
            0.17005807 = score(doc=2808,freq=1.0), product of:
              0.3754497 = queryWeight, product of:
                3.1885068 = boost
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.016247964 = queryNorm
              0.45294502 = fieldWeight in 2808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.0625 = fieldNorm(doc=2808)
          0.42605487 = weight(abstract_txt:classifiers in 2808) [ClassicSimilarity], result of:
            0.42605487 = score(doc=2808,freq=5.0), product of:
              0.40501764 = queryWeight, product of:
                3.3116806 = boost
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.016247964 = queryNorm
              1.0519415 = fieldWeight in 2808, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.0625 = fieldNorm(doc=2808)
          0.27455276 = weight(abstract_txt:hierarchical in 2808) [ClassicSimilarity], result of:
            0.27455276 = score(doc=2808,freq=6.0), product of:
              0.31296802 = queryWeight, product of:
                3.3614821 = boost
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.016247964 = queryNorm
              0.877255 = fieldWeight in 2808, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.0625 = fieldNorm(doc=2808)
        0.28 = coord(7/25)
    
  5. Yoon, Y.; Lee, C.; Lee, G.G.: ¬An effective procedure for constructing a hierarchical text classification system (2006) 0.29
    0.2876885 = sum of:
      0.2876885 = product of:
        1.0274589 = sum of:
          0.024566775 = weight(abstract_txt:text in 273) [ClassicSimilarity], result of:
            0.024566775 = score(doc=273,freq=1.0), product of:
              0.077818334 = queryWeight, product of:
                1.1852413 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.016247964 = queryNorm
              0.3156939 = fieldWeight in 273, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.078125 = fieldNorm(doc=273)
          0.0461608 = weight(abstract_txt:large in 273) [ClassicSimilarity], result of:
            0.0461608 = score(doc=273,freq=2.0), product of:
              0.09404926 = queryWeight, product of:
                1.3029974 = boost
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.016247964 = queryNorm
              0.4908151 = fieldWeight in 273, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.078125 = fieldNorm(doc=273)
          0.18154912 = weight(abstract_txt:hierarchy in 273) [ClassicSimilarity], result of:
            0.18154912 = score(doc=273,freq=3.0), product of:
              0.2047099 = queryWeight, product of:
                1.9223624 = boost
                6.553973 = idf(docFreq=171, maxDocs=44421)
                0.016247964 = queryNorm
              0.8868605 = fieldWeight in 273, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.553973 = idf(docFreq=171, maxDocs=44421)
                0.078125 = fieldNorm(doc=273)
          0.10593956 = weight(abstract_txt:classification in 273) [ClassicSimilarity], result of:
            0.10593956 = score(doc=273,freq=5.0), product of:
              0.15190612 = queryWeight, product of:
                2.3419006 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.016247964 = queryNorm
              0.6974015 = fieldWeight in 273, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.078125 = fieldNorm(doc=273)
          0.052202504 = weight(abstract_txt:documents in 273) [ClassicSimilarity], result of:
            0.052202504 = score(doc=273,freq=1.0), product of:
              0.16205187 = queryWeight, product of:
                2.418844 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.016247964 = queryNorm
              0.32213452 = fieldWeight in 273, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.078125 = fieldNorm(doc=273)
          0.33682594 = weight(abstract_txt:classifiers in 273) [ClassicSimilarity], result of:
            0.33682594 = score(doc=273,freq=2.0), product of:
              0.40501764 = queryWeight, product of:
                3.3116806 = boost
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.016247964 = queryNorm
              0.83163273 = fieldWeight in 273, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.078125 = fieldNorm(doc=273)
          0.28021422 = weight(abstract_txt:hierarchical in 273) [ClassicSimilarity], result of:
            0.28021422 = score(doc=273,freq=4.0), product of:
              0.31296802 = queryWeight, product of:
                3.3614821 = boost
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.016247964 = queryNorm
              0.8953446 = fieldWeight in 273, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.078125 = fieldNorm(doc=273)
        0.28 = coord(7/25)