Document (#34761)

Author
Liu, R.-L.
Title
Context recognition for hierarchical text classification
Source
Journal of the American Society for Information Science and Technology. 60(2009) no.4, S.803-813
Year
2009
Abstract
Information is often organized as a text hierarchy. A hierarchical text-classification system is thus essential for the management, sharing, and dissemination of information. It aims to automatically classify each incoming document into zero, one, or several categories in the text hierarchy. In this paper, we present a technique called CRHTC (context recognition for hierarchical text classification) that performs hierarchical text classification by recognizing the context of discussion (COD) of each category. A category's COD is governed by its ancestor categories, whose contents indicate contextual backgrounds of the category. A document may be classified into a category only if its content matches the category's COD. CRHTC does not require any trials to manually set parameters, and hence is more portable and easier to implement than other methods. It is empirically evaluated under various conditions. The results show that CRHTC achieves both better and more stable performance than several hierarchical and nonhierarchical text-classification methodologies.
Theme
Automatisches Klassifizieren
Object
CRHTC

Similar documents (content)

  1. Yoon, Y.; Lee, C.; Lee, G.G.: ¬An effective procedure for constructing a hierarchical text classification system (2006) 0.32
    0.32290587 = sum of:
      0.32290587 = product of:
        1.0090809 = sum of:
          0.07801316 = weight(abstract_txt:performs in 273) [ClassicSimilarity], result of:
            0.07801316 = score(doc=273,freq=1.0), product of:
              0.13887121 = queryWeight, product of:
                1.0030326 = boost
                7.190608 = idf(docFreq=90, maxDocs=44421)
                0.01925447 = queryNorm
              0.56176627 = fieldWeight in 273, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.190608 = idf(docFreq=90, maxDocs=44421)
                0.078125 = fieldNorm(doc=273)
          0.021207297 = weight(abstract_txt:into in 273) [ClassicSimilarity], result of:
            0.021207297 = score(doc=273,freq=1.0), product of:
              0.073423296 = queryWeight, product of:
                1.0314326 = boost
                3.697102 = idf(docFreq=2993, maxDocs=44421)
                0.01925447 = queryNorm
              0.2888361 = fieldWeight in 273, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.697102 = idf(docFreq=2993, maxDocs=44421)
                0.078125 = fieldNorm(doc=273)
          0.024755213 = weight(abstract_txt:than in 273) [ClassicSimilarity], result of:
            0.024755213 = score(doc=273,freq=1.0), product of:
              0.08139944 = queryWeight, product of:
                1.086012 = boost
                3.8927383 = idf(docFreq=2461, maxDocs=44421)
                0.01925447 = queryNorm
              0.30412018 = fieldWeight in 273, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8927383 = idf(docFreq=2461, maxDocs=44421)
                0.078125 = fieldNorm(doc=273)
          0.039489496 = weight(abstract_txt:several in 273) [ClassicSimilarity], result of:
            0.039489496 = score(doc=273,freq=1.0), product of:
              0.111130014 = queryWeight, product of:
                1.2689357 = boost
                4.548416 = idf(docFreq=1277, maxDocs=44421)
                0.01925447 = queryNorm
              0.355345 = fieldWeight in 273, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.548416 = idf(docFreq=1277, maxDocs=44421)
                0.078125 = fieldNorm(doc=273)
          0.20463298 = weight(abstract_txt:hierarchy in 273) [ClassicSimilarity], result of:
            0.20463298 = score(doc=273,freq=3.0), product of:
              0.23073864 = queryWeight, product of:
                1.8284541 = boost
                6.553973 = idf(docFreq=171, maxDocs=44421)
                0.01925447 = queryNorm
              0.8868605 = fieldWeight in 273, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.553973 = idf(docFreq=171, maxDocs=44421)
                0.078125 = fieldNorm(doc=273)
          0.14926212 = weight(abstract_txt:classification in 273) [ClassicSimilarity], result of:
            0.14926212 = score(doc=273,freq=5.0), product of:
              0.21402608 = queryWeight, product of:
                2.784372 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.01925447 = queryNorm
              0.6974015 = fieldWeight in 273, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.078125 = fieldNorm(doc=273)
          0.096916474 = weight(abstract_txt:text in 273) [ClassicSimilarity], result of:
            0.096916474 = score(doc=273,freq=1.0), product of:
              0.30699506 = queryWeight, product of:
                3.945696 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.01925447 = queryNorm
              0.3156939 = fieldWeight in 273, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.078125 = fieldNorm(doc=273)
          0.39480406 = weight(abstract_txt:hierarchical in 273) [ClassicSimilarity], result of:
            0.39480406 = score(doc=273,freq=4.0), product of:
              0.44095206 = queryWeight, product of:
                3.9965901 = boost
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.01925447 = queryNorm
              0.8953446 = fieldWeight in 273, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.078125 = fieldNorm(doc=273)
        0.32 = coord(8/25)
    
  2. Sun, A.; Lim, E.-P.; Ng, W.-K.: Performance measurement framework for hierarchical text classification (2003) 0.31
    0.30857438 = sum of:
      0.30857438 = product of:
        0.9642949 = sum of:
          0.016965838 = weight(abstract_txt:into in 2808) [ClassicSimilarity], result of:
            0.016965838 = score(doc=2808,freq=1.0), product of:
              0.073423296 = queryWeight, product of:
                1.0314326 = boost
                3.697102 = idf(docFreq=2993, maxDocs=44421)
                0.01925447 = queryNorm
              0.23106888 = fieldWeight in 2808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.697102 = idf(docFreq=2993, maxDocs=44421)
                0.0625 = fieldNorm(doc=2808)
          0.01980417 = weight(abstract_txt:than in 2808) [ClassicSimilarity], result of:
            0.01980417 = score(doc=2808,freq=1.0), product of:
              0.08139944 = queryWeight, product of:
                1.086012 = boost
                3.8927383 = idf(docFreq=2461, maxDocs=44421)
                0.01925447 = queryNorm
              0.24329615 = fieldWeight in 2808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8927383 = idf(docFreq=2461, maxDocs=44421)
                0.0625 = fieldNorm(doc=2808)
          0.026584137 = weight(abstract_txt:document in 2808) [ClassicSimilarity], result of:
            0.026584137 = score(doc=2808,freq=1.0), product of:
              0.09905248 = queryWeight, product of:
                1.1979994 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.01925447 = queryNorm
              0.26838437 = fieldWeight in 2808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.0625 = fieldNorm(doc=2808)
          0.09321445 = weight(abstract_txt:categories in 2808) [ClassicSimilarity], result of:
            0.09321445 = score(doc=2808,freq=4.0), product of:
              0.14401878 = queryWeight, product of:
                1.444553 = boost
                5.177905 = idf(docFreq=680, maxDocs=44421)
                0.01925447 = queryNorm
              0.64723814 = fieldWeight in 2808, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.177905 = idf(docFreq=680, maxDocs=44421)
                0.0625 = fieldNorm(doc=2808)
          0.21255897 = weight(abstract_txt:category in 2808) [ClassicSimilarity], result of:
            0.21255897 = score(doc=2808,freq=3.0), product of:
              0.31435952 = queryWeight, product of:
                2.613863 = boost
                6.2461467 = idf(docFreq=233, maxDocs=44421)
                0.01925447 = queryNorm
              0.6761652 = fieldWeight in 2808, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.2461467 = idf(docFreq=233, maxDocs=44421)
                0.0625 = fieldNorm(doc=2808)
          0.13080677 = weight(abstract_txt:classification in 2808) [ClassicSimilarity], result of:
            0.13080677 = score(doc=2808,freq=6.0), product of:
              0.21402608 = queryWeight, product of:
                2.784372 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.01925447 = queryNorm
              0.61117214 = fieldWeight in 2808, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.0625 = fieldNorm(doc=2808)
          0.07753318 = weight(abstract_txt:text in 2808) [ClassicSimilarity], result of:
            0.07753318 = score(doc=2808,freq=1.0), product of:
              0.30699506 = queryWeight, product of:
                3.945696 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.01925447 = queryNorm
              0.25255513 = fieldWeight in 2808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=2808)
          0.3868274 = weight(abstract_txt:hierarchical in 2808) [ClassicSimilarity], result of:
            0.3868274 = score(doc=2808,freq=6.0), product of:
              0.44095206 = queryWeight, product of:
                3.9965901 = boost
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.01925447 = queryNorm
              0.877255 = fieldWeight in 2808, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.0625 = fieldNorm(doc=2808)
        0.32 = coord(8/25)
    
  3. Gauch, S.; Chandramouli, A.; Ranganathan, S.: Training a hierarchical classifier using inter document relationships (2009) 0.24
    0.24172252 = sum of:
      0.24172252 = product of:
        0.8632947 = sum of:
          0.021207297 = weight(abstract_txt:into in 3697) [ClassicSimilarity], result of:
            0.021207297 = score(doc=3697,freq=1.0), product of:
              0.073423296 = queryWeight, product of:
                1.0314326 = boost
                3.697102 = idf(docFreq=2993, maxDocs=44421)
                0.01925447 = queryNorm
              0.2888361 = fieldWeight in 3697, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.697102 = idf(docFreq=2993, maxDocs=44421)
                0.078125 = fieldNorm(doc=3697)
          0.029300079 = weight(abstract_txt:each in 3697) [ClassicSimilarity], result of:
            0.029300079 = score(doc=3697,freq=1.0), product of:
              0.09107996 = queryWeight, product of:
                1.1487759 = boost
                4.1177115 = idf(docFreq=1965, maxDocs=44421)
                0.01925447 = queryNorm
              0.32169622 = fieldWeight in 3697, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1177115 = idf(docFreq=1965, maxDocs=44421)
                0.078125 = fieldNorm(doc=3697)
          0.03323017 = weight(abstract_txt:document in 3697) [ClassicSimilarity], result of:
            0.03323017 = score(doc=3697,freq=1.0), product of:
              0.09905248 = queryWeight, product of:
                1.1979994 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.01925447 = queryNorm
              0.33548045 = fieldWeight in 3697, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.078125 = fieldNorm(doc=3697)
          0.16708215 = weight(abstract_txt:hierarchy in 3697) [ClassicSimilarity], result of:
            0.16708215 = score(doc=3697,freq=2.0), product of:
              0.23073864 = queryWeight, product of:
                1.8284541 = boost
                6.553973 = idf(docFreq=171, maxDocs=44421)
                0.01925447 = queryNorm
              0.7241186 = fieldWeight in 3697, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.553973 = idf(docFreq=171, maxDocs=44421)
                0.078125 = fieldNorm(doc=3697)
          0.1335041 = weight(abstract_txt:classification in 3697) [ClassicSimilarity], result of:
            0.1335041 = score(doc=3697,freq=4.0), product of:
              0.21402608 = queryWeight, product of:
                2.784372 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.01925447 = queryNorm
              0.6237749 = fieldWeight in 3697, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.078125 = fieldNorm(doc=3697)
          0.13706058 = weight(abstract_txt:text in 3697) [ClassicSimilarity], result of:
            0.13706058 = score(doc=3697,freq=2.0), product of:
              0.30699506 = queryWeight, product of:
                3.945696 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.01925447 = queryNorm
              0.4464586 = fieldWeight in 3697, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.078125 = fieldNorm(doc=3697)
          0.34191033 = weight(abstract_txt:hierarchical in 3697) [ClassicSimilarity], result of:
            0.34191033 = score(doc=3697,freq=3.0), product of:
              0.44095206 = queryWeight, product of:
                3.9965901 = boost
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.01925447 = queryNorm
              0.77539116 = fieldWeight in 3697, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.078125 = fieldNorm(doc=3697)
        0.28 = coord(7/25)
    
  4. Yang, C.C.; Lin, J.; Wei, C.-P.: Retaining knowledge for document management : category-tree integration by exploiting category relationships and hierarchical structures (2010) 0.24
    0.24081963 = sum of:
      0.24081963 = product of:
        0.8600701 = sum of:
          0.12106625 = weight(abstract_txt:achieves in 568) [ClassicSimilarity], result of:
            0.12106625 = score(doc=568,freq=3.0), product of:
              0.14976671 = queryWeight, product of:
                1.0416374 = boost
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.01925447 = queryNorm
              0.8083655 = fieldWeight in 568, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.467361 = idf(docFreq=68, maxDocs=44421)
                0.0625 = fieldNorm(doc=568)
          0.01980417 = weight(abstract_txt:than in 568) [ClassicSimilarity], result of:
            0.01980417 = score(doc=568,freq=1.0), product of:
              0.08139944 = queryWeight, product of:
                1.086012 = boost
                3.8927383 = idf(docFreq=2461, maxDocs=44421)
                0.01925447 = queryNorm
              0.24329615 = fieldWeight in 568, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8927383 = idf(docFreq=2461, maxDocs=44421)
                0.0625 = fieldNorm(doc=568)
          0.026584137 = weight(abstract_txt:document in 568) [ClassicSimilarity], result of:
            0.026584137 = score(doc=568,freq=1.0), product of:
              0.09905248 = queryWeight, product of:
                1.1979994 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.01925447 = queryNorm
              0.26838437 = fieldWeight in 568, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.0625 = fieldNorm(doc=568)
          0.09321445 = weight(abstract_txt:categories in 568) [ClassicSimilarity], result of:
            0.09321445 = score(doc=568,freq=4.0), product of:
              0.14401878 = queryWeight, product of:
                1.444553 = boost
                5.177905 = idf(docFreq=680, maxDocs=44421)
                0.01925447 = queryNorm
              0.64723814 = fieldWeight in 568, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.177905 = idf(docFreq=680, maxDocs=44421)
                0.0625 = fieldNorm(doc=568)
          0.38807783 = weight(abstract_txt:category in 568) [ClassicSimilarity], result of:
            0.38807783 = score(doc=568,freq=10.0), product of:
              0.31435952 = queryWeight, product of:
                2.613863 = boost
                6.2461467 = idf(docFreq=233, maxDocs=44421)
                0.01925447 = queryNorm
              1.2345031 = fieldWeight in 568, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                6.2461467 = idf(docFreq=233, maxDocs=44421)
                0.0625 = fieldNorm(doc=568)
          0.053401638 = weight(abstract_txt:classification in 568) [ClassicSimilarity], result of:
            0.053401638 = score(doc=568,freq=1.0), product of:
              0.21402608 = queryWeight, product of:
                2.784372 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.01925447 = queryNorm
              0.24950996 = fieldWeight in 568, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.0625 = fieldNorm(doc=568)
          0.15792163 = weight(abstract_txt:hierarchical in 568) [ClassicSimilarity], result of:
            0.15792163 = score(doc=568,freq=1.0), product of:
              0.44095206 = queryWeight, product of:
                3.9965901 = boost
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.01925447 = queryNorm
              0.35813785 = fieldWeight in 568, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.0625 = fieldNorm(doc=568)
        0.28 = coord(7/25)
    
  5. Liu, R.-L.: Dynamic category profiling for text filtering and classification (2007) 0.23
    0.22762625 = sum of:
      0.22762625 = product of:
        0.711332 = sum of:
          0.021207297 = weight(abstract_txt:into in 1900) [ClassicSimilarity], result of:
            0.021207297 = score(doc=1900,freq=1.0), product of:
              0.073423296 = queryWeight, product of:
                1.0314326 = boost
                3.697102 = idf(docFreq=2993, maxDocs=44421)
                0.01925447 = queryNorm
              0.2888361 = fieldWeight in 1900, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.697102 = idf(docFreq=2993, maxDocs=44421)
                0.078125 = fieldNorm(doc=1900)
          0.024755213 = weight(abstract_txt:than in 1900) [ClassicSimilarity], result of:
            0.024755213 = score(doc=1900,freq=1.0), product of:
              0.08139944 = queryWeight, product of:
                1.086012 = boost
                3.8927383 = idf(docFreq=2461, maxDocs=44421)
                0.01925447 = queryNorm
              0.30412018 = fieldWeight in 1900, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8927383 = idf(docFreq=2461, maxDocs=44421)
                0.078125 = fieldNorm(doc=1900)
          0.029300079 = weight(abstract_txt:each in 1900) [ClassicSimilarity], result of:
            0.029300079 = score(doc=1900,freq=1.0), product of:
              0.09107996 = queryWeight, product of:
                1.1487759 = boost
                4.1177115 = idf(docFreq=1965, maxDocs=44421)
                0.01925447 = queryNorm
              0.32169622 = fieldWeight in 1900, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1177115 = idf(docFreq=1965, maxDocs=44421)
                0.078125 = fieldNorm(doc=1900)
          0.04699456 = weight(abstract_txt:document in 1900) [ClassicSimilarity], result of:
            0.04699456 = score(doc=1900,freq=2.0), product of:
              0.09905248 = queryWeight, product of:
                1.1979994 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.01925447 = queryNorm
              0.47444102 = fieldWeight in 1900, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.078125 = fieldNorm(doc=1900)
          0.0823907 = weight(abstract_txt:categories in 1900) [ClassicSimilarity], result of:
            0.0823907 = score(doc=1900,freq=2.0), product of:
              0.14401878 = queryWeight, product of:
                1.444553 = boost
                5.177905 = idf(docFreq=680, maxDocs=44421)
                0.01925447 = queryNorm
              0.57208306 = fieldWeight in 1900, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.177905 = idf(docFreq=680, maxDocs=44421)
                0.078125 = fieldNorm(doc=1900)
          0.34301558 = weight(abstract_txt:category in 1900) [ClassicSimilarity], result of:
            0.34301558 = score(doc=1900,freq=5.0), product of:
              0.31435952 = queryWeight, product of:
                2.613863 = boost
                6.2461467 = idf(docFreq=233, maxDocs=44421)
                0.01925447 = queryNorm
              1.091157 = fieldWeight in 1900, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.2461467 = idf(docFreq=233, maxDocs=44421)
                0.078125 = fieldNorm(doc=1900)
          0.06675205 = weight(abstract_txt:classification in 1900) [ClassicSimilarity], result of:
            0.06675205 = score(doc=1900,freq=1.0), product of:
              0.21402608 = queryWeight, product of:
                2.784372 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.01925447 = queryNorm
              0.31188744 = fieldWeight in 1900, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.078125 = fieldNorm(doc=1900)
          0.096916474 = weight(abstract_txt:text in 1900) [ClassicSimilarity], result of:
            0.096916474 = score(doc=1900,freq=1.0), product of:
              0.30699506 = queryWeight, product of:
                3.945696 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.01925447 = queryNorm
              0.3156939 = fieldWeight in 1900, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.078125 = fieldNorm(doc=1900)
        0.32 = coord(8/25)