Document (#22864)

Author
Wong, M.L.
Leung, K.S.
Cheng, J.C.Y.
Title
Discovering knowledge from noisy databases using genetic programming
Source
Journal of the American Society for Information Science. 51(2000) no.9, S.870-881
Year
2000
Abstract
In data mining, we emphasize the need for learning from huge, incomplete, and imperfect data sets. To handle noise in the problem domain, existing learning systems avoid overfitting the imperfect training examples by excluding insignificant patterns. The problem is that these systems use a limiting attribute-value language for representing the training examples and the induced knowledge. Moreover, some important patterns are ignored because they are statistically insignificant. In this article, we present a framework that combines genetic programming and inductive logic programming to induce knowledge represented in various knowledge representation formalisms from noisy databases (LOGENPRO). Moreover, the system is applied to one real-life medical database. The knowledge discovered provides insights to and allows better understanding of the medical domains
Theme
Data Mining
Field
Medizin

Similar documents (author)

  1. Cheng, K.-S.; Young, G.H.; Wong, K.-F.: ¬A study on word-based and integral-bit Chinese text compression algorithms (1999) 1.76
    1.7601374 = sum of:
      1.7601374 = product of:
        2.640206 = sum of:
          1.2891681 = weight(author_txt:wong in 4056) [ClassicSimilarity], result of:
            1.2891681 = score(doc=4056,freq=1.0), product of:
              0.50278586 = queryWeight, product of:
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.061278287 = queryNorm
              2.56405 = fieldWeight in 4056, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.3125 = fieldNorm(doc=4056)
          1.351038 = weight(author_txt:cheng in 4056) [ClassicSimilarity], result of:
            1.351038 = score(doc=4056,freq=1.0), product of:
              0.5187464 = queryWeight, product of:
                1.015748 = boost
                8.334172 = idf(docFreq=28, maxDocs=44421)
                0.061278287 = queryNorm
              2.6044288 = fieldWeight in 4056, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.334172 = idf(docFreq=28, maxDocs=44421)
                0.3125 = fieldNorm(doc=4056)
        0.6666667 = coord(2/3)
    
  2. Leung, S.W.: MARC CIP records and MARC LC records : an evaluative study of their discrepancies (1983) 1.39
    1.3860807 = sum of:
      1.3860807 = product of:
        4.158242 = sum of:
          4.158242 = weight(author_txt:leung in 452) [ClassicSimilarity], result of:
            4.158242 = score(doc=452,freq=1.0), product of:
              0.69145393 = queryWeight, product of:
                1.1727085 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.061278287 = queryNorm
              6.0137663 = fieldWeight in 452, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.625 = fieldNorm(doc=452)
        0.33333334 = coord(1/3)
    
  3. Leung, C.H.C.; Hibler, J.N.D.: Architecture of a pictorial database management system (1991) 1.11
    1.1088648 = sum of:
      1.1088648 = product of:
        3.326594 = sum of:
          3.326594 = weight(author_txt:leung in 4796) [ClassicSimilarity], result of:
            3.326594 = score(doc=4796,freq=1.0), product of:
              0.69145393 = queryWeight, product of:
                1.1727085 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.061278287 = queryNorm
              4.811013 = fieldWeight in 4796, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.5 = fieldNorm(doc=4796)
        0.33333334 = coord(1/3)
    
  4. Tam, A.M.; Leung, C.H.C.: Structured natural-language descriptions for semantic content retrieval of visual materials (2001) 1.11
    1.1088648 = sum of:
      1.1088648 = product of:
        3.326594 = sum of:
          3.326594 = weight(author_txt:leung in 531) [ClassicSimilarity], result of:
            3.326594 = score(doc=531,freq=1.0), product of:
              0.69145393 = queryWeight, product of:
                1.1727085 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.061278287 = queryNorm
              4.811013 = fieldWeight in 531, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.5 = fieldNorm(doc=531)
        0.33333334 = coord(1/3)
    
  5. Li, Y.-O.; Leung, S.W.: Computer cataloging of electronic Journals in unstable Aggregator Databases the Hong Kong Baptist University Library experience (2001) 0.97
    0.9702567 = sum of:
      0.9702567 = product of:
        2.91077 = sum of:
          2.91077 = weight(author_txt:leung in 289) [ClassicSimilarity], result of:
            2.91077 = score(doc=289,freq=1.0), product of:
              0.69145393 = queryWeight, product of:
                1.1727085 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.061278287 = queryNorm
              4.2096367 = fieldWeight in 289, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.4375 = fieldNorm(doc=289)
        0.33333334 = coord(1/3)
    

Similar documents (content)

  1. DeRaedt, L.: Logical settings for concept-learning (1997) 0.13
    0.12672453 = sum of:
      0.12672453 = product of:
        0.63362265 = sum of:
          0.13206773 = weight(abstract_txt:inductive in 4780) [ClassicSimilarity], result of:
            0.13206773 = score(doc=4780,freq=1.0), product of:
              0.13764015 = queryWeight, product of:
                1.0319041 = boost
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.017376577 = queryNorm
              0.9595145 = fieldWeight in 4780, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.125 = fieldNorm(doc=4780)
          0.031878464 = weight(abstract_txt:from in 4780) [ClassicSimilarity], result of:
            0.031878464 = score(doc=4780,freq=3.0), product of:
              0.053359564 = queryWeight, product of:
                1.1128421 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.017376577 = queryNorm
              0.59742737 = fieldWeight in 4780, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.125 = fieldNorm(doc=4780)
          0.051787753 = weight(abstract_txt:problem in 4780) [ClassicSimilarity], result of:
            0.051787753 = score(doc=4780,freq=1.0), product of:
              0.09290563 = queryWeight, product of:
                1.1989548 = boost
                4.4593854 = idf(docFreq=1396, maxDocs=44421)
                0.017376577 = queryNorm
              0.5574232 = fieldWeight in 4780, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4593854 = idf(docFreq=1396, maxDocs=44421)
                0.125 = fieldNorm(doc=4780)
          0.13924858 = weight(abstract_txt:learning in 4780) [ClassicSimilarity], result of:
            0.13924858 = score(doc=4780,freq=5.0), product of:
              0.105057694 = queryWeight, product of:
                1.2749575 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.017376577 = queryNorm
              1.3254486 = fieldWeight in 4780, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.125 = fieldNorm(doc=4780)
          0.27864018 = weight(abstract_txt:programming in 4780) [ClassicSimilarity], result of:
            0.27864018 = score(doc=4780,freq=1.0), product of:
              0.32655042 = queryWeight, product of:
                2.7529767 = boost
                6.82627 = idf(docFreq=130, maxDocs=44421)
                0.017376577 = queryNorm
              0.85328376 = fieldWeight in 4780, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.82627 = idf(docFreq=130, maxDocs=44421)
                0.125 = fieldNorm(doc=4780)
        0.2 = coord(5/25)
    
  2. Cohen, D.J.: From Babel to knowledge : data mining large digital collections (2006) 0.10
    0.103891075 = sum of:
      0.103891075 = product of:
        0.43287948 = sum of:
          0.01690611 = weight(abstract_txt:from in 2178) [ClassicSimilarity], result of:
            0.01690611 = score(doc=2178,freq=6.0), product of:
              0.053359564 = queryWeight, product of:
                1.1128421 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.017376577 = queryNorm
              0.31683373 = fieldWeight in 2178, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.046875 = fieldNorm(doc=2178)
          0.01883228 = weight(abstract_txt:databases in 2178) [ClassicSimilarity], result of:
            0.01883228 = score(doc=2178,freq=1.0), product of:
              0.09102033 = queryWeight, product of:
                1.1867275 = boost
                4.413907 = idf(docFreq=1461, maxDocs=44421)
                0.017376577 = queryNorm
              0.2069019 = fieldWeight in 2178, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.413907 = idf(docFreq=1461, maxDocs=44421)
                0.046875 = fieldNorm(doc=2178)
          0.03202588 = weight(abstract_txt:patterns in 2178) [ClassicSimilarity], result of:
            0.03202588 = score(doc=2178,freq=1.0), product of:
              0.12967928 = queryWeight, product of:
                1.4165016 = boost
                5.2685275 = idf(docFreq=621, maxDocs=44421)
                0.017376577 = queryNorm
              0.24696222 = fieldWeight in 2178, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2685275 = idf(docFreq=621, maxDocs=44421)
                0.046875 = fieldNorm(doc=2178)
          0.024419338 = weight(abstract_txt:knowledge in 2178) [ClassicSimilarity], result of:
            0.024419338 = score(doc=2178,freq=1.0), product of:
              0.14689457 = queryWeight, product of:
                2.3837168 = boost
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.017376577 = queryNorm
              0.16623716 = fieldWeight in 2178, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.046875 = fieldNorm(doc=2178)
          0.23620579 = weight(abstract_txt:imperfect in 2178) [ClassicSimilarity], result of:
            0.23620579 = score(doc=2178,freq=2.0), product of:
              0.38999006 = queryWeight, product of:
                2.4564536 = boost
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.017376577 = queryNorm
              0.6056713 = fieldWeight in 2178, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.046875 = fieldNorm(doc=2178)
          0.104490064 = weight(abstract_txt:programming in 2178) [ClassicSimilarity], result of:
            0.104490064 = score(doc=2178,freq=1.0), product of:
              0.32655042 = queryWeight, product of:
                2.7529767 = boost
                6.82627 = idf(docFreq=130, maxDocs=44421)
                0.017376577 = queryNorm
              0.3199814 = fieldWeight in 2178, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.82627 = idf(docFreq=130, maxDocs=44421)
                0.046875 = fieldNorm(doc=2178)
        0.24 = coord(6/25)
    
  3. Chen, L.; Fang, H.: ¬An automatic method for ex-tracting innovative ideas based on the Scopus® database (2019) 0.10
    0.09506613 = sum of:
      0.09506613 = product of:
        0.47533065 = sum of:
          0.01840504 = weight(abstract_txt:from in 310) [ClassicSimilarity], result of:
            0.01840504 = score(doc=310,freq=4.0), product of:
              0.053359564 = queryWeight, product of:
                1.1128421 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.017376577 = queryNorm
              0.34492487 = fieldWeight in 310, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.0625 = fieldNorm(doc=310)
          0.12977387 = weight(abstract_txt:excluding in 310) [ClassicSimilarity], result of:
            0.12977387 = score(doc=310,freq=2.0), product of:
              0.17140186 = queryWeight, product of:
                1.1515281 = boost
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.017376577 = queryNorm
              0.75713223 = fieldWeight in 310, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.0625 = fieldNorm(doc=310)
          0.066497274 = weight(abstract_txt:moreover in 310) [ClassicSimilarity], result of:
            0.066497274 = score(doc=310,freq=1.0), product of:
              0.1742261 = queryWeight, product of:
                1.6418686 = boost
                6.106756 = idf(docFreq=268, maxDocs=44421)
                0.017376577 = queryNorm
              0.38167226 = fieldWeight in 310, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.106756 = idf(docFreq=268, maxDocs=44421)
                0.0625 = fieldNorm(doc=310)
          0.22809535 = weight(abstract_txt:noisy in 310) [ClassicSimilarity], result of:
            0.22809535 = score(doc=310,freq=2.0), product of:
              0.31451762 = queryWeight, product of:
                2.2059937 = boost
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.017376577 = queryNorm
              0.7252228 = fieldWeight in 310, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.0625 = fieldNorm(doc=310)
          0.032559115 = weight(abstract_txt:knowledge in 310) [ClassicSimilarity], result of:
            0.032559115 = score(doc=310,freq=1.0), product of:
              0.14689457 = queryWeight, product of:
                2.3837168 = boost
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.017376577 = queryNorm
              0.22164954 = fieldWeight in 310, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.0625 = fieldNorm(doc=310)
        0.2 = coord(5/25)
    
  4. Scott, J.E.: Organizational knowledge and the Intranet (2002) 0.09
    0.09294714 = sum of:
      0.09294714 = product of:
        0.2904598 = sum of:
          0.04127116 = weight(abstract_txt:inductive in 5246) [ClassicSimilarity], result of:
            0.04127116 = score(doc=5246,freq=1.0), product of:
              0.13764015 = queryWeight, product of:
                1.0319041 = boost
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.017376577 = queryNorm
              0.2998483 = fieldWeight in 5246, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.0390625 = fieldNorm(doc=5246)
          0.008133955 = weight(abstract_txt:from in 5246) [ClassicSimilarity], result of:
            0.008133955 = score(doc=5246,freq=2.0), product of:
              0.053359564 = queryWeight, product of:
                1.1128421 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.017376577 = queryNorm
              0.15243669 = fieldWeight in 5246, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.0390625 = fieldNorm(doc=5246)
          0.0624183 = weight(abstract_txt:induce in 5246) [ClassicSimilarity], result of:
            0.0624183 = score(doc=5246,freq=1.0), product of:
              0.18135184 = queryWeight, product of:
                1.1844801 = boost
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.017376577 = queryNorm
              0.34418344 = fieldWeight in 5246, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.0390625 = fieldNorm(doc=5246)
          0.019460581 = weight(abstract_txt:learning in 5246) [ClassicSimilarity], result of:
            0.019460581 = score(doc=5246,freq=1.0), product of:
              0.105057694 = queryWeight, product of:
                1.2749575 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.017376577 = queryNorm
              0.18523708 = fieldWeight in 5246, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.0390625 = fieldNorm(doc=5246)
          0.024269804 = weight(abstract_txt:training in 5246) [ClassicSimilarity], result of:
            0.024269804 = score(doc=5246,freq=1.0), product of:
              0.121721745 = queryWeight, product of:
                1.3723531 = boost
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.017376577 = queryNorm
              0.19938758 = fieldWeight in 5246, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.0390625 = fieldNorm(doc=5246)
          0.03578829 = weight(abstract_txt:medical in 5246) [ClassicSimilarity], result of:
            0.03578829 = score(doc=5246,freq=1.0), product of:
              0.15769501 = queryWeight, product of:
                1.562035 = boost
                5.8098235 = idf(docFreq=361, maxDocs=44421)
                0.017376577 = queryNorm
              0.22694623 = fieldWeight in 5246, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8098235 = idf(docFreq=361, maxDocs=44421)
                0.0390625 = fieldNorm(doc=5246)
          0.041560795 = weight(abstract_txt:moreover in 5246) [ClassicSimilarity], result of:
            0.041560795 = score(doc=5246,freq=1.0), product of:
              0.1742261 = queryWeight, product of:
                1.6418686 = boost
                6.106756 = idf(docFreq=268, maxDocs=44421)
                0.017376577 = queryNorm
              0.23854516 = fieldWeight in 5246, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.106756 = idf(docFreq=268, maxDocs=44421)
                0.0390625 = fieldNorm(doc=5246)
          0.057556927 = weight(abstract_txt:knowledge in 5246) [ClassicSimilarity], result of:
            0.057556927 = score(doc=5246,freq=8.0), product of:
              0.14689457 = queryWeight, product of:
                2.3837168 = boost
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.017376577 = queryNorm
              0.39182472 = fieldWeight in 5246, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.0390625 = fieldNorm(doc=5246)
        0.32 = coord(8/25)
    
  5. Malo, P.; Sinha, A.; Wallenius, J.; Korhonen, P.: Concept-based document classification using Wikipedia and value function (2011) 0.09
    0.08897348 = sum of:
      0.08897348 = product of:
        0.37072283 = sum of:
          0.01992404 = weight(abstract_txt:from in 948) [ClassicSimilarity], result of:
            0.01992404 = score(doc=948,freq=3.0), product of:
              0.053359564 = queryWeight, product of:
                1.1128421 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.017376577 = queryNorm
              0.3733921 = fieldWeight in 948, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.078125 = fieldNorm(doc=948)
          0.032367345 = weight(abstract_txt:problem in 948) [ClassicSimilarity], result of:
            0.032367345 = score(doc=948,freq=1.0), product of:
              0.09290563 = queryWeight, product of:
                1.1989548 = boost
                4.4593854 = idf(docFreq=1396, maxDocs=44421)
                0.017376577 = queryNorm
              0.34838948 = fieldWeight in 948, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4593854 = idf(docFreq=1396, maxDocs=44421)
                0.078125 = fieldNorm(doc=948)
          0.05504283 = weight(abstract_txt:learning in 948) [ClassicSimilarity], result of:
            0.05504283 = score(doc=948,freq=2.0), product of:
              0.105057694 = queryWeight, product of:
                1.2749575 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.017376577 = queryNorm
              0.52392954 = fieldWeight in 948, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.078125 = fieldNorm(doc=948)
          0.04853961 = weight(abstract_txt:training in 948) [ClassicSimilarity], result of:
            0.04853961 = score(doc=948,freq=1.0), product of:
              0.121721745 = queryWeight, product of:
                1.3723531 = boost
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.017376577 = queryNorm
              0.39877516 = fieldWeight in 948, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.078125 = fieldNorm(doc=948)
          0.040698897 = weight(abstract_txt:knowledge in 948) [ClassicSimilarity], result of:
            0.040698897 = score(doc=948,freq=1.0), product of:
              0.14689457 = queryWeight, product of:
                2.3837168 = boost
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.017376577 = queryNorm
              0.27706194 = fieldWeight in 948, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.078125 = fieldNorm(doc=948)
          0.17415011 = weight(abstract_txt:programming in 948) [ClassicSimilarity], result of:
            0.17415011 = score(doc=948,freq=1.0), product of:
              0.32655042 = queryWeight, product of:
                2.7529767 = boost
                6.82627 = idf(docFreq=130, maxDocs=44421)
                0.017376577 = queryNorm
              0.53330237 = fieldWeight in 948, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.82627 = idf(docFreq=130, maxDocs=44421)
                0.078125 = fieldNorm(doc=948)
        0.24 = coord(6/25)