Document (#43722)

Author
Lowe, D.B.
Dollinger, I.
Koster, T.
Herbert, B.E.
Title
Text mining for type of research classification
Source
Cataloging and classification quarterly. 59(2021) no.8, p.815-834
Year
2021
Abstract
This project brought together undergraduate students in Computer Science with librarians to mine abstracts of articles from the Texas A&M University Libraries' institutional repository, OAKTrust, in order to probe the creation of new metadata to improve discovery and use. The mining operation task consisted simply of classifying the articles into two categories of research type: basic research ("for understanding," "curiosity-based," or "knowledge-based") and applied research ("use-based"). These categories are fundamental especially for funders but are also important to researchers. The mining-to-classification steps took several iterations, but ultimately, we achieved good results with the toolkit BERT (Bidirectional Encoder Representations from Transformers). The project and its workflows represent a preview of what may lie ahead in the future of crafting metadata using text mining techniques to enhance discoverability.
Content
Vgl.: https://doi.org/10.1080/01639374.2021.1998281.
Footnote
Teil eines Themenheftes: Artificial intelligence (AI) and automated processes for subject sccess
Theme
Automatisches Indexieren
Data Mining

Similar documents (author)

  1. Lowe, D.: Leverhulme Trust award to catalogue the archive of Stefan Heym (1995) 5.87
    5.874302 = sum of:
      5.874302 = weight(author_txt:lowe in 3756) [ClassicSimilarity], result of:
        5.874302 = fieldWeight in 3756, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.398883 = idf(docFreq=9, maxDocs=44421)
          0.625 = fieldNorm(doc=3756)
    
  2. Steichen, B.; Lowe, R.: How do multilingual users search? : An investigation of query and result list language choices (2021) 4.70
    4.6994414 = sum of:
      4.6994414 = weight(author_txt:lowe in 1247) [ClassicSimilarity], result of:
        4.6994414 = fieldWeight in 1247, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.398883 = idf(docFreq=9, maxDocs=44421)
          0.5 = fieldNorm(doc=1247)
    
  3. Bartolo, L.M.; Lowe, C.S.; Glotzer, S.C.: Information management of microstructures : non-print, multidisciplinary information in a materials science digital library (2004) 3.52
    3.524581 = sum of:
      3.524581 = weight(author_txt:lowe in 3669) [ClassicSimilarity], result of:
        3.524581 = fieldWeight in 3669, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.398883 = idf(docFreq=9, maxDocs=44421)
          0.375 = fieldNorm(doc=3669)
    
  4. Spitzer, K.L.; Eisenberg, M.B.; Lowe, C.A.: Information literacy : essential skills for the information age (1998) 3.52
    3.524581 = sum of:
      3.524581 = weight(author_txt:lowe in 4682) [ClassicSimilarity], result of:
        3.524581 = fieldWeight in 4682, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.398883 = idf(docFreq=9, maxDocs=44421)
          0.375 = fieldNorm(doc=4682)
    
  5. Spitzer, K.L.; Eisenberg, M.B.; Lowe, C.A.: Information literacy : essential skills for the information age (2004) 3.52
    3.524581 = sum of:
      3.524581 = weight(author_txt:lowe in 4686) [ClassicSimilarity], result of:
        3.524581 = fieldWeight in 4686, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.398883 = idf(docFreq=9, maxDocs=44421)
          0.375 = fieldNorm(doc=4686)
    

Similar documents (content)

  1. Chou, C.; Chu, T.: ¬An analysis of BERT (NLP) for assisted subject indexing for Project Gutenberg (2022) 0.31
    0.30987695 = sum of:
      0.30987695 = product of:
        1.1067034 = sum of:
          0.032300748 = weight(abstract_txt:classification in 2141) [ClassicSimilarity], result of:
            0.032300748 = score(doc=2141,freq=1.0), product of:
              0.08630449 = queryWeight, product of:
                1.0302049 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.02098466 = queryNorm
              0.37426496 = fieldWeight in 2141, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.09375 = fieldNorm(doc=2141)
          0.042647887 = weight(abstract_txt:project in 2141) [ClassicSimilarity], result of:
            0.042647887 = score(doc=2141,freq=1.0), product of:
              0.10386998 = queryWeight, product of:
                1.1301912 = boost
                4.3796177 = idf(docFreq=1512, maxDocs=44421)
                0.02098466 = queryNorm
              0.41058916 = fieldWeight in 2141, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3796177 = idf(docFreq=1512, maxDocs=44421)
                0.09375 = fieldNorm(doc=2141)
          0.18892533 = weight(abstract_txt:bidirectional in 2141) [ClassicSimilarity], result of:
            0.18892533 = score(doc=2141,freq=1.0), product of:
              0.22236948 = queryWeight, product of:
                1.1693095 = boost
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.02098466 = queryNorm
              0.849601 = fieldWeight in 2141, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.09375 = fieldNorm(doc=2141)
          0.19873105 = weight(abstract_txt:encoder in 2141) [ClassicSimilarity], result of:
            0.19873105 = score(doc=2141,freq=1.0), product of:
              0.22999877 = queryWeight, product of:
                1.1891993 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.02098466 = queryNorm
              0.86405265 = fieldWeight in 2141, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.09375 = fieldNorm(doc=2141)
          0.3650466 = weight(abstract_txt:bert in 2141) [ClassicSimilarity], result of:
            0.3650466 = score(doc=2141,freq=3.0), product of:
              0.23918842 = queryWeight, product of:
                1.212724 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.02098466 = queryNorm
              1.5261884 = fieldWeight in 2141, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.09375 = fieldNorm(doc=2141)
          0.24702536 = weight(abstract_txt:transformers in 2141) [ClassicSimilarity], result of:
            0.24702536 = score(doc=2141,freq=1.0), product of:
              0.26589453 = queryWeight, product of:
                1.278635 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.02098466 = queryNorm
              0.9290351 = fieldWeight in 2141, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.09375 = fieldNorm(doc=2141)
          0.032026377 = weight(abstract_txt:research in 2141) [ClassicSimilarity], result of:
            0.032026377 = score(doc=2141,freq=1.0), product of:
              0.10812022 = queryWeight, product of:
                1.6307048 = boost
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.02098466 = queryNorm
              0.2962108 = fieldWeight in 2141, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.09375 = fieldNorm(doc=2141)
        0.28 = coord(7/25)
    
  2. Meng, K.; Ba, Z.; Ma, Y.; Li, G.: ¬A network coupling approach to detecting hierarchical linkages between science and technology (2024) 0.15
    0.14595206 = sum of:
      0.14595206 = product of:
        0.6081336 = sum of:
          0.12595022 = weight(abstract_txt:bidirectional in 2207) [ClassicSimilarity], result of:
            0.12595022 = score(doc=2207,freq=1.0), product of:
              0.22236948 = queryWeight, product of:
                1.1693095 = boost
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.02098466 = queryNorm
              0.56640065 = fieldWeight in 2207, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.0625 = fieldNorm(doc=2207)
          0.13248736 = weight(abstract_txt:encoder in 2207) [ClassicSimilarity], result of:
            0.13248736 = score(doc=2207,freq=1.0), product of:
              0.22999877 = queryWeight, product of:
                1.1891993 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.02098466 = queryNorm
              0.5760351 = fieldWeight in 2207, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.0625 = fieldNorm(doc=2207)
          0.14050649 = weight(abstract_txt:bert in 2207) [ClassicSimilarity], result of:
            0.14050649 = score(doc=2207,freq=1.0), product of:
              0.23918842 = queryWeight, product of:
                1.212724 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.02098466 = queryNorm
              0.5874302 = fieldWeight in 2207, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.0625 = fieldNorm(doc=2207)
          0.02315499 = weight(abstract_txt:based in 2207) [ClassicSimilarity], result of:
            0.02315499 = score(doc=2207,freq=2.0), product of:
              0.08230055 = queryWeight, product of:
                1.2321225 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.02098466 = queryNorm
              0.28134674 = fieldWeight in 2207, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0625 = fieldNorm(doc=2207)
          0.16468358 = weight(abstract_txt:transformers in 2207) [ClassicSimilarity], result of:
            0.16468358 = score(doc=2207,freq=1.0), product of:
              0.26589453 = queryWeight, product of:
                1.278635 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.02098466 = queryNorm
              0.61935675 = fieldWeight in 2207, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.0625 = fieldNorm(doc=2207)
          0.021350918 = weight(abstract_txt:research in 2207) [ClassicSimilarity], result of:
            0.021350918 = score(doc=2207,freq=1.0), product of:
              0.10812022 = queryWeight, product of:
                1.6307048 = boost
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.02098466 = queryNorm
              0.19747387 = fieldWeight in 2207, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.0625 = fieldNorm(doc=2207)
        0.24 = coord(6/25)
    
  3. Chen, K.; Zhao, Y.; Song, N.; Han, Y.; Peng, J.; Wang, J.: You are not alone: : characterizing users' relationship-layer identities in online health communities (2024) 0.14
    0.14432439 = sum of:
      0.14432439 = product of:
        0.6013516 = sum of:
          0.12595022 = weight(abstract_txt:bidirectional in 2300) [ClassicSimilarity], result of:
            0.12595022 = score(doc=2300,freq=1.0), product of:
              0.22236948 = queryWeight, product of:
                1.1693095 = boost
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.02098466 = queryNorm
              0.56640065 = fieldWeight in 2300, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.0625 = fieldNorm(doc=2300)
          0.13248736 = weight(abstract_txt:encoder in 2300) [ClassicSimilarity], result of:
            0.13248736 = score(doc=2300,freq=1.0), product of:
              0.22999877 = queryWeight, product of:
                1.1891993 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.02098466 = queryNorm
              0.5760351 = fieldWeight in 2300, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.0625 = fieldNorm(doc=2300)
          0.14050649 = weight(abstract_txt:bert in 2300) [ClassicSimilarity], result of:
            0.14050649 = score(doc=2300,freq=1.0), product of:
              0.23918842 = queryWeight, product of:
                1.212724 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.02098466 = queryNorm
              0.5874302 = fieldWeight in 2300, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.0625 = fieldNorm(doc=2300)
          0.016373053 = weight(abstract_txt:based in 2300) [ClassicSimilarity], result of:
            0.016373053 = score(doc=2300,freq=1.0), product of:
              0.08230055 = queryWeight, product of:
                1.2321225 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.02098466 = queryNorm
              0.1989422 = fieldWeight in 2300, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0625 = fieldNorm(doc=2300)
          0.16468358 = weight(abstract_txt:transformers in 2300) [ClassicSimilarity], result of:
            0.16468358 = score(doc=2300,freq=1.0), product of:
              0.26589453 = queryWeight, product of:
                1.278635 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.02098466 = queryNorm
              0.61935675 = fieldWeight in 2300, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.0625 = fieldNorm(doc=2300)
          0.021350918 = weight(abstract_txt:research in 2300) [ClassicSimilarity], result of:
            0.021350918 = score(doc=2300,freq=1.0), product of:
              0.10812022 = queryWeight, product of:
                1.6307048 = boost
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.02098466 = queryNorm
              0.19747387 = fieldWeight in 2300, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.0625 = fieldNorm(doc=2300)
        0.24 = coord(6/25)
    
  4. Joo, S.; Choi, I.; Choi, N.: Topic analysis of the research domain in knowledge organization : a Latent Dirichlet Allocation approach (2018) 0.12
    0.11617721 = sum of:
      0.11617721 = product of:
        0.4149186 = sum of:
          0.02153383 = weight(abstract_txt:classification in 304) [ClassicSimilarity], result of:
            0.02153383 = score(doc=304,freq=1.0), product of:
              0.08630449 = queryWeight, product of:
                1.0302049 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.02098466 = queryNorm
              0.24950996 = fieldWeight in 304, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.0625 = fieldNorm(doc=304)
          0.038680036 = weight(abstract_txt:text in 304) [ClassicSimilarity], result of:
            0.038680036 = score(doc=304,freq=3.0), product of:
              0.08842398 = queryWeight, product of:
                1.0427781 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.02098466 = queryNorm
              0.4374383 = fieldWeight in 304, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=304)
          0.016373053 = weight(abstract_txt:based in 304) [ClassicSimilarity], result of:
            0.016373053 = score(doc=304,freq=1.0), product of:
              0.08230055 = queryWeight, product of:
                1.2321225 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.02098466 = queryNorm
              0.1989422 = fieldWeight in 304, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0625 = fieldNorm(doc=304)
          0.036959786 = weight(abstract_txt:articles in 304) [ClassicSimilarity], result of:
            0.036959786 = score(doc=304,freq=1.0), product of:
              0.12371969 = queryWeight, product of:
                1.2334635 = boost
                4.7798095 = idf(docFreq=1013, maxDocs=44421)
                0.02098466 = queryNorm
              0.2987381 = fieldWeight in 304, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7798095 = idf(docFreq=1013, maxDocs=44421)
                0.0625 = fieldNorm(doc=304)
          0.03931536 = weight(abstract_txt:metadata in 304) [ClassicSimilarity], result of:
            0.03931536 = score(doc=304,freq=1.0), product of:
              0.1289221 = queryWeight, product of:
                1.25913 = boost
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.02098466 = queryNorm
              0.30495438 = fieldWeight in 304, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.0625 = fieldNorm(doc=304)
          0.03698087 = weight(abstract_txt:research in 304) [ClassicSimilarity], result of:
            0.03698087 = score(doc=304,freq=3.0), product of:
              0.10812022 = queryWeight, product of:
                1.6307048 = boost
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.02098466 = queryNorm
              0.34203476 = fieldWeight in 304, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.0625 = fieldNorm(doc=304)
          0.22507568 = weight(abstract_txt:mining in 304) [ClassicSimilarity], result of:
            0.22507568 = score(doc=304,freq=2.0), product of:
              0.4125769 = queryWeight, product of:
                3.185476 = boost
                6.1720386 = idf(docFreq=251, maxDocs=44421)
                0.02098466 = queryNorm
              0.5455363 = fieldWeight in 304, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1720386 = idf(docFreq=251, maxDocs=44421)
                0.0625 = fieldNorm(doc=304)
        0.28 = coord(7/25)
    
  5. Altinel, B.; Ganiz, M.C.: Semantic text classification : a survey of past and recent advances (2018) 0.11
    0.111748345 = sum of:
      0.111748345 = product of:
        0.39910123 = sum of:
          0.06793616 = weight(abstract_txt:classification in 51) [ClassicSimilarity], result of:
            0.06793616 = score(doc=51,freq=13.0), product of:
              0.08630449 = queryWeight, product of:
                1.0302049 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.02098466 = queryNorm
              0.7871683 = fieldWeight in 51, product of:
                3.6055512 = tf(freq=13.0), with freq of:
                  13.0 = termFreq=13.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.0546875 = fieldNorm(doc=51)
          0.07045405 = weight(abstract_txt:text in 51) [ClassicSimilarity], result of:
            0.07045405 = score(doc=51,freq=13.0), product of:
              0.08842398 = queryWeight, product of:
                1.0427781 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.02098466 = queryNorm
              0.7967754 = fieldWeight in 51, product of:
                3.6055512 = tf(freq=13.0), with freq of:
                  13.0 = termFreq=13.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0546875 = fieldNorm(doc=51)
          0.02481409 = weight(abstract_txt:based in 51) [ClassicSimilarity], result of:
            0.02481409 = score(doc=51,freq=3.0), product of:
              0.08230055 = queryWeight, product of:
                1.2321225 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.02098466 = queryNorm
              0.30150574 = fieldWeight in 51, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0546875 = fieldNorm(doc=51)
          0.03684445 = weight(abstract_txt:type in 51) [ClassicSimilarity], result of:
            0.03684445 = score(doc=51,freq=1.0), product of:
              0.13495696 = queryWeight, product of:
                1.288263 = boost
                4.992163 = idf(docFreq=819, maxDocs=44421)
                0.02098466 = queryNorm
              0.2730089 = fieldWeight in 51, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.992163 = idf(docFreq=819, maxDocs=44421)
                0.0546875 = fieldNorm(doc=51)
          0.04111194 = weight(abstract_txt:categories in 51) [ClassicSimilarity], result of:
            0.04111194 = score(doc=51,freq=1.0), product of:
              0.14518636 = queryWeight, product of:
                1.3361949 = boost
                5.177905 = idf(docFreq=680, maxDocs=44421)
                0.02098466 = queryNorm
              0.28316668 = fieldWeight in 51, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.177905 = idf(docFreq=680, maxDocs=44421)
                0.0546875 = fieldNorm(doc=51)
          0.018682053 = weight(abstract_txt:research in 51) [ClassicSimilarity], result of:
            0.018682053 = score(doc=51,freq=1.0), product of:
              0.10812022 = queryWeight, product of:
                1.6307048 = boost
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.02098466 = queryNorm
              0.17278963 = fieldWeight in 51, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.0546875 = fieldNorm(doc=51)
          0.13925847 = weight(abstract_txt:mining in 51) [ClassicSimilarity], result of:
            0.13925847 = score(doc=51,freq=1.0), product of:
              0.4125769 = queryWeight, product of:
                3.185476 = boost
                6.1720386 = idf(docFreq=251, maxDocs=44421)
                0.02098466 = queryNorm
              0.33753335 = fieldWeight in 51, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1720386 = idf(docFreq=251, maxDocs=44421)
                0.0546875 = fieldNorm(doc=51)
        0.28 = coord(7/25)