Document (#13268)

Author
Yang, Y.
Wilbur, J.
Title
Using corpus statistics to remove redundant words in text categorization
Source
Journal of the American Society for Information Science. 47(1996) no.5, S.357-369
Year
1996
Abstract
This article studies aggressive word removal in text categorization to reduce the noice in free texts to enhance the computational efficiency of categorization. We use a novel stop word identification method to automatically generate domain specific stoplists which are much larger than a conventional domain-independent stoplist. In our tests with 3 categorization methods on text collections from different domains/applications, significant numbers of words were removed without sacrificing categorization effectiveness. In the test of the Expert Network method on CACM documents, for example, an 87% removal of unique qords reduced the vocabulary of documents from 8.002 distinct words to 1.045 words, which resulted in a 63% time savings and a 74% memory savings in the computation of category ranking, with a 10% precision improvement on average over not using word removal. It is evident in this study that automated word removal based on corpus statistics has a practical and significant impact on the computational tractability of categorization methods in large databases
Theme
Computerlinguistik

Similar documents (author)

  1. Wilbur, W.J.: Global term weights for document retrieval learned from TREC data (2001) 2.37
    2.3720903 = sum of:
      2.3720903 = product of:
        4.7441807 = sum of:
          4.7441807 = weight(author_txt:wilbur in 2646) [ClassicSimilarity], result of:
            4.7441807 = score(doc=2646,freq=1.0), product of:
              0.8440272 = queryWeight, product of:
                1.2545102 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.074809626 = queryNorm
              5.620886 = fieldWeight in 2646, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.625 = fieldNorm(doc=2646)
        0.5 = coord(1/2)
    
  2. Wilbur, W.J.: Human subjectivity and performance limits in document retrieval (1996) 2.37
    2.3720903 = sum of:
      2.3720903 = product of:
        4.7441807 = sum of:
          4.7441807 = weight(author_txt:wilbur in 6675) [ClassicSimilarity], result of:
            4.7441807 = score(doc=6675,freq=1.0), product of:
              0.8440272 = queryWeight, product of:
                1.2545102 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.074809626 = queryNorm
              5.620886 = fieldWeight in 6675, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.625 = fieldNorm(doc=6675)
        0.5 = coord(1/2)
    
  3. Wilbur, W.J.: ¬A comparison of group and individual performance among subject experts and untrained workers at the document retrieval task (1998) 2.37
    2.3720903 = sum of:
      2.3720903 = product of:
        4.7441807 = sum of:
          4.7441807 = weight(author_txt:wilbur in 4263) [ClassicSimilarity], result of:
            4.7441807 = score(doc=4263,freq=1.0), product of:
              0.8440272 = queryWeight, product of:
                1.2545102 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.074809626 = queryNorm
              5.620886 = fieldWeight in 4263, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.625 = fieldNorm(doc=4263)
        0.5 = coord(1/2)
    
  4. Wilbur, W.J.: Human subjectivity and performance limits in document retrieval (1999) 2.37
    2.3720903 = sum of:
      2.3720903 = product of:
        4.7441807 = sum of:
          4.7441807 = weight(author_txt:wilbur in 5539) [ClassicSimilarity], result of:
            4.7441807 = score(doc=5539,freq=1.0), product of:
              0.8440272 = queryWeight, product of:
                1.2545102 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.074809626 = queryNorm
              5.620886 = fieldWeight in 5539, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.625 = fieldNorm(doc=5539)
        0.5 = coord(1/2)
    
  5. Wilbur, W.J.: ¬A retrieval system based on automatic relevance weighting of search terms (1992) 2.37
    2.3720903 = sum of:
      2.3720903 = product of:
        4.7441807 = sum of:
          4.7441807 = weight(author_txt:wilbur in 6269) [ClassicSimilarity], result of:
            4.7441807 = score(doc=6269,freq=1.0), product of:
              0.8440272 = queryWeight, product of:
                1.2545102 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.074809626 = queryNorm
              5.620886 = fieldWeight in 6269, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.625 = fieldNorm(doc=6269)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Goren-Bar, D.; Kuflik, T.: Supporting user-subjective categorization with self-organizing maps and learning vector quantization (2005) 0.20
    0.19809066 = sum of:
      0.19809066 = product of:
        0.82537776 = sum of:
          0.009888737 = weight(abstract_txt:using in 4325) [ClassicSimilarity], result of:
            0.009888737 = score(doc=4325,freq=1.0), product of:
              0.045769658 = queryWeight, product of:
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.0132402 = queryNorm
              0.21605442 = fieldWeight in 4325, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.0625 = fieldNorm(doc=4325)
          0.029066568 = weight(abstract_txt:documents in 4325) [ClassicSimilarity], result of:
            0.029066568 = score(doc=4325,freq=3.0), product of:
              0.06511872 = queryWeight, product of:
                1.1927903 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0132402 = queryNorm
              0.4463627 = fieldWeight in 4325, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0625 = fieldNorm(doc=4325)
          0.01702883 = weight(abstract_txt:methods in 4325) [ClassicSimilarity], result of:
            0.01702883 = score(doc=4325,freq=1.0), product of:
              0.065756746 = queryWeight, product of:
                1.1986195 = boost
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.0132402 = queryNorm
              0.25896704 = fieldWeight in 4325, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.0625 = fieldNorm(doc=4325)
          0.021796329 = weight(abstract_txt:method in 4325) [ClassicSimilarity], result of:
            0.021796329 = score(doc=4325,freq=1.0), product of:
              0.07751862 = queryWeight, product of:
                1.3014101 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0132402 = queryNorm
              0.2811754 = fieldWeight in 4325, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0625 = fieldNorm(doc=4325)
          0.03579943 = weight(abstract_txt:domain in 4325) [ClassicSimilarity], result of:
            0.03579943 = score(doc=4325,freq=2.0), product of:
              0.08564944 = queryWeight, product of:
                1.3679601 = boost
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.0132402 = queryNorm
              0.41797623 = fieldWeight in 4325, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.0625 = fieldNorm(doc=4325)
          0.7117979 = weight(abstract_txt:categorization in 4325) [ClassicSimilarity], result of:
            0.7117979 = score(doc=4325,freq=12.0), product of:
              0.4989246 = queryWeight, product of:
                5.718594 = boost
                6.58948 = idf(docFreq=165, maxDocs=44421)
                0.0132402 = queryNorm
              1.4266642 = fieldWeight in 4325, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                6.58948 = idf(docFreq=165, maxDocs=44421)
                0.0625 = fieldNorm(doc=4325)
        0.24 = coord(6/25)
    
  2. Díaz, I.; Ranilla, J.; Montañes, E.; Fernández, J.; Combarro, E.F.: Improving performance of text categorization by combining filtering and support vector machines (2004) 0.17
    0.16534607 = sum of:
      0.16534607 = product of:
        0.5905217 = sum of:
          0.020976989 = weight(abstract_txt:documents in 3234) [ClassicSimilarity], result of:
            0.020976989 = score(doc=3234,freq=1.0), product of:
              0.06511872 = queryWeight, product of:
                1.1927903 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0132402 = queryNorm
              0.32213452 = fieldWeight in 3234, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.078125 = fieldNorm(doc=3234)
          0.027245412 = weight(abstract_txt:method in 3234) [ClassicSimilarity], result of:
            0.027245412 = score(doc=3234,freq=1.0), product of:
              0.07751862 = queryWeight, product of:
                1.3014101 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0132402 = queryNorm
              0.35146925 = fieldWeight in 3234, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.078125 = fieldNorm(doc=3234)
          0.029615648 = weight(abstract_txt:text in 3234) [ClassicSimilarity], result of:
            0.029615648 = score(doc=3234,freq=1.0), product of:
              0.09381127 = queryWeight, product of:
                1.7534133 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0132402 = queryNorm
              0.3156939 = fieldWeight in 3234, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.078125 = fieldNorm(doc=3234)
          0.06765193 = weight(abstract_txt:corpus in 3234) [ClassicSimilarity], result of:
            0.06765193 = score(doc=3234,freq=1.0), product of:
              0.14214467 = queryWeight, product of:
                1.7622862 = boost
                6.0919957 = idf(docFreq=272, maxDocs=44421)
                0.0132402 = queryNorm
              0.47593716 = fieldWeight in 3234, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0919957 = idf(docFreq=272, maxDocs=44421)
                0.078125 = fieldNorm(doc=3234)
          0.091941595 = weight(abstract_txt:words in 3234) [ClassicSimilarity], result of:
            0.091941595 = score(doc=3234,freq=1.0), product of:
              0.21973293 = queryWeight, product of:
                3.0986586 = boost
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.0132402 = queryNorm
              0.4184243 = fieldWeight in 3234, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.078125 = fieldNorm(doc=3234)
          0.09624222 = weight(abstract_txt:word in 3234) [ClassicSimilarity], result of:
            0.09624222 = score(doc=3234,freq=1.0), product of:
              0.22653268 = queryWeight, product of:
                3.146238 = boost
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.0132402 = queryNorm
              0.42484915 = fieldWeight in 3234, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.078125 = fieldNorm(doc=3234)
          0.25684795 = weight(abstract_txt:categorization in 3234) [ClassicSimilarity], result of:
            0.25684795 = score(doc=3234,freq=1.0), product of:
              0.4989246 = queryWeight, product of:
                5.718594 = boost
                6.58948 = idf(docFreq=165, maxDocs=44421)
                0.0132402 = queryNorm
              0.5148031 = fieldWeight in 3234, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.58948 = idf(docFreq=165, maxDocs=44421)
                0.078125 = fieldNorm(doc=3234)
        0.28 = coord(7/25)
    
  3. Han, K.; Rezapour, R.; Nakamura, K.; Devkota, D.; Miller, D.C.; Diesner, J.: ¬An expert-in-the-loop method for domain-specific document categorization based on small training data (2023) 0.16
    0.16421908 = sum of:
      0.16421908 = product of:
        0.5864967 = sum of:
          0.023732753 = weight(abstract_txt:documents in 1969) [ClassicSimilarity], result of:
            0.023732753 = score(doc=1969,freq=2.0), product of:
              0.06511872 = queryWeight, product of:
                1.1927903 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0132402 = queryNorm
              0.3644536 = fieldWeight in 1969, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0625 = fieldNorm(doc=1969)
          0.024082402 = weight(abstract_txt:methods in 1969) [ClassicSimilarity], result of:
            0.024082402 = score(doc=1969,freq=2.0), product of:
              0.065756746 = queryWeight, product of:
                1.1986195 = boost
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.0132402 = queryNorm
              0.3662347 = fieldWeight in 1969, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.0625 = fieldNorm(doc=1969)
          0.021796329 = weight(abstract_txt:method in 1969) [ClassicSimilarity], result of:
            0.021796329 = score(doc=1969,freq=1.0), product of:
              0.07751862 = queryWeight, product of:
                1.3014101 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0132402 = queryNorm
              0.2811754 = fieldWeight in 1969, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0625 = fieldNorm(doc=1969)
          0.05062804 = weight(abstract_txt:domain in 1969) [ClassicSimilarity], result of:
            0.05062804 = score(doc=1969,freq=4.0), product of:
              0.08564944 = queryWeight, product of:
                1.3679601 = boost
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.0132402 = queryNorm
              0.59110767 = fieldWeight in 1969, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.0625 = fieldNorm(doc=1969)
          0.023692518 = weight(abstract_txt:text in 1969) [ClassicSimilarity], result of:
            0.023692518 = score(doc=1969,freq=1.0), product of:
              0.09381127 = queryWeight, product of:
                1.7534133 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0132402 = queryNorm
              0.25255513 = fieldWeight in 1969, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=1969)
          0.086665735 = weight(abstract_txt:computational in 1969) [ClassicSimilarity], result of:
            0.086665735 = score(doc=1969,freq=2.0), product of:
              0.15442066 = queryWeight, product of:
                1.8368084 = boost
                6.3496094 = idf(docFreq=210, maxDocs=44421)
                0.0132402 = queryNorm
              0.5612315 = fieldWeight in 1969, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.3496094 = idf(docFreq=210, maxDocs=44421)
                0.0625 = fieldNorm(doc=1969)
          0.35589895 = weight(abstract_txt:categorization in 1969) [ClassicSimilarity], result of:
            0.35589895 = score(doc=1969,freq=3.0), product of:
              0.4989246 = queryWeight, product of:
                5.718594 = boost
                6.58948 = idf(docFreq=165, maxDocs=44421)
                0.0132402 = queryNorm
              0.7133321 = fieldWeight in 1969, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.58948 = idf(docFreq=165, maxDocs=44421)
                0.0625 = fieldNorm(doc=1969)
        0.28 = coord(7/25)
    
  4. Kim, W.; Wilbur, W.J.: Corpus-based statistical screening for content-bearing terms (2001) 0.16
    0.16019762 = sum of:
      0.16019762 = product of:
        0.44499338 = sum of:
          0.007416553 = weight(abstract_txt:using in 188) [ClassicSimilarity], result of:
            0.007416553 = score(doc=188,freq=1.0), product of:
              0.045769658 = queryWeight, product of:
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.0132402 = queryNorm
              0.16204081 = fieldWeight in 188, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.046875 = fieldNorm(doc=188)
          0.05413989 = weight(abstract_txt:stop in 188) [ClassicSimilarity], result of:
            0.05413989 = score(doc=188,freq=2.0), product of:
              0.108501196 = queryWeight, product of:
                1.088713 = boost
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.0132402 = queryNorm
              0.49897966 = fieldWeight in 188, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.046875 = fieldNorm(doc=188)
          0.028143587 = weight(abstract_txt:documents in 188) [ClassicSimilarity], result of:
            0.028143587 = score(doc=188,freq=5.0), product of:
              0.06511872 = queryWeight, product of:
                1.1927903 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0132402 = queryNorm
              0.43218887 = fieldWeight in 188, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.046875 = fieldNorm(doc=188)
          0.05072736 = weight(abstract_txt:removed in 188) [ClassicSimilarity], result of:
            0.05072736 = score(doc=188,freq=1.0), product of:
              0.13089642 = queryWeight, product of:
                1.1958041 = boost
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.0132402 = queryNorm
              0.38753816 = fieldWeight in 188, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.046875 = fieldNorm(doc=188)
          0.022121098 = weight(abstract_txt:methods in 188) [ClassicSimilarity], result of:
            0.022121098 = score(doc=188,freq=3.0), product of:
              0.065756746 = queryWeight, product of:
                1.1986195 = boost
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.0132402 = queryNorm
              0.33640805 = fieldWeight in 188, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.046875 = fieldNorm(doc=188)
          0.016347248 = weight(abstract_txt:method in 188) [ClassicSimilarity], result of:
            0.016347248 = score(doc=188,freq=1.0), product of:
              0.07751862 = queryWeight, product of:
                1.3014101 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0132402 = queryNorm
              0.21088156 = fieldWeight in 188, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.046875 = fieldNorm(doc=188)
          0.017769389 = weight(abstract_txt:text in 188) [ClassicSimilarity], result of:
            0.017769389 = score(doc=188,freq=1.0), product of:
              0.09381127 = queryWeight, product of:
                1.7534133 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0132402 = queryNorm
              0.18941635 = fieldWeight in 188, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.046875 = fieldNorm(doc=188)
          0.0955485 = weight(abstract_txt:words in 188) [ClassicSimilarity], result of:
            0.0955485 = score(doc=188,freq=3.0), product of:
              0.21973293 = queryWeight, product of:
                3.0986586 = boost
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.0132402 = queryNorm
              0.43483928 = fieldWeight in 188, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.046875 = fieldNorm(doc=188)
          0.15277977 = weight(abstract_txt:word in 188) [ClassicSimilarity], result of:
            0.15277977 = score(doc=188,freq=7.0), product of:
              0.22653268 = queryWeight, product of:
                3.146238 = boost
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.0132402 = queryNorm
              0.6744271 = fieldWeight in 188, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.046875 = fieldNorm(doc=188)
        0.36 = coord(9/25)
    
  5. Tseng, Y.-H.: Automatic thesaurus generation for Chinese documents (2002) 0.15
    0.15300705 = sum of:
      0.15300705 = product of:
        0.47814703 = sum of:
          0.009888737 = weight(abstract_txt:using in 226) [ClassicSimilarity], result of:
            0.009888737 = score(doc=226,freq=1.0), product of:
              0.045769658 = queryWeight, product of:
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.0132402 = queryNorm
              0.21605442 = fieldWeight in 226, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.0625 = fieldNorm(doc=226)
          0.040445957 = weight(abstract_txt:reduced in 226) [ClassicSimilarity], result of:
            0.040445957 = score(doc=226,freq=1.0), product of:
              0.092908874 = queryWeight, product of:
                1.007453 = boost
                6.965269 = idf(docFreq=113, maxDocs=44421)
                0.0132402 = queryNorm
              0.43532932 = fieldWeight in 226, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.965269 = idf(docFreq=113, maxDocs=44421)
                0.0625 = fieldNorm(doc=226)
          0.051043577 = weight(abstract_txt:stop in 226) [ClassicSimilarity], result of:
            0.051043577 = score(doc=226,freq=1.0), product of:
              0.108501196 = queryWeight, product of:
                1.088713 = boost
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.0132402 = queryNorm
              0.47044253 = fieldWeight in 226, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.0625 = fieldNorm(doc=226)
          0.05490908 = weight(abstract_txt:computation in 226) [ClassicSimilarity], result of:
            0.05490908 = score(doc=226,freq=1.0), product of:
              0.11391211 = queryWeight, product of:
                1.1155297 = boost
                7.7124834 = idf(docFreq=53, maxDocs=44421)
                0.0132402 = queryNorm
              0.4820302 = fieldWeight in 226, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7124834 = idf(docFreq=53, maxDocs=44421)
                0.0625 = fieldNorm(doc=226)
          0.01678159 = weight(abstract_txt:documents in 226) [ClassicSimilarity], result of:
            0.01678159 = score(doc=226,freq=1.0), product of:
              0.06511872 = queryWeight, product of:
                1.1927903 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0132402 = queryNorm
              0.25770763 = fieldWeight in 226, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0625 = fieldNorm(doc=226)
          0.023692518 = weight(abstract_txt:text in 226) [ClassicSimilarity], result of:
            0.023692518 = score(doc=226,freq=1.0), product of:
              0.09381127 = queryWeight, product of:
                1.7534133 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0132402 = queryNorm
              0.25255513 = fieldWeight in 226, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=226)
          0.12739801 = weight(abstract_txt:words in 226) [ClassicSimilarity], result of:
            0.12739801 = score(doc=226,freq=3.0), product of:
              0.21973293 = queryWeight, product of:
                3.0986586 = boost
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.0132402 = queryNorm
              0.5797857 = fieldWeight in 226, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.0625 = fieldNorm(doc=226)
          0.15398756 = weight(abstract_txt:word in 226) [ClassicSimilarity], result of:
            0.15398756 = score(doc=226,freq=4.0), product of:
              0.22653268 = queryWeight, product of:
                3.146238 = boost
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.0132402 = queryNorm
              0.67975867 = fieldWeight in 226, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.0625 = fieldNorm(doc=226)
        0.32 = coord(8/25)