Document (#36776)

Author
Maghsoodi, N.
Homayounpour, M.M.
Title
Improving Farsi multiclass text classification using a thesaurus and two-stage feature selection
Source
Journal of the American Society for Information Science and Technology. 62(2011) no.10, S.2055-2066
Year
2011
Abstract
The progressive increase of information content has recently made it necessary to create a system for automatic classification of documents. In this article, a system is presented for the categorization of multiclass Farsi documents that requires fewer training examples and can help to compensate the shortcoming of the standard training dataset. The new idea proposed in the present article is based on extending the feature vector by adding some words extracted from a thesaurus and then filtering the new feature vector by applying secondary feature selection to discard inappropriate features. In fact, a phase of secondary feature selection is applied to choose more appropriate features among the features added from a thesaurus to enhance the effect of using a thesaurus on the efficiency of the classifier. To evaluate the proposed system, a corpus is gathered from the Farsi Wikipedia website and some articles in the Hamshahri newspaper, the Roshd periodical, and the Soroush magazine. In addition to studying the role of a thesaurus and applying secondary feature selection, the effect of a various number of categories, size of the training dataset, and average number of words in the test data also are examined. As the results indicate, classification efficiency improves by applying this approach, especially when available data is not sufficient for some text categories.
Theme
Automatisches Klassifizieren

Similar documents (content)

  1. Mengle, S.S.R.; Goharian, N.: Ambiguity measure feature-selection algorithm (2009) 0.49
    0.48905146 = sum of:
      0.48905146 = product of:
        1.1114806 = sum of:
          0.04794571 = weight(abstract_txt:text in 3804) [ClassicSimilarity], result of:
            0.04794571 = score(doc=3804,freq=6.0), product of:
              0.07750289 = queryWeight, product of:
                1.0148128 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.018899737 = queryNorm
              0.61863124 = fieldWeight in 3804, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=3804)
          0.029410489 = weight(abstract_txt:documents in 3804) [ClassicSimilarity], result of:
            0.029410489 = score(doc=3804,freq=2.0), product of:
              0.080697484 = queryWeight, product of:
                1.0355165 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.018899737 = queryNorm
              0.3644536 = fieldWeight in 3804, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0625 = fieldNorm(doc=3804)
          0.020983819 = weight(abstract_txt:number in 3804) [ClassicSimilarity], result of:
            0.020983819 = score(doc=3804,freq=1.0), product of:
              0.081181705 = queryWeight, product of:
                1.0386186 = boost
                4.1356745 = idf(docFreq=1930, maxDocs=44421)
                0.018899737 = queryNorm
              0.25847965 = fieldWeight in 3804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1356745 = idf(docFreq=1930, maxDocs=44421)
                0.0625 = fieldNorm(doc=3804)
          0.009349325 = weight(abstract_txt:from in 3804) [ClassicSimilarity], result of:
            0.009349325 = score(doc=3804,freq=1.0), product of:
              0.05421079 = queryWeight, product of:
                1.0394784 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.018899737 = queryNorm
              0.17246243 = fieldWeight in 3804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.0625 = fieldNorm(doc=3804)
          0.04721188 = weight(abstract_txt:effect in 3804) [ClassicSimilarity], result of:
            0.04721188 = score(doc=3804,freq=1.0), product of:
              0.13939142 = queryWeight, product of:
                1.3609589 = boost
                5.419201 = idf(docFreq=534, maxDocs=44421)
                0.018899737 = queryNorm
              0.33870006 = fieldWeight in 3804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.419201 = idf(docFreq=534, maxDocs=44421)
                0.0625 = fieldNorm(doc=3804)
          0.028311338 = weight(abstract_txt:classification in 3804) [ClassicSimilarity], result of:
            0.028311338 = score(doc=3804,freq=1.0), product of:
              0.11346777 = queryWeight, product of:
                1.5038651 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.018899737 = queryNorm
              0.24950996 = fieldWeight in 3804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.0625 = fieldNorm(doc=3804)
          0.082210094 = weight(abstract_txt:vector in 3804) [ClassicSimilarity], result of:
            0.082210094 = score(doc=3804,freq=1.0), product of:
              0.20175235 = queryWeight, product of:
                1.6373303 = boost
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.018899737 = queryNorm
              0.40748024 = fieldWeight in 3804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.0625 = fieldNorm(doc=3804)
          0.07215117 = weight(abstract_txt:features in 3804) [ClassicSimilarity], result of:
            0.07215117 = score(doc=3804,freq=3.0), product of:
              0.14678694 = queryWeight, product of:
                1.7104734 = boost
                4.5406218 = idf(docFreq=1287, maxDocs=44421)
                0.018899737 = queryNorm
              0.4915367 = fieldWeight in 3804, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.5406218 = idf(docFreq=1287, maxDocs=44421)
                0.0625 = fieldNorm(doc=3804)
          0.083688535 = weight(abstract_txt:training in 3804) [ClassicSimilarity], result of:
            0.083688535 = score(doc=3804,freq=2.0), product of:
              0.18549529 = queryWeight, product of:
                1.9228219 = boost
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.018899737 = queryNorm
              0.45116258 = fieldWeight in 3804, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.0625 = fieldNorm(doc=3804)
          0.2060502 = weight(abstract_txt:selection in 3804) [ClassicSimilarity], result of:
            0.2060502 = score(doc=3804,freq=5.0), product of:
              0.27428612 = queryWeight, product of:
                2.6998765 = boost
                5.375318 = idf(docFreq=558, maxDocs=44421)
                0.018899737 = queryNorm
              0.75122356 = fieldWeight in 3804, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.375318 = idf(docFreq=558, maxDocs=44421)
                0.0625 = fieldNorm(doc=3804)
          0.48416808 = weight(abstract_txt:feature in 3804) [ClassicSimilarity], result of:
            0.48416808 = score(doc=3804,freq=7.0), product of:
              0.49606702 = queryWeight, product of:
                4.4469047 = boost
                5.9023747 = idf(docFreq=329, maxDocs=44421)
                0.018899737 = queryNorm
              0.9760135 = fieldWeight in 3804, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.9023747 = idf(docFreq=329, maxDocs=44421)
                0.0625 = fieldNorm(doc=3804)
        0.44 = coord(11/25)
    
  2. Duwairi, R.M.: Machine learning for Arabic text categorization (2006) 0.29
    0.2938974 = sum of:
      0.2938974 = product of:
        0.8163816 = sum of:
          0.02446719 = weight(abstract_txt:text in 115) [ClassicSimilarity], result of:
            0.02446719 = score(doc=115,freq=1.0), product of:
              0.07750289 = queryWeight, product of:
                1.0148128 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.018899737 = queryNorm
              0.3156939 = fieldWeight in 115, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.078125 = fieldNorm(doc=115)
          0.051990893 = weight(abstract_txt:documents in 115) [ClassicSimilarity], result of:
            0.051990893 = score(doc=115,freq=4.0), product of:
              0.080697484 = queryWeight, product of:
                1.0355165 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.018899737 = queryNorm
              0.64426905 = fieldWeight in 115, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.078125 = fieldNorm(doc=115)
          0.03628364 = weight(abstract_txt:proposed in 115) [ClassicSimilarity], result of:
            0.03628364 = score(doc=115,freq=1.0), product of:
              0.10078651 = queryWeight, product of:
                1.1572527 = boost
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.018899737 = queryNorm
              0.36000493 = fieldWeight in 115, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.078125 = fieldNorm(doc=115)
          0.07280023 = weight(abstract_txt:categories in 115) [ClassicSimilarity], result of:
            0.07280023 = score(doc=115,freq=2.0), product of:
              0.12725465 = queryWeight, product of:
                1.3003607 = boost
                5.177905 = idf(docFreq=680, maxDocs=44421)
                0.018899737 = queryNorm
              0.57208306 = fieldWeight in 115, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.177905 = idf(docFreq=680, maxDocs=44421)
                0.078125 = fieldNorm(doc=115)
          0.05696868 = weight(abstract_txt:words in 115) [ClassicSimilarity], result of:
            0.05696868 = score(doc=115,freq=1.0), product of:
              0.13615051 = queryWeight, product of:
                1.3450445 = boost
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.018899737 = queryNorm
              0.4184243 = fieldWeight in 115, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.078125 = fieldNorm(doc=115)
          0.10276262 = weight(abstract_txt:vector in 115) [ClassicSimilarity], result of:
            0.10276262 = score(doc=115,freq=1.0), product of:
              0.20175235 = queryWeight, product of:
                1.6373303 = boost
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.018899737 = queryNorm
              0.5093503 = fieldWeight in 115, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.078125 = fieldNorm(doc=115)
          0.07363898 = weight(abstract_txt:features in 115) [ClassicSimilarity], result of:
            0.07363898 = score(doc=115,freq=2.0), product of:
              0.14678694 = queryWeight, product of:
                1.7104734 = boost
                4.5406218 = idf(docFreq=1287, maxDocs=44421)
                0.018899737 = queryNorm
              0.50167257 = fieldWeight in 115, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5406218 = idf(docFreq=1287, maxDocs=44421)
                0.078125 = fieldNorm(doc=115)
          0.073970914 = weight(abstract_txt:training in 115) [ClassicSimilarity], result of:
            0.073970914 = score(doc=115,freq=1.0), product of:
              0.18549529 = queryWeight, product of:
                1.9228219 = boost
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.018899737 = queryNorm
              0.39877516 = fieldWeight in 115, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.078125 = fieldNorm(doc=115)
          0.32349843 = weight(abstract_txt:feature in 115) [ClassicSimilarity], result of:
            0.32349843 = score(doc=115,freq=2.0), product of:
              0.49606702 = queryWeight, product of:
                4.4469047 = boost
                5.9023747 = idf(docFreq=329, maxDocs=44421)
                0.018899737 = queryNorm
              0.65212643 = fieldWeight in 115, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.9023747 = idf(docFreq=329, maxDocs=44421)
                0.078125 = fieldNorm(doc=115)
        0.36 = coord(9/25)
    
  3. Malenica, M.; Smuc, T.; Snajder, J.; Basic, B.D.: Language morphology offset : text classification on a Croatian-English parallel corpus (2008) 0.29
    0.28754872 = sum of:
      0.28754872 = product of:
        0.89858973 = sum of:
          0.02446719 = weight(abstract_txt:text in 3035) [ClassicSimilarity], result of:
            0.02446719 = score(doc=3035,freq=1.0), product of:
              0.07750289 = queryWeight, product of:
                1.0148128 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.018899737 = queryNorm
              0.3156939 = fieldWeight in 3035, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.078125 = fieldNorm(doc=3035)
          0.0370945 = weight(abstract_txt:number in 3035) [ClassicSimilarity], result of:
            0.0370945 = score(doc=3035,freq=2.0), product of:
              0.081181705 = queryWeight, product of:
                1.0386186 = boost
                4.1356745 = idf(docFreq=1930, maxDocs=44421)
                0.018899737 = queryNorm
              0.45693177 = fieldWeight in 3035, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1356745 = idf(docFreq=1930, maxDocs=44421)
                0.078125 = fieldNorm(doc=3035)
          0.035389174 = weight(abstract_txt:classification in 3035) [ClassicSimilarity], result of:
            0.035389174 = score(doc=3035,freq=1.0), product of:
              0.11346777 = queryWeight, product of:
                1.5038651 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.018899737 = queryNorm
              0.31188744 = fieldWeight in 3035, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.078125 = fieldNorm(doc=3035)
          0.10276262 = weight(abstract_txt:vector in 3035) [ClassicSimilarity], result of:
            0.10276262 = score(doc=3035,freq=1.0), product of:
              0.20175235 = queryWeight, product of:
                1.6373303 = boost
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.018899737 = queryNorm
              0.5093503 = fieldWeight in 3035, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.078125 = fieldNorm(doc=3035)
          0.07363898 = weight(abstract_txt:features in 3035) [ClassicSimilarity], result of:
            0.07363898 = score(doc=3035,freq=2.0), product of:
              0.14678694 = queryWeight, product of:
                1.7104734 = boost
                4.5406218 = idf(docFreq=1287, maxDocs=44421)
                0.018899737 = queryNorm
              0.50167257 = fieldWeight in 3035, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5406218 = idf(docFreq=1287, maxDocs=44421)
                0.078125 = fieldNorm(doc=3035)
          0.11384869 = weight(abstract_txt:applying in 3035) [ClassicSimilarity], result of:
            0.11384869 = score(doc=3035,freq=1.0), product of:
              0.24727352 = queryWeight, product of:
                2.2200432 = boost
                5.8933253 = idf(docFreq=332, maxDocs=44421)
                0.018899737 = queryNorm
              0.46041605 = fieldWeight in 3035, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8933253 = idf(docFreq=332, maxDocs=44421)
                0.078125 = fieldNorm(doc=3035)
          0.11518556 = weight(abstract_txt:selection in 3035) [ClassicSimilarity], result of:
            0.11518556 = score(doc=3035,freq=1.0), product of:
              0.27428612 = queryWeight, product of:
                2.6998765 = boost
                5.375318 = idf(docFreq=558, maxDocs=44421)
                0.018899737 = queryNorm
              0.41994673 = fieldWeight in 3035, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.375318 = idf(docFreq=558, maxDocs=44421)
                0.078125 = fieldNorm(doc=3035)
          0.39620304 = weight(abstract_txt:feature in 3035) [ClassicSimilarity], result of:
            0.39620304 = score(doc=3035,freq=3.0), product of:
              0.49606702 = queryWeight, product of:
                4.4469047 = boost
                5.9023747 = idf(docFreq=329, maxDocs=44421)
                0.018899737 = queryNorm
              0.79868853 = fieldWeight in 3035, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.9023747 = idf(docFreq=329, maxDocs=44421)
                0.078125 = fieldNorm(doc=3035)
        0.32 = coord(8/25)
    
  4. Ikae, C.; Savoy, J.: Gender identification on Twitter (2022) 0.28
    0.27827036 = sum of:
      0.27827036 = product of:
        0.8695949 = sum of:
          0.020983819 = weight(abstract_txt:number in 1446) [ClassicSimilarity], result of:
            0.020983819 = score(doc=1446,freq=1.0), product of:
              0.081181705 = queryWeight, product of:
                1.0386186 = boost
                4.1356745 = idf(docFreq=1930, maxDocs=44421)
                0.018899737 = queryNorm
              0.25847965 = fieldWeight in 1446, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1356745 = idf(docFreq=1930, maxDocs=44421)
                0.0625 = fieldNorm(doc=1446)
          0.029026913 = weight(abstract_txt:proposed in 1446) [ClassicSimilarity], result of:
            0.029026913 = score(doc=1446,freq=1.0), product of:
              0.10078651 = queryWeight, product of:
                1.1572527 = boost
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.018899737 = queryNorm
              0.28800395 = fieldWeight in 1446, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.0625 = fieldNorm(doc=1446)
          0.045574944 = weight(abstract_txt:words in 1446) [ClassicSimilarity], result of:
            0.045574944 = score(doc=1446,freq=1.0), product of:
              0.13615051 = queryWeight, product of:
                1.3450445 = boost
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.018899737 = queryNorm
              0.33473945 = fieldWeight in 1446, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.0625 = fieldNorm(doc=1446)
          0.022150021 = weight(abstract_txt:some in 1446) [ClassicSimilarity], result of:
            0.022150021 = score(doc=1446,freq=1.0), product of:
              0.096341856 = queryWeight, product of:
                1.3857348 = boost
                3.6785707 = idf(docFreq=3049, maxDocs=44421)
                0.018899737 = queryNorm
              0.22991067 = fieldWeight in 1446, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6785707 = idf(docFreq=3049, maxDocs=44421)
                0.0625 = fieldNorm(doc=1446)
          0.082210094 = weight(abstract_txt:vector in 1446) [ClassicSimilarity], result of:
            0.082210094 = score(doc=1446,freq=1.0), product of:
              0.20175235 = queryWeight, product of:
                1.6373303 = boost
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.018899737 = queryNorm
              0.40748024 = fieldWeight in 1446, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.0625 = fieldNorm(doc=1446)
          0.09107896 = weight(abstract_txt:applying in 1446) [ClassicSimilarity], result of:
            0.09107896 = score(doc=1446,freq=1.0), product of:
              0.24727352 = queryWeight, product of:
                2.2200432 = boost
                5.8933253 = idf(docFreq=332, maxDocs=44421)
                0.018899737 = queryNorm
              0.36833283 = fieldWeight in 1446, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8933253 = idf(docFreq=332, maxDocs=44421)
                0.0625 = fieldNorm(doc=1446)
          0.13031758 = weight(abstract_txt:selection in 1446) [ClassicSimilarity], result of:
            0.13031758 = score(doc=1446,freq=2.0), product of:
              0.27428612 = queryWeight, product of:
                2.6998765 = boost
                5.375318 = idf(docFreq=558, maxDocs=44421)
                0.018899737 = queryNorm
              0.47511548 = fieldWeight in 1446, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.375318 = idf(docFreq=558, maxDocs=44421)
                0.0625 = fieldNorm(doc=1446)
          0.44825256 = weight(abstract_txt:feature in 1446) [ClassicSimilarity], result of:
            0.44825256 = score(doc=1446,freq=6.0), product of:
              0.49606702 = queryWeight, product of:
                4.4469047 = boost
                5.9023747 = idf(docFreq=329, maxDocs=44421)
                0.018899737 = queryNorm
              0.9036129 = fieldWeight in 1446, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.9023747 = idf(docFreq=329, maxDocs=44421)
                0.0625 = fieldNorm(doc=1446)
        0.32 = coord(8/25)
    
  5. Tseng, Y.-H.; Lin, C.-J.; Lin, Y.-I.: Text mining techniques for patent analysis (2007) 0.24
    0.23959076 = sum of:
      0.23959076 = product of:
        0.5989769 = sum of:
          0.027681466 = weight(abstract_txt:text in 1935) [ClassicSimilarity], result of:
            0.027681466 = score(doc=1935,freq=2.0), product of:
              0.07750289 = queryWeight, product of:
                1.0148128 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.018899737 = queryNorm
              0.3571669 = fieldWeight in 1935, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=1935)
          0.020796357 = weight(abstract_txt:documents in 1935) [ClassicSimilarity], result of:
            0.020796357 = score(doc=1935,freq=1.0), product of:
              0.080697484 = queryWeight, product of:
                1.0355165 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.018899737 = queryNorm
              0.25770763 = fieldWeight in 1935, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0625 = fieldNorm(doc=1935)
          0.041050255 = weight(abstract_txt:proposed in 1935) [ClassicSimilarity], result of:
            0.041050255 = score(doc=1935,freq=2.0), product of:
              0.10078651 = queryWeight, product of:
                1.1572527 = boost
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.018899737 = queryNorm
              0.4072991 = fieldWeight in 1935, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.0625 = fieldNorm(doc=1935)
          0.045574944 = weight(abstract_txt:words in 1935) [ClassicSimilarity], result of:
            0.045574944 = score(doc=1935,freq=1.0), product of:
              0.13615051 = queryWeight, product of:
                1.3450445 = boost
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.018899737 = queryNorm
              0.33473945 = fieldWeight in 1935, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.0625 = fieldNorm(doc=1935)
          0.03132486 = weight(abstract_txt:some in 1935) [ClassicSimilarity], result of:
            0.03132486 = score(doc=1935,freq=2.0), product of:
              0.096341856 = queryWeight, product of:
                1.3857348 = boost
                3.6785707 = idf(docFreq=3049, maxDocs=44421)
                0.018899737 = queryNorm
              0.32514277 = fieldWeight in 1935, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6785707 = idf(docFreq=3049, maxDocs=44421)
                0.0625 = fieldNorm(doc=1935)
          0.049036674 = weight(abstract_txt:classification in 1935) [ClassicSimilarity], result of:
            0.049036674 = score(doc=1935,freq=3.0), product of:
              0.11346777 = queryWeight, product of:
                1.5038651 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.018899737 = queryNorm
              0.43216392 = fieldWeight in 1935, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.0625 = fieldNorm(doc=1935)
          0.06670906 = weight(abstract_txt:efficiency in 1935) [ClassicSimilarity], result of:
            0.06670906 = score(doc=1935,freq=1.0), product of:
              0.17551936 = queryWeight, product of:
                1.5271775 = boost
                6.0810666 = idf(docFreq=275, maxDocs=44421)
                0.018899737 = queryNorm
              0.38006666 = fieldWeight in 1935, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0810666 = idf(docFreq=275, maxDocs=44421)
                0.0625 = fieldNorm(doc=1935)
          0.041656498 = weight(abstract_txt:features in 1935) [ClassicSimilarity], result of:
            0.041656498 = score(doc=1935,freq=1.0), product of:
              0.14678694 = queryWeight, product of:
                1.7104734 = boost
                4.5406218 = idf(docFreq=1287, maxDocs=44421)
                0.018899737 = queryNorm
              0.28378886 = fieldWeight in 1935, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5406218 = idf(docFreq=1287, maxDocs=44421)
                0.0625 = fieldNorm(doc=1935)
          0.092148446 = weight(abstract_txt:selection in 1935) [ClassicSimilarity], result of:
            0.092148446 = score(doc=1935,freq=1.0), product of:
              0.27428612 = queryWeight, product of:
                2.6998765 = boost
                5.375318 = idf(docFreq=558, maxDocs=44421)
                0.018899737 = queryNorm
              0.33595738 = fieldWeight in 1935, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.375318 = idf(docFreq=558, maxDocs=44421)
                0.0625 = fieldNorm(doc=1935)
          0.18299834 = weight(abstract_txt:feature in 1935) [ClassicSimilarity], result of:
            0.18299834 = score(doc=1935,freq=1.0), product of:
              0.49606702 = queryWeight, product of:
                4.4469047 = boost
                5.9023747 = idf(docFreq=329, maxDocs=44421)
                0.018899737 = queryNorm
              0.36889842 = fieldWeight in 1935, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9023747 = idf(docFreq=329, maxDocs=44421)
                0.0625 = fieldNorm(doc=1935)
        0.4 = coord(10/25)