Document (#35302)

Author
Dolamic, L.
Savoy, J.
Title
Indexing and searching strategies for the Russian language
Source
Journal of the American Society for Information Science and Technology. 60(2009) no.12, S.2540-2547
Year
2009
Abstract
This paper describes and evaluates various stemming and indexing strategies for the Russian language. We design and evaluate two stemming approaches, a light and a more aggressive one, and compare these stemmers to the Snowball stemmer, to no stemming, and also to a language-independent approach (n-gram). To evaluate the suggested stemming strategies we apply various probabilistic information retrieval (IR) models, including the Okapi, the Divergence from Randomness (DFR), a statistical language model (LM), as well as two vector-space approaches, namely, the classical tf idf scheme and the dtu-dtn model. We find that the vector-space dtu-dtn and the DFR models tend to result in better retrieval effectiveness than the Okapi, LM, or tf idf models, while only the latter two IR approaches result in statistically significant performance differences. Ignoring stemming generally reduces the MAP by more than 50%, and these differences are always significant. When applying an n-gram approach, performance differences are usually lower than an approach involving stemming. Finally, our light stemmer tends to perform best, although performance differences between the light, aggressive, and Snowball stemmers are not statistically significant.
Theme
Automatisches Indexieren

Similar documents (author)

  1. Savoy, J.: Stemming of French words based on grammatical categories (1993) 5.21
    5.2088575 = sum of:
      5.2088575 = weight(author_txt:savoy in 4649) [ClassicSimilarity], result of:
        5.2088575 = fieldWeight in 4649, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.334172 = idf(docFreq=28, maxDocs=44421)
          0.625 = fieldNorm(doc=4649)
    
  2. Savoy, J.: Effectiveness of information retrieval systems used in a hypertext environment (1993) 5.21
    5.2088575 = sum of:
      5.2088575 = weight(author_txt:savoy in 6510) [ClassicSimilarity], result of:
        5.2088575 = fieldWeight in 6510, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.334172 = idf(docFreq=28, maxDocs=44421)
          0.625 = fieldNorm(doc=6510)
    
  3. Savoy, J.: ¬A learning scheme for information retrieval in hypertext (1994) 5.21
    5.2088575 = sum of:
      5.2088575 = weight(author_txt:savoy in 7291) [ClassicSimilarity], result of:
        5.2088575 = fieldWeight in 7291, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.334172 = idf(docFreq=28, maxDocs=44421)
          0.625 = fieldNorm(doc=7291)
    
  4. Savoy, J.: Bayesian inference networks and spreading activation in hypertext systems (1992) 5.21
    5.2088575 = sum of:
      5.2088575 = weight(author_txt:savoy in 260) [ClassicSimilarity], result of:
        5.2088575 = fieldWeight in 260, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.334172 = idf(docFreq=28, maxDocs=44421)
          0.625 = fieldNorm(doc=260)
    
  5. Savoy, J.: Searching information in legal hypertext systems (1993/94) 5.21
    5.2088575 = sum of:
      5.2088575 = weight(author_txt:savoy in 825) [ClassicSimilarity], result of:
        5.2088575 = fieldWeight in 825, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.334172 = idf(docFreq=28, maxDocs=44421)
          0.625 = fieldNorm(doc=825)
    

Similar documents (content)

  1. Savoy, J.: Searching strategies for the Hungarian language (2008) 1.19
    1.1870561 = sum of:
      1.1870561 = product of:
        1.8547752 = sum of:
          0.042276293 = weight(abstract_txt:space in 3037) [ClassicSimilarity], result of:
            0.042276293 = score(doc=3037,freq=1.0), product of:
              0.100333676 = queryWeight, product of:
                1.2292353 = boost
                5.393369 = idf(docFreq=548, maxDocs=44421)
                0.015133924 = queryNorm
              0.42135698 = fieldWeight in 3037, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.393369 = idf(docFreq=548, maxDocs=44421)
                0.078125 = fieldNorm(doc=3037)
          0.036659077 = weight(abstract_txt:approach in 3037) [ClassicSimilarity], result of:
            0.036659077 = score(doc=3037,freq=3.0), product of:
              0.07241465 = queryWeight, product of:
                1.2789999 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.015133924 = queryNorm
              0.5062384 = fieldWeight in 3037, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.078125 = fieldNorm(doc=3037)
          0.023843685 = weight(abstract_txt:than in 3037) [ClassicSimilarity], result of:
            0.023843685 = score(doc=3037,freq=1.0), product of:
              0.07840218 = queryWeight, product of:
                1.3308262 = boost
                3.8927383 = idf(docFreq=2461, maxDocs=44421)
                0.015133924 = queryNorm
              0.30412018 = fieldWeight in 3037, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8927383 = idf(docFreq=2461, maxDocs=44421)
                0.078125 = fieldNorm(doc=3037)
          0.074678585 = weight(abstract_txt:vector in 3037) [ClassicSimilarity], result of:
            0.074678585 = score(doc=3037,freq=1.0), product of:
              0.14661537 = queryWeight, product of:
                1.4859405 = boost
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.015133924 = queryNorm
              0.5093503 = fieldWeight in 3037, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.078125 = fieldNorm(doc=3037)
          0.085430816 = weight(abstract_txt:statistically in 3037) [ClassicSimilarity], result of:
            0.085430816 = score(doc=3037,freq=1.0), product of:
              0.16037075 = queryWeight, product of:
                1.5540831 = boost
                6.8186655 = idf(docFreq=131, maxDocs=44421)
                0.015133924 = queryNorm
              0.5327082 = fieldWeight in 3037, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8186655 = idf(docFreq=131, maxDocs=44421)
                0.078125 = fieldNorm(doc=3037)
          0.039853428 = weight(abstract_txt:performance in 3037) [ClassicSimilarity], result of:
            0.039853428 = score(doc=3037,freq=1.0), product of:
              0.11042218 = queryWeight, product of:
                1.5793757 = boost
                4.619759 = idf(docFreq=1189, maxDocs=44421)
                0.015133924 = queryNorm
              0.36091867 = fieldWeight in 3037, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.619759 = idf(docFreq=1189, maxDocs=44421)
                0.078125 = fieldNorm(doc=3037)
          0.039940633 = weight(abstract_txt:models in 3037) [ClassicSimilarity], result of:
            0.039940633 = score(doc=3037,freq=1.0), product of:
              0.1105832 = queryWeight, product of:
                1.5805268 = boost
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.015133924 = queryNorm
              0.36118174 = fieldWeight in 3037, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.078125 = fieldNorm(doc=3037)
          0.043495797 = weight(abstract_txt:significant in 3037) [ClassicSimilarity], result of:
            0.043495797 = score(doc=3037,freq=1.0), product of:
              0.11705161 = queryWeight, product of:
                1.6260953 = boost
                4.7564163 = idf(docFreq=1037, maxDocs=44421)
                0.015133924 = queryNorm
              0.37159503 = fieldWeight in 3037, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7564163 = idf(docFreq=1037, maxDocs=44421)
                0.078125 = fieldNorm(doc=3037)
          0.07645113 = weight(abstract_txt:strategies in 3037) [ClassicSimilarity], result of:
            0.07645113 = score(doc=3037,freq=2.0), product of:
              0.13530852 = queryWeight, product of:
                1.7483157 = boost
                5.113918 = idf(docFreq=725, maxDocs=44421)
                0.015133924 = queryNorm
              0.5650134 = fieldWeight in 3037, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.113918 = idf(docFreq=725, maxDocs=44421)
                0.078125 = fieldNorm(doc=3037)
          0.1386229 = weight(abstract_txt:okapi in 3037) [ClassicSimilarity], result of:
            0.1386229 = score(doc=3037,freq=1.0), product of:
              0.22144817 = queryWeight, product of:
                1.8261974 = boost
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.015133924 = queryNorm
              0.6259835 = fieldWeight in 3037, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.078125 = fieldNorm(doc=3037)
          0.067736655 = weight(abstract_txt:language in 3037) [ClassicSimilarity], result of:
            0.067736655 = score(doc=3037,freq=3.0), product of:
              0.12001463 = queryWeight, product of:
                1.9012698 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.015133924 = queryNorm
              0.5644033 = fieldWeight in 3037, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.078125 = fieldNorm(doc=3037)
          0.11864439 = weight(abstract_txt:light in 3037) [ClassicSimilarity], result of:
            0.11864439 = score(doc=3037,freq=2.0), product of:
              0.1813708 = queryWeight, product of:
                2.024142 = boost
                5.920724 = idf(docFreq=323, maxDocs=44421)
                0.015133924 = queryNorm
              0.65415376 = fieldWeight in 3037, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.920724 = idf(docFreq=323, maxDocs=44421)
                0.078125 = fieldNorm(doc=3037)
          0.21097122 = weight(abstract_txt:stemmer in 3037) [ClassicSimilarity], result of:
            0.21097122 = score(doc=3037,freq=1.0), product of:
              0.29299775 = queryWeight, product of:
                2.1006021 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.015133924 = queryNorm
              0.72004384 = fieldWeight in 3037, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.078125 = fieldNorm(doc=3037)
          0.21097122 = weight(abstract_txt:aggressive in 3037) [ClassicSimilarity], result of:
            0.21097122 = score(doc=3037,freq=1.0), product of:
              0.29299775 = queryWeight, product of:
                2.1006021 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.015133924 = queryNorm
              0.72004384 = fieldWeight in 3037, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.078125 = fieldNorm(doc=3037)
          0.065521404 = weight(abstract_txt:differences in 3037) [ClassicSimilarity], result of:
            0.065521404 = score(doc=3037,freq=1.0), product of:
              0.16929635 = queryWeight, product of:
                2.258138 = boost
                4.9538813 = idf(docFreq=851, maxDocs=44421)
                0.015133924 = queryNorm
              0.38702196 = fieldWeight in 3037, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9538813 = idf(docFreq=851, maxDocs=44421)
                0.078125 = fieldNorm(doc=3037)
          0.579678 = weight(abstract_txt:stemming in 3037) [ClassicSimilarity], result of:
            0.579678 = score(doc=3037,freq=3.0), product of:
              0.57478666 = queryWeight, product of:
                5.0959554 = boost
                7.4529724 = idf(docFreq=69, maxDocs=44421)
                0.015133924 = queryNorm
              1.0085099 = fieldWeight in 3037, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.4529724 = idf(docFreq=69, maxDocs=44421)
                0.078125 = fieldNorm(doc=3037)
        0.64 = coord(16/25)
    
  2. Fautsch, C.; Savoy, J.: Algorithmic stemmers or morphological analysis? : an evaluation (2009) 0.52
    0.5229036 = sum of:
      0.5229036 = product of:
        1.3072591 = sum of:
          0.039423257 = weight(abstract_txt:various in 3950) [ClassicSimilarity], result of:
            0.039423257 = score(doc=3950,freq=3.0), product of:
              0.06640132 = queryWeight, product of:
                4.387581 = idf(docFreq=1500, maxDocs=44421)
                0.015133924 = queryNorm
              0.593712 = fieldWeight in 3950, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.387581 = idf(docFreq=1500, maxDocs=44421)
                0.078125 = fieldNorm(doc=3950)
          0.039296675 = weight(abstract_txt:approaches in 3950) [ClassicSimilarity], result of:
            0.039296675 = score(doc=3950,freq=1.0), product of:
              0.10939138 = queryWeight, product of:
                1.5719866 = boost
                4.5981455 = idf(docFreq=1215, maxDocs=44421)
                0.015133924 = queryNorm
              0.3592301 = fieldWeight in 3950, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5981455 = idf(docFreq=1215, maxDocs=44421)
                0.078125 = fieldNorm(doc=3950)
          0.056361258 = weight(abstract_txt:performance in 3950) [ClassicSimilarity], result of:
            0.056361258 = score(doc=3950,freq=2.0), product of:
              0.11042218 = queryWeight, product of:
                1.5793757 = boost
                4.619759 = idf(docFreq=1189, maxDocs=44421)
                0.015133924 = queryNorm
              0.5104161 = fieldWeight in 3950, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.619759 = idf(docFreq=1189, maxDocs=44421)
                0.078125 = fieldNorm(doc=3950)
          0.039940633 = weight(abstract_txt:models in 3950) [ClassicSimilarity], result of:
            0.039940633 = score(doc=3950,freq=1.0), product of:
              0.1105832 = queryWeight, product of:
                1.5805268 = boost
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.015133924 = queryNorm
              0.36118174 = fieldWeight in 3950, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.078125 = fieldNorm(doc=3950)
          0.043495797 = weight(abstract_txt:significant in 3950) [ClassicSimilarity], result of:
            0.043495797 = score(doc=3950,freq=1.0), product of:
              0.11705161 = queryWeight, product of:
                1.6260953 = boost
                4.7564163 = idf(docFreq=1037, maxDocs=44421)
                0.015133924 = queryNorm
              0.37159503 = fieldWeight in 3950, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7564163 = idf(docFreq=1037, maxDocs=44421)
                0.078125 = fieldNorm(doc=3950)
          0.055306748 = weight(abstract_txt:language in 3950) [ClassicSimilarity], result of:
            0.055306748 = score(doc=3950,freq=2.0), product of:
              0.12001463 = queryWeight, product of:
                1.9012698 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.015133924 = queryNorm
              0.46083337 = fieldWeight in 3950, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.078125 = fieldNorm(doc=3950)
          0.28363684 = weight(abstract_txt:stemmers in 3950) [ClassicSimilarity], result of:
            0.28363684 = score(doc=3950,freq=2.0), product of:
              0.28327867 = queryWeight, product of:
                2.0654685 = boost
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.015133924 = queryNorm
              1.0012643 = fieldWeight in 3950, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.078125 = fieldNorm(doc=3950)
          0.21097122 = weight(abstract_txt:stemmer in 3950) [ClassicSimilarity], result of:
            0.21097122 = score(doc=3950,freq=1.0), product of:
              0.29299775 = queryWeight, product of:
                2.1006021 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.015133924 = queryNorm
              0.72004384 = fieldWeight in 3950, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.078125 = fieldNorm(doc=3950)
          0.065521404 = weight(abstract_txt:differences in 3950) [ClassicSimilarity], result of:
            0.065521404 = score(doc=3950,freq=1.0), product of:
              0.16929635 = queryWeight, product of:
                2.258138 = boost
                4.9538813 = idf(docFreq=851, maxDocs=44421)
                0.015133924 = queryNorm
              0.38702196 = fieldWeight in 3950, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9538813 = idf(docFreq=851, maxDocs=44421)
                0.078125 = fieldNorm(doc=3950)
          0.4733051 = weight(abstract_txt:stemming in 3950) [ClassicSimilarity], result of:
            0.4733051 = score(doc=3950,freq=2.0), product of:
              0.57478666 = queryWeight, product of:
                5.0959554 = boost
                7.4529724 = idf(docFreq=69, maxDocs=44421)
                0.015133924 = queryNorm
              0.82344484 = fieldWeight in 3950, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.4529724 = idf(docFreq=69, maxDocs=44421)
                0.078125 = fieldNorm(doc=3950)
        0.4 = coord(10/25)
    
  3. Dolamic, L.; Savoy, J.: When stopword lists make the difference (2009) 0.37
    0.370075 = sum of:
      0.370075 = product of:
        0.7709896 = sum of:
          0.108501494 = weight(abstract_txt:randomness in 306) [ClassicSimilarity], result of:
            0.108501494 = score(doc=306,freq=1.0), product of:
              0.14927804 = queryWeight, product of:
                1.0602167 = boost
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.015133924 = queryNorm
              0.7268416 = fieldWeight in 306, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.078125 = fieldNorm(doc=306)
          0.035637103 = weight(abstract_txt:result in 306) [ClassicSimilarity], result of:
            0.035637103 = score(doc=306,freq=1.0), product of:
              0.08953312 = queryWeight, product of:
                1.1611906 = boost
                5.0948176 = idf(docFreq=739, maxDocs=44421)
                0.015133924 = queryNorm
              0.39803264 = fieldWeight in 306, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0948176 = idf(docFreq=739, maxDocs=44421)
                0.078125 = fieldNorm(doc=306)
          0.040489115 = weight(abstract_txt:evaluate in 306) [ClassicSimilarity], result of:
            0.040489115 = score(doc=306,freq=1.0), product of:
              0.09748571 = queryWeight, product of:
                1.2116638 = boost
                5.316273 = idf(docFreq=592, maxDocs=44421)
                0.015133924 = queryNorm
              0.41533384 = fieldWeight in 306, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.316273 = idf(docFreq=592, maxDocs=44421)
                0.078125 = fieldNorm(doc=306)
          0.021165127 = weight(abstract_txt:approach in 306) [ClassicSimilarity], result of:
            0.021165127 = score(doc=306,freq=1.0), product of:
              0.07241465 = queryWeight, product of:
                1.2789999 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.015133924 = queryNorm
              0.29227686 = fieldWeight in 306, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.078125 = fieldNorm(doc=306)
          0.085430816 = weight(abstract_txt:statistically in 306) [ClassicSimilarity], result of:
            0.085430816 = score(doc=306,freq=1.0), product of:
              0.16037075 = queryWeight, product of:
                1.5540831 = boost
                6.8186655 = idf(docFreq=131, maxDocs=44421)
                0.015133924 = queryNorm
              0.5327082 = fieldWeight in 306, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8186655 = idf(docFreq=131, maxDocs=44421)
                0.078125 = fieldNorm(doc=306)
          0.039296675 = weight(abstract_txt:approaches in 306) [ClassicSimilarity], result of:
            0.039296675 = score(doc=306,freq=1.0), product of:
              0.10939138 = queryWeight, product of:
                1.5719866 = boost
                4.5981455 = idf(docFreq=1215, maxDocs=44421)
                0.015133924 = queryNorm
              0.3592301 = fieldWeight in 306, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5981455 = idf(docFreq=1215, maxDocs=44421)
                0.078125 = fieldNorm(doc=306)
          0.056361258 = weight(abstract_txt:performance in 306) [ClassicSimilarity], result of:
            0.056361258 = score(doc=306,freq=2.0), product of:
              0.11042218 = queryWeight, product of:
                1.5793757 = boost
                4.619759 = idf(docFreq=1189, maxDocs=44421)
                0.015133924 = queryNorm
              0.5104161 = fieldWeight in 306, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.619759 = idf(docFreq=1189, maxDocs=44421)
                0.078125 = fieldNorm(doc=306)
          0.039940633 = weight(abstract_txt:models in 306) [ClassicSimilarity], result of:
            0.039940633 = score(doc=306,freq=1.0), product of:
              0.1105832 = queryWeight, product of:
                1.5805268 = boost
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.015133924 = queryNorm
              0.36118174 = fieldWeight in 306, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.078125 = fieldNorm(doc=306)
          0.043495797 = weight(abstract_txt:significant in 306) [ClassicSimilarity], result of:
            0.043495797 = score(doc=306,freq=1.0), product of:
              0.11705161 = queryWeight, product of:
                1.6260953 = boost
                4.7564163 = idf(docFreq=1037, maxDocs=44421)
                0.015133924 = queryNorm
              0.37159503 = fieldWeight in 306, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7564163 = idf(docFreq=1037, maxDocs=44421)
                0.078125 = fieldNorm(doc=306)
          0.19604239 = weight(abstract_txt:okapi in 306) [ClassicSimilarity], result of:
            0.19604239 = score(doc=306,freq=2.0), product of:
              0.22144817 = queryWeight, product of:
                1.8261974 = boost
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.015133924 = queryNorm
              0.88527435 = fieldWeight in 306, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.078125 = fieldNorm(doc=306)
          0.039107777 = weight(abstract_txt:language in 306) [ClassicSimilarity], result of:
            0.039107777 = score(doc=306,freq=1.0), product of:
              0.12001463 = queryWeight, product of:
                1.9012698 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.015133924 = queryNorm
              0.3258584 = fieldWeight in 306, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.078125 = fieldNorm(doc=306)
          0.065521404 = weight(abstract_txt:differences in 306) [ClassicSimilarity], result of:
            0.065521404 = score(doc=306,freq=1.0), product of:
              0.16929635 = queryWeight, product of:
                2.258138 = boost
                4.9538813 = idf(docFreq=851, maxDocs=44421)
                0.015133924 = queryNorm
              0.38702196 = fieldWeight in 306, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9538813 = idf(docFreq=851, maxDocs=44421)
                0.078125 = fieldNorm(doc=306)
        0.48 = coord(12/25)
    
  4. Fox, B.; Fox, C.J.: Efficient stemmer generation (2002) 0.34
    0.3436663 = sum of:
      0.3436663 = product of:
        1.7183315 = sum of:
          0.047208082 = weight(abstract_txt:than in 3585) [ClassicSimilarity], result of:
            0.047208082 = score(doc=3585,freq=2.0), product of:
              0.07840218 = queryWeight, product of:
                1.3308262 = boost
                3.8927383 = idf(docFreq=2461, maxDocs=44421)
                0.015133924 = queryNorm
              0.6021272 = fieldWeight in 3585, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.8927383 = idf(docFreq=2461, maxDocs=44421)
                0.109375 = fieldNorm(doc=3585)
          0.055794798 = weight(abstract_txt:performance in 3585) [ClassicSimilarity], result of:
            0.055794798 = score(doc=3585,freq=1.0), product of:
              0.11042218 = queryWeight, product of:
                1.5793757 = boost
                4.619759 = idf(docFreq=1189, maxDocs=44421)
                0.015133924 = queryNorm
              0.50528616 = fieldWeight in 3585, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.619759 = idf(docFreq=1189, maxDocs=44421)
                0.109375 = fieldNorm(doc=3585)
          0.48633587 = weight(abstract_txt:stemmers in 3585) [ClassicSimilarity], result of:
            0.48633587 = score(doc=3585,freq=3.0), product of:
              0.28327867 = queryWeight, product of:
                2.0654685 = boost
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.015133924 = queryNorm
              1.7168107 = fieldWeight in 3585, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.109375 = fieldNorm(doc=3585)
          0.66044444 = weight(abstract_txt:stemmer in 3585) [ClassicSimilarity], result of:
            0.66044444 = score(doc=3585,freq=5.0), product of:
              0.29299775 = queryWeight, product of:
                2.1006021 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.015133924 = queryNorm
              2.254094 = fieldWeight in 3585, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.109375 = fieldNorm(doc=3585)
          0.46854818 = weight(abstract_txt:stemming in 3585) [ClassicSimilarity], result of:
            0.46854818 = score(doc=3585,freq=1.0), product of:
              0.57478666 = queryWeight, product of:
                5.0959554 = boost
                7.4529724 = idf(docFreq=69, maxDocs=44421)
                0.015133924 = queryNorm
              0.81516886 = fieldWeight in 3585, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4529724 = idf(docFreq=69, maxDocs=44421)
                0.109375 = fieldNorm(doc=3585)
        0.2 = coord(5/25)
    
  5. Kettunen, K.; Kunttu, T.; Järvelin, K.: To stem or lemmatize a highly inflectional language in a probabilistic IR environment? (2005) 0.28
    0.28455573 = sum of:
      0.28455573 = product of:
        0.7904326 = sum of:
          0.014815589 = weight(abstract_txt:approach in 5395) [ClassicSimilarity], result of:
            0.014815589 = score(doc=5395,freq=1.0), product of:
              0.07241465 = queryWeight, product of:
                1.2789999 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.015133924 = queryNorm
              0.20459381 = fieldWeight in 5395, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.0546875 = fieldNorm(doc=5395)
          0.016690578 = weight(abstract_txt:than in 5395) [ClassicSimilarity], result of:
            0.016690578 = score(doc=5395,freq=1.0), product of:
              0.07840218 = queryWeight, product of:
                1.3308262 = boost
                3.8927383 = idf(docFreq=2461, maxDocs=44421)
                0.015133924 = queryNorm
              0.21288413 = fieldWeight in 5395, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8927383 = idf(docFreq=2461, maxDocs=44421)
                0.0546875 = fieldNorm(doc=5395)
          0.059801575 = weight(abstract_txt:statistically in 5395) [ClassicSimilarity], result of:
            0.059801575 = score(doc=5395,freq=1.0), product of:
              0.16037075 = queryWeight, product of:
                1.5540831 = boost
                6.8186655 = idf(docFreq=131, maxDocs=44421)
                0.015133924 = queryNorm
              0.37289578 = fieldWeight in 5395, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8186655 = idf(docFreq=131, maxDocs=44421)
                0.0546875 = fieldNorm(doc=5395)
          0.027897399 = weight(abstract_txt:performance in 5395) [ClassicSimilarity], result of:
            0.027897399 = score(doc=5395,freq=1.0), product of:
              0.11042218 = queryWeight, product of:
                1.5793757 = boost
                4.619759 = idf(docFreq=1189, maxDocs=44421)
                0.015133924 = queryNorm
              0.25264308 = fieldWeight in 5395, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.619759 = idf(docFreq=1189, maxDocs=44421)
                0.0546875 = fieldNorm(doc=5395)
          0.030447057 = weight(abstract_txt:significant in 5395) [ClassicSimilarity], result of:
            0.030447057 = score(doc=5395,freq=1.0), product of:
              0.11705161 = queryWeight, product of:
                1.6260953 = boost
                4.7564163 = idf(docFreq=1037, maxDocs=44421)
                0.015133924 = queryNorm
              0.26011652 = fieldWeight in 5395, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7564163 = idf(docFreq=1037, maxDocs=44421)
                0.0546875 = fieldNorm(doc=5395)
          0.054750886 = weight(abstract_txt:language in 5395) [ClassicSimilarity], result of:
            0.054750886 = score(doc=5395,freq=4.0), product of:
              0.12001463 = queryWeight, product of:
                1.9012698 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.015133924 = queryNorm
              0.45620176 = fieldWeight in 5395, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.0546875 = fieldNorm(doc=5395)
          0.20885085 = weight(abstract_txt:stemmer in 5395) [ClassicSimilarity], result of:
            0.20885085 = score(doc=5395,freq=2.0), product of:
              0.29299775 = queryWeight, product of:
                2.1006021 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.015133924 = queryNorm
              0.712807 = fieldWeight in 5395, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.0546875 = fieldNorm(doc=5395)
          0.045864988 = weight(abstract_txt:differences in 5395) [ClassicSimilarity], result of:
            0.045864988 = score(doc=5395,freq=1.0), product of:
              0.16929635 = queryWeight, product of:
                2.258138 = boost
                4.9538813 = idf(docFreq=851, maxDocs=44421)
                0.015133924 = queryNorm
              0.2709154 = fieldWeight in 5395, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9538813 = idf(docFreq=851, maxDocs=44421)
                0.0546875 = fieldNorm(doc=5395)
          0.3313136 = weight(abstract_txt:stemming in 5395) [ClassicSimilarity], result of:
            0.3313136 = score(doc=5395,freq=2.0), product of:
              0.57478666 = queryWeight, product of:
                5.0959554 = boost
                7.4529724 = idf(docFreq=69, maxDocs=44421)
                0.015133924 = queryNorm
              0.5764114 = fieldWeight in 5395, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.4529724 = idf(docFreq=69, maxDocs=44421)
                0.0546875 = fieldNorm(doc=5395)
        0.36 = coord(9/25)