Document (#38122)

Author
Manning, C.D.
Title
Part-of-Speech Tagging from 97% to 100% : is it time for some linguistics?
Source
Computational Linguistics and Intelligent Text Processing, 12th International Conference, CICLing 2011, Proceedings, Part I. Ed.: Alexander Gelbukh
Imprint
Berlin : Springer
Year
2011
Pages
S.171-189
Series
Lecture notes in computer science; 6608
Abstract
I examine what would be necessary to move part-of-speech tagging performance from its current level of about 97.3% token accuracy (56% sentence accuracy) to close to 100% accuracy. I suggest that it must still be possible to greatly increase tagging performance and examine some useful improvements that have recently been made to the Stanford Part-of-Speech Tagger. However, an error analysis of some of the remaining errors suggests that there is limited further mileage to be had either from better machine learning or better features in a discriminative sequence classifier. The prospects for further gains from semisupervised learning also seem quite limited. Rather, I suggest and begin to demonstrate that the largest opportunity for further progress comes from improving the taxonomic basis of the linguistic resources from which taggers are trained. That is, from improved descriptive linguistics. However, I conclude by suggesting that there are also limits to this process. The status of some words may not be able to be adequately captured by assigning them to one of a small number of categories. While conventions can be used in such cases to improve tagging consistency, they lack a strong linguistic basis.
Content
Vgl.: http://nlp.stanford.edu/~manning/papers/CICLing2011-manning-tagging.pdf.
Theme
Computerlinguistik

Similar documents (author)

  1. Manning, R.W.: ¬The Anglo-American Cataloguing Rules and their future (1999) 5.76
    5.7603507 = sum of:
      5.7603507 = weight(author_txt:manning in 809) [ClassicSimilarity], result of:
        5.7603507 = fieldWeight in 809, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.216561 = idf(docFreq=11, maxDocs=44421)
          0.625 = fieldNorm(doc=809)
    
  2. Manning, R.W.: ¬The Anglo American Cataloguing Rules and their future (2000) 5.76
    5.7603507 = sum of:
      5.7603507 = weight(author_txt:manning in 314) [ClassicSimilarity], result of:
        5.7603507 = fieldWeight in 314, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.216561 = idf(docFreq=11, maxDocs=44421)
          0.625 = fieldNorm(doc=314)
    
  3. Mallett, J.; Manning, C.: Multimedia and database design : a discussion of database technology and its use in multimedia (1993) 4.61
    4.6082807 = sum of:
      4.6082807 = weight(author_txt:manning in 6276) [ClassicSimilarity], result of:
        4.6082807 = fieldWeight in 6276, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.216561 = idf(docFreq=11, maxDocs=44421)
          0.5 = fieldNorm(doc=6276)
    
  4. Toutanova, K.; Manning, C.D.: Enriching the knowledge sources used in a maximum entropy Part-of-Speech Tagger (2000) 4.61
    4.6082807 = sum of:
      4.6082807 = weight(author_txt:manning in 2060) [ClassicSimilarity], result of:
        4.6082807 = fieldWeight in 2060, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.216561 = idf(docFreq=11, maxDocs=44421)
          0.5 = fieldNorm(doc=2060)
    
  5. Manning, C.D.; Schütze, H.: Foundations of statistical natural language processing (2000) 4.61
    4.6082807 = sum of:
      4.6082807 = weight(author_txt:manning in 2603) [ClassicSimilarity], result of:
        4.6082807 = fieldWeight in 2603, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.216561 = idf(docFreq=11, maxDocs=44421)
          0.5 = fieldNorm(doc=2603)
    

Similar documents (content)

  1. L'Homme, D.; L'Homme, M.-C.; Lemay, C.: Benchmarking the performance of two Part-of-Speech (POS) taggers for terminological purposes (2002) 0.50
    0.50291365 = sum of:
      0.50291365 = product of:
        1.257284 = sum of:
          0.031449523 = weight(abstract_txt:however in 2855) [ClassicSimilarity], result of:
            0.031449523 = score(doc=2855,freq=1.0), product of:
              0.09576168 = queryWeight, product of:
                1.052519 = boost
                4.203706 = idf(docFreq=1803, maxDocs=44421)
                0.021643601 = queryNorm
              0.3284145 = fieldWeight in 2855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.203706 = idf(docFreq=1803, maxDocs=44421)
                0.078125 = fieldNorm(doc=2855)
          0.3123119 = weight(abstract_txt:taggers in 2855) [ClassicSimilarity], result of:
            0.3123119 = score(doc=2855,freq=5.0), product of:
              0.2053563 = queryWeight, product of:
                1.0898659 = boost
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.021643601 = queryNorm
              1.5208293 = fieldWeight in 2855, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.078125 = fieldNorm(doc=2855)
          0.19752337 = weight(abstract_txt:tagger in 2855) [ClassicSimilarity], result of:
            0.19752337 = score(doc=2855,freq=2.0), product of:
              0.2053563 = queryWeight, product of:
                1.0898659 = boost
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.021643601 = queryNorm
              0.9618569 = fieldWeight in 2855, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.078125 = fieldNorm(doc=2855)
          0.04174218 = weight(abstract_txt:performance in 2855) [ClassicSimilarity], result of:
            0.04174218 = score(doc=2855,freq=1.0), product of:
              0.11565536 = queryWeight, product of:
                1.1566899 = boost
                4.619759 = idf(docFreq=1189, maxDocs=44421)
                0.021643601 = queryNorm
              0.36091867 = fieldWeight in 2855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.619759 = idf(docFreq=1189, maxDocs=44421)
                0.078125 = fieldNorm(doc=2855)
          0.061051443 = weight(abstract_txt:part in 2855) [ClassicSimilarity], result of:
            0.061051443 = score(doc=2855,freq=1.0), product of:
              0.170586 = queryWeight, product of:
                1.7204869 = boost
                4.581023 = idf(docFreq=1236, maxDocs=44421)
                0.021643601 = queryNorm
              0.35789245 = fieldWeight in 2855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.581023 = idf(docFreq=1236, maxDocs=44421)
                0.078125 = fieldNorm(doc=2855)
          0.016799493 = weight(abstract_txt:that in 2855) [ClassicSimilarity], result of:
            0.016799493 = score(doc=2855,freq=1.0), product of:
              0.09092575 = queryWeight, product of:
                1.7763892 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.021643601 = queryNorm
              0.18476056 = fieldWeight in 2855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.078125 = fieldNorm(doc=2855)
          0.13412559 = weight(abstract_txt:accuracy in 2855) [ClassicSimilarity], result of:
            0.13412559 = score(doc=2855,freq=1.0), product of:
              0.288284 = queryWeight, product of:
                2.2366083 = boost
                5.9552646 = idf(docFreq=312, maxDocs=44421)
                0.021643601 = queryNorm
              0.46525505 = fieldWeight in 2855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9552646 = idf(docFreq=312, maxDocs=44421)
                0.078125 = fieldNorm(doc=2855)
          0.04402949 = weight(abstract_txt:from in 2855) [ClassicSimilarity], result of:
            0.04402949 = score(doc=2855,freq=2.0), product of:
              0.14441894 = queryWeight, product of:
                2.4181328 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.021643601 = queryNorm
              0.30487338 = fieldWeight in 2855, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.078125 = fieldNorm(doc=2855)
          0.20619428 = weight(abstract_txt:speech in 2855) [ClassicSimilarity], result of:
            0.20619428 = score(doc=2855,freq=1.0), product of:
              0.38399938 = queryWeight, product of:
                2.5813384 = boost
                6.8731537 = idf(docFreq=124, maxDocs=44421)
                0.021643601 = queryNorm
              0.53696513 = fieldWeight in 2855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8731537 = idf(docFreq=124, maxDocs=44421)
                0.078125 = fieldNorm(doc=2855)
          0.21205671 = weight(abstract_txt:tagging in 2855) [ClassicSimilarity], result of:
            0.21205671 = score(doc=2855,freq=1.0), product of:
              0.4306195 = queryWeight, product of:
                3.156428 = boost
                6.3033047 = idf(docFreq=220, maxDocs=44421)
                0.021643601 = queryNorm
              0.49244568 = fieldWeight in 2855, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3033047 = idf(docFreq=220, maxDocs=44421)
                0.078125 = fieldNorm(doc=2855)
        0.4 = coord(10/25)
    
  2. Toutanova, K.; Manning, C.D.: Enriching the knowledge sources used in a maximum entropy Part-of-Speech Tagger (2000) 0.30
    0.2969659 = sum of:
      0.2969659 = product of:
        1.0605925 = sum of:
          0.23702806 = weight(abstract_txt:tagger in 2060) [ClassicSimilarity], result of:
            0.23702806 = score(doc=2060,freq=2.0), product of:
              0.2053563 = queryWeight, product of:
                1.0898659 = boost
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.021643601 = queryNorm
              1.1542283 = fieldWeight in 2060, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.09375 = fieldNorm(doc=2060)
          0.05009062 = weight(abstract_txt:performance in 2060) [ClassicSimilarity], result of:
            0.05009062 = score(doc=2060,freq=1.0), product of:
              0.11565536 = queryWeight, product of:
                1.1566899 = boost
                4.619759 = idf(docFreq=1189, maxDocs=44421)
                0.021643601 = queryNorm
              0.43310243 = fieldWeight in 2060, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.619759 = idf(docFreq=1189, maxDocs=44421)
                0.09375 = fieldNorm(doc=2060)
          0.07326173 = weight(abstract_txt:part in 2060) [ClassicSimilarity], result of:
            0.07326173 = score(doc=2060,freq=1.0), product of:
              0.170586 = queryWeight, product of:
                1.7204869 = boost
                4.581023 = idf(docFreq=1236, maxDocs=44421)
                0.021643601 = queryNorm
              0.42947093 = fieldWeight in 2060, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.581023 = idf(docFreq=1236, maxDocs=44421)
                0.09375 = fieldNorm(doc=2060)
          0.1609507 = weight(abstract_txt:accuracy in 2060) [ClassicSimilarity], result of:
            0.1609507 = score(doc=2060,freq=1.0), product of:
              0.288284 = queryWeight, product of:
                2.2366083 = boost
                5.9552646 = idf(docFreq=312, maxDocs=44421)
                0.021643601 = queryNorm
              0.55830604 = fieldWeight in 2060, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9552646 = idf(docFreq=312, maxDocs=44421)
                0.09375 = fieldNorm(doc=2060)
          0.037360262 = weight(abstract_txt:from in 2060) [ClassicSimilarity], result of:
            0.037360262 = score(doc=2060,freq=1.0), product of:
              0.14441894 = queryWeight, product of:
                2.4181328 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.021643601 = queryNorm
              0.25869364 = fieldWeight in 2060, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.09375 = fieldNorm(doc=2060)
          0.24743313 = weight(abstract_txt:speech in 2060) [ClassicSimilarity], result of:
            0.24743313 = score(doc=2060,freq=1.0), product of:
              0.38399938 = queryWeight, product of:
                2.5813384 = boost
                6.8731537 = idf(docFreq=124, maxDocs=44421)
                0.021643601 = queryNorm
              0.64435816 = fieldWeight in 2060, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8731537 = idf(docFreq=124, maxDocs=44421)
                0.09375 = fieldNorm(doc=2060)
          0.25446805 = weight(abstract_txt:tagging in 2060) [ClassicSimilarity], result of:
            0.25446805 = score(doc=2060,freq=1.0), product of:
              0.4306195 = queryWeight, product of:
                3.156428 = boost
                6.3033047 = idf(docFreq=220, maxDocs=44421)
                0.021643601 = queryNorm
              0.5909348 = fieldWeight in 2060, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3033047 = idf(docFreq=220, maxDocs=44421)
                0.09375 = fieldNorm(doc=2060)
        0.28 = coord(7/25)
    
  3. Xu, C.; Ma, B.; Chen, X.; Ma, F.: Social tagging in the scholarly world (2013) 0.28
    0.27875996 = sum of:
      0.27875996 = product of:
        0.99557126 = sum of:
          0.023242246 = weight(abstract_txt:there in 2091) [ClassicSimilarity], result of:
            0.023242246 = score(doc=2091,freq=1.0), product of:
              0.090832464 = queryWeight, product of:
                1.0250726 = boost
                4.094086 = idf(docFreq=2012, maxDocs=44421)
                0.021643601 = queryNorm
              0.2558804 = fieldWeight in 2091, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.094086 = idf(docFreq=2012, maxDocs=44421)
                0.0625 = fieldNorm(doc=2091)
          0.1117361 = weight(abstract_txt:taggers in 2091) [ClassicSimilarity], result of:
            0.1117361 = score(doc=2091,freq=1.0), product of:
              0.2053563 = queryWeight, product of:
                1.0898659 = boost
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.021643601 = queryNorm
              0.54410845 = fieldWeight in 2091, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.0625 = fieldNorm(doc=2091)
          0.047341224 = weight(abstract_txt:suggest in 2091) [ClassicSimilarity], result of:
            0.047341224 = score(doc=2091,freq=1.0), product of:
              0.14595379 = queryWeight, product of:
                1.2993966 = boost
                5.189722 = idf(docFreq=672, maxDocs=44421)
                0.021643601 = queryNorm
              0.32435763 = fieldWeight in 2091, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.189722 = idf(docFreq=672, maxDocs=44421)
                0.0625 = fieldNorm(doc=2091)
          0.055284202 = weight(abstract_txt:limited in 2091) [ClassicSimilarity], result of:
            0.055284202 = score(doc=2091,freq=1.0), product of:
              0.16185386 = queryWeight, product of:
                1.3683449 = boost
                5.465098 = idf(docFreq=510, maxDocs=44421)
                0.021643601 = queryNorm
              0.34156862 = fieldWeight in 2091, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.465098 = idf(docFreq=510, maxDocs=44421)
                0.0625 = fieldNorm(doc=2091)
          0.02327806 = weight(abstract_txt:that in 2091) [ClassicSimilarity], result of:
            0.02327806 = score(doc=2091,freq=3.0), product of:
              0.09092575 = queryWeight, product of:
                1.7763892 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.021643601 = queryNorm
              0.25601172 = fieldWeight in 2091, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=2091)
          0.035223592 = weight(abstract_txt:from in 2091) [ClassicSimilarity], result of:
            0.035223592 = score(doc=2091,freq=2.0), product of:
              0.14441894 = queryWeight, product of:
                2.4181328 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.021643601 = queryNorm
              0.2438987 = fieldWeight in 2091, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.0625 = fieldNorm(doc=2091)
          0.6994658 = weight(abstract_txt:tagging in 2091) [ClassicSimilarity], result of:
            0.6994658 = score(doc=2091,freq=17.0), product of:
              0.4306195 = queryWeight, product of:
                3.156428 = boost
                6.3033047 = idf(docFreq=220, maxDocs=44421)
                0.021643601 = queryNorm
              1.6243244 = fieldWeight in 2091, product of:
                4.1231055 = tf(freq=17.0), with freq of:
                  17.0 = termFreq=17.0
                6.3033047 = idf(docFreq=220, maxDocs=44421)
                0.0625 = fieldNorm(doc=2091)
        0.28 = coord(7/25)
    
  4. Heckner, M.; Mühlbacher, S.; Wolff, C.: Tagging tagging : a classification model for user keywords in scientific bibliography management systems (2007) 0.26
    0.257394 = sum of:
      0.257394 = product of:
        0.7149832 = sum of:
          0.018869715 = weight(abstract_txt:however in 1533) [ClassicSimilarity], result of:
            0.018869715 = score(doc=1533,freq=1.0), product of:
              0.09576168 = queryWeight, product of:
                1.052519 = boost
                4.203706 = idf(docFreq=1803, maxDocs=44421)
                0.021643601 = queryNorm
              0.19704871 = fieldWeight in 1533, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.203706 = idf(docFreq=1803, maxDocs=44421)
                0.046875 = fieldNorm(doc=1533)
          0.083802074 = weight(abstract_txt:tagger in 1533) [ClassicSimilarity], result of:
            0.083802074 = score(doc=1533,freq=1.0), product of:
              0.2053563 = queryWeight, product of:
                1.0898659 = boost
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.021643601 = queryNorm
              0.40808135 = fieldWeight in 1533, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.046875 = fieldNorm(doc=1533)
          0.026074154 = weight(abstract_txt:basis in 1533) [ClassicSimilarity], result of:
            0.026074154 = score(doc=1533,freq=1.0), product of:
              0.11880144 = queryWeight, product of:
                1.1723166 = boost
                4.682171 = idf(docFreq=1117, maxDocs=44421)
                0.021643601 = queryNorm
              0.21947676 = fieldWeight in 1533, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.682171 = idf(docFreq=1117, maxDocs=44421)
                0.046875 = fieldNorm(doc=1533)
          0.049885854 = weight(abstract_txt:linguistic in 1533) [ClassicSimilarity], result of:
            0.049885854 = score(doc=1533,freq=1.0), product of:
              0.18309078 = queryWeight, product of:
                1.4553494 = boost
                5.8125896 = idf(docFreq=360, maxDocs=44421)
                0.021643601 = queryNorm
              0.27246514 = fieldWeight in 1533, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8125896 = idf(docFreq=360, maxDocs=44421)
                0.046875 = fieldNorm(doc=1533)
          0.074541286 = weight(abstract_txt:linguistics in 1533) [ClassicSimilarity], result of:
            0.074541286 = score(doc=1533,freq=1.0), product of:
              0.23930188 = queryWeight, product of:
                1.6638229 = boost
                6.6452217 = idf(docFreq=156, maxDocs=44421)
                0.021643601 = queryNorm
              0.31149477 = fieldWeight in 1533, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6452217 = idf(docFreq=156, maxDocs=44421)
                0.046875 = fieldNorm(doc=1533)
          0.017458545 = weight(abstract_txt:that in 1533) [ClassicSimilarity], result of:
            0.017458545 = score(doc=1533,freq=3.0), product of:
              0.09092575 = queryWeight, product of:
                1.7763892 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.021643601 = queryNorm
              0.1920088 = fieldWeight in 1533, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.046875 = fieldNorm(doc=1533)
          0.02528924 = weight(abstract_txt:some in 1533) [ClassicSimilarity], result of:
            0.02528924 = score(doc=1533,freq=1.0), product of:
              0.14666125 = queryWeight, product of:
                1.8420726 = boost
                3.6785707 = idf(docFreq=3049, maxDocs=44421)
                0.021643601 = queryNorm
              0.172433 = fieldWeight in 1533, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6785707 = idf(docFreq=3049, maxDocs=44421)
                0.046875 = fieldNorm(doc=1533)
          0.037360262 = weight(abstract_txt:from in 1533) [ClassicSimilarity], result of:
            0.037360262 = score(doc=1533,freq=4.0), product of:
              0.14441894 = queryWeight, product of:
                2.4181328 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.021643601 = queryNorm
              0.25869364 = fieldWeight in 1533, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.046875 = fieldNorm(doc=1533)
          0.3817021 = weight(abstract_txt:tagging in 1533) [ClassicSimilarity], result of:
            0.3817021 = score(doc=1533,freq=9.0), product of:
              0.4306195 = queryWeight, product of:
                3.156428 = boost
                6.3033047 = idf(docFreq=220, maxDocs=44421)
                0.021643601 = queryNorm
              0.88640225 = fieldWeight in 1533, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                6.3033047 = idf(docFreq=220, maxDocs=44421)
                0.046875 = fieldNorm(doc=1533)
        0.36 = coord(9/25)
    
  5. Losee, R.M.: Learning syntactic rules and tags with genetic algorithms for information retrieval and filtering : an empirical basis for grammatical rules (1996) 0.25
    0.25086668 = sum of:
      0.25086668 = product of:
        0.69685185 = sum of:
          0.04174218 = weight(abstract_txt:performance in 4136) [ClassicSimilarity], result of:
            0.04174218 = score(doc=4136,freq=1.0), product of:
              0.11565536 = queryWeight, product of:
                1.1566899 = boost
                4.619759 = idf(docFreq=1189, maxDocs=44421)
                0.021643601 = queryNorm
              0.36091867 = fieldWeight in 4136, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.619759 = idf(docFreq=1189, maxDocs=44421)
                0.078125 = fieldNorm(doc=4136)
          0.045146164 = weight(abstract_txt:learning in 4136) [ClassicSimilarity], result of:
            0.045146164 = score(doc=4136,freq=1.0), product of:
              0.1218605 = queryWeight, product of:
                1.1873138 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.021643601 = queryNorm
              0.37047416 = fieldWeight in 4136, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.078125 = fieldNorm(doc=4136)
          0.08314309 = weight(abstract_txt:linguistic in 4136) [ClassicSimilarity], result of:
            0.08314309 = score(doc=4136,freq=1.0), product of:
              0.18309078 = queryWeight, product of:
                1.4553494 = boost
                5.8125896 = idf(docFreq=360, maxDocs=44421)
                0.021643601 = queryNorm
              0.45410857 = fieldWeight in 4136, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8125896 = idf(docFreq=360, maxDocs=44421)
                0.078125 = fieldNorm(doc=4136)
          0.061051443 = weight(abstract_txt:part in 4136) [ClassicSimilarity], result of:
            0.061051443 = score(doc=4136,freq=1.0), product of:
              0.170586 = queryWeight, product of:
                1.7204869 = boost
                4.581023 = idf(docFreq=1236, maxDocs=44421)
                0.021643601 = queryNorm
              0.35789245 = fieldWeight in 4136, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.581023 = idf(docFreq=1236, maxDocs=44421)
                0.078125 = fieldNorm(doc=4136)
          0.06422991 = weight(abstract_txt:further in 4136) [ClassicSimilarity], result of:
            0.06422991 = score(doc=4136,freq=1.0), product of:
              0.1764565 = queryWeight, product of:
                1.7498407 = boost
                4.6591816 = idf(docFreq=1143, maxDocs=44421)
                0.021643601 = queryNorm
              0.36399856 = fieldWeight in 4136, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6591816 = idf(docFreq=1143, maxDocs=44421)
                0.078125 = fieldNorm(doc=4136)
          0.023758072 = weight(abstract_txt:that in 4136) [ClassicSimilarity], result of:
            0.023758072 = score(doc=4136,freq=2.0), product of:
              0.09092575 = queryWeight, product of:
                1.7763892 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.021643601 = queryNorm
              0.2612909 = fieldWeight in 4136, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.078125 = fieldNorm(doc=4136)
          0.04214873 = weight(abstract_txt:some in 4136) [ClassicSimilarity], result of:
            0.04214873 = score(doc=4136,freq=1.0), product of:
              0.14666125 = queryWeight, product of:
                1.8420726 = boost
                3.6785707 = idf(docFreq=3049, maxDocs=44421)
                0.021643601 = queryNorm
              0.28738832 = fieldWeight in 4136, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6785707 = idf(docFreq=3049, maxDocs=44421)
                0.078125 = fieldNorm(doc=4136)
          0.04402949 = weight(abstract_txt:from in 4136) [ClassicSimilarity], result of:
            0.04402949 = score(doc=4136,freq=2.0), product of:
              0.14441894 = queryWeight, product of:
                2.4181328 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.021643601 = queryNorm
              0.30487338 = fieldWeight in 4136, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.078125 = fieldNorm(doc=4136)
          0.29160273 = weight(abstract_txt:speech in 4136) [ClassicSimilarity], result of:
            0.29160273 = score(doc=4136,freq=2.0), product of:
              0.38399938 = queryWeight, product of:
                2.5813384 = boost
                6.8731537 = idf(docFreq=124, maxDocs=44421)
                0.021643601 = queryNorm
              0.7593834 = fieldWeight in 4136, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8731537 = idf(docFreq=124, maxDocs=44421)
                0.078125 = fieldNorm(doc=4136)
        0.36 = coord(9/25)