Document (#29707)

Author
Huang, X.
Robertson, S.E.
Title
Application of probilistic methods to Chinese text retrieval
Source
Journal of documentation. 53(1997) no.1, S.74-79
Year
1997
Abstract
Discusses the use of text retrieval methods based on the probabilistic model with Chinese language material. Since Chinese text has no natural word boundaries, either a dictionary based word segmentation method must be applied to the text, or indexing and searching must be done in terms of single Chinese characters. In either case, it becomes important to have a good way of dealing with phrases or contoguous strings of characters; the probabilistic model does not at present have such a facility. Proposes some ad hoc modifications of the probabilistic weighting function and matching method for this purpose
Content
Vgl. auch unter: http://www.emeraldinsight.com/10.1108/EUM0000000007193.
Footnote
Contribution to a thematic issue on Okapi and information retrieval research

Similar documents (author)

  1. Beaulieu, M.M.; Gatford, M.; Huang, X.; Robertson, S.E.; Walker, S.; Williams, P.: Okapi an TREC-5 (1997) 2.56
    2.5631065 = sum of:
      2.5631065 = sum of:
        1.2445025 = weight(author_txt:huang in 4097) [ClassicSimilarity], result of:
          1.2445025 = score(doc=4097,freq=1.0), product of:
            0.69334716 = queryWeight, product of:
              7.179679 = idf(docFreq=91, maxDocs=44421)
              0.096570775 = queryNorm
            1.7949197 = fieldWeight in 4097, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.179679 = idf(docFreq=91, maxDocs=44421)
              0.25 = fieldNorm(doc=4097)
        1.318604 = weight(author_txt:robertson in 4097) [ClassicSimilarity], result of:
          1.318604 = score(doc=4097,freq=1.0), product of:
            0.7206037 = queryWeight, product of:
              1.0194663 = boost
              7.319441 = idf(docFreq=79, maxDocs=44421)
              0.096570775 = queryNorm
            1.8298602 = fieldWeight in 4097, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.319441 = idf(docFreq=79, maxDocs=44421)
              0.25 = fieldNorm(doc=4097)
    
  2. Robertson, M.A.: Windows 3.0 for the online searcher (1991) 1.65
    1.6482551 = sum of:
      1.6482551 = product of:
        3.2965102 = sum of:
          3.2965102 = weight(author_txt:robertson in 591) [ClassicSimilarity], result of:
            3.2965102 = score(doc=591,freq=1.0), product of:
              0.7206037 = queryWeight, product of:
                1.0194663 = boost
                7.319441 = idf(docFreq=79, maxDocs=44421)
                0.096570775 = queryNorm
              4.574651 = fieldWeight in 591, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.319441 = idf(docFreq=79, maxDocs=44421)
                0.625 = fieldNorm(doc=591)
        0.5 = coord(1/2)
    
  3. Robertson, S.E.: Some recent theories and models in information retrieval (1980) 1.65
    1.6482551 = sum of:
      1.6482551 = product of:
        3.2965102 = sum of:
          3.2965102 = weight(author_txt:robertson in 1325) [ClassicSimilarity], result of:
            3.2965102 = score(doc=1325,freq=1.0), product of:
              0.7206037 = queryWeight, product of:
                1.0194663 = boost
                7.319441 = idf(docFreq=79, maxDocs=44421)
                0.096570775 = queryNorm
              4.574651 = fieldWeight in 1325, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.319441 = idf(docFreq=79, maxDocs=44421)
                0.625 = fieldNorm(doc=1325)
        0.5 = coord(1/2)
    
  4. Robertson, S.E.: Theories and models in information retrieval (1977) 1.65
    1.6482551 = sum of:
      1.6482551 = product of:
        3.2965102 = sum of:
          3.2965102 = weight(author_txt:robertson in 1843) [ClassicSimilarity], result of:
            3.2965102 = score(doc=1843,freq=1.0), product of:
              0.7206037 = queryWeight, product of:
                1.0194663 = boost
                7.319441 = idf(docFreq=79, maxDocs=44421)
                0.096570775 = queryNorm
              4.574651 = fieldWeight in 1843, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.319441 = idf(docFreq=79, maxDocs=44421)
                0.625 = fieldNorm(doc=1843)
        0.5 = coord(1/2)
    
  5. Robertson, S.E.: On term selection for query expansion (1990) 1.65
    1.6482551 = sum of:
      1.6482551 = product of:
        3.2965102 = sum of:
          3.2965102 = weight(author_txt:robertson in 2649) [ClassicSimilarity], result of:
            3.2965102 = score(doc=2649,freq=1.0), product of:
              0.7206037 = queryWeight, product of:
                1.0194663 = boost
                7.319441 = idf(docFreq=79, maxDocs=44421)
                0.096570775 = queryNorm
              4.574651 = fieldWeight in 2649, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.319441 = idf(docFreq=79, maxDocs=44421)
                0.625 = fieldNorm(doc=2649)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Yang, C.C.; Li, K.W.: ¬A heuristic method based on a statistical approach for chinese text segmentation (2005) 0.67
    0.6693839 = sum of:
      0.6693839 = product of:
        1.521327 = sum of:
          0.046536755 = weight(abstract_txt:dealing in 5580) [ClassicSimilarity], result of:
            0.046536755 = score(doc=5580,freq=1.0), product of:
              0.120169744 = queryWeight, product of:
                1.0618914 = boost
                6.196136 = idf(docFreq=245, maxDocs=44421)
                0.018263923 = queryNorm
              0.3872585 = fieldWeight in 5580, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.196136 = idf(docFreq=245, maxDocs=44421)
                0.0625 = fieldNorm(doc=5580)
          0.017845098 = weight(abstract_txt:based in 5580) [ClassicSimilarity], result of:
            0.017845098 = score(doc=5580,freq=2.0), product of:
              0.06342742 = queryWeight, product of:
                1.0910285 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.018263923 = queryNorm
              0.28134674 = fieldWeight in 5580, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0625 = fieldNorm(doc=5580)
          0.012813389 = weight(abstract_txt:have in 5580) [ClassicSimilarity], result of:
            0.012813389 = score(doc=5580,freq=1.0), product of:
              0.0640792 = queryWeight, product of:
                1.0966198 = boost
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.018263923 = queryNorm
              0.19996175 = fieldWeight in 5580, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.0625 = fieldNorm(doc=5580)
          0.05724219 = weight(abstract_txt:dictionary in 5580) [ClassicSimilarity], result of:
            0.05724219 = score(doc=5580,freq=1.0), product of:
              0.13795641 = queryWeight, product of:
                1.1377674 = boost
                6.6388726 = idf(docFreq=157, maxDocs=44421)
                0.018263923 = queryNorm
              0.41492954 = fieldWeight in 5580, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6388726 = idf(docFreq=157, maxDocs=44421)
                0.0625 = fieldNorm(doc=5580)
          0.016439553 = weight(abstract_txt:retrieval in 5580) [ClassicSimilarity], result of:
            0.016439553 = score(doc=5580,freq=1.0), product of:
              0.07566024 = queryWeight, product of:
                1.1916025 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.018263923 = queryNorm
              0.21728125 = fieldWeight in 5580, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=5580)
          0.2938055 = weight(abstract_txt:segmentation in 5580) [ClassicSimilarity], result of:
            0.2938055 = score(doc=5580,freq=9.0), product of:
              0.19734381 = queryWeight, product of:
                1.3608 = boost
                7.9402676 = idf(docFreq=42, maxDocs=44421)
                0.018263923 = queryNorm
              1.4888002 = fieldWeight in 5580, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                7.9402676 = idf(docFreq=42, maxDocs=44421)
                0.0625 = fieldNorm(doc=5580)
          0.10687508 = weight(abstract_txt:method in 5580) [ClassicSimilarity], result of:
            0.10687508 = score(doc=5580,freq=9.0), product of:
              0.12670036 = queryWeight, product of:
                1.5420074 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.018263923 = queryNorm
              0.84352624 = fieldWeight in 5580, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0625 = fieldNorm(doc=5580)
          0.08898411 = weight(abstract_txt:word in 5580) [ClassicSimilarity], result of:
            0.08898411 = score(doc=5580,freq=2.0), product of:
              0.18512826 = queryWeight, product of:
                1.8639485 = boost
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.018263923 = queryNorm
              0.48066196 = fieldWeight in 5580, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.0625 = fieldNorm(doc=5580)
          0.15751964 = weight(abstract_txt:characters in 5580) [ClassicSimilarity], result of:
            0.15751964 = score(doc=5580,freq=1.0), product of:
              0.34132195 = queryWeight, product of:
                2.5309272 = boost
                7.3839793 = idf(docFreq=74, maxDocs=44421)
                0.018263923 = queryNorm
              0.4614987 = fieldWeight in 5580, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3839793 = idf(docFreq=74, maxDocs=44421)
                0.0625 = fieldNorm(doc=5580)
          0.13660632 = weight(abstract_txt:text in 5580) [ClassicSimilarity], result of:
            0.13660632 = score(doc=5580,freq=7.0), product of:
              0.20443988 = queryWeight, product of:
                2.7700994 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.018263923 = queryNorm
              0.66819805 = fieldWeight in 5580, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=5580)
          0.5866594 = weight(abstract_txt:chinese in 5580) [ClassicSimilarity], result of:
            0.5866594 = score(doc=5580,freq=9.0), product of:
              0.4967382 = queryWeight, product of:
                4.3179374 = boost
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.018263923 = queryNorm
              1.1810232 = fieldWeight in 5580, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.0625 = fieldNorm(doc=5580)
        0.44 = coord(11/25)
    
  2. Khoo, C.S.G.; Dai, D.; Loh, T.E.: Using statistical and contextual information to identify two- and three-character words in Chinese text (2002) 0.47
    0.46857843 = sum of:
      0.46857843 = product of:
        1.1714461 = sum of:
          0.012813389 = weight(abstract_txt:have in 206) [ClassicSimilarity], result of:
            0.012813389 = score(doc=206,freq=1.0), product of:
              0.0640792 = queryWeight, product of:
                1.0966198 = boost
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.018263923 = queryNorm
              0.19996175 = fieldWeight in 206, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.0625 = fieldNorm(doc=206)
          0.07175295 = weight(abstract_txt:modifications in 206) [ClassicSimilarity], result of:
            0.07175295 = score(doc=206,freq=1.0), product of:
              0.16038272 = queryWeight, product of:
                1.2267649 = boost
                7.1581726 = idf(docFreq=93, maxDocs=44421)
                0.018263923 = queryNorm
              0.4473858 = fieldWeight in 206, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1581726 = idf(docFreq=93, maxDocs=44421)
                0.0625 = fieldNorm(doc=206)
          0.23989119 = weight(abstract_txt:segmentation in 206) [ClassicSimilarity], result of:
            0.23989119 = score(doc=206,freq=6.0), product of:
              0.19734381 = queryWeight, product of:
                1.3608 = boost
                7.9402676 = idf(docFreq=42, maxDocs=44421)
                0.018263923 = queryNorm
              1.2156003 = fieldWeight in 206, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.9402676 = idf(docFreq=42, maxDocs=44421)
                0.0625 = fieldNorm(doc=206)
          0.024718544 = weight(abstract_txt:model in 206) [ClassicSimilarity], result of:
            0.024718544 = score(doc=206,freq=1.0), product of:
              0.099301614 = queryWeight, product of:
                1.3651353 = boost
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.018263923 = queryNorm
              0.24892388 = fieldWeight in 206, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.0625 = fieldNorm(doc=206)
          0.02783279 = weight(abstract_txt:methods in 206) [ClassicSimilarity], result of:
            0.02783279 = score(doc=206,freq=1.0), product of:
              0.10747618 = queryWeight, product of:
                1.4202136 = boost
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.018263923 = queryNorm
              0.25896704 = fieldWeight in 206, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.0625 = fieldNorm(doc=206)
          0.06292127 = weight(abstract_txt:word in 206) [ClassicSimilarity], result of:
            0.06292127 = score(doc=206,freq=1.0), product of:
              0.18512826 = queryWeight, product of:
                1.8639485 = boost
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.018263923 = queryNorm
              0.33987933 = fieldWeight in 206, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.0625 = fieldNorm(doc=206)
          0.068343736 = weight(abstract_txt:either in 206) [ClassicSimilarity], result of:
            0.068343736 = score(doc=206,freq=1.0), product of:
              0.19561714 = queryWeight, product of:
                1.9160242 = boost
                5.59 = idf(docFreq=450, maxDocs=44421)
                0.018263923 = queryNorm
              0.349375 = fieldWeight in 206, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.59 = idf(docFreq=450, maxDocs=44421)
                0.0625 = fieldNorm(doc=206)
          0.272832 = weight(abstract_txt:characters in 206) [ClassicSimilarity], result of:
            0.272832 = score(doc=206,freq=3.0), product of:
              0.34132195 = queryWeight, product of:
                2.5309272 = boost
                7.3839793 = idf(docFreq=74, maxDocs=44421)
                0.018263923 = queryNorm
              0.7993392 = fieldWeight in 206, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.3839793 = idf(docFreq=74, maxDocs=44421)
                0.0625 = fieldNorm(doc=206)
          0.05163234 = weight(abstract_txt:text in 206) [ClassicSimilarity], result of:
            0.05163234 = score(doc=206,freq=1.0), product of:
              0.20443988 = queryWeight, product of:
                2.7700994 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.018263923 = queryNorm
              0.25255513 = fieldWeight in 206, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=206)
          0.33870792 = weight(abstract_txt:chinese in 206) [ClassicSimilarity], result of:
            0.33870792 = score(doc=206,freq=3.0), product of:
              0.4967382 = queryWeight, product of:
                4.3179374 = boost
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.018263923 = queryNorm
              0.6818641 = fieldWeight in 206, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.0625 = fieldNorm(doc=206)
        0.4 = coord(10/25)
    
  3. Lee, K.H.; Ng, M.K.M.; Lu, Q.: Text segmentation for Chinese spell checking (1999) 0.44
    0.44075167 = sum of:
      0.44075167 = product of:
        1.2243102 = sum of:
          0.017845098 = weight(abstract_txt:based in 4913) [ClassicSimilarity], result of:
            0.017845098 = score(doc=4913,freq=2.0), product of:
              0.06342742 = queryWeight, product of:
                1.0910285 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.018263923 = queryNorm
              0.28134674 = fieldWeight in 4913, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0625 = fieldNorm(doc=4913)
          0.05724219 = weight(abstract_txt:dictionary in 4913) [ClassicSimilarity], result of:
            0.05724219 = score(doc=4913,freq=1.0), product of:
              0.13795641 = queryWeight, product of:
                1.1377674 = boost
                6.6388726 = idf(docFreq=157, maxDocs=44421)
                0.018263923 = queryNorm
              0.41492954 = fieldWeight in 4913, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6388726 = idf(docFreq=157, maxDocs=44421)
                0.0625 = fieldNorm(doc=4913)
          0.19587034 = weight(abstract_txt:segmentation in 4913) [ClassicSimilarity], result of:
            0.19587034 = score(doc=4913,freq=4.0), product of:
              0.19734381 = queryWeight, product of:
                1.3608 = boost
                7.9402676 = idf(docFreq=42, maxDocs=44421)
                0.018263923 = queryNorm
              0.99253345 = fieldWeight in 4913, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.9402676 = idf(docFreq=42, maxDocs=44421)
                0.0625 = fieldNorm(doc=4913)
          0.050381392 = weight(abstract_txt:method in 4913) [ClassicSimilarity], result of:
            0.050381392 = score(doc=4913,freq=2.0), product of:
              0.12670036 = queryWeight, product of:
                1.5420074 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.018263923 = queryNorm
              0.39764208 = fieldWeight in 4913, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0625 = fieldNorm(doc=4913)
          0.051173765 = weight(abstract_txt:must in 4913) [ClassicSimilarity], result of:
            0.051173765 = score(doc=4913,freq=1.0), product of:
              0.16130184 = queryWeight, product of:
                1.7398716 = boost
                5.076075 = idf(docFreq=753, maxDocs=44421)
                0.018263923 = queryNorm
              0.3172547 = fieldWeight in 4913, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.076075 = idf(docFreq=753, maxDocs=44421)
                0.0625 = fieldNorm(doc=4913)
          0.12584254 = weight(abstract_txt:word in 4913) [ClassicSimilarity], result of:
            0.12584254 = score(doc=4913,freq=4.0), product of:
              0.18512826 = queryWeight, product of:
                1.8639485 = boost
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.018263923 = queryNorm
              0.67975867 = fieldWeight in 4913, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.0625 = fieldNorm(doc=4913)
          0.15751964 = weight(abstract_txt:characters in 4913) [ClassicSimilarity], result of:
            0.15751964 = score(doc=4913,freq=1.0), product of:
              0.34132195 = queryWeight, product of:
                2.5309272 = boost
                7.3839793 = idf(docFreq=74, maxDocs=44421)
                0.018263923 = queryNorm
              0.4614987 = fieldWeight in 4913, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3839793 = idf(docFreq=74, maxDocs=44421)
                0.0625 = fieldNorm(doc=4913)
          0.08942983 = weight(abstract_txt:text in 4913) [ClassicSimilarity], result of:
            0.08942983 = score(doc=4913,freq=3.0), product of:
              0.20443988 = queryWeight, product of:
                2.7700994 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.018263923 = queryNorm
              0.4374383 = fieldWeight in 4913, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=4913)
          0.47900537 = weight(abstract_txt:chinese in 4913) [ClassicSimilarity], result of:
            0.47900537 = score(doc=4913,freq=6.0), product of:
              0.4967382 = queryWeight, product of:
                4.3179374 = boost
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.018263923 = queryNorm
              0.96430147 = fieldWeight in 4913, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.0625 = fieldNorm(doc=4913)
        0.36 = coord(9/25)
    
  4. Wang, F.L.; Yang, C.C.: Mining Web data for Chinese segmentation (2007) 0.41
    0.40786204 = sum of:
      0.40786204 = product of:
        1.1329501 = sum of:
          0.01261839 = weight(abstract_txt:based in 1604) [ClassicSimilarity], result of:
            0.01261839 = score(doc=1604,freq=1.0), product of:
              0.06342742 = queryWeight, product of:
                1.0910285 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.018263923 = queryNorm
              0.1989422 = fieldWeight in 1604, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0625 = fieldNorm(doc=1604)
          0.022193441 = weight(abstract_txt:have in 1604) [ClassicSimilarity], result of:
            0.022193441 = score(doc=1604,freq=3.0), product of:
              0.0640792 = queryWeight, product of:
                1.0966198 = boost
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.018263923 = queryNorm
              0.3463439 = fieldWeight in 1604, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.0625 = fieldNorm(doc=1604)
          0.05675703 = weight(abstract_txt:boundaries in 1604) [ClassicSimilarity], result of:
            0.05675703 = score(doc=1604,freq=1.0), product of:
              0.1371758 = queryWeight, product of:
                1.1345439 = boost
                6.6200633 = idf(docFreq=160, maxDocs=44421)
                0.018263923 = queryNorm
              0.41375396 = fieldWeight in 1604, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6200633 = idf(docFreq=160, maxDocs=44421)
                0.0625 = fieldNorm(doc=1604)
          0.05724219 = weight(abstract_txt:dictionary in 1604) [ClassicSimilarity], result of:
            0.05724219 = score(doc=1604,freq=1.0), product of:
              0.13795641 = queryWeight, product of:
                1.1377674 = boost
                6.6388726 = idf(docFreq=157, maxDocs=44421)
                0.018263923 = queryNorm
              0.41492954 = fieldWeight in 1604, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6388726 = idf(docFreq=157, maxDocs=44421)
                0.0625 = fieldNorm(doc=1604)
          0.016439553 = weight(abstract_txt:retrieval in 1604) [ClassicSimilarity], result of:
            0.016439553 = score(doc=1604,freq=1.0), product of:
              0.07566024 = queryWeight, product of:
                1.1916025 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.018263923 = queryNorm
              0.21728125 = fieldWeight in 1604, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=1604)
          0.3096982 = weight(abstract_txt:segmentation in 1604) [ClassicSimilarity], result of:
            0.3096982 = score(doc=1604,freq=10.0), product of:
              0.19734381 = queryWeight, product of:
                1.3608 = boost
                7.9402676 = idf(docFreq=42, maxDocs=44421)
                0.018263923 = queryNorm
              1.5693332 = fieldWeight in 1604, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                7.9402676 = idf(docFreq=42, maxDocs=44421)
                0.0625 = fieldNorm(doc=1604)
          0.08898411 = weight(abstract_txt:word in 1604) [ClassicSimilarity], result of:
            0.08898411 = score(doc=1604,freq=2.0), product of:
              0.18512826 = queryWeight, product of:
                1.8639485 = boost
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.018263923 = queryNorm
              0.48066196 = fieldWeight in 1604, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.0625 = fieldNorm(doc=1604)
          0.05163234 = weight(abstract_txt:text in 1604) [ClassicSimilarity], result of:
            0.05163234 = score(doc=1604,freq=1.0), product of:
              0.20443988 = queryWeight, product of:
                2.7700994 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.018263923 = queryNorm
              0.25255513 = fieldWeight in 1604, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=1604)
          0.5173849 = weight(abstract_txt:chinese in 1604) [ClassicSimilarity], result of:
            0.5173849 = score(doc=1604,freq=7.0), product of:
              0.4967382 = queryWeight, product of:
                4.3179374 = boost
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.018263923 = queryNorm
              1.0415646 = fieldWeight in 1604, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.0625 = fieldNorm(doc=1604)
        0.36 = coord(9/25)
    
  5. Kwok, K.L.: Employing multiple representations for Chinese information retrieval (1999) 0.39
    0.39498022 = sum of:
      0.39498022 = product of:
        1.0971673 = sum of:
          0.03886474 = weight(abstract_txt:done in 4773) [ClassicSimilarity], result of:
            0.03886474 = score(doc=4773,freq=1.0), product of:
              0.10656998 = queryWeight, product of:
                5.8349996 = idf(docFreq=352, maxDocs=44421)
                0.018263923 = queryNorm
              0.36468747 = fieldWeight in 4773, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8349996 = idf(docFreq=352, maxDocs=44421)
                0.0625 = fieldNorm(doc=4773)
          0.04931866 = weight(abstract_txt:retrieval in 4773) [ClassicSimilarity], result of:
            0.04931866 = score(doc=4773,freq=9.0), product of:
              0.07566024 = queryWeight, product of:
                1.1916025 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.018263923 = queryNorm
              0.6518438 = fieldWeight in 4773, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=4773)
          0.13850124 = weight(abstract_txt:segmentation in 4773) [ClassicSimilarity], result of:
            0.13850124 = score(doc=4773,freq=2.0), product of:
              0.19734381 = queryWeight, product of:
                1.3608 = boost
                7.9402676 = idf(docFreq=42, maxDocs=44421)
                0.018263923 = queryNorm
              0.7018271 = fieldWeight in 4773, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.9402676 = idf(docFreq=42, maxDocs=44421)
                0.0625 = fieldNorm(doc=4773)
          0.02783279 = weight(abstract_txt:methods in 4773) [ClassicSimilarity], result of:
            0.02783279 = score(doc=4773,freq=1.0), product of:
              0.10747618 = queryWeight, product of:
                1.4202136 = boost
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.018263923 = queryNorm
              0.25896704 = fieldWeight in 4773, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.0625 = fieldNorm(doc=4773)
          0.035625026 = weight(abstract_txt:method in 4773) [ClassicSimilarity], result of:
            0.035625026 = score(doc=4773,freq=1.0), product of:
              0.12670036 = queryWeight, product of:
                1.5420074 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.018263923 = queryNorm
              0.2811754 = fieldWeight in 4773, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0625 = fieldNorm(doc=4773)
          0.12584254 = weight(abstract_txt:word in 4773) [ClassicSimilarity], result of:
            0.12584254 = score(doc=4773,freq=4.0), product of:
              0.18512826 = queryWeight, product of:
                1.8639485 = boost
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.018263923 = queryNorm
              0.67975867 = fieldWeight in 4773, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.0625 = fieldNorm(doc=4773)
          0.22276641 = weight(abstract_txt:characters in 4773) [ClassicSimilarity], result of:
            0.22276641 = score(doc=4773,freq=2.0), product of:
              0.34132195 = queryWeight, product of:
                2.5309272 = boost
                7.3839793 = idf(docFreq=74, maxDocs=44421)
                0.018263923 = queryNorm
              0.65265775 = fieldWeight in 4773, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.3839793 = idf(docFreq=74, maxDocs=44421)
                0.0625 = fieldNorm(doc=4773)
          0.18186192 = weight(abstract_txt:probabilistic in 4773) [ClassicSimilarity], result of:
            0.18186192 = score(doc=4773,freq=1.0), product of:
              0.42999756 = queryWeight, product of:
                3.4791741 = boost
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.018263923 = queryNorm
              0.4229371 = fieldWeight in 4773, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.0625 = fieldNorm(doc=4773)
          0.27655387 = weight(abstract_txt:chinese in 4773) [ClassicSimilarity], result of:
            0.27655387 = score(doc=4773,freq=2.0), product of:
              0.4967382 = queryWeight, product of:
                4.3179374 = boost
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.018263923 = queryNorm
              0.5567397 = fieldWeight in 4773, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.0625 = fieldNorm(doc=4773)
        0.36 = coord(9/25)