Document (#30055)

Author
Qin, J.
Zhou, Y.
Chau, M.
Chen, H.
Title
Multilingual Web retrieval : an experiment in English-Chinese business intelligence
Source
Journal of the American Society for Information Science and Technology. 57(2006) no.5, S.671-683
Year
2006
Abstract
As increasing numbers of non-English resources have become available on the Web, the interesting and important issue of how Web users can retrieve documents in different languages has arisen. Cross-language information retrieval (CLIP), the study of retrieving information in one language by queries expressed in another language, is a promising approach to the problem. Cross-language information retrieval has attracted much attention in recent years. Most research systems have achieved satisfactory performance on standard Text REtrieval Conference (TREC) collections such as news articles, but CLIR techniques have not been widely studied and evaluated for applications such as Web portals. In this article, the authors present their research in developing and evaluating a multilingual English-Chinese Web portal that incorporates various CLIP techniques for use in the business domain. A dictionary-based approach was adopted and combines phrasal translation, co-occurrence analysis, and pre- and posttranslation query expansion. The portal was evaluated by domain experts, using a set of queries in both English and Chinese. The experimental results showed that co-occurrence-based phrasal translation achieved a 74.6% improvement in precision over simple word-byword translation. When used together, pre- and posttranslation query expansion improved the performance slightly, achieving a 78.0% improvement over the baseline word-by-word translation approach. In general, applying CLIR techniques in Web applications shows promise.
Footnote
Beitrag einer special topic section on multilingual information systems
Theme
Multilinguale Probleme
Area
Informationswirtschaft

Similar documents (author)

  1. Chau, M.; Wong, C.H.; Zhou, Y.; Qin, J.; Chen, H.: Evaluating the use of search engine development tools in IT education (2010) 4.18
    4.1825457 = sum of:
      4.1825457 = sum of:
        0.65183544 = weight(author_txt:chen in 312) [ClassicSimilarity], result of:
          0.65183544 = score(doc=312,freq=1.0), product of:
            0.3398878 = queryWeight, product of:
              6.136947 = idf(docFreq=260, maxDocs=44421)
              0.055383857 = queryNorm
            1.917796 = fieldWeight in 312, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.136947 = idf(docFreq=260, maxDocs=44421)
              0.3125 = fieldNorm(doc=312)
        1.3227684 = weight(author_txt:zhou in 312) [ClassicSimilarity], result of:
          1.3227684 = score(doc=312,freq=1.0), product of:
            0.5447946 = queryWeight, product of:
              1.2660434 = boost
              7.769642 = idf(docFreq=50, maxDocs=44421)
              0.055383857 = queryNorm
            2.428013 = fieldWeight in 312, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.769642 = idf(docFreq=50, maxDocs=44421)
              0.3125 = fieldNorm(doc=312)
        2.2079415 = weight(author_txt:chau in 312) [ClassicSimilarity], result of:
          2.2079415 = score(doc=312,freq=1.0), product of:
            0.7665997 = queryWeight, product of:
              1.5018153 = boost
              9.216561 = idf(docFreq=11, maxDocs=44421)
              0.055383857 = queryNorm
            2.8801754 = fieldWeight in 312, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.216561 = idf(docFreq=11, maxDocs=44421)
              0.3125 = fieldNorm(doc=312)
    
  2. Chen, H.; Chau, M.: Web mining : machine learning for Web applications (2003) 3.05
    3.050429 = sum of:
      3.050429 = product of:
        4.5756435 = sum of:
          1.0429367 = weight(author_txt:chen in 5242) [ClassicSimilarity], result of:
            1.0429367 = score(doc=5242,freq=1.0), product of:
              0.3398878 = queryWeight, product of:
                6.136947 = idf(docFreq=260, maxDocs=44421)
                0.055383857 = queryNorm
              3.0684736 = fieldWeight in 5242, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.136947 = idf(docFreq=260, maxDocs=44421)
                0.5 = fieldNorm(doc=5242)
          3.5327067 = weight(author_txt:chau in 5242) [ClassicSimilarity], result of:
            3.5327067 = score(doc=5242,freq=1.0), product of:
              0.7665997 = queryWeight, product of:
                1.5018153 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.055383857 = queryNorm
              4.6082807 = fieldWeight in 5242, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.5 = fieldNorm(doc=5242)
        0.6666667 = coord(2/3)
    
  3. Chen, H.; Fan, H.; Chau, M.; Zeng, D.: MetaSpider : meta-searching and categorization on the Web (2001) 1.91
    1.906518 = sum of:
      1.906518 = product of:
        2.859777 = sum of:
          0.65183544 = weight(author_txt:chen in 849) [ClassicSimilarity], result of:
            0.65183544 = score(doc=849,freq=1.0), product of:
              0.3398878 = queryWeight, product of:
                6.136947 = idf(docFreq=260, maxDocs=44421)
                0.055383857 = queryNorm
              1.917796 = fieldWeight in 849, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.136947 = idf(docFreq=260, maxDocs=44421)
                0.3125 = fieldNorm(doc=849)
          2.2079415 = weight(author_txt:chau in 849) [ClassicSimilarity], result of:
            2.2079415 = score(doc=849,freq=1.0), product of:
              0.7665997 = queryWeight, product of:
                1.5018153 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.055383857 = queryNorm
              2.8801754 = fieldWeight in 849, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.3125 = fieldNorm(doc=849)
        0.6666667 = coord(2/3)
    
  4. Chen, H.; Lally, A.M.; Zhu, B.; Chau, M.: HelpfulMed : Intelligent searching for medical information over the Internet (2003) 1.91
    1.906518 = sum of:
      1.906518 = product of:
        2.859777 = sum of:
          0.65183544 = weight(author_txt:chen in 2615) [ClassicSimilarity], result of:
            0.65183544 = score(doc=2615,freq=1.0), product of:
              0.3398878 = queryWeight, product of:
                6.136947 = idf(docFreq=260, maxDocs=44421)
                0.055383857 = queryNorm
              1.917796 = fieldWeight in 2615, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.136947 = idf(docFreq=260, maxDocs=44421)
                0.3125 = fieldNorm(doc=2615)
          2.2079415 = weight(author_txt:chau in 2615) [ClassicSimilarity], result of:
            2.2079415 = score(doc=2615,freq=1.0), product of:
              0.7665997 = queryWeight, product of:
                1.5018153 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.055383857 = queryNorm
              2.8801754 = fieldWeight in 2615, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.3125 = fieldNorm(doc=2615)
        0.6666667 = coord(2/3)
    
  5. Chau, M.; Shiu, B.; Chan, M.; Chen, H.: Redips: backlink search and analysis on the Web for business intelligence analysis (2007) 1.91
    1.906518 = sum of:
      1.906518 = product of:
        2.859777 = sum of:
          0.65183544 = weight(author_txt:chen in 1142) [ClassicSimilarity], result of:
            0.65183544 = score(doc=1142,freq=1.0), product of:
              0.3398878 = queryWeight, product of:
                6.136947 = idf(docFreq=260, maxDocs=44421)
                0.055383857 = queryNorm
              1.917796 = fieldWeight in 1142, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.136947 = idf(docFreq=260, maxDocs=44421)
                0.3125 = fieldNorm(doc=1142)
          2.2079415 = weight(author_txt:chau in 1142) [ClassicSimilarity], result of:
            2.2079415 = score(doc=1142,freq=1.0), product of:
              0.7665997 = queryWeight, product of:
                1.5018153 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.055383857 = queryNorm
              2.8801754 = fieldWeight in 1142, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.3125 = fieldNorm(doc=1142)
        0.6666667 = coord(2/3)
    

Similar documents (content)

  1. Chen, J.: ¬A lexical knowledge base approach for English-Chinese cross-language information retrieval (2006) 0.63
    0.63212454 = sum of:
      0.63212454 = product of:
        1.3169261 = sum of:
          0.04085963 = weight(abstract_txt:query in 5923) [ClassicSimilarity], result of:
            0.04085963 = score(doc=5923,freq=2.0), product of:
              0.097228885 = queryWeight, product of:
                1.00542 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.02033966 = queryNorm
              0.42024165 = fieldWeight in 5923, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.0625 = fieldNorm(doc=5923)
          0.04723875 = weight(abstract_txt:cross in 5923) [ClassicSimilarity], result of:
            0.04723875 = score(doc=5923,freq=1.0), product of:
              0.13494018 = queryWeight, product of:
                1.1844603 = boost
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.02033966 = queryNorm
              0.3500718 = fieldWeight in 5923, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.0625 = fieldNorm(doc=5923)
          0.04222811 = weight(abstract_txt:approach in 5923) [ClassicSimilarity], result of:
            0.04222811 = score(doc=5923,freq=4.0), product of:
              0.09029989 = queryWeight, product of:
                1.186695 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.02033966 = queryNorm
              0.467643 = fieldWeight in 5923, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.0625 = fieldNorm(doc=5923)
          0.05809561 = weight(abstract_txt:achieved in 5923) [ClassicSimilarity], result of:
            0.05809561 = score(doc=5923,freq=1.0), product of:
              0.15489519 = queryWeight, product of:
                1.269021 = boost
                6.0010242 = idf(docFreq=298, maxDocs=44421)
                0.02033966 = queryNorm
              0.37506402 = fieldWeight in 5923, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0010242 = idf(docFreq=298, maxDocs=44421)
                0.0625 = fieldNorm(doc=5923)
          0.062485088 = weight(abstract_txt:improvement in 5923) [ClassicSimilarity], result of:
            0.062485088 = score(doc=5923,freq=1.0), product of:
              0.16260228 = queryWeight, product of:
                1.300209 = boost
                6.148508 = idf(docFreq=257, maxDocs=44421)
                0.02033966 = queryNorm
              0.38428175 = fieldWeight in 5923, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.148508 = idf(docFreq=257, maxDocs=44421)
                0.0625 = fieldNorm(doc=5923)
          0.037537195 = weight(abstract_txt:techniques in 5923) [ClassicSimilarity], result of:
            0.037537195 = score(doc=5923,freq=1.0), product of:
              0.13251977 = queryWeight, product of:
                1.4375925 = boost
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.02033966 = queryNorm
              0.28325734 = fieldWeight in 5923, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.0625 = fieldNorm(doc=5923)
          0.031947646 = weight(abstract_txt:retrieval in 5923) [ClassicSimilarity], result of:
            0.031947646 = score(doc=5923,freq=2.0), product of:
              0.10396846 = queryWeight, product of:
                1.4703329 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.02033966 = queryNorm
              0.3072821 = fieldWeight in 5923, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=5923)
          0.33203322 = weight(abstract_txt:clir in 5923) [ClassicSimilarity], result of:
            0.33203322 = score(doc=5923,freq=5.0), product of:
              0.289561 = queryWeight, product of:
                1.7350816 = boost
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.02033966 = queryNorm
              1.146678 = fieldWeight in 5923, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.0625 = fieldNorm(doc=5923)
          0.05517328 = weight(abstract_txt:language in 5923) [ClassicSimilarity], result of:
            0.05517328 = score(doc=5923,freq=2.0), product of:
              0.14965627 = queryWeight, product of:
                1.7640558 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.02033966 = queryNorm
              0.3686667 = fieldWeight in 5923, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.0625 = fieldNorm(doc=5923)
          0.14250985 = weight(abstract_txt:chinese in 5923) [ClassicSimilarity], result of:
            0.14250985 = score(doc=5923,freq=2.0), product of:
              0.25597215 = queryWeight, product of:
                1.9979833 = boost
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.02033966 = queryNorm
              0.5567397 = fieldWeight in 5923, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.0625 = fieldNorm(doc=5923)
          0.13172033 = weight(abstract_txt:english in 5923) [ClassicSimilarity], result of:
            0.13172033 = score(doc=5923,freq=2.0), product of:
              0.26732787 = queryWeight, product of:
                2.3576915 = boost
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.02033966 = queryNorm
              0.4927295 = fieldWeight in 5923, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.0625 = fieldNorm(doc=5923)
          0.33509743 = weight(abstract_txt:translation in 5923) [ClassicSimilarity], result of:
            0.33509743 = score(doc=5923,freq=7.0), product of:
              0.3281208 = queryWeight, product of:
                2.6120515 = boost
                6.176015 = idf(docFreq=250, maxDocs=44421)
                0.02033966 = queryNorm
              1.0212624 = fieldWeight in 5923, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.176015 = idf(docFreq=250, maxDocs=44421)
                0.0625 = fieldNorm(doc=5923)
        0.48 = coord(12/25)
    
  2. Kim, S.; Ko, Y.; Oard, D.W.: Combining lexical and statistical translation evidence for cross-language information retrieval (2015) 0.53
    0.53041077 = sum of:
      0.53041077 = product of:
        1.0200207 = sum of:
          0.04085963 = weight(abstract_txt:query in 2606) [ClassicSimilarity], result of:
            0.04085963 = score(doc=2606,freq=2.0), product of:
              0.097228885 = queryWeight, product of:
                1.00542 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.02033966 = queryNorm
              0.42024165 = fieldWeight in 2606, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.0625 = fieldNorm(doc=2606)
          0.01320563 = weight(abstract_txt:have in 2606) [ClassicSimilarity], result of:
            0.01320563 = score(doc=2606,freq=1.0), product of:
              0.06604078 = queryWeight, product of:
                1.0148493 = boost
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.02033966 = queryNorm
              0.19996175 = fieldWeight in 2606, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.0625 = fieldNorm(doc=2606)
          0.035693217 = weight(abstract_txt:queries in 2606) [ClassicSimilarity], result of:
            0.035693217 = score(doc=2606,freq=1.0), product of:
              0.111943655 = queryWeight, product of:
                1.0788215 = boost
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.02033966 = queryNorm
              0.31884983 = fieldWeight in 2606, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.0625 = fieldNorm(doc=2606)
          0.04723875 = weight(abstract_txt:cross in 2606) [ClassicSimilarity], result of:
            0.04723875 = score(doc=2606,freq=1.0), product of:
              0.13494018 = queryWeight, product of:
                1.1844603 = boost
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.02033966 = queryNorm
              0.3500718 = fieldWeight in 2606, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.0625 = fieldNorm(doc=2606)
          0.021114055 = weight(abstract_txt:approach in 2606) [ClassicSimilarity], result of:
            0.021114055 = score(doc=2606,freq=1.0), product of:
              0.09029989 = queryWeight, product of:
                1.186695 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.02033966 = queryNorm
              0.2338215 = fieldWeight in 2606, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.0625 = fieldNorm(doc=2606)
          0.08657926 = weight(abstract_txt:expansion in 2606) [ClassicSimilarity], result of:
            0.08657926 = score(doc=2606,freq=2.0), product of:
              0.16040145 = queryWeight, product of:
                1.2913798 = boost
                6.106756 = idf(docFreq=268, maxDocs=44421)
                0.02033966 = queryNorm
              0.5397661 = fieldWeight in 2606, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.106756 = idf(docFreq=268, maxDocs=44421)
                0.0625 = fieldNorm(doc=2606)
          0.08330181 = weight(abstract_txt:occurrence in 2606) [ClassicSimilarity], result of:
            0.08330181 = score(doc=2606,freq=1.0), product of:
              0.19696029 = queryWeight, product of:
                1.4309986 = boost
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.02033966 = queryNorm
              0.4229371 = fieldWeight in 2606, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.0625 = fieldNorm(doc=2606)
          0.037537195 = weight(abstract_txt:techniques in 2606) [ClassicSimilarity], result of:
            0.037537195 = score(doc=2606,freq=1.0), product of:
              0.13251977 = queryWeight, product of:
                1.4375925 = boost
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.02033966 = queryNorm
              0.28325734 = fieldWeight in 2606, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.0625 = fieldNorm(doc=2606)
          0.022590397 = weight(abstract_txt:retrieval in 2606) [ClassicSimilarity], result of:
            0.022590397 = score(doc=2606,freq=1.0), product of:
              0.10396846 = queryWeight, product of:
                1.4703329 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.02033966 = queryNorm
              0.21728125 = fieldWeight in 2606, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=2606)
          0.14848977 = weight(abstract_txt:clir in 2606) [ClassicSimilarity], result of:
            0.14848977 = score(doc=2606,freq=1.0), product of:
              0.289561 = queryWeight, product of:
                1.7350816 = boost
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.02033966 = queryNorm
              0.51281 = fieldWeight in 2606, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.0625 = fieldNorm(doc=2606)
          0.05517328 = weight(abstract_txt:language in 2606) [ClassicSimilarity], result of:
            0.05517328 = score(doc=2606,freq=2.0), product of:
              0.14965627 = queryWeight, product of:
                1.7640558 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.02033966 = queryNorm
              0.3686667 = fieldWeight in 2606, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.0625 = fieldNorm(doc=2606)
          0.09314034 = weight(abstract_txt:english in 2606) [ClassicSimilarity], result of:
            0.09314034 = score(doc=2606,freq=1.0), product of:
              0.26732787 = queryWeight, product of:
                2.3576915 = boost
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.02033966 = queryNorm
              0.3484124 = fieldWeight in 2606, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.0625 = fieldNorm(doc=2606)
          0.33509743 = weight(abstract_txt:translation in 2606) [ClassicSimilarity], result of:
            0.33509743 = score(doc=2606,freq=7.0), product of:
              0.3281208 = queryWeight, product of:
                2.6120515 = boost
                6.176015 = idf(docFreq=250, maxDocs=44421)
                0.02033966 = queryNorm
              1.0212624 = fieldWeight in 2606, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.176015 = idf(docFreq=250, maxDocs=44421)
                0.0625 = fieldNorm(doc=2606)
        0.52 = coord(13/25)
    
  3. Yang, C.C.; Li, K.W.: Automatic construction of English/Chinese parallel corpora (2003) 0.45
    0.4466418 = sum of:
      0.4466418 = product of:
        1.015095 = sum of:
          0.035177093 = weight(abstract_txt:domain in 2683) [ClassicSimilarity], result of:
            0.035177093 = score(doc=2683,freq=2.0), product of:
              0.096183434 = queryWeight, product of:
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.02033966 = queryNorm
              0.3657292 = fieldWeight in 2683, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.03534531 = weight(abstract_txt:applications in 2683) [ClassicSimilarity], result of:
            0.03534531 = score(doc=2683,freq=2.0), product of:
              0.096489824 = queryWeight, product of:
                1.0015914 = boost
                4.7363873 = idf(docFreq=1058, maxDocs=44421)
                0.02033966 = queryNorm
              0.36631125 = fieldWeight in 2683, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7363873 = idf(docFreq=1058, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.016341131 = weight(abstract_txt:have in 2683) [ClassicSimilarity], result of:
            0.016341131 = score(doc=2683,freq=2.0), product of:
              0.06604078 = queryWeight, product of:
                1.0148493 = boost
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.02033966 = queryNorm
              0.24744003 = fieldWeight in 2683, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.082667805 = weight(abstract_txt:cross in 2683) [ClassicSimilarity], result of:
            0.082667805 = score(doc=2683,freq=4.0), product of:
              0.13494018 = queryWeight, product of:
                1.1844603 = boost
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.02033966 = queryNorm
              0.6126256 = fieldWeight in 2683, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.08313074 = weight(abstract_txt:multilingual in 2683) [ClassicSimilarity], result of:
            0.08313074 = score(doc=2683,freq=2.0), product of:
              0.17064808 = queryWeight, product of:
                1.3319888 = boost
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.02033966 = queryNorm
              0.4871472 = fieldWeight in 2683, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.027954191 = weight(abstract_txt:retrieval in 2683) [ClassicSimilarity], result of:
            0.027954191 = score(doc=2683,freq=2.0), product of:
              0.10396846 = queryWeight, product of:
                1.4703329 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.02033966 = queryNorm
              0.26887184 = fieldWeight in 2683, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.098279126 = weight(abstract_txt:word in 2683) [ClassicSimilarity], result of:
            0.098279126 = score(doc=2683,freq=3.0), product of:
              0.19079539 = queryWeight, product of:
                1.7249616 = boost
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.02033966 = queryNorm
              0.5151022 = fieldWeight in 2683, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.068273455 = weight(abstract_txt:language in 2683) [ClassicSimilarity], result of:
            0.068273455 = score(doc=2683,freq=4.0), product of:
              0.14965627 = queryWeight, product of:
                1.7640558 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.02033966 = queryNorm
              0.45620176 = fieldWeight in 2683, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.17634694 = weight(abstract_txt:chinese in 2683) [ClassicSimilarity], result of:
            0.17634694 = score(doc=2683,freq=4.0), product of:
              0.25597215 = queryWeight, product of:
                1.9979833 = boost
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.02033966 = queryNorm
              0.6889302 = fieldWeight in 2683, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.19962803 = weight(abstract_txt:english in 2683) [ClassicSimilarity], result of:
            0.19962803 = score(doc=2683,freq=6.0), product of:
              0.26732787 = queryWeight, product of:
                2.3576915 = boost
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.02033966 = queryNorm
              0.7467535 = fieldWeight in 2683, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.19195117 = weight(abstract_txt:translation in 2683) [ClassicSimilarity], result of:
            0.19195117 = score(doc=2683,freq=3.0), product of:
              0.3281208 = queryWeight, product of:
                2.6120515 = boost
                6.176015 = idf(docFreq=250, maxDocs=44421)
                0.02033966 = queryNorm
              0.5850015 = fieldWeight in 2683, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.176015 = idf(docFreq=250, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
        0.44 = coord(11/25)
    
  4. Ye, Z.; Huang, J.X.; He, B.; Lin, H.: Mining a multilingual association dictionary from Wikipedia for cross-language information retrieval (2012) 0.41
    0.4107348 = sum of:
      0.4107348 = product of:
        0.9334882 = sum of:
          0.049473125 = weight(abstract_txt:applications in 1513) [ClassicSimilarity], result of:
            0.049473125 = score(doc=1513,freq=3.0), product of:
              0.096489824 = queryWeight, product of:
                1.0015914 = boost
                4.7363873 = idf(docFreq=1058, maxDocs=44421)
                0.02033966 = queryNorm
              0.5127289 = fieldWeight in 1513, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7363873 = idf(docFreq=1058, maxDocs=44421)
                0.0625 = fieldNorm(doc=1513)
          0.02889212 = weight(abstract_txt:query in 1513) [ClassicSimilarity], result of:
            0.02889212 = score(doc=1513,freq=1.0), product of:
              0.097228885 = queryWeight, product of:
                1.00542 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.02033966 = queryNorm
              0.29715574 = fieldWeight in 1513, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.0625 = fieldNorm(doc=1513)
          0.0944775 = weight(abstract_txt:cross in 1513) [ClassicSimilarity], result of:
            0.0944775 = score(doc=1513,freq=4.0), product of:
              0.13494018 = queryWeight, product of:
                1.1844603 = boost
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.02033966 = queryNorm
              0.7001436 = fieldWeight in 1513, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.0625 = fieldNorm(doc=1513)
          0.021114055 = weight(abstract_txt:approach in 1513) [ClassicSimilarity], result of:
            0.021114055 = score(doc=1513,freq=1.0), product of:
              0.09029989 = queryWeight, product of:
                1.186695 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.02033966 = queryNorm
              0.2338215 = fieldWeight in 1513, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.0625 = fieldNorm(doc=1513)
          0.061220784 = weight(abstract_txt:expansion in 1513) [ClassicSimilarity], result of:
            0.061220784 = score(doc=1513,freq=1.0), product of:
              0.16040145 = queryWeight, product of:
                1.2913798 = boost
                6.106756 = idf(docFreq=268, maxDocs=44421)
                0.02033966 = queryNorm
              0.38167226 = fieldWeight in 1513, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.106756 = idf(docFreq=268, maxDocs=44421)
                0.0625 = fieldNorm(doc=1513)
          0.067179784 = weight(abstract_txt:multilingual in 1513) [ClassicSimilarity], result of:
            0.067179784 = score(doc=1513,freq=1.0), product of:
              0.17064808 = queryWeight, product of:
                1.3319888 = boost
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.02033966 = queryNorm
              0.3936744 = fieldWeight in 1513, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.0625 = fieldNorm(doc=1513)
          0.031947646 = weight(abstract_txt:retrieval in 1513) [ClassicSimilarity], result of:
            0.031947646 = score(doc=1513,freq=2.0), product of:
              0.10396846 = queryWeight, product of:
                1.4703329 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.02033966 = queryNorm
              0.3072821 = fieldWeight in 1513, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=1513)
          0.06484741 = weight(abstract_txt:word in 1513) [ClassicSimilarity], result of:
            0.06484741 = score(doc=1513,freq=1.0), product of:
              0.19079539 = queryWeight, product of:
                1.7249616 = boost
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.02033966 = queryNorm
              0.33987933 = fieldWeight in 1513, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.0625 = fieldNorm(doc=1513)
          0.25719184 = weight(abstract_txt:clir in 1513) [ClassicSimilarity], result of:
            0.25719184 = score(doc=1513,freq=3.0), product of:
              0.289561 = queryWeight, product of:
                1.7350816 = boost
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.02033966 = queryNorm
              0.8882129 = fieldWeight in 1513, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.0625 = fieldNorm(doc=1513)
          0.0780268 = weight(abstract_txt:language in 1513) [ClassicSimilarity], result of:
            0.0780268 = score(doc=1513,freq=4.0), product of:
              0.14965627 = queryWeight, product of:
                1.7640558 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.02033966 = queryNorm
              0.52137345 = fieldWeight in 1513, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.0625 = fieldNorm(doc=1513)
          0.17911713 = weight(abstract_txt:translation in 1513) [ClassicSimilarity], result of:
            0.17911713 = score(doc=1513,freq=2.0), product of:
              0.3281208 = queryWeight, product of:
                2.6120515 = boost
                6.176015 = idf(docFreq=250, maxDocs=44421)
                0.02033966 = queryNorm
              0.54588777 = fieldWeight in 1513, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.176015 = idf(docFreq=250, maxDocs=44421)
                0.0625 = fieldNorm(doc=1513)
        0.44 = coord(11/25)
    
  5. Chen, K.-H.: Evaluating Chinese text retrieval with multilingual queries (2002) 0.39
    0.39173672 = sum of:
      0.39173672 = product of:
        0.9793418 = sum of:
          0.043338183 = weight(abstract_txt:query in 2851) [ClassicSimilarity], result of:
            0.043338183 = score(doc=2851,freq=1.0), product of:
              0.097228885 = queryWeight, product of:
                1.00542 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.02033966 = queryNorm
              0.4457336 = fieldWeight in 2851, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.09375 = fieldNorm(doc=2851)
          0.053539824 = weight(abstract_txt:queries in 2851) [ClassicSimilarity], result of:
            0.053539824 = score(doc=2851,freq=1.0), product of:
              0.111943655 = queryWeight, product of:
                1.0788215 = boost
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.02033966 = queryNorm
              0.47827476 = fieldWeight in 2851, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.09375 = fieldNorm(doc=2851)
          0.07085812 = weight(abstract_txt:cross in 2851) [ClassicSimilarity], result of:
            0.07085812 = score(doc=2851,freq=1.0), product of:
              0.13494018 = queryWeight, product of:
                1.1844603 = boost
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.02033966 = queryNorm
              0.5251077 = fieldWeight in 2851, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.09375 = fieldNorm(doc=2851)
          0.09183118 = weight(abstract_txt:expansion in 2851) [ClassicSimilarity], result of:
            0.09183118 = score(doc=2851,freq=1.0), product of:
              0.16040145 = queryWeight, product of:
                1.2913798 = boost
                6.106756 = idf(docFreq=268, maxDocs=44421)
                0.02033966 = queryNorm
              0.5725084 = fieldWeight in 2851, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.106756 = idf(docFreq=268, maxDocs=44421)
                0.09375 = fieldNorm(doc=2851)
          0.14250985 = weight(abstract_txt:multilingual in 2851) [ClassicSimilarity], result of:
            0.14250985 = score(doc=2851,freq=2.0), product of:
              0.17064808 = queryWeight, product of:
                1.3319888 = boost
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.02033966 = queryNorm
              0.83510953 = fieldWeight in 2851, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.09375 = fieldNorm(doc=2851)
          0.056305792 = weight(abstract_txt:techniques in 2851) [ClassicSimilarity], result of:
            0.056305792 = score(doc=2851,freq=1.0), product of:
              0.13251977 = queryWeight, product of:
                1.4375925 = boost
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.02033966 = queryNorm
              0.424886 = fieldWeight in 2851, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.09375 = fieldNorm(doc=2851)
          0.058691565 = weight(abstract_txt:retrieval in 2851) [ClassicSimilarity], result of:
            0.058691565 = score(doc=2851,freq=3.0), product of:
              0.10396846 = queryWeight, product of:
                1.4703329 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.02033966 = queryNorm
              0.5645132 = fieldWeight in 2851, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.09375 = fieldNorm(doc=2851)
          0.0585201 = weight(abstract_txt:language in 2851) [ClassicSimilarity], result of:
            0.0585201 = score(doc=2851,freq=1.0), product of:
              0.14965627 = queryWeight, product of:
                1.7640558 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.02033966 = queryNorm
              0.39103007 = fieldWeight in 2851, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.09375 = fieldNorm(doc=2851)
          0.21376479 = weight(abstract_txt:chinese in 2851) [ClassicSimilarity], result of:
            0.21376479 = score(doc=2851,freq=2.0), product of:
              0.25597215 = queryWeight, product of:
                1.9979833 = boost
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.02033966 = queryNorm
              0.83510953 = fieldWeight in 2851, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.09375 = fieldNorm(doc=2851)
          0.18998241 = weight(abstract_txt:translation in 2851) [ClassicSimilarity], result of:
            0.18998241 = score(doc=2851,freq=1.0), product of:
              0.3281208 = queryWeight, product of:
                2.6120515 = boost
                6.176015 = idf(docFreq=250, maxDocs=44421)
                0.02033966 = queryNorm
              0.5790014 = fieldWeight in 2851, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.176015 = idf(docFreq=250, maxDocs=44421)
                0.09375 = fieldNorm(doc=2851)
        0.4 = coord(10/25)