Document (#32337)

Author
Chau, M.
Fang, X.
Rittman, C.C.
Title
Web searching in Chinese : a study of a search engine in Hong Kong
Source
Journal of the American Society for Information Science and Technology. 58(2007) no.7, S.1044-1054
Year
2007
Abstract
The number of non-English resources has been increasing rapidly on the Web. Although many studies have been conducted on the query logs in search engines that are primarily English-based (e.g., Excite and AltaVista), only a few of them have studied the information-seeking behavior on the Web in non-English languages. In this article, we report the analysis of the search-query logs of a search engine that focused on Chinese. Three months of search-query logs of Timway, a search engine based in Hong Kong, were collected and analyzed. Metrics on sessions, queries, search topics, and character usage are reported. N-gram analysis also has been applied to perform character-based analysis. Our analysis suggests that some characteristics identified in the search log, such as search topics and the mean number of queries per sessions, are similar to those in English search engines; however, other characteristics, such as the use of operators in query formulation, are significantly different. The analysis also shows that only a very small number of unique Chinese characters are used in search queries. We believe the findings from this study have provided some insights into further research in non-English Web searching.
Theme
Internet
Location
Hong Kong

Similar documents (author)

  1. Chau, M.; Fang, X.; Sheng, O.R.U.: Analysis of the query logs of a Web site search engine (2005) 4.67
    4.6728725 = sum of:
      4.6728725 = sum of:
        2.0156949 = weight(author_txt:fang in 5573) [ClassicSimilarity], result of:
          2.0156949 = score(doc=5573,freq=1.0), product of:
            0.63947445 = queryWeight, product of:
              8.405631 = idf(docFreq=26, maxDocs=44421)
              0.07607691 = queryNorm
            3.1521115 = fieldWeight in 5573, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.405631 = idf(docFreq=26, maxDocs=44421)
              0.375 = fieldNorm(doc=5573)
        2.6571777 = weight(author_txt:chau in 5573) [ClassicSimilarity], result of:
          2.6571777 = score(doc=5573,freq=1.0), product of:
            0.7688124 = queryWeight, product of:
              1.0964746 = boost
              9.216561 = idf(docFreq=11, maxDocs=44421)
              0.07607691 = queryNorm
            3.4562106 = fieldWeight in 5573, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.216561 = idf(docFreq=11, maxDocs=44421)
              0.375 = fieldNorm(doc=5573)
    
  2. Chau, M.; Lu, Y.; Fang, X.; Yang, C.C.: Characteristics of character usage in Chinese Web searching (2009) 3.89
    3.8940601 = sum of:
      3.8940601 = sum of:
        1.6797458 = weight(author_txt:fang in 3456) [ClassicSimilarity], result of:
          1.6797458 = score(doc=3456,freq=1.0), product of:
            0.63947445 = queryWeight, product of:
              8.405631 = idf(docFreq=26, maxDocs=44421)
              0.07607691 = queryNorm
            2.6267598 = fieldWeight in 3456, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.405631 = idf(docFreq=26, maxDocs=44421)
              0.3125 = fieldNorm(doc=3456)
        2.2143145 = weight(author_txt:chau in 3456) [ClassicSimilarity], result of:
          2.2143145 = score(doc=3456,freq=1.0), product of:
            0.7688124 = queryWeight, product of:
              1.0964746 = boost
              9.216561 = idf(docFreq=11, maxDocs=44421)
              0.07607691 = queryNorm
            2.8801754 = fieldWeight in 3456, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.216561 = idf(docFreq=11, maxDocs=44421)
              0.3125 = fieldNorm(doc=3456)
    
  3. Chau, M.Y.: Finding order in a chaotic world : a model for organized research using the World Wide Web (1997) 2.21
    2.2143145 = sum of:
      2.2143145 = product of:
        4.428629 = sum of:
          4.428629 = weight(author_txt:chau in 1529) [ClassicSimilarity], result of:
            4.428629 = score(doc=1529,freq=1.0), product of:
              0.7688124 = queryWeight, product of:
                1.0964746 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.07607691 = queryNorm
              5.7603507 = fieldWeight in 1529, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.625 = fieldNorm(doc=1529)
        0.5 = coord(1/2)
    
  4. Chen, H.; Chau, M.: Web mining : machine learning for Web applications (2003) 1.77
    1.7714517 = sum of:
      1.7714517 = product of:
        3.5429034 = sum of:
          3.5429034 = weight(author_txt:chau in 5242) [ClassicSimilarity], result of:
            3.5429034 = score(doc=5242,freq=1.0), product of:
              0.7688124 = queryWeight, product of:
                1.0964746 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.07607691 = queryNorm
              4.6082807 = fieldWeight in 5242, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.5 = fieldNorm(doc=5242)
        0.5 = coord(1/2)
    
  5. Fang, L.: ¬A developing search service : heterogeneous resources integration and retrieval system (2004) 1.68
    1.6797458 = sum of:
      1.6797458 = product of:
        3.3594916 = sum of:
          3.3594916 = weight(author_txt:fang in 2193) [ClassicSimilarity], result of:
            3.3594916 = score(doc=2193,freq=1.0), product of:
              0.63947445 = queryWeight, product of:
                8.405631 = idf(docFreq=26, maxDocs=44421)
                0.07607691 = queryNorm
              5.2535195 = fieldWeight in 2193, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.405631 = idf(docFreq=26, maxDocs=44421)
                0.625 = fieldNorm(doc=2193)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Chau, M.; Lu, Y.; Fang, X.; Yang, C.C.: Characteristics of character usage in Chinese Web searching (2009) 1.24
    1.238504 = sum of:
      1.238504 = product of:
        1.7201445 = sum of:
          0.05766508 = weight(abstract_txt:characters in 3456) [ClassicSimilarity], result of:
            0.05766508 = score(doc=3456,freq=1.0), product of:
              0.12495177 = queryWeight, product of:
                7.3839793 = idf(docFreq=74, maxDocs=44421)
                0.01692201 = queryNorm
              0.4614987 = fieldWeight in 3456, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3839793 = idf(docFreq=74, maxDocs=44421)
                0.0625 = fieldNorm(doc=3456)
          0.03190303 = weight(abstract_txt:searching in 3456) [ClassicSimilarity], result of:
            0.03190303 = score(doc=3456,freq=2.0), product of:
              0.08420834 = queryWeight, product of:
                1.1609709 = boost
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.01692201 = queryNorm
              0.3788583 = fieldWeight in 3456, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.0625 = fieldNorm(doc=3456)
          0.015156159 = weight(abstract_txt:that in 3456) [ClassicSimilarity], result of:
            0.015156159 = score(doc=3456,freq=4.0), product of:
              0.0512696 = queryWeight, product of:
                1.2811168 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.01692201 = queryNorm
              0.2956169 = fieldWeight in 3456, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=3456)
          0.014072248 = weight(abstract_txt:have in 3456) [ClassicSimilarity], result of:
            0.014072248 = score(doc=3456,freq=1.0), product of:
              0.0703747 = queryWeight, product of:
                1.2998633 = boost
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.01692201 = queryNorm
              0.19996175 = fieldWeight in 3456, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.0625 = fieldNorm(doc=3456)
          0.033254113 = weight(abstract_txt:characteristics in 3456) [ClassicSimilarity], result of:
            0.033254113 = score(doc=3456,freq=1.0), product of:
              0.109070525 = queryWeight, product of:
                1.321288 = boost
                4.8781815 = idf(docFreq=918, maxDocs=44421)
                0.01692201 = queryNorm
              0.30488634 = fieldWeight in 3456, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8781815 = idf(docFreq=918, maxDocs=44421)
                0.0625 = fieldNorm(doc=3456)
          0.059960764 = weight(abstract_txt:engines in 3456) [ClassicSimilarity], result of:
            0.059960764 = score(doc=3456,freq=2.0), product of:
              0.12824643 = queryWeight, product of:
                1.4327368 = boost
                5.2896495 = idf(docFreq=608, maxDocs=44421)
                0.01692201 = queryNorm
              0.46754336 = fieldWeight in 3456, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2896495 = idf(docFreq=608, maxDocs=44421)
                0.0625 = fieldNorm(doc=3456)
          0.020290213 = weight(abstract_txt:been in 3456) [ClassicSimilarity], result of:
            0.020290213 = score(doc=3456,freq=1.0), product of:
              0.089818396 = queryWeight, product of:
                1.4684936 = boost
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.01692201 = queryNorm
              0.22590263 = fieldWeight in 3456, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.0625 = fieldNorm(doc=3456)
          0.11227061 = weight(abstract_txt:character in 3456) [ClassicSimilarity], result of:
            0.11227061 = score(doc=3456,freq=2.0), product of:
              0.19482493 = queryWeight, product of:
                1.7658998 = boost
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.01692201 = queryNorm
              0.5762641 = fieldWeight in 3456, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.0625 = fieldNorm(doc=3456)
          0.08068564 = weight(abstract_txt:queries in 3456) [ClassicSimilarity], result of:
            0.08068564 = score(doc=3456,freq=2.0), product of:
              0.1789349 = queryWeight, product of:
                2.0727024 = boost
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.01692201 = queryNorm
              0.45092174 = fieldWeight in 3456, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.0625 = fieldNorm(doc=3456)
          0.07200702 = weight(abstract_txt:engine in 3456) [ClassicSimilarity], result of:
            0.07200702 = score(doc=3456,freq=1.0), product of:
              0.2089733 = queryWeight, product of:
                2.239932 = boost
                5.5132036 = idf(docFreq=486, maxDocs=44421)
                0.01692201 = queryNorm
              0.34457523 = fieldWeight in 3456, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5132036 = idf(docFreq=486, maxDocs=44421)
                0.0625 = fieldNorm(doc=3456)
          0.17738307 = weight(abstract_txt:kong in 3456) [ClassicSimilarity], result of:
            0.17738307 = score(doc=3456,freq=1.0), product of:
              0.33298033 = queryWeight, product of:
                2.3086233 = boost
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.01692201 = queryNorm
              0.53271335 = fieldWeight in 3456, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.0625 = fieldNorm(doc=3456)
          0.17738307 = weight(abstract_txt:hong in 3456) [ClassicSimilarity], result of:
            0.17738307 = score(doc=3456,freq=1.0), product of:
              0.33298033 = queryWeight, product of:
                2.3086233 = boost
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.01692201 = queryNorm
              0.53271335 = fieldWeight in 3456, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.0625 = fieldNorm(doc=3456)
          0.034701277 = weight(abstract_txt:analysis in 3456) [ClassicSimilarity], result of:
            0.034701277 = score(doc=3456,freq=1.0), product of:
              0.15229563 = queryWeight, product of:
                2.4686387 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.01692201 = queryNorm
              0.2278547 = fieldWeight in 3456, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.0625 = fieldNorm(doc=3456)
          0.2630328 = weight(abstract_txt:chinese in 3456) [ClassicSimilarity], result of:
            0.2630328 = score(doc=3456,freq=6.0), product of:
              0.2727703 = queryWeight, product of:
                2.5591042 = boost
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.01692201 = queryNorm
              0.96430147 = fieldWeight in 3456, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.0625 = fieldNorm(doc=3456)
          0.06157633 = weight(abstract_txt:query in 3456) [ClassicSimilarity], result of:
            0.06157633 = score(doc=3456,freq=1.0), product of:
              0.20721905 = queryWeight, product of:
                2.5755715 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.01692201 = queryNorm
              0.29715574 = fieldWeight in 3456, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.0625 = fieldNorm(doc=3456)
          0.12987672 = weight(abstract_txt:logs in 3456) [ClassicSimilarity], result of:
            0.12987672 = score(doc=3456,freq=1.0), product of:
              0.30964336 = queryWeight, product of:
                2.726593 = boost
                6.7110353 = idf(docFreq=146, maxDocs=44421)
                0.01692201 = queryNorm
              0.4194397 = fieldWeight in 3456, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7110353 = idf(docFreq=146, maxDocs=44421)
                0.0625 = fieldNorm(doc=3456)
          0.1754556 = weight(abstract_txt:english in 3456) [ClassicSimilarity], result of:
            0.1754556 = score(doc=3456,freq=2.0), product of:
              0.3560891 = queryWeight, product of:
                3.7747927 = boost
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.01692201 = queryNorm
              0.4927295 = fieldWeight in 3456, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.0625 = fieldNorm(doc=3456)
          0.20347077 = weight(abstract_txt:search in 3456) [ClassicSimilarity], result of:
            0.20347077 = score(doc=3456,freq=7.0), product of:
              0.33669245 = queryWeight, product of:
                5.444297 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.01692201 = queryNorm
              0.6043223 = fieldWeight in 3456, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.0625 = fieldNorm(doc=3456)
        0.72 = coord(18/25)
    
  2. Chung, W.; Zhang, Y.; Huang, Z.; Wang, G.; Ong, T.-H.; Chen, H.: Internet searching and browsing in a multilingual world : an experiment an the Chinese Business Intelligence Portal (CBizPort) (2004) 0.60
    0.6037596 = sum of:
      0.6037596 = product of:
        1.2578325 = sum of:
          0.050443117 = weight(abstract_txt:searching in 3393) [ClassicSimilarity], result of:
            0.050443117 = score(doc=3393,freq=5.0), product of:
              0.08420834 = queryWeight, product of:
                1.1609709 = boost
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.01692201 = queryNorm
              0.5990276 = fieldWeight in 3393, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.0625 = fieldNorm(doc=3393)
          0.015156159 = weight(abstract_txt:that in 3393) [ClassicSimilarity], result of:
            0.015156159 = score(doc=3393,freq=4.0), product of:
              0.0512696 = queryWeight, product of:
                1.2811168 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.01692201 = queryNorm
              0.2956169 = fieldWeight in 3393, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=3393)
          0.013858093 = weight(abstract_txt:based in 3393) [ClassicSimilarity], result of:
            0.013858093 = score(doc=3393,freq=1.0), product of:
              0.06965889 = queryWeight, product of:
                1.2932358 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.01692201 = queryNorm
              0.1989422 = fieldWeight in 3393, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0625 = fieldNorm(doc=3393)
          0.014072248 = weight(abstract_txt:have in 3393) [ClassicSimilarity], result of:
            0.014072248 = score(doc=3393,freq=1.0), product of:
              0.0703747 = queryWeight, product of:
                1.2998633 = boost
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.01692201 = queryNorm
              0.19996175 = fieldWeight in 3393, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.0625 = fieldNorm(doc=3393)
          0.08479733 = weight(abstract_txt:engines in 3393) [ClassicSimilarity], result of:
            0.08479733 = score(doc=3393,freq=4.0), product of:
              0.12824643 = queryWeight, product of:
                1.4327368 = boost
                5.2896495 = idf(docFreq=608, maxDocs=44421)
                0.01692201 = queryNorm
              0.6612062 = fieldWeight in 3393, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.2896495 = idf(docFreq=608, maxDocs=44421)
                0.0625 = fieldNorm(doc=3393)
          0.07200702 = weight(abstract_txt:engine in 3393) [ClassicSimilarity], result of:
            0.07200702 = score(doc=3393,freq=1.0), product of:
              0.2089733 = queryWeight, product of:
                2.239932 = boost
                5.5132036 = idf(docFreq=486, maxDocs=44421)
                0.01692201 = queryNorm
              0.34457523 = fieldWeight in 3393, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5132036 = idf(docFreq=486, maxDocs=44421)
                0.0625 = fieldNorm(doc=3393)
          0.17738307 = weight(abstract_txt:kong in 3393) [ClassicSimilarity], result of:
            0.17738307 = score(doc=3393,freq=1.0), product of:
              0.33298033 = queryWeight, product of:
                2.3086233 = boost
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.01692201 = queryNorm
              0.53271335 = fieldWeight in 3393, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.0625 = fieldNorm(doc=3393)
          0.17738307 = weight(abstract_txt:hong in 3393) [ClassicSimilarity], result of:
            0.17738307 = score(doc=3393,freq=1.0), product of:
              0.33298033 = queryWeight, product of:
                2.3086233 = boost
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.01692201 = queryNorm
              0.53271335 = fieldWeight in 3393, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.0625 = fieldNorm(doc=3393)
          0.034701277 = weight(abstract_txt:analysis in 3393) [ClassicSimilarity], result of:
            0.034701277 = score(doc=3393,freq=1.0), product of:
              0.15229563 = queryWeight, product of:
                2.4686387 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.01692201 = queryNorm
              0.2278547 = fieldWeight in 3393, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.0625 = fieldNorm(doc=3393)
          0.21476535 = weight(abstract_txt:chinese in 3393) [ClassicSimilarity], result of:
            0.21476535 = score(doc=3393,freq=4.0), product of:
              0.2727703 = queryWeight, product of:
                2.5591042 = boost
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.01692201 = queryNorm
              0.7873488 = fieldWeight in 3393, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.0625 = fieldNorm(doc=3393)
          0.21488835 = weight(abstract_txt:english in 3393) [ClassicSimilarity], result of:
            0.21488835 = score(doc=3393,freq=3.0), product of:
              0.3560891 = queryWeight, product of:
                3.7747927 = boost
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.01692201 = queryNorm
              0.60346794 = fieldWeight in 3393, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.0625 = fieldNorm(doc=3393)
          0.18837734 = weight(abstract_txt:search in 3393) [ClassicSimilarity], result of:
            0.18837734 = score(doc=3393,freq=6.0), product of:
              0.33669245 = queryWeight, product of:
                5.444297 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.01692201 = queryNorm
              0.5594938 = fieldWeight in 3393, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.0625 = fieldNorm(doc=3393)
        0.48 = coord(12/25)
    
  3. Ozmutlu, H.C.; Cavdur, F.; Ozmutlu, S.: Cross-validation of neural network applications for automatic new topic identification (2008) 0.47
    0.47292534 = sum of:
      0.47292534 = product of:
        1.0748303 = sum of:
          0.09360372 = weight(abstract_txt:excite in 2364) [ClassicSimilarity], result of:
            0.09360372 = score(doc=2364,freq=2.0), product of:
              0.13697855 = queryWeight, product of:
                1.0470202 = boost
                7.731176 = idf(docFreq=52, maxDocs=44421)
                0.01692201 = queryNorm
              0.68334585 = fieldWeight in 2364, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.731176 = idf(docFreq=52, maxDocs=44421)
                0.0625 = fieldNorm(doc=2364)
          0.01856243 = weight(abstract_txt:that in 2364) [ClassicSimilarity], result of:
            0.01856243 = score(doc=2364,freq=6.0), product of:
              0.0512696 = queryWeight, product of:
                1.2811168 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.01692201 = queryNorm
              0.3620553 = fieldWeight in 2364, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=2364)
          0.04702842 = weight(abstract_txt:characteristics in 2364) [ClassicSimilarity], result of:
            0.04702842 = score(doc=2364,freq=2.0), product of:
              0.109070525 = queryWeight, product of:
                1.321288 = boost
                4.8781815 = idf(docFreq=918, maxDocs=44421)
                0.01692201 = queryNorm
              0.4311744 = fieldWeight in 2364, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.8781815 = idf(docFreq=918, maxDocs=44421)
                0.0625 = fieldNorm(doc=2364)
          0.03752642 = weight(abstract_txt:topics in 2364) [ClassicSimilarity], result of:
            0.03752642 = score(doc=2364,freq=1.0), product of:
              0.11822299 = queryWeight, product of:
                1.3756082 = boost
                5.078731 = idf(docFreq=751, maxDocs=44421)
                0.01692201 = queryNorm
              0.3174207 = fieldWeight in 2364, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.078731 = idf(docFreq=751, maxDocs=44421)
                0.0625 = fieldNorm(doc=2364)
          0.059960764 = weight(abstract_txt:engines in 2364) [ClassicSimilarity], result of:
            0.059960764 = score(doc=2364,freq=2.0), product of:
              0.12824643 = queryWeight, product of:
                1.4327368 = boost
                5.2896495 = idf(docFreq=608, maxDocs=44421)
                0.01692201 = queryNorm
              0.46754336 = fieldWeight in 2364, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2896495 = idf(docFreq=608, maxDocs=44421)
                0.0625 = fieldNorm(doc=2364)
          0.04298498 = weight(abstract_txt:number in 2364) [ClassicSimilarity], result of:
            0.04298498 = score(doc=2364,freq=2.0), product of:
              0.11759136 = queryWeight, product of:
                1.6802624 = boost
                4.1356745 = idf(docFreq=1930, maxDocs=44421)
                0.01692201 = queryNorm
              0.36554542 = fieldWeight in 2364, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1356745 = idf(docFreq=1930, maxDocs=44421)
                0.0625 = fieldNorm(doc=2364)
          0.08606247 = weight(abstract_txt:sessions in 2364) [ClassicSimilarity], result of:
            0.08606247 = score(doc=2364,freq=1.0), product of:
              0.20559837 = queryWeight, product of:
                1.8140682 = boost
                6.697521 = idf(docFreq=148, maxDocs=44421)
                0.01692201 = queryNorm
              0.41859508 = fieldWeight in 2364, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.697521 = idf(docFreq=148, maxDocs=44421)
                0.0625 = fieldNorm(doc=2364)
          0.057053365 = weight(abstract_txt:queries in 2364) [ClassicSimilarity], result of:
            0.057053365 = score(doc=2364,freq=1.0), product of:
              0.1789349 = queryWeight, product of:
                2.0727024 = boost
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.01692201 = queryNorm
              0.31884983 = fieldWeight in 2364, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.0625 = fieldNorm(doc=2364)
          0.17638047 = weight(abstract_txt:engine in 2364) [ClassicSimilarity], result of:
            0.17638047 = score(doc=2364,freq=6.0), product of:
              0.2089733 = queryWeight, product of:
                2.239932 = boost
                5.5132036 = idf(docFreq=486, maxDocs=44421)
                0.01692201 = queryNorm
              0.84403354 = fieldWeight in 2364, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.5132036 = idf(docFreq=486, maxDocs=44421)
                0.0625 = fieldNorm(doc=2364)
          0.22495307 = weight(abstract_txt:logs in 2364) [ClassicSimilarity], result of:
            0.22495307 = score(doc=2364,freq=3.0), product of:
              0.30964336 = queryWeight, product of:
                2.726593 = boost
                6.7110353 = idf(docFreq=146, maxDocs=44421)
                0.01692201 = queryNorm
              0.72649086 = fieldWeight in 2364, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.7110353 = idf(docFreq=146, maxDocs=44421)
                0.0625 = fieldNorm(doc=2364)
          0.23071416 = weight(abstract_txt:search in 2364) [ClassicSimilarity], result of:
            0.23071416 = score(doc=2364,freq=9.0), product of:
              0.33669245 = queryWeight, product of:
                5.444297 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.01692201 = queryNorm
              0.6852371 = fieldWeight in 2364, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.0625 = fieldNorm(doc=2364)
        0.44 = coord(11/25)
    
  4. Koshman, S.; Spink, A.; Jansen, B.J.: Web searching on the Vivisimo search engine (2006) 0.47
    0.46843675 = sum of:
      0.46843675 = product of:
        0.9008399 = sum of:
          0.039073072 = weight(abstract_txt:searching in 341) [ClassicSimilarity], result of:
            0.039073072 = score(doc=341,freq=3.0), product of:
              0.08420834 = queryWeight, product of:
                1.1609709 = boost
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.01692201 = queryNorm
              0.46400478 = fieldWeight in 341, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.0625 = fieldNorm(doc=341)
          0.015156159 = weight(abstract_txt:that in 341) [ClassicSimilarity], result of:
            0.015156159 = score(doc=341,freq=4.0), product of:
              0.0512696 = queryWeight, product of:
                1.2811168 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.01692201 = queryNorm
              0.2956169 = fieldWeight in 341, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=341)
          0.013858093 = weight(abstract_txt:based in 341) [ClassicSimilarity], result of:
            0.013858093 = score(doc=341,freq=1.0), product of:
              0.06965889 = queryWeight, product of:
                1.2932358 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.01692201 = queryNorm
              0.1989422 = fieldWeight in 341, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0625 = fieldNorm(doc=341)
          0.014072248 = weight(abstract_txt:have in 341) [ClassicSimilarity], result of:
            0.014072248 = score(doc=341,freq=1.0), product of:
              0.0703747 = queryWeight, product of:
                1.2998633 = boost
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.01692201 = queryNorm
              0.19996175 = fieldWeight in 341, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.0625 = fieldNorm(doc=341)
          0.033254113 = weight(abstract_txt:characteristics in 341) [ClassicSimilarity], result of:
            0.033254113 = score(doc=341,freq=1.0), product of:
              0.109070525 = queryWeight, product of:
                1.321288 = boost
                4.8781815 = idf(docFreq=918, maxDocs=44421)
                0.01692201 = queryNorm
              0.30488634 = fieldWeight in 341, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8781815 = idf(docFreq=918, maxDocs=44421)
                0.0625 = fieldNorm(doc=341)
          0.03752642 = weight(abstract_txt:topics in 341) [ClassicSimilarity], result of:
            0.03752642 = score(doc=341,freq=1.0), product of:
              0.11822299 = queryWeight, product of:
                1.3756082 = boost
                5.078731 = idf(docFreq=751, maxDocs=44421)
                0.01692201 = queryNorm
              0.3174207 = fieldWeight in 341, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.078731 = idf(docFreq=751, maxDocs=44421)
                0.0625 = fieldNorm(doc=341)
          0.020290213 = weight(abstract_txt:been in 341) [ClassicSimilarity], result of:
            0.020290213 = score(doc=341,freq=1.0), product of:
              0.089818396 = queryWeight, product of:
                1.4684936 = boost
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.01692201 = queryNorm
              0.22590263 = fieldWeight in 341, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.0625 = fieldNorm(doc=341)
          0.14906456 = weight(abstract_txt:sessions in 341) [ClassicSimilarity], result of:
            0.14906456 = score(doc=341,freq=3.0), product of:
              0.20559837 = queryWeight, product of:
                1.8140682 = boost
                6.697521 = idf(docFreq=148, maxDocs=44421)
                0.01692201 = queryNorm
              0.7250279 = fieldWeight in 341, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.697521 = idf(docFreq=148, maxDocs=44421)
                0.0625 = fieldNorm(doc=341)
          0.08068564 = weight(abstract_txt:queries in 341) [ClassicSimilarity], result of:
            0.08068564 = score(doc=341,freq=2.0), product of:
              0.1789349 = queryWeight, product of:
                2.0727024 = boost
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.01692201 = queryNorm
              0.45092174 = fieldWeight in 341, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.0625 = fieldNorm(doc=341)
          0.14401405 = weight(abstract_txt:engine in 341) [ClassicSimilarity], result of:
            0.14401405 = score(doc=341,freq=4.0), product of:
              0.2089733 = queryWeight, product of:
                2.239932 = boost
                5.5132036 = idf(docFreq=486, maxDocs=44421)
                0.01692201 = queryNorm
              0.68915045 = fieldWeight in 341, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.5132036 = idf(docFreq=486, maxDocs=44421)
                0.0625 = fieldNorm(doc=341)
          0.049075015 = weight(abstract_txt:analysis in 341) [ClassicSimilarity], result of:
            0.049075015 = score(doc=341,freq=2.0), product of:
              0.15229563 = queryWeight, product of:
                2.4686387 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.01692201 = queryNorm
              0.3222352 = fieldWeight in 341, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.0625 = fieldNorm(doc=341)
          0.06157633 = weight(abstract_txt:query in 341) [ClassicSimilarity], result of:
            0.06157633 = score(doc=341,freq=1.0), product of:
              0.20721905 = queryWeight, product of:
                2.5755715 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.01692201 = queryNorm
              0.29715574 = fieldWeight in 341, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.0625 = fieldNorm(doc=341)
          0.24319407 = weight(abstract_txt:search in 341) [ClassicSimilarity], result of:
            0.24319407 = score(doc=341,freq=10.0), product of:
              0.33669245 = queryWeight, product of:
                5.444297 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.01692201 = queryNorm
              0.72230333 = fieldWeight in 341, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.0625 = fieldNorm(doc=341)
        0.52 = coord(13/25)
    
  5. Pu, H.-T.; Chuang, S.-L.; Yang, C.: Subject categorization of query terms for exploring Web users' search interests (2002) 0.40
    0.40345287 = sum of:
      0.40345287 = product of:
        0.8405268 = sum of:
          0.03190303 = weight(abstract_txt:searching in 1587) [ClassicSimilarity], result of:
            0.03190303 = score(doc=1587,freq=2.0), product of:
              0.08420834 = queryWeight, product of:
                1.1609709 = boost
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.01692201 = queryNorm
              0.3788583 = fieldWeight in 1587, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.0625 = fieldNorm(doc=1587)
          0.010717023 = weight(abstract_txt:that in 1587) [ClassicSimilarity], result of:
            0.010717023 = score(doc=1587,freq=2.0), product of:
              0.0512696 = queryWeight, product of:
                1.2811168 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.01692201 = queryNorm
              0.20903271 = fieldWeight in 1587, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=1587)
          0.013858093 = weight(abstract_txt:based in 1587) [ClassicSimilarity], result of:
            0.013858093 = score(doc=1587,freq=1.0), product of:
              0.06965889 = queryWeight, product of:
                1.2932358 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.01692201 = queryNorm
              0.1989422 = fieldWeight in 1587, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0625 = fieldNorm(doc=1587)
          0.03752642 = weight(abstract_txt:topics in 1587) [ClassicSimilarity], result of:
            0.03752642 = score(doc=1587,freq=1.0), product of:
              0.11822299 = queryWeight, product of:
                1.3756082 = boost
                5.078731 = idf(docFreq=751, maxDocs=44421)
                0.01692201 = queryNorm
              0.3174207 = fieldWeight in 1587, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.078731 = idf(docFreq=751, maxDocs=44421)
                0.0625 = fieldNorm(doc=1587)
          0.042398665 = weight(abstract_txt:engines in 1587) [ClassicSimilarity], result of:
            0.042398665 = score(doc=1587,freq=1.0), product of:
              0.12824643 = queryWeight, product of:
                1.4327368 = boost
                5.2896495 = idf(docFreq=608, maxDocs=44421)
                0.01692201 = queryNorm
              0.3306031 = fieldWeight in 1587, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2896495 = idf(docFreq=608, maxDocs=44421)
                0.0625 = fieldNorm(doc=1587)
          0.020290213 = weight(abstract_txt:been in 1587) [ClassicSimilarity], result of:
            0.020290213 = score(doc=1587,freq=1.0), product of:
              0.089818396 = queryWeight, product of:
                1.4684936 = boost
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.01692201 = queryNorm
              0.22590263 = fieldWeight in 1587, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.0625 = fieldNorm(doc=1587)
          0.08068564 = weight(abstract_txt:queries in 1587) [ClassicSimilarity], result of:
            0.08068564 = score(doc=1587,freq=2.0), product of:
              0.1789349 = queryWeight, product of:
                2.0727024 = boost
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.01692201 = queryNorm
              0.45092174 = fieldWeight in 1587, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.0625 = fieldNorm(doc=1587)
          0.07200702 = weight(abstract_txt:engine in 1587) [ClassicSimilarity], result of:
            0.07200702 = score(doc=1587,freq=1.0), product of:
              0.2089733 = queryWeight, product of:
                2.239932 = boost
                5.5132036 = idf(docFreq=486, maxDocs=44421)
                0.01692201 = queryNorm
              0.34457523 = fieldWeight in 1587, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5132036 = idf(docFreq=486, maxDocs=44421)
                0.0625 = fieldNorm(doc=1587)
          0.060104374 = weight(abstract_txt:analysis in 1587) [ClassicSimilarity], result of:
            0.060104374 = score(doc=1587,freq=3.0), product of:
              0.15229563 = queryWeight, product of:
                2.4686387 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.01692201 = queryNorm
              0.3946559 = fieldWeight in 1587, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.0625 = fieldNorm(doc=1587)
          0.13768886 = weight(abstract_txt:query in 1587) [ClassicSimilarity], result of:
            0.13768886 = score(doc=1587,freq=5.0), product of:
              0.20721905 = queryWeight, product of:
                2.5755715 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.01692201 = queryNorm
              0.6644604 = fieldWeight in 1587, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.0625 = fieldNorm(doc=1587)
          0.12987672 = weight(abstract_txt:logs in 1587) [ClassicSimilarity], result of:
            0.12987672 = score(doc=1587,freq=1.0), product of:
              0.30964336 = queryWeight, product of:
                2.726593 = boost
                6.7110353 = idf(docFreq=146, maxDocs=44421)
                0.01692201 = queryNorm
              0.4194397 = fieldWeight in 1587, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7110353 = idf(docFreq=146, maxDocs=44421)
                0.0625 = fieldNorm(doc=1587)
          0.20347077 = weight(abstract_txt:search in 1587) [ClassicSimilarity], result of:
            0.20347077 = score(doc=1587,freq=7.0), product of:
              0.33669245 = queryWeight, product of:
                5.444297 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.01692201 = queryNorm
              0.6043223 = fieldWeight in 1587, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.0625 = fieldNorm(doc=1587)
        0.48 = coord(12/25)