Document (#32608)

Author
Liu, Y.
Zhang, M.
Cen, R.
Ru, L.
Ma, S.
Title
Data cleansing for Web information retrieval using query independent features
Source
Journal of the American Society for Information Science and Technology. 58(2007) no.12, S.1884-1898
Year
2007
Abstract
Understanding what kinds of Web pages are the most useful for Web search engine users is a critical task in Web information retrieval (IR). Most previous works used hyperlink analysis algorithms to solve this problem. However, little research has been focused on query-independent Web data cleansing for Web IR. In this paper, we first provide analysis of the differences between retrieval target pages and ordinary ones based on more than 30 million Web pages obtained from both the Text Retrieval Conference (TREC) and a widely used Chinese search engine, SOGOU (www.sogou.com). We further propose a learning-based data cleansing algorithm for reducing Web pages that are unlikely to be useful for user requests. We found that there exists a large proportion of low-quality Web pages in both the English and the Chinese Web page corpus, and retrieval target pages can be identified using query-independent features and cleansing algorithms. The experimental results showed that our algorithm is effective in reducing a large portion of Web pages with a small loss in retrieval target pages. It makes it possible for Web IR tools to meet a large fraction of users' needs with only a small part of pages on the Web. These results may help Web search engines make better use of their limited storage and computation resources to improve search performance.
Footnote
Beitrag eines Themenschwerpunktes "Mining Web resources for enhancing information retrieval"
Theme
Data Mining
Suchmaschinen
Object
WWW

Similar documents (author)

  1. Zhang, M.; Zhang, Y.: Professional organizations in Twittersphere : an empirical study of U.S. library and information science professional organizations-related Tweets (2020) 4.53
    4.5277104 = sum of:
      4.5277104 = weight(author_txt:zhang in 775) [ClassicSimilarity], result of:
        4.5277104 = score(doc=775,freq=2.0), product of:
          0.99999994 = queryWeight, product of:
            6.40315 = idf(docFreq=199, maxDocs=44421)
            0.15617312 = queryNorm
          4.527711 = fieldWeight in 775, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            6.40315 = idf(docFreq=199, maxDocs=44421)
            0.5 = fieldNorm(doc=775)
    
  2. Zhang, Y.; Zhang, C.: Enhancing keyphrase extraction from microblogs using human reading time (2021) 4.53
    4.5277104 = sum of:
      4.5277104 = weight(author_txt:zhang in 1238) [ClassicSimilarity], result of:
        4.5277104 = score(doc=1238,freq=2.0), product of:
          0.99999994 = queryWeight, product of:
            6.40315 = idf(docFreq=199, maxDocs=44421)
            0.15617312 = queryNorm
          4.527711 = fieldWeight in 1238, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            6.40315 = idf(docFreq=199, maxDocs=44421)
            0.5 = fieldNorm(doc=1238)
    
  3. Zhang, J.: TOFIR: A tool of facilitating information retrieval : introduce a visual retrieval model (2001) 4.00
    4.0019684 = sum of:
      4.0019684 = weight(author_txt:zhang in 7710) [ClassicSimilarity], result of:
        4.0019684 = score(doc=7710,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            6.40315 = idf(docFreq=199, maxDocs=44421)
            0.15617312 = queryNorm
          4.001969 = fieldWeight in 7710, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            6.40315 = idf(docFreq=199, maxDocs=44421)
            0.625 = fieldNorm(doc=7710)
    
  4. Zhang, A.: Multimedia file formats on the Internet : a beginner's guide for PC users (1995) 4.00
    4.0019684 = sum of:
      4.0019684 = weight(author_txt:zhang in 3280) [ClassicSimilarity], result of:
        4.0019684 = score(doc=3280,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            6.40315 = idf(docFreq=199, maxDocs=44421)
            0.15617312 = queryNorm
          4.001969 = fieldWeight in 3280, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            6.40315 = idf(docFreq=199, maxDocs=44421)
            0.625 = fieldNorm(doc=3280)
    
  5. Zhang, J.: ¬A representational analysis of relational information displays (1996) 4.00
    4.0019684 = sum of:
      4.0019684 = weight(author_txt:zhang in 6471) [ClassicSimilarity], result of:
        4.0019684 = score(doc=6471,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            6.40315 = idf(docFreq=199, maxDocs=44421)
            0.15617312 = queryNorm
          4.001969 = fieldWeight in 6471, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            6.40315 = idf(docFreq=199, maxDocs=44421)
            0.625 = fieldNorm(doc=6471)
    

Similar documents (content)

  1. Souza, J.; Carvalho, A.; Cristo, M.; Moura, E.; Calado, P.; Chirita, P.-A.; Nejdl, W.: Using site-level connections to estimate link confidence (2012) 0.26
    0.25907022 = sum of:
      0.25907022 = product of:
        0.6476755 = sum of:
          0.01332976 = weight(abstract_txt:most in 1498) [ClassicSimilarity], result of:
            0.01332976 = score(doc=1498,freq=1.0), product of:
              0.054099698 = queryWeight, product of:
                1.0677928 = boost
                3.94228 = idf(docFreq=2342, maxDocs=44421)
                0.012851695 = queryNorm
              0.2463925 = fieldWeight in 1498, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.94228 = idf(docFreq=2342, maxDocs=44421)
                0.0625 = fieldNorm(doc=1498)
          0.02880321 = weight(abstract_txt:features in 1498) [ClassicSimilarity], result of:
            0.02880321 = score(doc=1498,freq=2.0), product of:
              0.071767956 = queryWeight, product of:
                1.2298577 = boost
                4.5406218 = idf(docFreq=1287, maxDocs=44421)
                0.012851695 = queryNorm
              0.40133804 = fieldWeight in 1498, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5406218 = idf(docFreq=1287, maxDocs=44421)
                0.0625 = fieldNorm(doc=1498)
          0.03645793 = weight(abstract_txt:engine in 1498) [ClassicSimilarity], result of:
            0.03645793 = score(doc=1498,freq=1.0), product of:
              0.105805434 = queryWeight, product of:
                1.4932879 = boost
                5.5132036 = idf(docFreq=486, maxDocs=44421)
                0.012851695 = queryNorm
              0.34457523 = fieldWeight in 1498, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5132036 = idf(docFreq=486, maxDocs=44421)
                0.0625 = fieldNorm(doc=1498)
          0.04029179 = weight(abstract_txt:algorithms in 1498) [ClassicSimilarity], result of:
            0.04029179 = score(doc=1498,freq=1.0), product of:
              0.11309871 = queryWeight, product of:
                1.5438973 = boost
                5.7000527 = idf(docFreq=403, maxDocs=44421)
                0.012851695 = queryNorm
              0.3562533 = fieldWeight in 1498, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7000527 = idf(docFreq=403, maxDocs=44421)
                0.0625 = fieldNorm(doc=1498)
          0.040397126 = weight(abstract_txt:algorithm in 1498) [ClassicSimilarity], result of:
            0.040397126 = score(doc=1498,freq=1.0), product of:
              0.11329575 = queryWeight, product of:
                1.5452415 = boost
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.012851695 = queryNorm
              0.35656348 = fieldWeight in 1498, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.0625 = fieldNorm(doc=1498)
          0.028609475 = weight(abstract_txt:large in 1498) [ClassicSimilarity], result of:
            0.028609475 = score(doc=1498,freq=1.0), product of:
              0.10304264 = queryWeight, product of:
                1.8048607 = boost
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.012851695 = queryNorm
              0.27764696 = fieldWeight in 1498, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.0625 = fieldNorm(doc=1498)
          0.04960192 = weight(abstract_txt:query in 1498) [ClassicSimilarity], result of:
            0.04960192 = score(doc=1498,freq=2.0), product of:
              0.1180319 = queryWeight, product of:
                1.9316787 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.012851695 = queryNorm
              0.42024165 = fieldWeight in 1498, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.0625 = fieldNorm(doc=1498)
          0.047491267 = weight(abstract_txt:search in 1498) [ClassicSimilarity], result of:
            0.047491267 = score(doc=1498,freq=5.0), product of:
              0.09298419 = queryWeight, product of:
                1.9797444 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.012851695 = queryNorm
              0.5107456 = fieldWeight in 1498, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.0625 = fieldNorm(doc=1498)
          0.06399705 = weight(abstract_txt:independent in 1498) [ClassicSimilarity], result of:
            0.06399705 = score(doc=1498,freq=1.0), product of:
              0.1762451 = queryWeight, product of:
                2.360444 = boost
                5.8098235 = idf(docFreq=361, maxDocs=44421)
                0.012851695 = queryNorm
              0.36311397 = fieldWeight in 1498, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8098235 = idf(docFreq=361, maxDocs=44421)
                0.0625 = fieldNorm(doc=1498)
          0.29869604 = weight(abstract_txt:pages in 1498) [ClassicSimilarity], result of:
            0.29869604 = score(doc=1498,freq=3.0), product of:
              0.49222466 = queryWeight, product of:
                6.8324666 = boost
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.012851695 = queryNorm
              0.6068287 = fieldWeight in 1498, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.0625 = fieldNorm(doc=1498)
        0.4 = coord(10/25)
    
  2. Wang, F.L.; Yang, C.C.: Mining Web data for Chinese segmentation (2007) 0.18
    0.1841865 = sum of:
      0.1841865 = product of:
        0.51162916 = sum of:
          0.01332976 = weight(abstract_txt:most in 1604) [ClassicSimilarity], result of:
            0.01332976 = score(doc=1604,freq=1.0), product of:
              0.054099698 = queryWeight, product of:
                1.0677928 = boost
                3.94228 = idf(docFreq=2342, maxDocs=44421)
                0.012851695 = queryNorm
              0.2463925 = fieldWeight in 1604, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.94228 = idf(docFreq=2342, maxDocs=44421)
                0.0625 = fieldNorm(doc=1604)
          0.017045395 = weight(abstract_txt:data in 1604) [ClassicSimilarity], result of:
            0.017045395 = score(doc=1604,freq=2.0), product of:
              0.057907984 = queryWeight, product of:
                1.3530207 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.012851695 = queryNorm
              0.29435313 = fieldWeight in 1604, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=1604)
          0.06978742 = weight(abstract_txt:algorithms in 1604) [ClassicSimilarity], result of:
            0.06978742 = score(doc=1604,freq=3.0), product of:
              0.11309871 = queryWeight, product of:
                1.5438973 = boost
                5.7000527 = idf(docFreq=403, maxDocs=44421)
                0.012851695 = queryNorm
              0.6170488 = fieldWeight in 1604, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.7000527 = idf(docFreq=403, maxDocs=44421)
                0.0625 = fieldNorm(doc=1604)
          0.098952346 = weight(abstract_txt:algorithm in 1604) [ClassicSimilarity], result of:
            0.098952346 = score(doc=1604,freq=6.0), product of:
              0.11329575 = queryWeight, product of:
                1.5452415 = boost
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.012851695 = queryNorm
              0.8733986 = fieldWeight in 1604, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.0625 = fieldNorm(doc=1604)
          0.14384687 = weight(abstract_txt:chinese in 1604) [ClassicSimilarity], result of:
            0.14384687 = score(doc=1604,freq=7.0), product of:
              0.13810652 = queryWeight, product of:
                1.7060694 = boost
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.012851695 = queryNorm
              1.0415646 = fieldWeight in 1604, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.0625 = fieldNorm(doc=1604)
          0.04045991 = weight(abstract_txt:large in 1604) [ClassicSimilarity], result of:
            0.04045991 = score(doc=1604,freq=2.0), product of:
              0.10304264 = queryWeight, product of:
                1.8048607 = boost
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.012851695 = queryNorm
              0.3926521 = fieldWeight in 1604, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.0625 = fieldNorm(doc=1604)
          0.03678658 = weight(abstract_txt:search in 1604) [ClassicSimilarity], result of:
            0.03678658 = score(doc=1604,freq=3.0), product of:
              0.09298419 = queryWeight, product of:
                1.9797444 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.012851695 = queryNorm
              0.39562184 = fieldWeight in 1604, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.0625 = fieldNorm(doc=1604)
          0.06399705 = weight(abstract_txt:independent in 1604) [ClassicSimilarity], result of:
            0.06399705 = score(doc=1604,freq=1.0), product of:
              0.1762451 = queryWeight, product of:
                2.360444 = boost
                5.8098235 = idf(docFreq=361, maxDocs=44421)
                0.012851695 = queryNorm
              0.36311397 = fieldWeight in 1604, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8098235 = idf(docFreq=361, maxDocs=44421)
                0.0625 = fieldNorm(doc=1604)
          0.02742382 = weight(abstract_txt:retrieval in 1604) [ClassicSimilarity], result of:
            0.02742382 = score(doc=1604,freq=1.0), product of:
              0.12621346 = queryWeight, product of:
                2.8248997 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.012851695 = queryNorm
              0.21728125 = fieldWeight in 1604, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=1604)
        0.36 = coord(9/25)
    
  3. Thelwall, M.; Vaughan, L.: New versions of PageRank employing alternative Web document models (2004) 0.17
    0.16638993 = sum of:
      0.16638993 = product of:
        0.5942497 = sum of:
          0.01332976 = weight(abstract_txt:most in 799) [ClassicSimilarity], result of:
            0.01332976 = score(doc=799,freq=1.0), product of:
              0.054099698 = queryWeight, product of:
                1.0677928 = boost
                3.94228 = idf(docFreq=2342, maxDocs=44421)
                0.012851695 = queryNorm
              0.2463925 = fieldWeight in 799, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.94228 = idf(docFreq=2342, maxDocs=44421)
                0.0625 = fieldNorm(doc=799)
          0.03645793 = weight(abstract_txt:engine in 799) [ClassicSimilarity], result of:
            0.03645793 = score(doc=799,freq=1.0), product of:
              0.105805434 = queryWeight, product of:
                1.4932879 = boost
                5.5132036 = idf(docFreq=486, maxDocs=44421)
                0.012851695 = queryNorm
              0.34457523 = fieldWeight in 799, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5132036 = idf(docFreq=486, maxDocs=44421)
                0.0625 = fieldNorm(doc=799)
          0.06978742 = weight(abstract_txt:algorithms in 799) [ClassicSimilarity], result of:
            0.06978742 = score(doc=799,freq=3.0), product of:
              0.11309871 = queryWeight, product of:
                1.5438973 = boost
                5.7000527 = idf(docFreq=403, maxDocs=44421)
                0.012851695 = queryNorm
              0.6170488 = fieldWeight in 799, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.7000527 = idf(docFreq=403, maxDocs=44421)
                0.0625 = fieldNorm(doc=799)
          0.040397126 = weight(abstract_txt:algorithm in 799) [ClassicSimilarity], result of:
            0.040397126 = score(doc=799,freq=1.0), product of:
              0.11329575 = queryWeight, product of:
                1.5452415 = boost
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.012851695 = queryNorm
              0.35656348 = fieldWeight in 799, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.0625 = fieldNorm(doc=799)
          0.02123874 = weight(abstract_txt:search in 799) [ClassicSimilarity], result of:
            0.02123874 = score(doc=799,freq=1.0), product of:
              0.09298419 = queryWeight, product of:
                1.9797444 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.012851695 = queryNorm
              0.22841237 = fieldWeight in 799, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.0625 = fieldNorm(doc=799)
          0.02742382 = weight(abstract_txt:retrieval in 799) [ClassicSimilarity], result of:
            0.02742382 = score(doc=799,freq=1.0), product of:
              0.12621346 = queryWeight, product of:
                2.8248997 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.012851695 = queryNorm
              0.21728125 = fieldWeight in 799, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=799)
          0.38561493 = weight(abstract_txt:pages in 799) [ClassicSimilarity], result of:
            0.38561493 = score(doc=799,freq=5.0), product of:
              0.49222466 = queryWeight, product of:
                6.8324666 = boost
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.012851695 = queryNorm
              0.78341246 = fieldWeight in 799, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.0625 = fieldNorm(doc=799)
        0.28 = coord(7/25)
    
  4. Austin, D.: How Google finds your needle in the Web's haystack : as we'll see, the trick is to ask the web itself to rank the importance of pages... (2006) 0.16
    0.16478689 = sum of:
      0.16478689 = product of:
        0.5885246 = sum of:
          0.0166622 = weight(abstract_txt:most in 218) [ClassicSimilarity], result of:
            0.0166622 = score(doc=218,freq=4.0), product of:
              0.054099698 = queryWeight, product of:
                1.0677928 = boost
                3.94228 = idf(docFreq=2342, maxDocs=44421)
                0.012851695 = queryNorm
              0.30799064 = fieldWeight in 218, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.94228 = idf(docFreq=2342, maxDocs=44421)
                0.0390625 = fieldNorm(doc=218)
          0.015384586 = weight(abstract_txt:useful in 218) [ClassicSimilarity], result of:
            0.015384586 = score(doc=218,freq=1.0), product of:
              0.081429884 = queryWeight, product of:
                1.3100307 = boost
                4.83662 = idf(docFreq=957, maxDocs=44421)
                0.012851695 = queryNorm
              0.18893047 = fieldWeight in 218, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.83662 = idf(docFreq=957, maxDocs=44421)
                0.0390625 = fieldNorm(doc=218)
          0.03946687 = weight(abstract_txt:engine in 218) [ClassicSimilarity], result of:
            0.03946687 = score(doc=218,freq=3.0), product of:
              0.105805434 = queryWeight, product of:
                1.4932879 = boost
                5.5132036 = idf(docFreq=486, maxDocs=44421)
                0.012851695 = queryNorm
              0.37301362 = fieldWeight in 218, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.5132036 = idf(docFreq=486, maxDocs=44421)
                0.0390625 = fieldNorm(doc=218)
          0.035706352 = weight(abstract_txt:algorithm in 218) [ClassicSimilarity], result of:
            0.035706352 = score(doc=218,freq=2.0), product of:
              0.11329575 = queryWeight, product of:
                1.5452415 = boost
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.012851695 = queryNorm
              0.31516057 = fieldWeight in 218, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.0390625 = fieldNorm(doc=218)
          0.017880922 = weight(abstract_txt:large in 218) [ClassicSimilarity], result of:
            0.017880922 = score(doc=218,freq=1.0), product of:
              0.10304264 = queryWeight, product of:
                1.8048607 = boost
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.012851695 = queryNorm
              0.17352936 = fieldWeight in 218, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.0390625 = fieldNorm(doc=218)
          0.04598322 = weight(abstract_txt:search in 218) [ClassicSimilarity], result of:
            0.04598322 = score(doc=218,freq=12.0), product of:
              0.09298419 = queryWeight, product of:
                1.9797444 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.012851695 = queryNorm
              0.49452728 = fieldWeight in 218, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.0390625 = fieldNorm(doc=218)
          0.4174404 = weight(abstract_txt:pages in 218) [ClassicSimilarity], result of:
            0.4174404 = score(doc=218,freq=15.0), product of:
              0.49222466 = queryWeight, product of:
                6.8324666 = boost
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.012851695 = queryNorm
              0.8480689 = fieldWeight in 218, product of:
                3.8729835 = tf(freq=15.0), with freq of:
                  15.0 = termFreq=15.0
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.0390625 = fieldNorm(doc=218)
        0.28 = coord(7/25)
    
  5. Lawrence, S.; Giles, C.L.: Inquirus, the NECI meta search engine (1998) 0.16
    0.1625048 = sum of:
      0.1625048 = product of:
        0.6771034 = sum of:
          0.01799237 = weight(abstract_txt:both in 4604) [ClassicSimilarity], result of:
            0.01799237 = score(doc=4604,freq=1.0), product of:
              0.050424863 = queryWeight, product of:
                1.030889 = boost
                3.8060317 = idf(docFreq=2684, maxDocs=44421)
                0.012851695 = queryNorm
              0.35681546 = fieldWeight in 4604, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8060317 = idf(docFreq=2684, maxDocs=44421)
                0.09375 = fieldNorm(doc=4604)
          0.036923006 = weight(abstract_txt:useful in 4604) [ClassicSimilarity], result of:
            0.036923006 = score(doc=4604,freq=1.0), product of:
              0.081429884 = queryWeight, product of:
                1.3100307 = boost
                4.83662 = idf(docFreq=957, maxDocs=44421)
                0.012851695 = queryNorm
              0.4534331 = fieldWeight in 4604, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.83662 = idf(docFreq=957, maxDocs=44421)
                0.09375 = fieldNorm(doc=4604)
          0.0546869 = weight(abstract_txt:engine in 4604) [ClassicSimilarity], result of:
            0.0546869 = score(doc=4604,freq=1.0), product of:
              0.105805434 = queryWeight, product of:
                1.4932879 = boost
                5.5132036 = idf(docFreq=486, maxDocs=44421)
                0.012851695 = queryNorm
              0.51686287 = fieldWeight in 4604, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5132036 = idf(docFreq=486, maxDocs=44421)
                0.09375 = fieldNorm(doc=4604)
          0.07440288 = weight(abstract_txt:query in 4604) [ClassicSimilarity], result of:
            0.07440288 = score(doc=4604,freq=2.0), product of:
              0.1180319 = queryWeight, product of:
                1.9316787 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.012851695 = queryNorm
              0.6303625 = fieldWeight in 4604, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.09375 = fieldNorm(doc=4604)
          0.04505417 = weight(abstract_txt:search in 4604) [ClassicSimilarity], result of:
            0.04505417 = score(doc=4604,freq=2.0), product of:
              0.09298419 = queryWeight, product of:
                1.9797444 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.012851695 = queryNorm
              0.4845358 = fieldWeight in 4604, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.09375 = fieldNorm(doc=4604)
          0.44804406 = weight(abstract_txt:pages in 4604) [ClassicSimilarity], result of:
            0.44804406 = score(doc=4604,freq=3.0), product of:
              0.49222466 = queryWeight, product of:
                6.8324666 = boost
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.012851695 = queryNorm
              0.91024303 = fieldWeight in 4604, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.09375 = fieldNorm(doc=4604)
        0.24 = coord(6/25)