Document (#37438)

Author
Blanco, L.
Bronzi, M.
Crescenzi, V.
Merialdo, P.
Papotti, P.
Title
Flint: from Web pages to probabilistic semantic data
Source
Semantic search over the Web. Eds.: R. De Virgilio, et al
Imprint
Berlin : Springer
Year
2012
Pages
S.333-359
Series
Data-centric systems and applications
Abstract
The Web is a surprisingly extensive source of information: it offers a huge number of sites containing data about a disparate range of topics. Although Web pages are built for human fruition, not for automatic processing of the data, we observe that an increasing number of Web sites deliver pages containing structured information about recognizable concepts, relevant to specific application domains, such as movies, finance, sport, products, etc. The development of scalable techniques to discover, extract, and integrate data from fairly structured large corpora available on the Web is a challenging issue, because to face the Web scale, these activities should be accomplished automatically by domain-independent techniques. To cope with the complexity and the heterogeneity of Web data, state-of-the-art approaches focus on information organized according to specific patterns that frequently occur on the Web. Meaningful examples are WebTables, which focuses on data published in HTML tables, and information extraction systems, such as TextRunner, which exploits lexical-syntactic patterns. As noticed by Cafarella et al., even if a small fraction of the Web is organized according to these patterns, due to the Web scale, the amount of data involved is impressive. In this chapter, we focus on methods and techniques to wring out value from the data delivered by large data-intensive Web sites.
Theme
Semantic Web
Object
Flint

Similar documents (author)

  1. Blanco, E. González- => González-Blanco, E.: 4.89
    4.8878193 = sum of:
      4.8878193 = weight(author_txt:blanco in 1095) [ClassicSimilarity], result of:
        4.8878193 = fieldWeight in 1095, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          9.216561 = idf(docFreq=11, maxDocs=44421)
          0.375 = fieldNorm(doc=1095)
    
  2. Blanco, E. González- => González-Blanco, E.: 4.89
    4.8878193 = sum of:
      4.8878193 = weight(author_txt:blanco in 1468) [ClassicSimilarity], result of:
        4.8878193 = fieldWeight in 1468, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          9.216561 = idf(docFreq=11, maxDocs=44421)
          0.375 = fieldNorm(doc=1468)
    
  3. Blanco, E.; Moldovan, D.: ¬A model for composing semantic relations (2011) 4.61
    4.6082807 = sum of:
      4.6082807 = weight(author_txt:blanco in 762) [ClassicSimilarity], result of:
        4.6082807 = fieldWeight in 762, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.216561 = idf(docFreq=11, maxDocs=44421)
          0.5 = fieldNorm(doc=762)
    
  4. Blanco, E.; Cankaya, H.C.; Moldovan, D.: Composition of semantic relations : model and applications (2010) 3.46
    3.4562106 = sum of:
      3.4562106 = weight(author_txt:blanco in 761) [ClassicSimilarity], result of:
        3.4562106 = fieldWeight in 761, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.216561 = idf(docFreq=11, maxDocs=44421)
          0.375 = fieldNorm(doc=761)
    
  5. Blanco, R.; Matthews, M.; Mika, P.: Ranking of daily deals with concept expansion (2015) 3.46
    3.4562106 = sum of:
      3.4562106 = weight(author_txt:blanco in 3663) [ClassicSimilarity], result of:
        3.4562106 = fieldWeight in 3663, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.216561 = idf(docFreq=11, maxDocs=44421)
          0.375 = fieldNorm(doc=3663)
    

Similar documents (content)

  1. Bizer, C.; Mendes, P.N.; Jentzsch, A.: Topology of the Web of Data (2012) 0.23
    0.22799711 = sum of:
      0.22799711 = product of:
        0.712491 = sum of:
          0.020214045 = weight(abstract_txt:from in 1425) [ClassicSimilarity], result of:
            0.020214045 = score(doc=1425,freq=4.0), product of:
              0.06697623 = queryWeight, product of:
                1.0425589 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.023281211 = queryNorm
              0.30180925 = fieldWeight in 1425, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1425)
          0.025599632 = weight(abstract_txt:specific in 1425) [ClassicSimilarity], result of:
            0.025599632 = score(doc=1425,freq=1.0), product of:
              0.10871695 = queryWeight, product of:
                1.0845343 = boost
                4.305746 = idf(docFreq=1628, maxDocs=44421)
                0.023281211 = queryNorm
              0.23547049 = fieldWeight in 1425, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.305746 = idf(docFreq=1628, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1425)
          0.009077621 = weight(abstract_txt:information in 1425) [ClassicSimilarity], result of:
            0.009077621 = score(doc=1425,freq=1.0), product of:
              0.06862243 = queryWeight, product of:
                1.2185482 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.023281211 = queryNorm
              0.13228357 = fieldWeight in 1425, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1425)
          0.11471886 = weight(abstract_txt:structured in 1425) [ClassicSimilarity], result of:
            0.11471886 = score(doc=1425,freq=5.0), product of:
              0.1728123 = queryWeight, product of:
                1.3673571 = boost
                5.428591 = idf(docFreq=529, maxDocs=44421)
                0.023281211 = queryNorm
              0.66383505 = fieldWeight in 1425, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.428591 = idf(docFreq=529, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1425)
          0.044779904 = weight(abstract_txt:techniques in 1425) [ClassicSimilarity], result of:
            0.044779904 = score(doc=1425,freq=1.0), product of:
              0.18067329 = queryWeight, product of:
                1.7123291 = boost
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.023281211 = queryNorm
              0.24785016 = fieldWeight in 1425, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1425)
          0.15185958 = weight(abstract_txt:sites in 1425) [ClassicSimilarity], result of:
            0.15185958 = score(doc=1425,freq=4.0), product of:
              0.2569094 = queryWeight, product of:
                2.0418804 = boost
                5.4043584 = idf(docFreq=542, maxDocs=44421)
                0.023281211 = queryNorm
              0.5911017 = fieldWeight in 1425, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.4043584 = idf(docFreq=542, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1425)
          0.16946737 = weight(abstract_txt:pages in 1425) [ClassicSimilarity], result of:
            0.16946737 = score(doc=1425,freq=4.0), product of:
              0.2764029 = queryWeight, product of:
                2.1179297 = boost
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.023281211 = queryNorm
              0.6131172 = fieldWeight in 1425, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1425)
          0.17677397 = weight(abstract_txt:data in 1425) [ClassicSimilarity], result of:
            0.17677397 = score(doc=1425,freq=11.0), product of:
              0.29265788 = queryWeight, product of:
                3.7746875 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.023281211 = queryNorm
              0.6040294 = fieldWeight in 1425, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1425)
        0.32 = coord(8/25)
    
  2. Brunetti, J.M.; Roberto García, R.: User-centered design and evaluation of overview components for semantic data exploration (2014) 0.22
    0.215969 = sum of:
      0.215969 = product of:
        0.5399225 = sum of:
          0.015005036 = weight(abstract_txt:from in 2626) [ClassicSimilarity], result of:
            0.015005036 = score(doc=2626,freq=3.0), product of:
              0.06697623 = queryWeight, product of:
                1.0425589 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.023281211 = queryNorm
              0.22403526 = fieldWeight in 2626, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.046875 = fieldNorm(doc=2626)
          0.034079675 = weight(abstract_txt:large in 2626) [ClassicSimilarity], result of:
            0.034079675 = score(doc=2626,freq=2.0), product of:
              0.11572474 = queryWeight, product of:
                1.1189425 = boost
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.023281211 = queryNorm
              0.2944891 = fieldWeight in 2626, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.046875 = fieldNorm(doc=2626)
          0.017398436 = weight(abstract_txt:information in 2626) [ClassicSimilarity], result of:
            0.017398436 = score(doc=2626,freq=5.0), product of:
              0.06862243 = queryWeight, product of:
                1.2185482 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.023281211 = queryNorm
              0.2535386 = fieldWeight in 2626, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.046875 = fieldNorm(doc=2626)
          0.03146786 = weight(abstract_txt:focus in 2626) [ClassicSimilarity], result of:
            0.03146786 = score(doc=2626,freq=1.0), product of:
              0.13825603 = queryWeight, product of:
                1.2230288 = boost
                4.855588 = idf(docFreq=939, maxDocs=44421)
                0.023281211 = queryNorm
              0.22760569 = fieldWeight in 2626, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.855588 = idf(docFreq=939, maxDocs=44421)
                0.046875 = fieldNorm(doc=2626)
          0.043974716 = weight(abstract_txt:structured in 2626) [ClassicSimilarity], result of:
            0.043974716 = score(doc=2626,freq=1.0), product of:
              0.1728123 = queryWeight, product of:
                1.3673571 = boost
                5.428591 = idf(docFreq=529, maxDocs=44421)
                0.023281211 = queryNorm
              0.2544652 = fieldWeight in 2626, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.428591 = idf(docFreq=529, maxDocs=44421)
                0.046875 = fieldNorm(doc=2626)
          0.044964533 = weight(abstract_txt:scale in 2626) [ClassicSimilarity], result of:
            0.044964533 = score(doc=2626,freq=1.0), product of:
              0.17539586 = queryWeight, product of:
                1.3775403 = boost
                5.4690194 = idf(docFreq=508, maxDocs=44421)
                0.023281211 = queryNorm
              0.2563603 = fieldWeight in 2626, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4690194 = idf(docFreq=508, maxDocs=44421)
                0.046875 = fieldNorm(doc=2626)
          0.038382772 = weight(abstract_txt:techniques in 2626) [ClassicSimilarity], result of:
            0.038382772 = score(doc=2626,freq=1.0), product of:
              0.18067329 = queryWeight, product of:
                1.7123291 = boost
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.023281211 = queryNorm
              0.212443 = fieldWeight in 2626, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.046875 = fieldNorm(doc=2626)
          0.06508268 = weight(abstract_txt:sites in 2626) [ClassicSimilarity], result of:
            0.06508268 = score(doc=2626,freq=1.0), product of:
              0.2569094 = queryWeight, product of:
                2.0418804 = boost
                5.4043584 = idf(docFreq=542, maxDocs=44421)
                0.023281211 = queryNorm
              0.2533293 = fieldWeight in 2626, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4043584 = idf(docFreq=542, maxDocs=44421)
                0.046875 = fieldNorm(doc=2626)
          0.07262887 = weight(abstract_txt:pages in 2626) [ClassicSimilarity], result of:
            0.07262887 = score(doc=2626,freq=1.0), product of:
              0.2764029 = queryWeight, product of:
                2.1179297 = boost
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.023281211 = queryNorm
              0.2627645 = fieldWeight in 2626, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.046875 = fieldNorm(doc=2626)
          0.17693788 = weight(abstract_txt:data in 2626) [ClassicSimilarity], result of:
            0.17693788 = score(doc=2626,freq=15.0), product of:
              0.29265788 = queryWeight, product of:
                3.7746875 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.023281211 = queryNorm
              0.60458946 = fieldWeight in 2626, product of:
                3.8729835 = tf(freq=15.0), with freq of:
                  15.0 = termFreq=15.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.046875 = fieldNorm(doc=2626)
        0.4 = coord(10/25)
    
  3. Brin, S.; Page, L.: ¬The anatomy of a large-scale hypertextual Web search engine (1998) 0.16
    0.15684468 = sum of:
      0.15684468 = product of:
        0.49013963 = sum of:
          0.025925046 = weight(abstract_txt:number in 1947) [ClassicSimilarity], result of:
            0.025925046 = score(doc=1947,freq=1.0), product of:
              0.1002982 = queryWeight, product of:
                1.0416965 = boost
                4.1356745 = idf(docFreq=1930, maxDocs=44421)
                0.023281211 = queryNorm
              0.25847965 = fieldWeight in 1947, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1356745 = idf(docFreq=1930, maxDocs=44421)
                0.0625 = fieldNorm(doc=1947)
          0.016335415 = weight(abstract_txt:from in 1947) [ClassicSimilarity], result of:
            0.016335415 = score(doc=1947,freq=2.0), product of:
              0.06697623 = queryWeight, product of:
                1.0425589 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.023281211 = queryNorm
              0.2438987 = fieldWeight in 1947, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.0625 = fieldNorm(doc=1947)
          0.06426124 = weight(abstract_txt:large in 1947) [ClassicSimilarity], result of:
            0.06426124 = score(doc=1947,freq=4.0), product of:
              0.11572474 = queryWeight, product of:
                1.1189425 = boost
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.023281211 = queryNorm
              0.5552939 = fieldWeight in 1947, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.0625 = fieldNorm(doc=1947)
          0.01467165 = weight(abstract_txt:information in 1947) [ClassicSimilarity], result of:
            0.01467165 = score(doc=1947,freq=2.0), product of:
              0.06862243 = queryWeight, product of:
                1.2185482 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.023281211 = queryNorm
              0.21380253 = fieldWeight in 1947, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0625 = fieldNorm(doc=1947)
          0.11990542 = weight(abstract_txt:scale in 1947) [ClassicSimilarity], result of:
            0.11990542 = score(doc=1947,freq=4.0), product of:
              0.17539586 = queryWeight, product of:
                1.3775403 = boost
                5.4690194 = idf(docFreq=508, maxDocs=44421)
                0.023281211 = queryNorm
              0.6836274 = fieldWeight in 1947, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.4690194 = idf(docFreq=508, maxDocs=44421)
                0.0625 = fieldNorm(doc=1947)
          0.051177032 = weight(abstract_txt:techniques in 1947) [ClassicSimilarity], result of:
            0.051177032 = score(doc=1947,freq=1.0), product of:
              0.18067329 = queryWeight, product of:
                1.7123291 = boost
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.023281211 = queryNorm
              0.28325734 = fieldWeight in 1947, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.0625 = fieldNorm(doc=1947)
          0.13695031 = weight(abstract_txt:pages in 1947) [ClassicSimilarity], result of:
            0.13695031 = score(doc=1947,freq=2.0), product of:
              0.2764029 = queryWeight, product of:
                2.1179297 = boost
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.023281211 = queryNorm
              0.49547353 = fieldWeight in 1947, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.0625 = fieldNorm(doc=1947)
          0.060913544 = weight(abstract_txt:data in 1947) [ClassicSimilarity], result of:
            0.060913544 = score(doc=1947,freq=1.0), product of:
              0.29265788 = queryWeight, product of:
                3.7746875 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.023281211 = queryNorm
              0.20813909 = fieldWeight in 1947, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=1947)
        0.32 = coord(8/25)
    
  4. Zhu, B.; Chen, H.: Information visualization (2004) 0.16
    0.15537967 = sum of:
      0.15537967 = product of:
        0.43161017 = sum of:
          0.014438604 = weight(abstract_txt:from in 5276) [ClassicSimilarity], result of:
            0.014438604 = score(doc=5276,freq=4.0), product of:
              0.06697623 = queryWeight, product of:
                1.0425589 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.023281211 = queryNorm
              0.21557805 = fieldWeight in 5276, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.0390625 = fieldNorm(doc=5276)
          0.018285451 = weight(abstract_txt:specific in 5276) [ClassicSimilarity], result of:
            0.018285451 = score(doc=5276,freq=1.0), product of:
              0.10871695 = queryWeight, product of:
                1.0845343 = boost
                4.305746 = idf(docFreq=1628, maxDocs=44421)
                0.023281211 = queryNorm
              0.1681932 = fieldWeight in 5276, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.305746 = idf(docFreq=1628, maxDocs=44421)
                0.0390625 = fieldNorm(doc=5276)
          0.04016328 = weight(abstract_txt:large in 5276) [ClassicSimilarity], result of:
            0.04016328 = score(doc=5276,freq=4.0), product of:
              0.11572474 = queryWeight, product of:
                1.1189425 = boost
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.023281211 = queryNorm
              0.3470587 = fieldWeight in 5276, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.0390625 = fieldNorm(doc=5276)
          0.028263167 = weight(abstract_txt:information in 5276) [ClassicSimilarity], result of:
            0.028263167 = score(doc=5276,freq=19.0), product of:
              0.06862243 = queryWeight, product of:
                1.2185482 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.023281211 = queryNorm
              0.41186482 = fieldWeight in 5276, product of:
                4.358899 = tf(freq=19.0), with freq of:
                  19.0 = termFreq=19.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0390625 = fieldNorm(doc=5276)
          0.052991208 = weight(abstract_txt:scale in 5276) [ClassicSimilarity], result of:
            0.052991208 = score(doc=5276,freq=2.0), product of:
              0.17539586 = queryWeight, product of:
                1.3775403 = boost
                5.4690194 = idf(docFreq=508, maxDocs=44421)
                0.023281211 = queryNorm
              0.3021235 = fieldWeight in 5276, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4690194 = idf(docFreq=508, maxDocs=44421)
                0.0390625 = fieldNorm(doc=5276)
          0.063971296 = weight(abstract_txt:techniques in 5276) [ClassicSimilarity], result of:
            0.063971296 = score(doc=5276,freq=4.0), product of:
              0.18067329 = queryWeight, product of:
                1.7123291 = boost
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.023281211 = queryNorm
              0.35407168 = fieldWeight in 5276, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.0390625 = fieldNorm(doc=5276)
          0.08703223 = weight(abstract_txt:patterns in 5276) [ClassicSimilarity], result of:
            0.08703223 = score(doc=5276,freq=3.0), product of:
              0.24415755 = queryWeight, product of:
                1.9905604 = boost
                5.2685275 = idf(docFreq=621, maxDocs=44421)
                0.023281211 = queryNorm
              0.3564593 = fieldWeight in 5276, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.2685275 = idf(docFreq=621, maxDocs=44421)
                0.0390625 = fieldNorm(doc=5276)
          0.060524065 = weight(abstract_txt:pages in 5276) [ClassicSimilarity], result of:
            0.060524065 = score(doc=5276,freq=1.0), product of:
              0.2764029 = queryWeight, product of:
                2.1179297 = boost
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.023281211 = queryNorm
              0.21897045 = fieldWeight in 5276, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.0390625 = fieldNorm(doc=5276)
          0.06594085 = weight(abstract_txt:data in 5276) [ClassicSimilarity], result of:
            0.06594085 = score(doc=5276,freq=3.0), product of:
              0.29265788 = queryWeight, product of:
                3.7746875 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.023281211 = queryNorm
              0.22531718 = fieldWeight in 5276, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0390625 = fieldNorm(doc=5276)
        0.36 = coord(9/25)
    
  5. Spink, A.; Wolfram, D.; Jansen, B.J.; Saracevic, T.: Searching the Web : the public and their queries (2001) 0.16
    0.15519024 = sum of:
      0.15519024 = product of:
        0.431084 = sum of:
          0.019443784 = weight(abstract_txt:number in 980) [ClassicSimilarity], result of:
            0.019443784 = score(doc=980,freq=1.0), product of:
              0.1002982 = queryWeight, product of:
                1.0416965 = boost
                4.1356745 = idf(docFreq=1930, maxDocs=44421)
                0.023281211 = queryNorm
              0.19385974 = fieldWeight in 980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1356745 = idf(docFreq=1930, maxDocs=44421)
                0.046875 = fieldNorm(doc=980)
          0.024503123 = weight(abstract_txt:from in 980) [ClassicSimilarity], result of:
            0.024503123 = score(doc=980,freq=8.0), product of:
              0.06697623 = queryWeight, product of:
                1.0425589 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.023281211 = queryNorm
              0.36584806 = fieldWeight in 980, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.046875 = fieldNorm(doc=980)
          0.031031441 = weight(abstract_txt:specific in 980) [ClassicSimilarity], result of:
            0.031031441 = score(doc=980,freq=2.0), product of:
              0.10871695 = queryWeight, product of:
                1.0845343 = boost
                4.305746 = idf(docFreq=1628, maxDocs=44421)
                0.023281211 = queryNorm
              0.28543332 = fieldWeight in 980, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.305746 = idf(docFreq=1628, maxDocs=44421)
                0.046875 = fieldNorm(doc=980)
          0.034079675 = weight(abstract_txt:large in 980) [ClassicSimilarity], result of:
            0.034079675 = score(doc=980,freq=2.0), product of:
              0.11572474 = queryWeight, product of:
                1.1189425 = boost
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.023281211 = queryNorm
              0.2944891 = fieldWeight in 980, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.046875 = fieldNorm(doc=980)
          0.01347677 = weight(abstract_txt:information in 980) [ClassicSimilarity], result of:
            0.01347677 = score(doc=980,freq=3.0), product of:
              0.06862243 = queryWeight, product of:
                1.2185482 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.023281211 = queryNorm
              0.19639015 = fieldWeight in 980, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.046875 = fieldNorm(doc=980)
          0.068110496 = weight(abstract_txt:containing in 980) [ClassicSimilarity], result of:
            0.068110496 = score(doc=980,freq=1.0), product of:
              0.23133889 = queryWeight, product of:
                1.5820456 = boost
                6.2809324 = idf(docFreq=225, maxDocs=44421)
                0.023281211 = queryNorm
              0.2944187 = fieldWeight in 980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2809324 = idf(docFreq=225, maxDocs=44421)
                0.046875 = fieldNorm(doc=980)
          0.0920408 = weight(abstract_txt:sites in 980) [ClassicSimilarity], result of:
            0.0920408 = score(doc=980,freq=2.0), product of:
              0.2569094 = queryWeight, product of:
                2.0418804 = boost
                5.4043584 = idf(docFreq=542, maxDocs=44421)
                0.023281211 = queryNorm
              0.3582617 = fieldWeight in 980, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4043584 = idf(docFreq=542, maxDocs=44421)
                0.046875 = fieldNorm(doc=980)
          0.10271274 = weight(abstract_txt:pages in 980) [ClassicSimilarity], result of:
            0.10271274 = score(doc=980,freq=2.0), product of:
              0.2764029 = queryWeight, product of:
                2.1179297 = boost
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.023281211 = queryNorm
              0.37160516 = fieldWeight in 980, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.046875 = fieldNorm(doc=980)
          0.04568516 = weight(abstract_txt:data in 980) [ClassicSimilarity], result of:
            0.04568516 = score(doc=980,freq=1.0), product of:
              0.29265788 = queryWeight, product of:
                3.7746875 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.023281211 = queryNorm
              0.15610433 = fieldWeight in 980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.046875 = fieldNorm(doc=980)
        0.36 = coord(9/25)