Document (#33216)

Author
Warnick, W.L.
Leberman, A.
Scott, R.L.
Spence, K.J.
Johnsom, L.A.
Allen, V.S.
Title
Searching the deep Web : directed query engine applications at the Department of Energy
Source
D-Lib magazine. 7(2001) no.1, xx S
Year
2001
Abstract
Directed Query Engines, an emerging class of search engine specifically designed to access distributed resources on the deep web, offer the opportunity to create inexpensive digital libraries. Already, one such engine, Distributed Explorer, has been used to select and assemble high quality information resources and incorporate them into publicly available systems for the physical sciences. By nesting Directed Query Engines so that one query launches several other engines in a cascading fashion, enormous virtual collections may soon be assembled to form a comprehensive information infrastructure for the physical sciences. Once a Directed Query Engine has been configured for a set of information resources, distributed alerts tools can provide patrons with personalized, profile-based notices of recent additions to any of the selected resources. Due to the potentially enormous size and scope of Directed Query Engine applications, consideration must be given to issues surrounding the representation of large quantities of information from multiple, heterogeneous sources.
Footnote
Vgl.: http://www.dlib.org/dlib/january01/warnick/01warnick.html.
Theme
Internet
Suchmaschinen
Object
WWW

Similar documents (author)

  1. Scott, D.S.: Subject classification and natural-language processing for retrieval in large databases (1989) 1.93
    1.934928 = sum of:
      1.934928 = product of:
        3.869856 = sum of:
          3.869856 = weight(author_txt:scott in 966) [ClassicSimilarity], result of:
            3.869856 = score(doc=966,freq=1.0), product of:
              0.7573931 = queryWeight, product of:
                1.0770049 = boost
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.08602214 = queryNorm
              5.1094418 = fieldWeight in 966, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.625 = fieldNorm(doc=966)
        0.5 = coord(1/2)
    
  2. Scott, E.: ¬The evolution of bibliographic systems in the USA, 1876-1945 (1976/77) 1.93
    1.934928 = sum of:
      1.934928 = product of:
        3.869856 = sum of:
          3.869856 = weight(author_txt:scott in 4364) [ClassicSimilarity], result of:
            3.869856 = score(doc=4364,freq=1.0), product of:
              0.7573931 = queryWeight, product of:
                1.0770049 = boost
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.08602214 = queryNorm
              5.1094418 = fieldWeight in 4364, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.625 = fieldNorm(doc=4364)
        0.5 = coord(1/2)
    
  3. Scott, P.: Hypertext: information at your fingertips (1993) 1.93
    1.934928 = sum of:
      1.934928 = product of:
        3.869856 = sum of:
          3.869856 = weight(author_txt:scott in 6191) [ClassicSimilarity], result of:
            3.869856 = score(doc=6191,freq=1.0), product of:
              0.7573931 = queryWeight, product of:
                1.0770049 = boost
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.08602214 = queryNorm
              5.1094418 = fieldWeight in 6191, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.625 = fieldNorm(doc=6191)
        0.5 = coord(1/2)
    
  4. Scott, T.: Sherlock - MUSLS' black box public user interface (1993) 1.93
    1.934928 = sum of:
      1.934928 = product of:
        3.869856 = sum of:
          3.869856 = weight(author_txt:scott in 6580) [ClassicSimilarity], result of:
            3.869856 = score(doc=6580,freq=1.0), product of:
              0.7573931 = queryWeight, product of:
                1.0770049 = boost
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.08602214 = queryNorm
              5.1094418 = fieldWeight in 6580, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.625 = fieldNorm(doc=6580)
        0.5 = coord(1/2)
    
  5. Scott, P.: Hypertext ... information at your fingertips (1993) 1.93
    1.934928 = sum of:
      1.934928 = product of:
        3.869856 = sum of:
          3.869856 = weight(author_txt:scott in 74) [ClassicSimilarity], result of:
            3.869856 = score(doc=74,freq=1.0), product of:
              0.7573931 = queryWeight, product of:
                1.0770049 = boost
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.08602214 = queryNorm
              5.1094418 = fieldWeight in 74, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.625 = fieldNorm(doc=74)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Shokouhi, M.; Zobel, J.; Tahaghoghi, S.; Scholer, F.: Using query logs to establish vocabularies in distributed information retrieval (2007) 0.18
    0.17622179 = sum of:
      0.17622179 = product of:
        0.62936354 = sum of:
          0.0336109 = weight(abstract_txt:applications in 1901) [ClassicSimilarity], result of:
            0.0336109 = score(doc=1901,freq=1.0), product of:
              0.09083284 = queryWeight, product of:
                1.2452506 = boost
                4.7363873 = idf(docFreq=1058, maxDocs=44421)
                0.015400646 = queryNorm
              0.37003025 = fieldWeight in 1901, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7363873 = idf(docFreq=1058, maxDocs=44421)
                0.078125 = fieldNorm(doc=1901)
          0.012663012 = weight(abstract_txt:information in 1901) [ClassicSimilarity], result of:
            0.012663012 = score(doc=1901,freq=2.0), product of:
              0.047382087 = queryWeight, product of:
                1.2719129 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.015400646 = queryNorm
              0.26725316 = fieldWeight in 1901, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.078125 = fieldNorm(doc=1901)
          0.070228025 = weight(abstract_txt:engines in 1901) [ClassicSimilarity], result of:
            0.070228025 = score(doc=1901,freq=1.0), product of:
              0.16993918 = queryWeight, product of:
                2.0860643 = boost
                5.2896495 = idf(docFreq=608, maxDocs=44421)
                0.015400646 = queryNorm
              0.41325387 = fieldWeight in 1901, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2896495 = idf(docFreq=608, maxDocs=44421)
                0.078125 = fieldNorm(doc=1901)
          0.04767582 = weight(abstract_txt:resources in 1901) [ClassicSimilarity], result of:
            0.04767582 = score(doc=1901,freq=1.0), product of:
              0.14447685 = queryWeight, product of:
                2.2210045 = boost
                4.2238636 = idf(docFreq=1767, maxDocs=44421)
                0.015400646 = queryNorm
              0.32998934 = fieldWeight in 1901, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2238636 = idf(docFreq=1767, maxDocs=44421)
                0.078125 = fieldNorm(doc=1901)
          0.12867622 = weight(abstract_txt:distributed in 1901) [ClassicSimilarity], result of:
            0.12867622 = score(doc=1901,freq=2.0), product of:
              0.20196469 = queryWeight, product of:
                2.2741477 = boost
                5.7665734 = idf(docFreq=377, maxDocs=44421)
                0.015400646 = queryNorm
              0.6371224 = fieldWeight in 1901, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7665734 = idf(docFreq=377, maxDocs=44421)
                0.078125 = fieldNorm(doc=1901)
          0.13252279 = weight(abstract_txt:engine in 1901) [ClassicSimilarity], result of:
            0.13252279 = score(doc=1901,freq=1.0), product of:
              0.30767807 = queryWeight, product of:
                3.6237113 = boost
                5.5132036 = idf(docFreq=486, maxDocs=44421)
                0.015400646 = queryNorm
              0.43071902 = fieldWeight in 1901, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5132036 = idf(docFreq=486, maxDocs=44421)
                0.078125 = fieldNorm(doc=1901)
          0.20398681 = weight(abstract_txt:query in 1901) [ClassicSimilarity], result of:
            0.20398681 = score(doc=1901,freq=4.0), product of:
              0.27458572 = queryWeight, product of:
                3.7500315 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.015400646 = queryNorm
              0.74288934 = fieldWeight in 1901, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.078125 = fieldNorm(doc=1901)
        0.28 = coord(7/25)
    
  2. Callery, A.; Tracy-Proulx, D.: Yahoo! : Cataloging the Web (1997) 0.13
    0.12727249 = sum of:
      0.12727249 = product of:
        0.63636243 = sum of:
          0.014326563 = weight(abstract_txt:information in 4405) [ClassicSimilarity], result of:
            0.014326563 = score(doc=4405,freq=1.0), product of:
              0.047382087 = queryWeight, product of:
                1.2719129 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.015400646 = queryNorm
              0.30236244 = fieldWeight in 4405, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.125 = fieldNorm(doc=4405)
          0.2213532 = weight(abstract_txt:enormous in 4405) [ClassicSimilarity], result of:
            0.2213532 = score(doc=4405,freq=1.0), product of:
              0.2332921 = queryWeight, product of:
                1.9956543 = boost
                7.590594 = idf(docFreq=60, maxDocs=44421)
                0.015400646 = queryNorm
              0.9488242 = fieldWeight in 4405, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.590594 = idf(docFreq=60, maxDocs=44421)
                0.125 = fieldNorm(doc=4405)
          0.112364836 = weight(abstract_txt:engines in 4405) [ClassicSimilarity], result of:
            0.112364836 = score(doc=4405,freq=1.0), product of:
              0.16993918 = queryWeight, product of:
                2.0860643 = boost
                5.2896495 = idf(docFreq=608, maxDocs=44421)
                0.015400646 = queryNorm
              0.6612062 = fieldWeight in 4405, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2896495 = idf(docFreq=608, maxDocs=44421)
                0.125 = fieldNorm(doc=4405)
          0.07628131 = weight(abstract_txt:resources in 4405) [ClassicSimilarity], result of:
            0.07628131 = score(doc=4405,freq=1.0), product of:
              0.14447685 = queryWeight, product of:
                2.2210045 = boost
                4.2238636 = idf(docFreq=1767, maxDocs=44421)
                0.015400646 = queryNorm
              0.52798295 = fieldWeight in 4405, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2238636 = idf(docFreq=1767, maxDocs=44421)
                0.125 = fieldNorm(doc=4405)
          0.21203649 = weight(abstract_txt:engine in 4405) [ClassicSimilarity], result of:
            0.21203649 = score(doc=4405,freq=1.0), product of:
              0.30767807 = queryWeight, product of:
                3.6237113 = boost
                5.5132036 = idf(docFreq=486, maxDocs=44421)
                0.015400646 = queryNorm
              0.68915045 = fieldWeight in 4405, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5132036 = idf(docFreq=486, maxDocs=44421)
                0.125 = fieldNorm(doc=4405)
        0.2 = coord(5/25)
    
  3. IAC launches LifeCenter, shows InfoTrac Total Access (1998) 0.10
    0.10051076 = sum of:
      0.10051076 = product of:
        0.8375897 = sum of:
          0.6303452 = weight(title_txt:launches in 4300) [ClassicSimilarity], result of:
            0.6303452 = score(doc=4300,freq=1.0), product of:
              0.1788426 = queryWeight, product of:
                1.2355372 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.015400646 = queryNorm
              3.524581 = fieldWeight in 4300, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.375 = fieldNorm(doc=4300)
          0.021712543 = weight(abstract_txt:information in 4300) [ClassicSimilarity], result of:
            0.021712543 = score(doc=4300,freq=3.0), product of:
              0.047382087 = queryWeight, product of:
                1.2719129 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.015400646 = queryNorm
              0.4582437 = fieldWeight in 4300, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.109375 = fieldNorm(doc=4300)
          0.18553193 = weight(abstract_txt:engine in 4300) [ClassicSimilarity], result of:
            0.18553193 = score(doc=4300,freq=1.0), product of:
              0.30767807 = queryWeight, product of:
                3.6237113 = boost
                5.5132036 = idf(docFreq=486, maxDocs=44421)
                0.015400646 = queryNorm
              0.60300666 = fieldWeight in 4300, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5132036 = idf(docFreq=486, maxDocs=44421)
                0.109375 = fieldNorm(doc=4300)
        0.12 = coord(3/25)
    
  4. Summann, F.; Lossau, N.: Search engine technology and digital libraries : moving from theory to practice (2004) 0.10
    0.100267775 = sum of:
      0.100267775 = product of:
        0.4177824 = sum of:
          0.048738018 = weight(abstract_txt:soon in 2196) [ClassicSimilarity], result of:
            0.048738018 = score(doc=2196,freq=1.0), product of:
              0.117154606 = queryWeight, product of:
                7.607123 = idf(docFreq=59, maxDocs=44421)
                0.015400646 = queryNorm
              0.41601452 = fieldWeight in 2196, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.607123 = idf(docFreq=59, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2196)
          0.0743493 = weight(abstract_txt:configured in 2196) [ClassicSimilarity], result of:
            0.0743493 = score(doc=2196,freq=1.0), product of:
              0.15525015 = queryWeight, product of:
                1.1511617 = boost
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.015400646 = queryNorm
              0.47890002 = fieldWeight in 2196, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2196)
          0.0062678717 = weight(abstract_txt:information in 2196) [ClassicSimilarity], result of:
            0.0062678717 = score(doc=2196,freq=1.0), product of:
              0.047382087 = queryWeight, product of:
                1.2719129 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.015400646 = queryNorm
              0.13228357 = fieldWeight in 2196, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2196)
          0.069522195 = weight(abstract_txt:engines in 2196) [ClassicSimilarity], result of:
            0.069522195 = score(doc=2196,freq=2.0), product of:
              0.16993918 = queryWeight, product of:
                2.0860643 = boost
                5.2896495 = idf(docFreq=608, maxDocs=44421)
                0.015400646 = queryNorm
              0.40910044 = fieldWeight in 2196, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2896495 = idf(docFreq=608, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2196)
          0.033373073 = weight(abstract_txt:resources in 2196) [ClassicSimilarity], result of:
            0.033373073 = score(doc=2196,freq=1.0), product of:
              0.14447685 = queryWeight, product of:
                2.2210045 = boost
                4.2238636 = idf(docFreq=1767, maxDocs=44421)
                0.015400646 = queryNorm
              0.23099254 = fieldWeight in 2196, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2238636 = idf(docFreq=1767, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2196)
          0.18553193 = weight(abstract_txt:engine in 2196) [ClassicSimilarity], result of:
            0.18553193 = score(doc=2196,freq=4.0), product of:
              0.30767807 = queryWeight, product of:
                3.6237113 = boost
                5.5132036 = idf(docFreq=486, maxDocs=44421)
                0.015400646 = queryNorm
              0.60300666 = fieldWeight in 2196, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.5132036 = idf(docFreq=486, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2196)
        0.24 = coord(6/25)
    
  5. Yilmaz, T.; Ozcan, R.; Altingovde, I.S.; Ulusoy, Ö.: Improving educational web search for question-like queries through subject classification (2019) 0.09
    0.09431525 = sum of:
      0.09431525 = product of:
        0.47157627 = sum of:
          0.0071632816 = weight(abstract_txt:information in 41) [ClassicSimilarity], result of:
            0.0071632816 = score(doc=41,freq=1.0), product of:
              0.047382087 = queryWeight, product of:
                1.2719129 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.015400646 = queryNorm
              0.15118122 = fieldWeight in 41, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0625 = fieldNorm(doc=41)
          0.07945393 = weight(abstract_txt:engines in 41) [ClassicSimilarity], result of:
            0.07945393 = score(doc=41,freq=2.0), product of:
              0.16993918 = queryWeight, product of:
                2.0860643 = boost
                5.2896495 = idf(docFreq=608, maxDocs=44421)
                0.015400646 = queryNorm
              0.46754336 = fieldWeight in 41, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2896495 = idf(docFreq=608, maxDocs=44421)
                0.0625 = fieldNorm(doc=41)
          0.038140655 = weight(abstract_txt:resources in 41) [ClassicSimilarity], result of:
            0.038140655 = score(doc=41,freq=1.0), product of:
              0.14447685 = queryWeight, product of:
                2.2210045 = boost
                4.2238636 = idf(docFreq=1767, maxDocs=44421)
                0.015400646 = queryNorm
              0.26399148 = fieldWeight in 41, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2238636 = idf(docFreq=1767, maxDocs=44421)
                0.0625 = fieldNorm(doc=41)
          0.18362898 = weight(abstract_txt:engine in 41) [ClassicSimilarity], result of:
            0.18362898 = score(doc=41,freq=3.0), product of:
              0.30767807 = queryWeight, product of:
                3.6237113 = boost
                5.5132036 = idf(docFreq=486, maxDocs=44421)
                0.015400646 = queryNorm
              0.5968218 = fieldWeight in 41, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.5132036 = idf(docFreq=486, maxDocs=44421)
                0.0625 = fieldNorm(doc=41)
          0.16318944 = weight(abstract_txt:query in 41) [ClassicSimilarity], result of:
            0.16318944 = score(doc=41,freq=4.0), product of:
              0.27458572 = queryWeight, product of:
                3.7500315 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.015400646 = queryNorm
              0.5943115 = fieldWeight in 41, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.0625 = fieldNorm(doc=41)
        0.2 = coord(5/25)