Document (#14192)

Author
Joss, M.W.
Wszola, S.
Title
¬The engines that can : text search and retrieval software, their strategies, and vendors
Source
CD-ROM professional. 9(1996) no.6, S.30+(14 S.)
Year
1996
Abstract
Traces the development of text searching and retrieval software designed to cope with the increasing demands made by the storage and handling of large amounts of data, recorded on high data storage media, from CD-ROM to multi gigabyte storage media and online information services, with particular reference to the need to cope with graphics as well as conventional ASCII text. Includes details of: Boolean searching, fuzzy searching and matching; relevance ranking; proximity searching and improved strategies for dealing with text searching in very large databases. Concludes that the best searching tools for CD-ROM publishers are those optimized for searching and retrieval on CD-ROM. CD-ROM drives have relatively lower random seek times than hard discs and so the software most appropriate to the medium is that which can effectively arrange the indexes and text on the CD-ROM to avoid continuous random access searching. Lists and reviews a selection of software packages designed to achieve the sort of results required for rapid CD-ROM searching
Theme
Retrievalalgorithmen

Similar documents (content)

  1. Moffat, A.; Bell, T.A.H.: In situ generation of compressed inverted files (1995) 0.23
    0.22962306 = sum of:
      0.22962306 = product of:
        0.7175721 = sum of:
          0.012385799 = weight(abstract_txt:that in 2716) [ClassicSimilarity], result of:
            0.012385799 = score(doc=2716,freq=2.0), product of:
              0.047402333 = queryWeight, product of:
                1.0134008 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.019778768 = queryNorm
              0.2612909 = fieldWeight in 2716, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.078125 = fieldNorm(doc=2716)
          0.088859 = weight(abstract_txt:sort in 2716) [ClassicSimilarity], result of:
            0.088859 = score(doc=2716,freq=1.0), product of:
              0.15403554 = queryWeight, product of:
                1.0547055 = boost
                7.3839793 = idf(docFreq=74, maxDocs=44421)
                0.019778768 = queryNorm
              0.57687336 = fieldWeight in 2716, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3839793 = idf(docFreq=74, maxDocs=44421)
                0.078125 = fieldNorm(doc=2716)
          0.06702847 = weight(abstract_txt:large in 2716) [ClassicSimilarity], result of:
            0.06702847 = score(doc=2716,freq=3.0), product of:
              0.11150537 = queryWeight, product of:
                1.2690643 = boost
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.019778768 = queryNorm
              0.6011233 = fieldWeight in 2716, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.078125 = fieldNorm(doc=2716)
          0.16833414 = weight(abstract_txt:gigabyte in 2716) [ClassicSimilarity], result of:
            0.16833414 = score(doc=2716,freq=1.0), product of:
              0.23583129 = queryWeight, product of:
                1.3050328 = boost
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.019778768 = queryNorm
              0.71379054 = fieldWeight in 2716, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.078125 = fieldNorm(doc=2716)
          0.053249862 = weight(abstract_txt:designed in 2716) [ClassicSimilarity], result of:
            0.053249862 = score(doc=2716,freq=1.0), product of:
              0.13794595 = queryWeight, product of:
                1.4115304 = boost
                4.941053 = idf(docFreq=862, maxDocs=44421)
                0.019778768 = queryNorm
              0.38601977 = fieldWeight in 2716, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.941053 = idf(docFreq=862, maxDocs=44421)
                0.078125 = fieldNorm(doc=2716)
          0.12296904 = weight(abstract_txt:random in 2716) [ClassicSimilarity], result of:
            0.12296904 = score(doc=2716,freq=1.0), product of:
              0.2410056 = queryWeight, product of:
                1.8657322 = boost
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.019778768 = queryNorm
              0.5102331 = fieldWeight in 2716, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.078125 = fieldNorm(doc=2716)
          0.13192947 = weight(abstract_txt:storage in 2716) [ClassicSimilarity], result of:
            0.13192947 = score(doc=2716,freq=1.0), product of:
              0.28912675 = queryWeight, product of:
                2.5027964 = boost
                5.8406816 = idf(docFreq=350, maxDocs=44421)
                0.019778768 = queryNorm
              0.45630324 = fieldWeight in 2716, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8406816 = idf(docFreq=350, maxDocs=44421)
                0.078125 = fieldNorm(doc=2716)
          0.07281628 = weight(abstract_txt:text in 2716) [ClassicSimilarity], result of:
            0.07281628 = score(doc=2716,freq=1.0), product of:
              0.2306547 = queryWeight, product of:
                2.8859375 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.019778768 = queryNorm
              0.3156939 = fieldWeight in 2716, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.078125 = fieldNorm(doc=2716)
        0.32 = coord(8/25)
    
  2. Tenopir, C.: Full-text retrieval : systems and files (1994) 0.22
    0.22133972 = sum of:
      0.22133972 = product of:
        0.92224884 = sum of:
          0.012261314 = weight(abstract_txt:that in 2492) [ClassicSimilarity], result of:
            0.012261314 = score(doc=2492,freq=1.0), product of:
              0.047402333 = queryWeight, product of:
                1.0134008 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.019778768 = queryNorm
              0.2586648 = fieldWeight in 2492, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.109375 = fieldNorm(doc=2492)
          0.1706819 = weight(abstract_txt:discs in 2492) [ClassicSimilarity], result of:
            0.1706819 = score(doc=2492,freq=1.0), product of:
              0.1901923 = queryWeight, product of:
                1.1719719 = boost
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.019778768 = queryNorm
              0.8974175 = fieldWeight in 2492, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.109375 = fieldNorm(doc=2492)
          0.03894987 = weight(abstract_txt:retrieval in 2492) [ClassicSimilarity], result of:
            0.03894987 = score(doc=2492,freq=1.0), product of:
              0.10243437 = queryWeight, product of:
                1.4897186 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.019778768 = queryNorm
              0.3802422 = fieldWeight in 2492, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.109375 = fieldNorm(doc=2492)
          0.10230387 = weight(abstract_txt:software in 2492) [ClassicSimilarity], result of:
            0.10230387 = score(doc=2492,freq=1.0), product of:
              0.21462616 = queryWeight, product of:
                2.4899583 = boost
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.019778768 = queryNorm
              0.47666076 = fieldWeight in 2492, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.109375 = fieldNorm(doc=2492)
          0.2883378 = weight(abstract_txt:text in 2492) [ClassicSimilarity], result of:
            0.2883378 = score(doc=2492,freq=8.0), product of:
              0.2306547 = queryWeight, product of:
                2.8859375 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.019778768 = queryNorm
              1.2500842 = fieldWeight in 2492, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.109375 = fieldNorm(doc=2492)
          0.30971405 = weight(abstract_txt:searching in 2492) [ClassicSimilarity], result of:
            0.30971405 = score(doc=2492,freq=2.0), product of:
              0.46713895 = queryWeight, product of:
                5.5101705 = boost
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.019778768 = queryNorm
              0.663002 = fieldWeight in 2492, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.109375 = fieldNorm(doc=2492)
        0.24 = coord(6/25)
    
  3. Flanders, B.: On-line books : an advanced technology electronic library system (1992) 0.18
    0.17683034 = sum of:
      0.17683034 = product of:
        0.7367931 = sum of:
          0.008758081 = weight(abstract_txt:that in 2660) [ClassicSimilarity], result of:
            0.008758081 = score(doc=2660,freq=1.0), product of:
              0.047402333 = queryWeight, product of:
                1.0134008 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.019778768 = queryNorm
              0.18476056 = fieldWeight in 2660, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.078125 = fieldNorm(doc=2660)
          0.12191564 = weight(abstract_txt:discs in 2660) [ClassicSimilarity], result of:
            0.12191564 = score(doc=2660,freq=1.0), product of:
              0.1901923 = queryWeight, product of:
                1.1719719 = boost
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.019778768 = queryNorm
              0.6410125 = fieldWeight in 2660, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.078125 = fieldNorm(doc=2660)
          0.027821334 = weight(abstract_txt:retrieval in 2660) [ClassicSimilarity], result of:
            0.027821334 = score(doc=2660,freq=1.0), product of:
              0.10243437 = queryWeight, product of:
                1.4897186 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.019778768 = queryNorm
              0.27160156 = fieldWeight in 2660, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.078125 = fieldNorm(doc=2660)
          0.34905258 = weight(abstract_txt:storage in 2660) [ClassicSimilarity], result of:
            0.34905258 = score(doc=2660,freq=7.0), product of:
              0.28912675 = queryWeight, product of:
                2.5027964 = boost
                5.8406816 = idf(docFreq=350, maxDocs=44421)
                0.019778768 = queryNorm
              1.2072649 = fieldWeight in 2660, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.8406816 = idf(docFreq=350, maxDocs=44421)
                0.078125 = fieldNorm(doc=2660)
          0.07281628 = weight(abstract_txt:text in 2660) [ClassicSimilarity], result of:
            0.07281628 = score(doc=2660,freq=1.0), product of:
              0.2306547 = queryWeight, product of:
                2.8859375 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.019778768 = queryNorm
              0.3156939 = fieldWeight in 2660, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.078125 = fieldNorm(doc=2660)
          0.15642923 = weight(abstract_txt:searching in 2660) [ClassicSimilarity], result of:
            0.15642923 = score(doc=2660,freq=1.0), product of:
              0.46713895 = queryWeight, product of:
                5.5101705 = boost
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.019778768 = queryNorm
              0.3348666 = fieldWeight in 2660, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.078125 = fieldNorm(doc=2660)
        0.24 = coord(6/25)
    
  4. Casale, M.: Full text retrieval for the Web (1996) 0.17
    0.17101271 = sum of:
      0.17101271 = product of:
        0.6107597 = sum of:
          0.014862957 = weight(abstract_txt:that in 6825) [ClassicSimilarity], result of:
            0.014862957 = score(doc=6825,freq=2.0), product of:
              0.047402333 = queryWeight, product of:
                1.0134008 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.019778768 = queryNorm
              0.31354907 = fieldWeight in 6825, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.09375 = fieldNorm(doc=6825)
          0.09587096 = weight(abstract_txt:vendors in 6825) [ClassicSimilarity], result of:
            0.09587096 = score(doc=6825,freq=1.0), product of:
              0.14349073 = queryWeight, product of:
                1.0179646 = boost
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.019778768 = queryNorm
              0.66813344 = fieldWeight in 6825, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.09375 = fieldNorm(doc=6825)
          0.016476994 = weight(abstract_txt:with in 6825) [ClassicSimilarity], result of:
            0.016476994 = score(doc=6825,freq=1.0), product of:
              0.07041056 = queryWeight, product of:
                1.4261645 = boost
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.019778768 = queryNorm
              0.23401311 = fieldWeight in 6825, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.09375 = fieldNorm(doc=6825)
          0.033385605 = weight(abstract_txt:retrieval in 6825) [ClassicSimilarity], result of:
            0.033385605 = score(doc=6825,freq=1.0), product of:
              0.10243437 = queryWeight, product of:
                1.4897186 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.019778768 = queryNorm
              0.3259219 = fieldWeight in 6825, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.09375 = fieldNorm(doc=6825)
          0.08768903 = weight(abstract_txt:software in 6825) [ClassicSimilarity], result of:
            0.08768903 = score(doc=6825,freq=1.0), product of:
              0.21462616 = queryWeight, product of:
                2.4899583 = boost
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.019778768 = queryNorm
              0.40856636 = fieldWeight in 6825, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.09375 = fieldNorm(doc=6825)
          0.17475909 = weight(abstract_txt:text in 6825) [ClassicSimilarity], result of:
            0.17475909 = score(doc=6825,freq=4.0), product of:
              0.2306547 = queryWeight, product of:
                2.8859375 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.019778768 = queryNorm
              0.7576654 = fieldWeight in 6825, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.09375 = fieldNorm(doc=6825)
          0.18771507 = weight(abstract_txt:searching in 6825) [ClassicSimilarity], result of:
            0.18771507 = score(doc=6825,freq=1.0), product of:
              0.46713895 = queryWeight, product of:
                5.5101705 = boost
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.019778768 = queryNorm
              0.4018399 = fieldWeight in 6825, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.09375 = fieldNorm(doc=6825)
        0.28 = coord(7/25)
    
  5. Schmidt, J.: Full-text searching : as seen from a non-bibliographic searcher's point of view (1989) 0.16
    0.16184337 = sum of:
      0.16184337 = product of:
        0.6743474 = sum of:
          0.0235004 = weight(abstract_txt:that in 2944) [ClassicSimilarity], result of:
            0.0235004 = score(doc=2944,freq=5.0), product of:
              0.047402333 = queryWeight, product of:
                1.0134008 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.019778768 = queryNorm
              0.4957646 = fieldWeight in 2944, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.09375 = fieldNorm(doc=2944)
          0.11067775 = weight(abstract_txt:designed in 2944) [ClassicSimilarity], result of:
            0.11067775 = score(doc=2944,freq=3.0), product of:
              0.13794595 = queryWeight, product of:
                1.4115304 = boost
                4.941053 = idf(docFreq=862, maxDocs=44421)
                0.019778768 = queryNorm
              0.8023269 = fieldWeight in 2944, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.941053 = idf(docFreq=862, maxDocs=44421)
                0.09375 = fieldNorm(doc=2944)
          0.016476994 = weight(abstract_txt:with in 2944) [ClassicSimilarity], result of:
            0.016476994 = score(doc=2944,freq=1.0), product of:
              0.07041056 = queryWeight, product of:
                1.4261645 = boost
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.019778768 = queryNorm
              0.23401311 = fieldWeight in 2944, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.09375 = fieldNorm(doc=2944)
          0.04721437 = weight(abstract_txt:retrieval in 2944) [ClassicSimilarity], result of:
            0.04721437 = score(doc=2944,freq=2.0), product of:
              0.10243437 = queryWeight, product of:
                1.4897186 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.019778768 = queryNorm
              0.46092314 = fieldWeight in 2944, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.09375 = fieldNorm(doc=2944)
          0.1513458 = weight(abstract_txt:text in 2944) [ClassicSimilarity], result of:
            0.1513458 = score(doc=2944,freq=3.0), product of:
              0.2306547 = queryWeight, product of:
                2.8859375 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.019778768 = queryNorm
              0.6561575 = fieldWeight in 2944, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.09375 = fieldNorm(doc=2944)
          0.32513207 = weight(abstract_txt:searching in 2944) [ClassicSimilarity], result of:
            0.32513207 = score(doc=2944,freq=3.0), product of:
              0.46713895 = queryWeight, product of:
                5.5101705 = boost
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.019778768 = queryNorm
              0.6960072 = fieldWeight in 2944, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2862926 = idf(docFreq=1660, maxDocs=44421)
                0.09375 = fieldNorm(doc=2944)
        0.24 = coord(6/25)