Document (#21500)

Author
Faloutsos, C.
Title
Signature files
Source
Information retrieval: data structures and algorithms. Ed.: W.B. Frakes u. R. Baeza-Yates
Imprint
Englewood Cliffs, NJ : Prentice Hall
Year
1992
Pages
S.44-65
Abstract
Presents a survey and discussion on signature-based text retrieval methods. It describes the main idea behind the signature approach and its advantages over other text retrieval methods, it provides a classification of the signature methods that have appeared in the literature, it describes the main representatives of each class, together with the relative advantages and drawbacks, and it gives a list of applications as well as commercial or university prototypes that use the signature approach
Theme
Retrievalalgorithmen

Similar documents (content)

  1. Lam, W.; Wong, K.-F.; Wong, C.-Y.: Chinese document indexing based on new partitioned signature file : model and evaluation (2001) 0.39
    0.38975814 = sum of:
      0.38975814 = product of:
        1.3919934 = sum of:
          0.004016825 = weight(abstract_txt:that in 1303) [ClassicSimilarity], result of:
            0.004016825 = score(doc=1303,freq=1.0), product of:
              0.027175881 = queryWeight, product of:
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.011491174 = queryNorm
              0.14780845 = fieldWeight in 1303, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=1303)
          0.040296115 = weight(abstract_txt:files in 1303) [ClassicSimilarity], result of:
            0.040296115 = score(doc=1303,freq=2.0), product of:
              0.079631306 = queryWeight, product of:
                1.2104173 = boost
                5.7251167 = idf(docFreq=393, maxDocs=44421)
                0.011491174 = queryNorm
              0.5060336 = fieldWeight in 1303, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7251167 = idf(docFreq=393, maxDocs=44421)
                0.0625 = fieldNorm(doc=1303)
          0.02210103 = weight(abstract_txt:retrieval in 1303) [ClassicSimilarity], result of:
            0.02210103 = score(doc=1303,freq=3.0), product of:
              0.058725893 = queryWeight, product of:
                1.4700192 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.011491174 = queryNorm
              0.37634215 = fieldWeight in 1303, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=1303)
          0.015901512 = weight(abstract_txt:approach in 1303) [ClassicSimilarity], result of:
            0.015901512 = score(doc=1303,freq=1.0), product of:
              0.06800706 = queryWeight, product of:
                1.5819224 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.011491174 = queryNorm
              0.2338215 = fieldWeight in 1303, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.0625 = fieldNorm(doc=1303)
          0.02003797 = weight(abstract_txt:text in 1303) [ClassicSimilarity], result of:
            0.02003797 = score(doc=1303,freq=1.0), product of:
              0.07934097 = queryWeight, product of:
                1.7086651 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.011491174 = queryNorm
              0.25255513 = fieldWeight in 1303, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=1303)
          0.045827366 = weight(abstract_txt:methods in 1303) [ClassicSimilarity], result of:
            0.045827366 = score(doc=1303,freq=2.0), product of:
              0.12513115 = queryWeight, product of:
                2.6280675 = boost
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.011491174 = queryNorm
              0.3662347 = fieldWeight in 1303, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.0625 = fieldNorm(doc=1303)
          1.2438126 = weight(abstract_txt:signature in 1303) [ClassicSimilarity], result of:
            1.2438126 = score(doc=1303,freq=7.0), product of:
              0.88249516 = queryWeight, product of:
                9.010199 = boost
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.011491174 = queryNorm
              1.409427 = fieldWeight in 1303, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.0625 = fieldNorm(doc=1303)
        0.28 = coord(7/25)
    
  2. Kelledy, F.; Smeaton, A.F.: Signature files and beyond (1996) 0.36
    0.3571577 = sum of:
      0.3571577 = product of:
        1.4881572 = sum of:
          0.0071008108 = weight(abstract_txt:that in 42) [ClassicSimilarity], result of:
            0.0071008108 = score(doc=42,freq=2.0), product of:
              0.027175881 = queryWeight, product of:
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.011491174 = queryNorm
              0.2612909 = fieldWeight in 42, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.078125 = fieldNorm(doc=42)
          0.071234144 = weight(abstract_txt:files in 42) [ClassicSimilarity], result of:
            0.071234144 = score(doc=42,freq=4.0), product of:
              0.079631306 = queryWeight, product of:
                1.2104173 = boost
                5.7251167 = idf(docFreq=393, maxDocs=44421)
                0.011491174 = queryNorm
              0.8945495 = fieldWeight in 42, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.7251167 = idf(docFreq=393, maxDocs=44421)
                0.078125 = fieldNorm(doc=42)
          0.01987689 = weight(abstract_txt:approach in 42) [ClassicSimilarity], result of:
            0.01987689 = score(doc=42,freq=1.0), product of:
              0.06800706 = queryWeight, product of:
                1.5819224 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.011491174 = queryNorm
              0.29227686 = fieldWeight in 42, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.078125 = fieldNorm(doc=42)
          0.03542246 = weight(abstract_txt:text in 42) [ClassicSimilarity], result of:
            0.03542246 = score(doc=42,freq=2.0), product of:
              0.07934097 = queryWeight, product of:
                1.7086651 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.011491174 = queryNorm
              0.4464586 = fieldWeight in 42, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.078125 = fieldNorm(doc=42)
          0.040506054 = weight(abstract_txt:methods in 42) [ClassicSimilarity], result of:
            0.040506054 = score(doc=42,freq=1.0), product of:
              0.12513115 = queryWeight, product of:
                2.6280675 = boost
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.011491174 = queryNorm
              0.3237088 = fieldWeight in 42, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.078125 = fieldNorm(doc=42)
          1.3140168 = weight(abstract_txt:signature in 42) [ClassicSimilarity], result of:
            1.3140168 = score(doc=42,freq=5.0), product of:
              0.88249516 = queryWeight, product of:
                9.010199 = boost
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.011491174 = queryNorm
              1.4889791 = fieldWeight in 42, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.078125 = fieldNorm(doc=42)
        0.24 = coord(6/25)
    
  3. Lee, D.L.; Ren, L.: Document ranking on weight-partitioned signature files (1996) 0.34
    0.33555254 = sum of:
      0.33555254 = product of:
        1.6777627 = sum of:
          0.0060252375 = weight(abstract_txt:that in 3417) [ClassicSimilarity], result of:
            0.0060252375 = score(doc=3417,freq=1.0), product of:
              0.027175881 = queryWeight, product of:
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.011491174 = queryNorm
              0.22171268 = fieldWeight in 3417, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.09375 = fieldNorm(doc=3417)
          0.03303671 = weight(abstract_txt:together in 3417) [ClassicSimilarity], result of:
            0.03303671 = score(doc=3417,freq=1.0), product of:
              0.06706903 = queryWeight, product of:
                1.1108469 = boost
                5.254162 = idf(docFreq=630, maxDocs=44421)
                0.011491174 = queryNorm
              0.49257767 = fieldWeight in 3417, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.254162 = idf(docFreq=630, maxDocs=44421)
                0.09375 = fieldNorm(doc=3417)
          0.042740487 = weight(abstract_txt:files in 3417) [ClassicSimilarity], result of:
            0.042740487 = score(doc=3417,freq=1.0), product of:
              0.079631306 = queryWeight, product of:
                1.2104173 = boost
                5.7251167 = idf(docFreq=393, maxDocs=44421)
                0.011491174 = queryNorm
              0.5367297 = fieldWeight in 3417, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7251167 = idf(docFreq=393, maxDocs=44421)
                0.09375 = fieldNorm(doc=3417)
          0.019140054 = weight(abstract_txt:retrieval in 3417) [ClassicSimilarity], result of:
            0.019140054 = score(doc=3417,freq=1.0), product of:
              0.058725893 = queryWeight, product of:
                1.4700192 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.011491174 = queryNorm
              0.3259219 = fieldWeight in 3417, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.09375 = fieldNorm(doc=3417)
          1.5768203 = weight(abstract_txt:signature in 3417) [ClassicSimilarity], result of:
            1.5768203 = score(doc=3417,freq=5.0), product of:
              0.88249516 = queryWeight, product of:
                9.010199 = boost
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.011491174 = queryNorm
              1.786775 = fieldWeight in 3417, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.09375 = fieldNorm(doc=3417)
        0.2 = coord(5/25)
    
  4. Robertson, A.M.; Willett, P.: Applications of n-grams in textual information systems (1998) 0.24
    0.23514087 = sum of:
      0.23514087 = product of:
        0.9797537 = sum of:
          0.0070294435 = weight(abstract_txt:that in 5715) [ClassicSimilarity], result of:
            0.0070294435 = score(doc=5715,freq=1.0), product of:
              0.027175881 = queryWeight, product of:
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.011491174 = queryNorm
              0.2586648 = fieldWeight in 5715, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.109375 = fieldNorm(doc=5715)
          0.028234158 = weight(abstract_txt:applications in 5715) [ClassicSimilarity], result of:
            0.028234158 = score(doc=5715,freq=1.0), product of:
              0.05450164 = queryWeight, product of:
                1.0013778 = boost
                4.7363873 = idf(docFreq=1058, maxDocs=44421)
                0.011491174 = queryNorm
              0.5180423 = fieldWeight in 5715, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7363873 = idf(docFreq=1058, maxDocs=44421)
                0.109375 = fieldNorm(doc=5715)
          0.0498639 = weight(abstract_txt:files in 5715) [ClassicSimilarity], result of:
            0.0498639 = score(doc=5715,freq=1.0), product of:
              0.079631306 = queryWeight, product of:
                1.2104173 = boost
                5.7251167 = idf(docFreq=393, maxDocs=44421)
                0.011491174 = queryNorm
              0.62618464 = fieldWeight in 5715, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7251167 = idf(docFreq=393, maxDocs=44421)
                0.109375 = fieldNorm(doc=5715)
          0.022330062 = weight(abstract_txt:retrieval in 5715) [ClassicSimilarity], result of:
            0.022330062 = score(doc=5715,freq=1.0), product of:
              0.058725893 = queryWeight, product of:
                1.4700192 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.011491174 = queryNorm
              0.3802422 = fieldWeight in 5715, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.109375 = fieldNorm(doc=5715)
          0.049591444 = weight(abstract_txt:text in 5715) [ClassicSimilarity], result of:
            0.049591444 = score(doc=5715,freq=2.0), product of:
              0.07934097 = queryWeight, product of:
                1.7086651 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.011491174 = queryNorm
              0.6250421 = fieldWeight in 5715, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.109375 = fieldNorm(doc=5715)
          0.8227047 = weight(abstract_txt:signature in 5715) [ClassicSimilarity], result of:
            0.8227047 = score(doc=5715,freq=1.0), product of:
              0.88249516 = queryWeight, product of:
                9.010199 = boost
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.011491174 = queryNorm
              0.93224835 = fieldWeight in 5715, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.109375 = fieldNorm(doc=5715)
        0.24 = coord(6/25)
    
  5. Carterette, B.; Can, F.: Comparing inverted files and signature files for searching a large lexicon (2005) 0.18
    0.18461153 = sum of:
      0.18461153 = product of:
        1.5384295 = sum of:
          0.061690576 = weight(abstract_txt:files in 2029) [ClassicSimilarity], result of:
            0.061690576 = score(doc=2029,freq=3.0), product of:
              0.079631306 = queryWeight, product of:
                1.2104173 = boost
                5.7251167 = idf(docFreq=393, maxDocs=44421)
                0.011491174 = queryNorm
              0.77470255 = fieldWeight in 2029, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.7251167 = idf(docFreq=393, maxDocs=44421)
                0.078125 = fieldNorm(doc=2029)
          0.037305605 = weight(abstract_txt:main in 2029) [ClassicSimilarity], result of:
            0.037305605 = score(doc=2029,freq=1.0), product of:
              0.10347555 = queryWeight, product of:
                1.9513135 = boost
                4.61473 = idf(docFreq=1195, maxDocs=44421)
                0.011491174 = queryNorm
              0.3605258 = fieldWeight in 2029, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.61473 = idf(docFreq=1195, maxDocs=44421)
                0.078125 = fieldNorm(doc=2029)
          1.4394333 = weight(abstract_txt:signature in 2029) [ClassicSimilarity], result of:
            1.4394333 = score(doc=2029,freq=6.0), product of:
              0.88249516 = queryWeight, product of:
                9.010199 = boost
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.011491174 = queryNorm
              1.6310949 = fieldWeight in 2029, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.078125 = fieldNorm(doc=2029)
        0.12 = coord(3/25)