Document (#21511)

Author
Wartik, S.
Fox, E.
Heath, L.
Chen, Q.-F.
Title
Hashing algorithms
Source
Information retrieval: data structures and algorithms. Ed.: W.B. Frakes u. R. Baeza-Yates
Imprint
Englewood Cliffs, NJ : Prentice Hall
Year
1992
Pages
S.293-362
Abstract
Discusses hashing, an information storage and retrieval technique useful for implementing many of the other structures in this book. The concepts underlying hashing are presented, along with 2 implementation strategies. The chapter also contains an extensive discussion of perfect hashing, an important optimization in information retrieval, and an O(n) algorithm to find minimal perfect hash functions for a set of keys
Theme
Retrievalalgorithmen

Similar documents (author)

  1. Heath, F.: Libraries, information technology, and the future (1995) 2.89
    2.8914335 = sum of:
      2.8914335 = product of:
        5.782867 = sum of:
          5.782867 = weight(author_txt:heath in 3732) [ClassicSimilarity], result of:
            5.782867 = score(doc=3732,freq=1.0), product of:
              0.93368924 = queryWeight, product of:
                1.6147618 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.05834895 = queryNorm
              6.1935673 = fieldWeight in 3732, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.625 = fieldNorm(doc=3732)
        0.5 = coord(1/2)
    
  2. Bizer, C.; Heath, T.: Linked Data : evolving the web into a global data space (2011) 2.31
    2.3131468 = sum of:
      2.3131468 = product of:
        4.6262937 = sum of:
          4.6262937 = weight(author_txt:heath in 725) [ClassicSimilarity], result of:
            4.6262937 = score(doc=725,freq=1.0), product of:
              0.93368924 = queryWeight, product of:
                1.6147618 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.05834895 = queryNorm
              4.954854 = fieldWeight in 725, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.5 = fieldNorm(doc=725)
        0.5 = coord(1/2)
    
  3. Vikor, D.L.; Gaumond, G.; Heath, F.M.: Building electronic cooperation in the 1990s : the Maryland, Georgia, and Texas experiences (1997) 1.73
    1.7348602 = sum of:
      1.7348602 = product of:
        3.4697204 = sum of:
          3.4697204 = weight(author_txt:heath in 2680) [ClassicSimilarity], result of:
            3.4697204 = score(doc=2680,freq=1.0), product of:
              0.93368924 = queryWeight, product of:
                1.6147618 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.05834895 = queryNorm
              3.7161405 = fieldWeight in 2680, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.375 = fieldNorm(doc=2680)
        0.5 = coord(1/2)
    
  4. Bizer, C.; Cyganiak, R.; Heath, T.: How to publish Linked Data on the Web (2007) 1.73
    1.7348602 = sum of:
      1.7348602 = product of:
        3.4697204 = sum of:
          3.4697204 = weight(author_txt:heath in 778) [ClassicSimilarity], result of:
            3.4697204 = score(doc=778,freq=1.0), product of:
              0.93368924 = queryWeight, product of:
                1.6147618 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.05834895 = queryNorm
              3.7161405 = fieldWeight in 778, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.375 = fieldNorm(doc=778)
        0.5 = coord(1/2)
    
  5. Chen, Y.N.; Chen, S.J.: ¬A metadata practice of the OFLA FRBR model : a case study for the National Palace Museum in Taipai (2004) 0.78
    0.7769495 = sum of:
      0.7769495 = product of:
        1.553899 = sum of:
          1.553899 = weight(author_txt:chen in 4384) [ClassicSimilarity], result of:
            1.553899 = score(doc=4384,freq=2.0), product of:
              0.3580844 = queryWeight, product of:
                6.136947 = idf(docFreq=260, maxDocs=44421)
                0.05834895 = queryNorm
              4.339477 = fieldWeight in 4384, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.136947 = idf(docFreq=260, maxDocs=44421)
                0.5 = fieldNorm(doc=4384)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Wartik, S.: Boolean operators (1992) 0.18
    0.1825571 = sum of:
      0.1825571 = product of:
        1.1409819 = sum of:
          0.00813865 = weight(abstract_txt:information in 4509) [ClassicSimilarity], result of:
            0.00813865 = score(doc=4509,freq=1.0), product of:
              0.026916869 = queryWeight, product of:
                1.0002438 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.011125022 = queryNorm
              0.30236244 = fieldWeight in 4509, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.125 = fieldNorm(doc=4509)
          0.040893387 = weight(abstract_txt:implementation in 4509) [ClassicSimilarity], result of:
            0.040893387 = score(doc=4509,freq=1.0), product of:
              0.062673174 = queryWeight, product of:
                1.0792435 = boost
                5.2198906 = idf(docFreq=652, maxDocs=44421)
                0.011125022 = queryNorm
              0.6524863 = fieldWeight in 4509, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2198906 = idf(docFreq=652, maxDocs=44421)
                0.125 = fieldNorm(doc=4509)
          0.024161598 = weight(abstract_txt:retrieval in 4509) [ClassicSimilarity], result of:
            0.024161598 = score(doc=4509,freq=1.0), product of:
              0.05559982 = queryWeight, product of:
                1.4375743 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.011125022 = queryNorm
              0.4345625 = fieldWeight in 4509, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.125 = fieldNorm(doc=4509)
          1.0677882 = weight(abstract_txt:hashing in 4509) [ClassicSimilarity], result of:
            1.0677882 = score(doc=4509,freq=1.0), product of:
              0.87563485 = queryWeight, product of:
                8.068078 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.011125022 = queryNorm
              1.2194446 = fieldWeight in 4509, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.125 = fieldNorm(doc=4509)
        0.16 = coord(4/25)
    
  2. Nelson, M.J.: ¬A prefix trie index for inverted files (1997) 0.17
    0.1707814 = sum of:
      0.1707814 = product of:
        1.0673838 = sum of:
          0.005086656 = weight(abstract_txt:information in 1495) [ClassicSimilarity], result of:
            0.005086656 = score(doc=1495,freq=1.0), product of:
              0.026916869 = queryWeight, product of:
                1.0002438 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.011125022 = queryNorm
              0.18897653 = fieldWeight in 1495, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.078125 = fieldNorm(doc=1495)
          0.021356037 = weight(abstract_txt:retrieval in 1495) [ClassicSimilarity], result of:
            0.021356037 = score(doc=1495,freq=2.0), product of:
              0.05559982 = queryWeight, product of:
                1.4375743 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.011125022 = queryNorm
              0.3841026 = fieldWeight in 1495, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.078125 = fieldNorm(doc=1495)
          0.09714069 = weight(abstract_txt:keys in 1495) [ClassicSimilarity], result of:
            0.09714069 = score(doc=1495,freq=1.0), product of:
              0.15263721 = queryWeight, product of:
                1.6842587 = boost
                8.146119 = idf(docFreq=34, maxDocs=44421)
                0.011125022 = queryNorm
              0.63641554 = fieldWeight in 1495, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.146119 = idf(docFreq=34, maxDocs=44421)
                0.078125 = fieldNorm(doc=1495)
          0.9438004 = weight(abstract_txt:hashing in 1495) [ClassicSimilarity], result of:
            0.9438004 = score(doc=1495,freq=2.0), product of:
              0.87563485 = queryWeight, product of:
                8.068078 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.011125022 = queryNorm
              1.077847 = fieldWeight in 1495, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.078125 = fieldNorm(doc=1495)
        0.16 = coord(4/25)
    
  3. Hoad, T.C.; Zobel, J.: Methods for identifying versioned and plagiarized documents (2003) 0.17
    0.16520298 = sum of:
      0.16520298 = product of:
        0.8260149 = sum of:
          0.004069325 = weight(abstract_txt:information in 159) [ClassicSimilarity], result of:
            0.004069325 = score(doc=159,freq=1.0), product of:
              0.026916869 = queryWeight, product of:
                1.0002438 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.011125022 = queryNorm
              0.15118122 = fieldWeight in 159, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0625 = fieldNorm(doc=159)
          0.019226497 = weight(abstract_txt:strategies in 159) [ClassicSimilarity], result of:
            0.019226497 = score(doc=159,freq=1.0), product of:
              0.06015426 = queryWeight, product of:
                1.057333 = boost
                5.113918 = idf(docFreq=725, maxDocs=44421)
                0.011125022 = queryNorm
              0.31961986 = fieldWeight in 159, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.113918 = idf(docFreq=725, maxDocs=44421)
                0.0625 = fieldNorm(doc=159)
          0.035597943 = weight(abstract_txt:technique in 159) [ClassicSimilarity], result of:
            0.035597943 = score(doc=159,freq=2.0), product of:
              0.071990125 = queryWeight, product of:
                1.1566849 = boost
                5.5944448 = idf(docFreq=448, maxDocs=44421)
                0.011125022 = queryNorm
              0.4944837 = fieldWeight in 159, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5944448 = idf(docFreq=448, maxDocs=44421)
                0.0625 = fieldNorm(doc=159)
          0.012080799 = weight(abstract_txt:retrieval in 159) [ClassicSimilarity], result of:
            0.012080799 = score(doc=159,freq=1.0), product of:
              0.05559982 = queryWeight, product of:
                1.4375743 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.011125022 = queryNorm
              0.21728125 = fieldWeight in 159, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=159)
          0.7550403 = weight(abstract_txt:hashing in 159) [ClassicSimilarity], result of:
            0.7550403 = score(doc=159,freq=2.0), product of:
              0.87563485 = queryWeight, product of:
                8.068078 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.011125022 = queryNorm
              0.86227757 = fieldWeight in 159, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.0625 = fieldNorm(doc=159)
        0.2 = coord(5/25)
    
  4. Ford, D.A.; Christodoukalis, S.: File organizations for optical disks (1992) 0.15
    0.14658627 = sum of:
      0.14658627 = product of:
        0.9161642 = sum of:
          0.028933315 = weight(abstract_txt:structures in 4501) [ClassicSimilarity], result of:
            0.028933315 = score(doc=4501,freq=1.0), product of:
              0.0602843 = queryWeight, product of:
                1.0584753 = boost
                5.1194425 = idf(docFreq=721, maxDocs=44421)
                0.011125022 = queryNorm
              0.47994775 = fieldWeight in 4501, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1194425 = idf(docFreq=721, maxDocs=44421)
                0.09375 = fieldNorm(doc=4501)
          0.060762513 = weight(abstract_txt:storage in 4501) [ClassicSimilarity], result of:
            0.060762513 = score(doc=4501,freq=2.0), product of:
              0.0784668 = queryWeight, product of:
                1.2075957 = boost
                5.8406816 = idf(docFreq=350, maxDocs=44421)
                0.011125022 = queryNorm
              0.7743722 = fieldWeight in 4501, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8406816 = idf(docFreq=350, maxDocs=44421)
                0.09375 = fieldNorm(doc=4501)
          0.025627242 = weight(abstract_txt:retrieval in 4501) [ClassicSimilarity], result of:
            0.025627242 = score(doc=4501,freq=2.0), product of:
              0.05559982 = queryWeight, product of:
                1.4375743 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.011125022 = queryNorm
              0.46092314 = fieldWeight in 4501, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.09375 = fieldNorm(doc=4501)
          0.80084115 = weight(abstract_txt:hashing in 4501) [ClassicSimilarity], result of:
            0.80084115 = score(doc=4501,freq=1.0), product of:
              0.87563485 = queryWeight, product of:
                8.068078 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.011125022 = queryNorm
              0.91458344 = fieldWeight in 4501, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.09375 = fieldNorm(doc=4501)
        0.16 = coord(4/25)
    
  5. Lam, W.; Wong, K.-F.; Wong, C.-Y.: Chinese document indexing based on new partitioned signature file : model and evaluation (2001) 0.13
    0.12938847 = sum of:
      0.12938847 = product of:
        0.8086779 = sum of:
          0.004069325 = weight(abstract_txt:information in 1303) [ClassicSimilarity], result of:
            0.004069325 = score(doc=1303,freq=1.0), product of:
              0.026916869 = queryWeight, product of:
                1.0002438 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.011125022 = queryNorm
              0.15118122 = fieldWeight in 1303, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0625 = fieldNorm(doc=1303)
          0.028643725 = weight(abstract_txt:storage in 1303) [ClassicSimilarity], result of:
            0.028643725 = score(doc=1303,freq=1.0), product of:
              0.0784668 = queryWeight, product of:
                1.2075957 = boost
                5.8406816 = idf(docFreq=350, maxDocs=44421)
                0.011125022 = queryNorm
              0.3650426 = fieldWeight in 1303, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8406816 = idf(docFreq=350, maxDocs=44421)
                0.0625 = fieldNorm(doc=1303)
          0.020924555 = weight(abstract_txt:retrieval in 1303) [ClassicSimilarity], result of:
            0.020924555 = score(doc=1303,freq=3.0), product of:
              0.05559982 = queryWeight, product of:
                1.4375743 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.011125022 = queryNorm
              0.37634215 = fieldWeight in 1303, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=1303)
          0.7550403 = weight(abstract_txt:hashing in 1303) [ClassicSimilarity], result of:
            0.7550403 = score(doc=1303,freq=2.0), product of:
              0.87563485 = queryWeight, product of:
                8.068078 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.011125022 = queryNorm
              0.86227757 = fieldWeight in 1303, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.0625 = fieldNorm(doc=1303)
        0.16 = coord(4/25)