Document (#21501)

Author
Gonnet, G.H.
Snider, T.
Baeza-Yates, R.A.
Title
New indices for text : PAT trees and PAT arrays
Source
Information retrieval: data structures and algorithms. Ed.: W.B. Frakes u. R. Baeza-Yates
Imprint
Englewood Cliffs, NJ : Prentice Hall
Year
1992
Pages
S.66-82
Abstract
We survey new indices for text, with emphasis on PAT arrays (also called suffic arrays). A PAT array is an index based on a new model of text that does not use the concept of word and does not need to know the structure of text
Theme
Retrievalalgorithmen

Similar documents (author)

  1. Baeza-Yates, R.A.: Introduction to data structures and algorithms related to information retrieval (1992) 6.30
    6.3001256 = sum of:
      6.3001256 = sum of:
        2.9882364 = weight(author_txt:yates in 4082) [ClassicSimilarity], result of:
          2.9882364 = score(doc=4082,freq=1.0), product of:
            0.68247724 = queryWeight, product of:
              8.757029 = idf(docFreq=18, maxDocs=44421)
              0.077934794 = queryNorm
            4.3785143 = fieldWeight in 4082, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.757029 = idf(docFreq=18, maxDocs=44421)
              0.5 = fieldNorm(doc=4082)
        3.3118892 = weight(author_txt:baeza in 4082) [ClassicSimilarity], result of:
          3.3118892 = score(doc=4082,freq=1.0), product of:
            0.7309069 = queryWeight, product of:
              1.0348728 = boost
              9.06241 = idf(docFreq=13, maxDocs=44421)
              0.077934794 = queryNorm
            4.531205 = fieldWeight in 4082, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.06241 = idf(docFreq=13, maxDocs=44421)
              0.5 = fieldNorm(doc=4082)
    
  2. Baeza-Yates, R.A.: String searching algorithms (1992) 6.30
    6.3001256 = sum of:
      6.3001256 = sum of:
        2.9882364 = weight(author_txt:yates in 4505) [ClassicSimilarity], result of:
          2.9882364 = score(doc=4505,freq=1.0), product of:
            0.68247724 = queryWeight, product of:
              8.757029 = idf(docFreq=18, maxDocs=44421)
              0.077934794 = queryNorm
            4.3785143 = fieldWeight in 4505, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.757029 = idf(docFreq=18, maxDocs=44421)
              0.5 = fieldNorm(doc=4505)
        3.3118892 = weight(author_txt:baeza in 4505) [ClassicSimilarity], result of:
          3.3118892 = score(doc=4505,freq=1.0), product of:
            0.7309069 = queryWeight, product of:
              1.0348728 = boost
              9.06241 = idf(docFreq=13, maxDocs=44421)
              0.077934794 = queryNorm
            4.531205 = fieldWeight in 4505, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.06241 = idf(docFreq=13, maxDocs=44421)
              0.5 = fieldNorm(doc=4505)
    
  3. Baeza-Yates, R.; Navarro, G.: Block addressing indices for approximate text retrieval (2000) 5.51
    5.51261 = sum of:
      5.51261 = sum of:
        2.6147068 = weight(author_txt:yates in 5295) [ClassicSimilarity], result of:
          2.6147068 = score(doc=5295,freq=1.0), product of:
            0.68247724 = queryWeight, product of:
              8.757029 = idf(docFreq=18, maxDocs=44421)
              0.077934794 = queryNorm
            3.8312001 = fieldWeight in 5295, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.757029 = idf(docFreq=18, maxDocs=44421)
              0.4375 = fieldNorm(doc=5295)
        2.8979032 = weight(author_txt:baeza in 5295) [ClassicSimilarity], result of:
          2.8979032 = score(doc=5295,freq=1.0), product of:
            0.7309069 = queryWeight, product of:
              1.0348728 = boost
              9.06241 = idf(docFreq=13, maxDocs=44421)
              0.077934794 = queryNorm
            3.9648046 = fieldWeight in 5295, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.06241 = idf(docFreq=13, maxDocs=44421)
              0.4375 = fieldNorm(doc=5295)
    
  4. Baeza-Yates, R.; Navarro, G.: XQL and proximal nodes (2002) 5.51
    5.51261 = sum of:
      5.51261 = sum of:
        2.6147068 = weight(author_txt:yates in 1454) [ClassicSimilarity], result of:
          2.6147068 = score(doc=1454,freq=1.0), product of:
            0.68247724 = queryWeight, product of:
              8.757029 = idf(docFreq=18, maxDocs=44421)
              0.077934794 = queryNorm
            3.8312001 = fieldWeight in 1454, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.757029 = idf(docFreq=18, maxDocs=44421)
              0.4375 = fieldNorm(doc=1454)
        2.8979032 = weight(author_txt:baeza in 1454) [ClassicSimilarity], result of:
          2.8979032 = score(doc=1454,freq=1.0), product of:
            0.7309069 = queryWeight, product of:
              1.0348728 = boost
              9.06241 = idf(docFreq=13, maxDocs=44421)
              0.077934794 = queryNorm
            3.9648046 = fieldWeight in 1454, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.06241 = idf(docFreq=13, maxDocs=44421)
              0.4375 = fieldNorm(doc=1454)
    
  5. Castillo, C.; Baeza-Yates, R.: Web retrieval and mining (2009) 5.51
    5.51261 = sum of:
      5.51261 = sum of:
        2.6147068 = weight(author_txt:yates in 891) [ClassicSimilarity], result of:
          2.6147068 = score(doc=891,freq=1.0), product of:
            0.68247724 = queryWeight, product of:
              8.757029 = idf(docFreq=18, maxDocs=44421)
              0.077934794 = queryNorm
            3.8312001 = fieldWeight in 891, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.757029 = idf(docFreq=18, maxDocs=44421)
              0.4375 = fieldNorm(doc=891)
        2.8979032 = weight(author_txt:baeza in 891) [ClassicSimilarity], result of:
          2.8979032 = score(doc=891,freq=1.0), product of:
            0.7309069 = queryWeight, product of:
              1.0348728 = boost
              9.06241 = idf(docFreq=13, maxDocs=44421)
              0.077934794 = queryNorm
            3.9648046 = fieldWeight in 891, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.06241 = idf(docFreq=13, maxDocs=44421)
              0.4375 = fieldNorm(doc=891)
    

Similar documents (content)

  1. Will, L.: ¬The ISO 25964 data model for the structure of an information retrieval thesaurus (2012) 0.23
    0.22756723 = sum of:
      0.22756723 = product of:
        0.7585574 = sum of:
          0.0041369847 = weight(abstract_txt:that in 1862) [ClassicSimilarity], result of:
            0.0041369847 = score(doc=1862,freq=1.0), product of:
              0.018659215 = queryWeight, product of:
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.007889948 = queryNorm
              0.22171268 = fieldWeight in 1862, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.09375 = fieldNorm(doc=1862)
          0.012238939 = weight(abstract_txt:also in 1862) [ClassicSimilarity], result of:
            0.012238939 = score(doc=1862,freq=1.0), product of:
              0.038453273 = queryWeight, product of:
                1.4355555 = boost
                3.3949955 = idf(docFreq=4049, maxDocs=44421)
                0.007889948 = queryNorm
              0.31828082 = fieldWeight in 1862, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3949955 = idf(docFreq=4049, maxDocs=44421)
                0.09375 = fieldNorm(doc=1862)
          0.019759947 = weight(abstract_txt:model in 1862) [ClassicSimilarity], result of:
            0.019759947 = score(doc=1862,freq=1.0), product of:
              0.052920993 = queryWeight, product of:
                1.6840978 = boost
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.007889948 = queryNorm
              0.37338582 = fieldWeight in 1862, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.09375 = fieldNorm(doc=1862)
          0.0258881 = weight(abstract_txt:structure in 1862) [ClassicSimilarity], result of:
            0.0258881 = score(doc=1862,freq=1.0), product of:
              0.06336327 = queryWeight, product of:
                1.8427742 = boost
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.007889948 = queryNorm
              0.40856636 = fieldWeight in 1862, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.09375 = fieldNorm(doc=1862)
          0.028592179 = weight(abstract_txt:concept in 1862) [ClassicSimilarity], result of:
            0.028592179 = score(doc=1862,freq=1.0), product of:
              0.06770212 = queryWeight, product of:
                1.9048223 = boost
                4.5047812 = idf(docFreq=1334, maxDocs=44421)
                0.007889948 = queryNorm
              0.42232323 = fieldWeight in 1862, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5047812 = idf(docFreq=1334, maxDocs=44421)
                0.09375 = fieldNorm(doc=1862)
          0.6679412 = weight(abstract_txt:arrays in 1862) [ClassicSimilarity], result of:
            0.6679412 = score(doc=1862,freq=1.0), product of:
              0.7979396 = queryWeight, product of:
                11.326584 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.007889948 = queryNorm
              0.8370824 = fieldWeight in 1862, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.09375 = fieldNorm(doc=1862)
        0.3 = coord(6/20)
    
  2. Harman, D.; Fox, E.; Baeza-Yates, R.; Lee, W.: Inverted files (1992) 0.19
    0.19114377 = sum of:
      0.19114377 = product of:
        0.9557188 = sum of:
          0.007800773 = weight(abstract_txt:that in 4497) [ClassicSimilarity], result of:
            0.007800773 = score(doc=4497,freq=2.0), product of:
              0.018659215 = queryWeight, product of:
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.007889948 = queryNorm
              0.41806543 = fieldWeight in 4497, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.125 = fieldNorm(doc=4497)
          0.00648592 = weight(abstract_txt:with in 4497) [ClassicSimilarity], result of:
            0.00648592 = score(doc=4497,freq=1.0), product of:
              0.020787042 = queryWeight, product of:
                1.0554792 = boost
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.007889948 = queryNorm
              0.31201747 = fieldWeight in 4497, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.125 = fieldNorm(doc=4497)
          0.05084389 = weight(abstract_txt:survey in 4497) [ClassicSimilarity], result of:
            0.05084389 = score(doc=4497,freq=1.0), product of:
              0.08202964 = queryWeight, product of:
                2.0967116 = boost
                4.958587 = idf(docFreq=847, maxDocs=44421)
                0.007889948 = queryNorm
              0.6198234 = fieldWeight in 4497, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.958587 = idf(docFreq=847, maxDocs=44421)
                0.125 = fieldNorm(doc=4497)
          0.8905882 = weight(abstract_txt:arrays in 4497) [ClassicSimilarity], result of:
            0.8905882 = score(doc=4497,freq=1.0), product of:
              0.7979396 = queryWeight, product of:
                11.326584 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.007889948 = queryNorm
              1.1161098 = fieldWeight in 4497, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.125 = fieldNorm(doc=4497)
        0.2 = coord(4/20)
    
  3. Hartman, J.H.; Proebsting, T.A.; Sundaram, R.: Index-based hyperlinks (1997) 0.18
    0.17747176 = sum of:
      0.17747176 = product of:
        0.709887 = sum of:
          0.004826482 = weight(abstract_txt:that in 3723) [ClassicSimilarity], result of:
            0.004826482 = score(doc=3723,freq=1.0), product of:
              0.018659215 = queryWeight, product of:
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.007889948 = queryNorm
              0.2586648 = fieldWeight in 3723, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.109375 = fieldNorm(doc=3723)
          0.011768297 = weight(abstract_txt:based in 3723) [ClassicSimilarity], result of:
            0.011768297 = score(doc=3723,freq=1.0), product of:
              0.033802487 = queryWeight, product of:
                1.3459461 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.007889948 = queryNorm
              0.34814885 = fieldWeight in 3723, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.109375 = fieldNorm(doc=3723)
          0.014278763 = weight(abstract_txt:also in 3723) [ClassicSimilarity], result of:
            0.014278763 = score(doc=3723,freq=1.0), product of:
              0.038453273 = queryWeight, product of:
                1.4355555 = boost
                3.3949955 = idf(docFreq=4049, maxDocs=44421)
                0.007889948 = queryNorm
              0.37132764 = fieldWeight in 3723, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3949955 = idf(docFreq=4049, maxDocs=44421)
                0.109375 = fieldNorm(doc=3723)
          0.03909941 = weight(abstract_txt:index in 3723) [ClassicSimilarity], result of:
            0.03909941 = score(doc=3723,freq=1.0), product of:
              0.07526384 = queryWeight, product of:
                2.0083828 = boost
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.007889948 = queryNorm
              0.51949793 = fieldWeight in 3723, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.109375 = fieldNorm(doc=3723)
          0.6399141 = weight(abstract_txt:indices in 3723) [ClassicSimilarity], result of:
            0.6399141 = score(doc=3723,freq=5.0), product of:
              0.3574709 = queryWeight, product of:
                6.189972 = boost
                7.319441 = idf(docFreq=79, maxDocs=44421)
                0.007889948 = queryNorm
              1.7901152 = fieldWeight in 3723, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.319441 = idf(docFreq=79, maxDocs=44421)
                0.109375 = fieldNorm(doc=3723)
        0.25 = coord(5/20)
    
  4. Ibáñez, A.; Armañanzas, R.; Bielza, C.; Larrañaga, P.: Genetic algorithms and Gaussian Bayesian networks to uncover the predictive core set of bibliometric indices (2016) 0.15
    0.15063056 = sum of:
      0.15063056 = product of:
        0.50210184 = sum of:
          0.006167053 = weight(abstract_txt:that in 4041) [ClassicSimilarity], result of:
            0.006167053 = score(doc=4041,freq=5.0), product of:
              0.018659215 = queryWeight, product of:
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.007889948 = queryNorm
              0.33050975 = fieldWeight in 4041, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=4041)
          0.00324296 = weight(abstract_txt:with in 4041) [ClassicSimilarity], result of:
            0.00324296 = score(doc=4041,freq=1.0), product of:
              0.020787042 = queryWeight, product of:
                1.0554792 = boost
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.007889948 = queryNorm
              0.15600874 = fieldWeight in 4041, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.0625 = fieldNorm(doc=4041)
          0.008159293 = weight(abstract_txt:also in 4041) [ClassicSimilarity], result of:
            0.008159293 = score(doc=4041,freq=1.0), product of:
              0.038453273 = queryWeight, product of:
                1.4355555 = boost
                3.3949955 = idf(docFreq=4049, maxDocs=44421)
                0.007889948 = queryNorm
              0.21218722 = fieldWeight in 4041, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3949955 = idf(docFreq=4049, maxDocs=44421)
                0.0625 = fieldNorm(doc=4041)
          0.013173299 = weight(abstract_txt:model in 4041) [ClassicSimilarity], result of:
            0.013173299 = score(doc=4041,freq=1.0), product of:
              0.052920993 = queryWeight, product of:
                1.6840978 = boost
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.007889948 = queryNorm
              0.24892388 = fieldWeight in 4041, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.0625 = fieldNorm(doc=4041)
          0.03869838 = weight(abstract_txt:index in 4041) [ClassicSimilarity], result of:
            0.03869838 = score(doc=4041,freq=3.0), product of:
              0.07526384 = queryWeight, product of:
                2.0083828 = boost
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.007889948 = queryNorm
              0.5141696 = fieldWeight in 4041, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.0625 = fieldNorm(doc=4041)
          0.43266088 = weight(abstract_txt:indices in 4041) [ClassicSimilarity], result of:
            0.43266088 = score(doc=4041,freq=7.0), product of:
              0.3574709 = queryWeight, product of:
                6.189972 = boost
                7.319441 = idf(docFreq=79, maxDocs=44421)
                0.007889948 = queryNorm
              1.2103387 = fieldWeight in 4041, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.319441 = idf(docFreq=79, maxDocs=44421)
                0.0625 = fieldNorm(doc=4041)
        0.3 = coord(6/20)
    
  5. Rousseau, R.; Jin, B.: ¬The age-dependent h-type AR**2-index : basic properties and a case study (2008) 0.15
    0.1468398 = sum of:
      0.1468398 = product of:
        0.48946595 = sum of:
          0.0048754835 = weight(abstract_txt:that in 3638) [ClassicSimilarity], result of:
            0.0048754835 = score(doc=3638,freq=2.0), product of:
              0.018659215 = queryWeight, product of:
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.007889948 = queryNorm
              0.2612909 = fieldWeight in 3638, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.078125 = fieldNorm(doc=3638)
          0.0040537002 = weight(abstract_txt:with in 3638) [ClassicSimilarity], result of:
            0.0040537002 = score(doc=3638,freq=1.0), product of:
              0.020787042 = queryWeight, product of:
                1.0554792 = boost
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.007889948 = queryNorm
              0.19501092 = fieldWeight in 3638, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.078125 = fieldNorm(doc=3638)
          0.062449247 = weight(abstract_txt:index in 3638) [ClassicSimilarity], result of:
            0.062449247 = score(doc=3638,freq=5.0), product of:
              0.07526384 = queryWeight, product of:
                2.0083828 = boost
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.007889948 = queryNorm
              0.82973766 = fieldWeight in 3638, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.078125 = fieldNorm(doc=3638)
          0.054862678 = weight(abstract_txt:called in 3638) [ClassicSimilarity], result of:
            0.054862678 = score(doc=3638,freq=2.0), product of:
              0.09369857 = queryWeight, product of:
                2.2408862 = boost
                5.2995505 = idf(docFreq=602, maxDocs=44421)
                0.007889948 = queryNorm
              0.5855231 = fieldWeight in 3638, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2995505 = idf(docFreq=602, maxDocs=44421)
                0.078125 = fieldNorm(doc=3638)
          0.07414112 = weight(abstract_txt:does in 3638) [ClassicSimilarity], result of:
            0.07414112 = score(doc=3638,freq=1.0), product of:
              0.18180579 = queryWeight, product of:
                4.414405 = boost
                5.2198906 = idf(docFreq=652, maxDocs=44421)
                0.007889948 = queryNorm
              0.40780395 = fieldWeight in 3638, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2198906 = idf(docFreq=652, maxDocs=44421)
                0.078125 = fieldNorm(doc=3638)
          0.28908372 = weight(abstract_txt:indices in 3638) [ClassicSimilarity], result of:
            0.28908372 = score(doc=3638,freq=2.0), product of:
              0.3574709 = queryWeight, product of:
                6.189972 = boost
                7.319441 = idf(docFreq=79, maxDocs=44421)
                0.007889948 = queryNorm
              0.8086916 = fieldWeight in 3638, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.319441 = idf(docFreq=79, maxDocs=44421)
                0.078125 = fieldNorm(doc=3638)
        0.3 = coord(6/20)