Document (#6191)

Author
Keen, E.M.
Title
Some aspects of proximity searching in text retrieval systems
Source
Journal of information science. 18(1992), S.89-98
Year
1992
Abstract
Describes and evaluates the proximity search facilities in external online systems and in-house retrieval software. Discusses and illustrates capabilities, syntax and circumstances of use. Presents measurements of the overheads required by proximity for storage, record input time and search time. The search strategy narrowing effect of proximity is illustrated by recall and precision test results. Usage and problems lead to a number of design ideas for better implementation: some based on existing Boolean strategies, one on the use of weighted proximity to automatically produce ranked output. A comparison of Boolean, quorum and proximate term pairs distance is included
Theme
Retrievalstudien
Suchtaktik

Similar documents (author)

  1. Keen, E.M.: ¬The Aberystwyth index languages tests (1973) 5.38
    5.378652 = sum of:
      5.378652 = weight(author_txt:keen in 773) [ClassicSimilarity], result of:
        5.378652 = score(doc=773,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.6058445 = idf(docFreq=21, maxDocs=44218)
            0.1162001 = queryNorm
          5.3786526 = fieldWeight in 773, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.6058445 = idf(docFreq=21, maxDocs=44218)
            0.625 = fieldNorm(doc=773)
    
  2. Keen, E.M.: Prospects for classification suggested by evaluation tests (1976) 5.38
    5.378652 = sum of:
      5.378652 = weight(author_txt:keen in 1277) [ClassicSimilarity], result of:
        5.378652 = score(doc=1277,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.6058445 = idf(docFreq=21, maxDocs=44218)
            0.1162001 = queryNorm
          5.3786526 = fieldWeight in 1277, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.6058445 = idf(docFreq=21, maxDocs=44218)
            0.625 = fieldNorm(doc=1277)
    
  3. Keen, E.M.: On the generation and searching of entries in printed subject indexes (1977) 5.38
    5.378652 = sum of:
      5.378652 = weight(author_txt:keen in 2302) [ClassicSimilarity], result of:
        5.378652 = score(doc=2302,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.6058445 = idf(docFreq=21, maxDocs=44218)
            0.1162001 = queryNorm
          5.3786526 = fieldWeight in 2302, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.6058445 = idf(docFreq=21, maxDocs=44218)
            0.625 = fieldNorm(doc=2302)
    
  4. Keen, E.M.: Presenting results of experimental retrieval comparisons (1992) 5.38
    5.378652 = sum of:
      5.378652 = weight(author_txt:keen in 3644) [ClassicSimilarity], result of:
        5.378652 = score(doc=3644,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.6058445 = idf(docFreq=21, maxDocs=44218)
            0.1162001 = queryNorm
          5.3786526 = fieldWeight in 3644, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.6058445 = idf(docFreq=21, maxDocs=44218)
            0.625 = fieldNorm(doc=3644)
    
  5. Keen, M.: Query reformulation in ranked output interaction (1994) 5.38
    5.378652 = sum of:
      5.378652 = weight(author_txt:keen in 1065) [ClassicSimilarity], result of:
        5.378652 = score(doc=1065,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.6058445 = idf(docFreq=21, maxDocs=44218)
            0.1162001 = queryNorm
          5.3786526 = fieldWeight in 1065, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.6058445 = idf(docFreq=21, maxDocs=44218)
            0.625 = fieldNorm(doc=1065)
    

Similar documents (content)

  1. Boeri, R.J.; Hensel, M.: Set up a winning text retrieval system : carefully (1995) 0.18
    0.1761665 = sum of:
      0.1761665 = product of:
        0.8808325 = sum of:
          0.0280636 = weight(abstract_txt:systems in 2809) [ClassicSimilarity], result of:
            0.0280636 = score(doc=2809,freq=1.0), product of:
              0.06580211 = queryWeight, product of:
                1.1528106 = boost
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.016729707 = queryNorm
              0.4264848 = fieldWeight in 2809, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.125 = fieldNorm(doc=2809)
          0.11408276 = weight(abstract_txt:house in 2809) [ClassicSimilarity], result of:
            0.11408276 = score(doc=2809,freq=1.0), product of:
              0.13302939 = queryWeight, product of:
                1.1590358 = boost
                6.8606052 = idf(docFreq=125, maxDocs=44218)
                0.016729707 = queryNorm
              0.85757565 = fieldWeight in 2809, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8606052 = idf(docFreq=125, maxDocs=44218)
                0.125 = fieldNorm(doc=2809)
          0.041936718 = weight(abstract_txt:retrieval in 2809) [ClassicSimilarity], result of:
            0.041936718 = score(doc=2809,freq=2.0), product of:
              0.068264864 = queryWeight, product of:
                1.1741854 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.016729707 = queryNorm
              0.6143236 = fieldWeight in 2809, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.125 = fieldNorm(doc=2809)
          0.03515394 = weight(abstract_txt:some in 2809) [ClassicSimilarity], result of:
            0.03515394 = score(doc=2809,freq=1.0), product of:
              0.07646457 = queryWeight, product of:
                1.2427053 = boost
                3.6779325 = idf(docFreq=3037, maxDocs=44218)
                0.016729707 = queryNorm
              0.45974156 = fieldWeight in 2809, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6779325 = idf(docFreq=3037, maxDocs=44218)
                0.125 = fieldNorm(doc=2809)
          0.66159546 = weight(abstract_txt:proximity in 2809) [ClassicSimilarity], result of:
            0.66159546 = score(doc=2809,freq=1.0), product of:
              0.7342646 = queryWeight, product of:
                6.088837 = boost
                7.208251 = idf(docFreq=88, maxDocs=44218)
                0.016729707 = queryNorm
              0.9010314 = fieldWeight in 2809, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.208251 = idf(docFreq=88, maxDocs=44218)
                0.125 = fieldNorm(doc=2809)
        0.2 = coord(5/25)
    
  2. Ojala, M.: Who's hosting this search? (1995) 0.15
    0.15469803 = sum of:
      0.15469803 = product of:
        0.96686274 = sum of:
          0.080868945 = weight(abstract_txt:input in 2676) [ClassicSimilarity], result of:
            0.080868945 = score(doc=2676,freq=1.0), product of:
              0.10576016 = queryWeight, product of:
                1.0334373 = boost
                6.1171575 = idf(docFreq=264, maxDocs=44218)
                0.016729707 = queryNorm
              0.7646447 = fieldWeight in 2676, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1171575 = idf(docFreq=264, maxDocs=44218)
                0.125 = fieldNorm(doc=2676)
          0.05188046 = weight(abstract_txt:search in 2676) [ClassicSimilarity], result of:
            0.05188046 = score(doc=2676,freq=1.0), product of:
              0.11346029 = queryWeight, product of:
                1.8539824 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.016729707 = queryNorm
              0.45725656 = fieldWeight in 2676, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.125 = fieldNorm(doc=2676)
          0.17251785 = weight(abstract_txt:boolean in 2676) [ClassicSimilarity], result of:
            0.17251785 = score(doc=2676,freq=1.0), product of:
              0.22081755 = queryWeight, product of:
                2.11181 = boost
                6.2501497 = idf(docFreq=231, maxDocs=44218)
                0.016729707 = queryNorm
              0.7812687 = fieldWeight in 2676, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2501497 = idf(docFreq=231, maxDocs=44218)
                0.125 = fieldNorm(doc=2676)
          0.66159546 = weight(abstract_txt:proximity in 2676) [ClassicSimilarity], result of:
            0.66159546 = score(doc=2676,freq=1.0), product of:
              0.7342646 = queryWeight, product of:
                6.088837 = boost
                7.208251 = idf(docFreq=88, maxDocs=44218)
                0.016729707 = queryNorm
              0.9010314 = fieldWeight in 2676, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.208251 = idf(docFreq=88, maxDocs=44218)
                0.125 = fieldNorm(doc=2676)
        0.16 = coord(4/25)
    
  3. Milstead, J.L.: Specifications for thesaurus software (1991) 0.15
    0.15288842 = sum of:
      0.15288842 = product of:
        0.6370351 = sum of:
          0.04721562 = weight(abstract_txt:capabilities in 2291) [ClassicSimilarity], result of:
            0.04721562 = score(doc=2291,freq=1.0), product of:
              0.10106591 = queryWeight, product of:
                1.010242 = boost
                5.9798594 = idf(docFreq=303, maxDocs=44218)
                0.016729707 = queryNorm
              0.4671765 = fieldWeight in 2291, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9798594 = idf(docFreq=303, maxDocs=44218)
                0.078125 = fieldNorm(doc=2291)
          0.017539749 = weight(abstract_txt:systems in 2291) [ClassicSimilarity], result of:
            0.017539749 = score(doc=2291,freq=1.0), product of:
              0.06580211 = queryWeight, product of:
                1.1528106 = boost
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.016729707 = queryNorm
              0.26655298 = fieldWeight in 2291, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.078125 = fieldNorm(doc=2291)
          0.018533587 = weight(abstract_txt:retrieval in 2291) [ClassicSimilarity], result of:
            0.018533587 = score(doc=2291,freq=1.0), product of:
              0.068264864 = queryWeight, product of:
                1.1741854 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.016729707 = queryNorm
              0.27149525 = fieldWeight in 2291, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.078125 = fieldNorm(doc=2291)
          0.032425288 = weight(abstract_txt:search in 2291) [ClassicSimilarity], result of:
            0.032425288 = score(doc=2291,freq=1.0), product of:
              0.11346029 = queryWeight, product of:
                1.8539824 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.016729707 = queryNorm
              0.28578535 = fieldWeight in 2291, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.078125 = fieldNorm(doc=2291)
          0.10782365 = weight(abstract_txt:boolean in 2291) [ClassicSimilarity], result of:
            0.10782365 = score(doc=2291,freq=1.0), product of:
              0.22081755 = queryWeight, product of:
                2.11181 = boost
                6.2501497 = idf(docFreq=231, maxDocs=44218)
                0.016729707 = queryNorm
              0.48829293 = fieldWeight in 2291, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2501497 = idf(docFreq=231, maxDocs=44218)
                0.078125 = fieldNorm(doc=2291)
          0.41349718 = weight(abstract_txt:proximity in 2291) [ClassicSimilarity], result of:
            0.41349718 = score(doc=2291,freq=1.0), product of:
              0.7342646 = queryWeight, product of:
                6.088837 = boost
                7.208251 = idf(docFreq=88, maxDocs=44218)
                0.016729707 = queryNorm
              0.5631446 = fieldWeight in 2291, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.208251 = idf(docFreq=88, maxDocs=44218)
                0.078125 = fieldNorm(doc=2291)
        0.24 = coord(6/25)
    
  4. Clarke, S.J.: Search engines for the World Wide Web : an evaluation of recent developments (2000) 0.14
    0.13902988 = sum of:
      0.13902988 = product of:
        0.8689368 = sum of:
          0.07554499 = weight(abstract_txt:capabilities in 6107) [ClassicSimilarity], result of:
            0.07554499 = score(doc=6107,freq=1.0), product of:
              0.10106591 = queryWeight, product of:
                1.010242 = boost
                5.9798594 = idf(docFreq=303, maxDocs=44218)
                0.016729707 = queryNorm
              0.7474824 = fieldWeight in 6107, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9798594 = idf(docFreq=303, maxDocs=44218)
                0.125 = fieldNorm(doc=6107)
          0.041936718 = weight(abstract_txt:retrieval in 6107) [ClassicSimilarity], result of:
            0.041936718 = score(doc=6107,freq=2.0), product of:
              0.068264864 = queryWeight, product of:
                1.1741854 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.016729707 = queryNorm
              0.6143236 = fieldWeight in 6107, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.125 = fieldNorm(doc=6107)
          0.08985959 = weight(abstract_txt:search in 6107) [ClassicSimilarity], result of:
            0.08985959 = score(doc=6107,freq=3.0), product of:
              0.11346029 = queryWeight, product of:
                1.8539824 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.016729707 = queryNorm
              0.7919916 = fieldWeight in 6107, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.125 = fieldNorm(doc=6107)
          0.66159546 = weight(abstract_txt:proximity in 6107) [ClassicSimilarity], result of:
            0.66159546 = score(doc=6107,freq=1.0), product of:
              0.7342646 = queryWeight, product of:
                6.088837 = boost
                7.208251 = idf(docFreq=88, maxDocs=44218)
                0.016729707 = queryNorm
              0.9010314 = fieldWeight in 6107, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.208251 = idf(docFreq=88, maxDocs=44218)
                0.125 = fieldNorm(doc=6107)
        0.16 = coord(4/25)
    
  5. Vaughan, L.: New measurements for search engine evaluation proposed and tested (2004) 0.12
    0.121233955 = sum of:
      0.121233955 = product of:
        0.5051415 = sum of:
          0.07024055 = weight(abstract_txt:ranked in 2535) [ClassicSimilarity], result of:
            0.07024055 = score(doc=2535,freq=1.0), product of:
              0.11663225 = queryWeight, product of:
                1.0852565 = boost
                6.4238877 = idf(docFreq=194, maxDocs=44218)
                0.016729707 = queryNorm
              0.6022395 = fieldWeight in 2535, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4238877 = idf(docFreq=194, maxDocs=44218)
                0.09375 = fieldNorm(doc=2535)
          0.029765943 = weight(abstract_txt:systems in 2535) [ClassicSimilarity], result of:
            0.029765943 = score(doc=2535,freq=2.0), product of:
              0.06580211 = queryWeight, product of:
                1.1528106 = boost
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.016729707 = queryNorm
              0.45235544 = fieldWeight in 2535, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.09375 = fieldNorm(doc=2535)
          0.03145254 = weight(abstract_txt:retrieval in 2535) [ClassicSimilarity], result of:
            0.03145254 = score(doc=2535,freq=2.0), product of:
              0.068264864 = queryWeight, product of:
                1.1741854 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.016729707 = queryNorm
              0.4607427 = fieldWeight in 2535, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.09375 = fieldNorm(doc=2535)
          0.026365455 = weight(abstract_txt:some in 2535) [ClassicSimilarity], result of:
            0.026365455 = score(doc=2535,freq=1.0), product of:
              0.07646457 = queryWeight, product of:
                1.2427053 = boost
                3.6779325 = idf(docFreq=3037, maxDocs=44218)
                0.016729707 = queryNorm
              0.34480616 = fieldWeight in 2535, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6779325 = idf(docFreq=3037, maxDocs=44218)
                0.09375 = fieldNorm(doc=2535)
          0.26031083 = weight(abstract_txt:measurements in 2535) [ClassicSimilarity], result of:
            0.26031083 = score(doc=2535,freq=4.0), product of:
              0.17595498 = queryWeight, product of:
                1.3329806 = boost
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.016729707 = queryNorm
              1.4794172 = fieldWeight in 2535, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.09375 = fieldNorm(doc=2535)
          0.08700618 = weight(abstract_txt:search in 2535) [ClassicSimilarity], result of:
            0.08700618 = score(doc=2535,freq=5.0), product of:
              0.11346029 = queryWeight, product of:
                1.8539824 = boost
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.016729707 = queryNorm
              0.7668426 = fieldWeight in 2535, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.6580524 = idf(docFreq=3098, maxDocs=44218)
                0.09375 = fieldNorm(doc=2535)
        0.24 = coord(6/25)