Document (#6190)

Author
Keen, E.M.
Title
Some aspects of proximity searching in text retrieval systems
Source
Journal of information science. 18(1992), S.89-98
Year
1992
Abstract
Describes and evaluates the proximity search facilities in external online systems and in-house retrieval software. Discusses and illustrates capabilities, syntax and circumstances of use. Presents measurements of the overheads required by proximity for storage, record input time and search time. The search strategy narrowing effect of proximity is illustrated by recall and precision test results. Usage and problems lead to a number of design ideas for better implementation: some based on existing Boolean strategies, one on the use of weighted proximity to automatically produce ranked output. A comparison of Boolean, quorum and proximate term pairs distance is included
Theme
Retrievalstudien
Suchtaktik

Similar documents (author)

  1. Keen, E.M.: ¬The Aberystwyth index languages tests (1973) 5.38
    5.3815155 = sum of:
      5.3815155 = weight(author_txt:keen in 772) [ClassicSimilarity], result of:
        5.3815155 = fieldWeight in 772, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.610425 = idf(docFreq=21, maxDocs=44421)
          0.625 = fieldNorm(doc=772)
    
  2. Keen, E.M.: Prospects for classification suggested by evaluation tests (1976) 5.38
    5.3815155 = sum of:
      5.3815155 = weight(author_txt:keen in 1276) [ClassicSimilarity], result of:
        5.3815155 = fieldWeight in 1276, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.610425 = idf(docFreq=21, maxDocs=44421)
          0.625 = fieldNorm(doc=1276)
    
  3. Keen, E.M.: On the generation and searching of entries in printed subject indexes (1977) 5.38
    5.3815155 = sum of:
      5.3815155 = weight(author_txt:keen in 2301) [ClassicSimilarity], result of:
        5.3815155 = fieldWeight in 2301, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.610425 = idf(docFreq=21, maxDocs=44421)
          0.625 = fieldNorm(doc=2301)
    
  4. Keen, E.M.: Presenting results of experimental retrieval comparisons (1992) 5.38
    5.3815155 = sum of:
      5.3815155 = weight(author_txt:keen in 3643) [ClassicSimilarity], result of:
        5.3815155 = fieldWeight in 3643, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.610425 = idf(docFreq=21, maxDocs=44421)
          0.625 = fieldNorm(doc=3643)
    
  5. Keen, M.: Query reformulation in ranked output interaction (1994) 5.38
    5.3815155 = sum of:
      5.3815155 = weight(author_txt:keen in 1133) [ClassicSimilarity], result of:
        5.3815155 = fieldWeight in 1133, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.610425 = idf(docFreq=21, maxDocs=44421)
          0.625 = fieldNorm(doc=1133)
    

Similar documents (content)

  1. Boeri, R.J.; Hensel, M.: Set up a winning text retrieval system : carefully (1995) 0.18
    0.17633858 = sum of:
      0.17633858 = product of:
        0.8816929 = sum of:
          0.028024623 = weight(abstract_txt:systems in 2877) [ClassicSimilarity], result of:
            0.028024623 = score(doc=2877,freq=1.0), product of:
              0.06572427 = queryWeight, product of:
                1.1522831 = boost
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.016721012 = queryNorm
              0.42639688 = fieldWeight in 2877, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.125 = fieldNorm(doc=2877)
          0.11422329 = weight(abstract_txt:house in 2877) [ClassicSimilarity], result of:
            0.11422329 = score(doc=2877,freq=1.0), product of:
              0.13310438 = queryWeight, product of:
                1.159518 = boost
                6.8651857 = idf(docFreq=125, maxDocs=44421)
                0.016721012 = queryNorm
              0.8581482 = fieldWeight in 2877, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8651857 = idf(docFreq=125, maxDocs=44421)
                0.125 = fieldNorm(doc=2877)
          0.041953627 = weight(abstract_txt:retrieval in 2877) [ClassicSimilarity], result of:
            0.041953627 = score(doc=2877,freq=2.0), product of:
              0.068265654 = queryWeight, product of:
                1.1743497 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.016721012 = queryNorm
              0.6145642 = fieldWeight in 2877, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.125 = fieldNorm(doc=2877)
          0.035145126 = weight(abstract_txt:some in 2877) [ClassicSimilarity], result of:
            0.035145126 = score(doc=2877,freq=1.0), product of:
              0.07643213 = queryWeight, product of:
                1.2426084 = boost
                3.6785707 = idf(docFreq=3049, maxDocs=44421)
                0.016721012 = queryNorm
              0.45982134 = fieldWeight in 2877, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6785707 = idf(docFreq=3049, maxDocs=44421)
                0.125 = fieldNorm(doc=2877)
          0.6623462 = weight(abstract_txt:proximity in 2877) [ClassicSimilarity], result of:
            0.6623462 = score(doc=2877,freq=1.0), product of:
              0.73463106 = queryWeight, product of:
                6.091173 = boost
                7.212831 = idf(docFreq=88, maxDocs=44421)
                0.016721012 = queryNorm
              0.9016039 = fieldWeight in 2877, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.212831 = idf(docFreq=88, maxDocs=44421)
                0.125 = fieldNorm(doc=2877)
        0.2 = coord(5/25)
    
  2. Ojala, M.: Who's hosting this search? (1995) 0.15
    0.15477541 = sum of:
      0.15477541 = product of:
        0.9673463 = sum of:
          0.08054228 = weight(abstract_txt:input in 2744) [ClassicSimilarity], result of:
            0.08054228 = score(doc=2744,freq=1.0), product of:
              0.105448045 = queryWeight, product of:
                1.0320497 = boost
                6.110481 = idf(docFreq=267, maxDocs=44421)
                0.016721012 = queryNorm
              0.7638101 = fieldWeight in 2744, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.110481 = idf(docFreq=267, maxDocs=44421)
                0.125 = fieldNorm(doc=2744)
          0.051693726 = weight(abstract_txt:search in 2744) [ClassicSimilarity], result of:
            0.051693726 = score(doc=2744,freq=1.0), product of:
              0.11315877 = queryWeight, product of:
                1.8517658 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.016721012 = queryNorm
              0.45682475 = fieldWeight in 2744, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.125 = fieldNorm(doc=2744)
          0.1727641 = weight(abstract_txt:boolean in 2744) [ClassicSimilarity], result of:
            0.1727641 = score(doc=2744,freq=1.0), product of:
              0.22097081 = queryWeight, product of:
                2.112826 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.016721012 = queryNorm
              0.7818413 = fieldWeight in 2744, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.125 = fieldNorm(doc=2744)
          0.6623462 = weight(abstract_txt:proximity in 2744) [ClassicSimilarity], result of:
            0.6623462 = score(doc=2744,freq=1.0), product of:
              0.73463106 = queryWeight, product of:
                6.091173 = boost
                7.212831 = idf(docFreq=88, maxDocs=44421)
                0.016721012 = queryNorm
              0.9016039 = fieldWeight in 2744, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.212831 = idf(docFreq=88, maxDocs=44421)
                0.125 = fieldNorm(doc=2744)
        0.16 = coord(4/25)
    
  3. Milstead, J.L.: Specifications for thesaurus software (1991) 0.15
    0.15293065 = sum of:
      0.15293065 = product of:
        0.637211 = sum of:
          0.04690205 = weight(abstract_txt:capabilities in 2290) [ClassicSimilarity], result of:
            0.04690205 = score(doc=2290,freq=1.0), product of:
              0.10059208 = queryWeight, product of:
                1.0080062 = boost
                5.9681263 = idf(docFreq=308, maxDocs=44421)
                0.016721012 = queryNorm
              0.46625987 = fieldWeight in 2290, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9681263 = idf(docFreq=308, maxDocs=44421)
                0.078125 = fieldNorm(doc=2290)
          0.01751539 = weight(abstract_txt:systems in 2290) [ClassicSimilarity], result of:
            0.01751539 = score(doc=2290,freq=1.0), product of:
              0.06572427 = queryWeight, product of:
                1.1522831 = boost
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.016721012 = queryNorm
              0.26649806 = fieldWeight in 2290, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.078125 = fieldNorm(doc=2290)
          0.018541059 = weight(abstract_txt:retrieval in 2290) [ClassicSimilarity], result of:
            0.018541059 = score(doc=2290,freq=1.0), product of:
              0.068265654 = queryWeight, product of:
                1.1743497 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.016721012 = queryNorm
              0.27160156 = fieldWeight in 2290, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.078125 = fieldNorm(doc=2290)
          0.03230858 = weight(abstract_txt:search in 2290) [ClassicSimilarity], result of:
            0.03230858 = score(doc=2290,freq=1.0), product of:
              0.11315877 = queryWeight, product of:
                1.8517658 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.016721012 = queryNorm
              0.28551546 = fieldWeight in 2290, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.078125 = fieldNorm(doc=2290)
          0.10797756 = weight(abstract_txt:boolean in 2290) [ClassicSimilarity], result of:
            0.10797756 = score(doc=2290,freq=1.0), product of:
              0.22097081 = queryWeight, product of:
                2.112826 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.016721012 = queryNorm
              0.4886508 = fieldWeight in 2290, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.078125 = fieldNorm(doc=2290)
          0.4139664 = weight(abstract_txt:proximity in 2290) [ClassicSimilarity], result of:
            0.4139664 = score(doc=2290,freq=1.0), product of:
              0.73463106 = queryWeight, product of:
                6.091173 = boost
                7.212831 = idf(docFreq=88, maxDocs=44421)
                0.016721012 = queryNorm
              0.56350243 = fieldWeight in 2290, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.212831 = idf(docFreq=88, maxDocs=44421)
                0.078125 = fieldNorm(doc=2290)
        0.24 = coord(6/25)
    
  4. Clarke, S.J.: Search engines for the World Wide Web : an evaluation of recent developments (2000) 0.14
    0.13902068 = sum of:
      0.13902068 = product of:
        0.86887926 = sum of:
          0.075043276 = weight(abstract_txt:capabilities in 107) [ClassicSimilarity], result of:
            0.075043276 = score(doc=107,freq=1.0), product of:
              0.10059208 = queryWeight, product of:
                1.0080062 = boost
                5.9681263 = idf(docFreq=308, maxDocs=44421)
                0.016721012 = queryNorm
              0.7460158 = fieldWeight in 107, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9681263 = idf(docFreq=308, maxDocs=44421)
                0.125 = fieldNorm(doc=107)
          0.041953627 = weight(abstract_txt:retrieval in 107) [ClassicSimilarity], result of:
            0.041953627 = score(doc=107,freq=2.0), product of:
              0.068265654 = queryWeight, product of:
                1.1743497 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.016721012 = queryNorm
              0.6145642 = fieldWeight in 107, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.125 = fieldNorm(doc=107)
          0.08953616 = weight(abstract_txt:search in 107) [ClassicSimilarity], result of:
            0.08953616 = score(doc=107,freq=3.0), product of:
              0.11315877 = queryWeight, product of:
                1.8517658 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.016721012 = queryNorm
              0.7912437 = fieldWeight in 107, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.125 = fieldNorm(doc=107)
          0.6623462 = weight(abstract_txt:proximity in 107) [ClassicSimilarity], result of:
            0.6623462 = score(doc=107,freq=1.0), product of:
              0.73463106 = queryWeight, product of:
                6.091173 = boost
                7.212831 = idf(docFreq=88, maxDocs=44421)
                0.016721012 = queryNorm
              0.9016039 = fieldWeight in 107, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.212831 = idf(docFreq=88, maxDocs=44421)
                0.125 = fieldNorm(doc=107)
        0.16 = coord(4/25)
    
  5. Vaughan, L.: New measurements for search engine evaluation proposed and tested (2004) 0.12
    0.121233985 = sum of:
      0.121233985 = product of:
        0.5051416 = sum of:
          0.070336625 = weight(abstract_txt:ranked in 3535) [ClassicSimilarity], result of:
            0.070336625 = score(doc=3535,freq=1.0), product of:
              0.11670858 = queryWeight, product of:
                1.0857571 = boost
                6.428468 = idf(docFreq=194, maxDocs=44421)
                0.016721012 = queryNorm
              0.6026689 = fieldWeight in 3535, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.428468 = idf(docFreq=194, maxDocs=44421)
                0.09375 = fieldNorm(doc=3535)
          0.029724602 = weight(abstract_txt:systems in 3535) [ClassicSimilarity], result of:
            0.029724602 = score(doc=3535,freq=2.0), product of:
              0.06572427 = queryWeight, product of:
                1.1522831 = boost
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.016721012 = queryNorm
              0.4522622 = fieldWeight in 3535, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.09375 = fieldNorm(doc=3535)
          0.03146522 = weight(abstract_txt:retrieval in 3535) [ClassicSimilarity], result of:
            0.03146522 = score(doc=3535,freq=2.0), product of:
              0.068265654 = queryWeight, product of:
                1.1743497 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.016721012 = queryNorm
              0.46092314 = fieldWeight in 3535, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.09375 = fieldNorm(doc=3535)
          0.026358845 = weight(abstract_txt:some in 3535) [ClassicSimilarity], result of:
            0.026358845 = score(doc=3535,freq=1.0), product of:
              0.07643213 = queryWeight, product of:
                1.2426084 = boost
                3.6785707 = idf(docFreq=3049, maxDocs=44421)
                0.016721012 = queryNorm
              0.344866 = fieldWeight in 3535, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6785707 = idf(docFreq=3049, maxDocs=44421)
                0.09375 = fieldNorm(doc=3535)
          0.26056328 = weight(abstract_txt:measurements in 3535) [ClassicSimilarity], result of:
            0.26056328 = score(doc=3535,freq=4.0), product of:
              0.17602345 = queryWeight, product of:
                1.3334188 = boost
                7.894805 = idf(docFreq=44, maxDocs=44421)
                0.016721012 = queryNorm
              1.4802759 = fieldWeight in 3535, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.894805 = idf(docFreq=44, maxDocs=44421)
                0.09375 = fieldNorm(doc=3535)
          0.08669302 = weight(abstract_txt:search in 3535) [ClassicSimilarity], result of:
            0.08669302 = score(doc=3535,freq=5.0), product of:
              0.11315877 = queryWeight, product of:
                1.8517658 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.016721012 = queryNorm
              0.7661184 = fieldWeight in 3535, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.09375 = fieldNorm(doc=3535)
        0.24 = coord(6/25)