Document (#19317)

Author
Keen, E.M.
Hartley, R.J.
Title
Phrase processing in text retrieval
Source
Journal of document and text management. 2(1994) no.1, S.23-34
Year
1994
Abstract
After introducing types of records, queries and text processing options, the features needed in software for phrase processing are identified and different approaches in current text retrieval research in the Text Retrieval Conference (TREC) projects are enumerated. Then follow eight observations on issues in phrase searching relating both to practice and to research, giving the authors' selection of crucial and controversial issues, supported by 21 references
Theme
Retrievalstudien
Object
TREC

Similar documents (author)

  1. Keen, E.M.: ¬The Aberystwyth index languages tests (1973) 1.98
    1.97831 = sum of:
      1.97831 = product of:
        3.95662 = sum of:
          3.95662 = weight(author_txt:keen in 772) [ClassicSimilarity], result of:
            3.95662 = score(doc=772,freq=1.0), product of:
              0.7352241 = queryWeight, product of:
                1.0414811 = boost
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.08198677 = queryNorm
              5.3815155 = fieldWeight in 772, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.625 = fieldNorm(doc=772)
        0.5 = coord(1/2)
    
  2. Keen, E.M.: Prospects for classification suggested by evaluation tests (1976) 1.98
    1.97831 = sum of:
      1.97831 = product of:
        3.95662 = sum of:
          3.95662 = weight(author_txt:keen in 1276) [ClassicSimilarity], result of:
            3.95662 = score(doc=1276,freq=1.0), product of:
              0.7352241 = queryWeight, product of:
                1.0414811 = boost
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.08198677 = queryNorm
              5.3815155 = fieldWeight in 1276, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.625 = fieldNorm(doc=1276)
        0.5 = coord(1/2)
    
  3. Keen, E.M.: On the generation and searching of entries in printed subject indexes (1977) 1.98
    1.97831 = sum of:
      1.97831 = product of:
        3.95662 = sum of:
          3.95662 = weight(author_txt:keen in 2301) [ClassicSimilarity], result of:
            3.95662 = score(doc=2301,freq=1.0), product of:
              0.7352241 = queryWeight, product of:
                1.0414811 = boost
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.08198677 = queryNorm
              5.3815155 = fieldWeight in 2301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.625 = fieldNorm(doc=2301)
        0.5 = coord(1/2)
    
  4. Keen, E.M.: Presenting results of experimental retrieval comparisons (1992) 1.98
    1.97831 = sum of:
      1.97831 = product of:
        3.95662 = sum of:
          3.95662 = weight(author_txt:keen in 3643) [ClassicSimilarity], result of:
            3.95662 = score(doc=3643,freq=1.0), product of:
              0.7352241 = queryWeight, product of:
                1.0414811 = boost
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.08198677 = queryNorm
              5.3815155 = fieldWeight in 3643, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.625 = fieldNorm(doc=3643)
        0.5 = coord(1/2)
    
  5. Keen, E.M.: Some aspects of proximity searching in text retrieval systems (1992) 1.98
    1.97831 = sum of:
      1.97831 = product of:
        3.95662 = sum of:
          3.95662 = weight(author_txt:keen in 6189) [ClassicSimilarity], result of:
            3.95662 = score(doc=6189,freq=1.0), product of:
              0.7352241 = queryWeight, product of:
                1.0414811 = boost
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.08198677 = queryNorm
              5.3815155 = fieldWeight in 6189, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.625 = fieldNorm(doc=6189)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Fagan, J.L.: ¬The effectiveness of a nonsyntactic approach to automatic phrase indexing for document retrieval (1989) 0.16
    0.16266692 = sum of:
      0.16266692 = product of:
        0.8133346 = sum of:
          0.034565486 = weight(abstract_txt:needed in 2845) [ClassicSimilarity], result of:
            0.034565486 = score(doc=2845,freq=1.0), product of:
              0.105004005 = queryWeight, product of:
                1.0324062 = boost
                5.266921 = idf(docFreq=622, maxDocs=44421)
                0.019310718 = queryNorm
              0.32918257 = fieldWeight in 2845, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.266921 = idf(docFreq=622, maxDocs=44421)
                0.0625 = fieldNorm(doc=2845)
          0.014924203 = weight(abstract_txt:research in 2845) [ClassicSimilarity], result of:
            0.014924203 = score(doc=2845,freq=1.0), product of:
              0.07557558 = queryWeight, product of:
                1.2386638 = boost
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.019310718 = queryNorm
              0.19747387 = fieldWeight in 2845, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.0625 = fieldNorm(doc=2845)
          0.051651258 = weight(abstract_txt:retrieval in 2845) [ClassicSimilarity], result of:
            0.051651258 = score(doc=2845,freq=3.0), product of:
              0.13724548 = queryWeight, product of:
                2.04436 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.019310718 = queryNorm
              0.37634215 = fieldWeight in 2845, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=2845)
          0.088303074 = weight(abstract_txt:text in 2845) [ClassicSimilarity], result of:
            0.088303074 = score(doc=2845,freq=2.0), product of:
              0.24723198 = queryWeight, product of:
                3.168327 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.019310718 = queryNorm
              0.3571669 = fieldWeight in 2845, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=2845)
          0.6238906 = weight(abstract_txt:phrase in 2845) [ClassicSimilarity], result of:
            0.6238906 = score(doc=2845,freq=6.0), product of:
              0.5734643 = queryWeight, product of:
                4.1788955 = boost
                7.1063476 = idf(docFreq=98, maxDocs=44421)
                0.019310718 = queryNorm
              1.0879328 = fieldWeight in 2845, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.1063476 = idf(docFreq=98, maxDocs=44421)
                0.0625 = fieldNorm(doc=2845)
        0.2 = coord(5/25)
    
  2. Fidel, R.; Efthimiadis, E.N.: Terminological knowledge structure for intermediary expert systems (1995) 0.15
    0.14502376 = sum of:
      0.14502376 = product of:
        0.6042657 = sum of:
          0.045929823 = weight(abstract_txt:selection in 6695) [ClassicSimilarity], result of:
            0.045929823 = score(doc=6695,freq=1.0), product of:
              0.1093706 = queryWeight, product of:
                1.053654 = boost
                5.375318 = idf(docFreq=558, maxDocs=44421)
                0.019310718 = queryNorm
              0.41994673 = fieldWeight in 6695, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.375318 = idf(docFreq=558, maxDocs=44421)
                0.078125 = fieldNorm(doc=6695)
          0.018655254 = weight(abstract_txt:research in 6695) [ClassicSimilarity], result of:
            0.018655254 = score(doc=6695,freq=1.0), product of:
              0.07557558 = queryWeight, product of:
                1.2386638 = boost
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.019310718 = queryNorm
              0.24684234 = fieldWeight in 6695, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.078125 = fieldNorm(doc=6695)
          0.037276085 = weight(abstract_txt:retrieval in 6695) [ClassicSimilarity], result of:
            0.037276085 = score(doc=6695,freq=1.0), product of:
              0.13724548 = queryWeight, product of:
                2.04436 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.019310718 = queryNorm
              0.27160156 = fieldWeight in 6695, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.078125 = fieldNorm(doc=6695)
          0.10597704 = weight(abstract_txt:processing in 6695) [ClassicSimilarity], result of:
            0.10597704 = score(doc=6695,freq=1.0), product of:
              0.2754349 = queryWeight, product of:
                2.8961284 = boost
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.019310718 = queryNorm
              0.38476256 = fieldWeight in 6695, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.078125 = fieldNorm(doc=6695)
          0.07804963 = weight(abstract_txt:text in 6695) [ClassicSimilarity], result of:
            0.07804963 = score(doc=6695,freq=1.0), product of:
              0.24723198 = queryWeight, product of:
                3.168327 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.019310718 = queryNorm
              0.3156939 = fieldWeight in 6695, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.078125 = fieldNorm(doc=6695)
          0.31837785 = weight(abstract_txt:phrase in 6695) [ClassicSimilarity], result of:
            0.31837785 = score(doc=6695,freq=1.0), product of:
              0.5734643 = queryWeight, product of:
                4.1788955 = boost
                7.1063476 = idf(docFreq=98, maxDocs=44421)
                0.019310718 = queryNorm
              0.5551834 = fieldWeight in 6695, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1063476 = idf(docFreq=98, maxDocs=44421)
                0.078125 = fieldNorm(doc=6695)
        0.24 = coord(6/25)
    
  3. TREC: experiment and evaluation in information retrieval (2005) 0.14
    0.14129192 = sum of:
      0.14129192 = product of:
        0.5887163 = sum of:
          0.040744867 = weight(abstract_txt:conference in 761) [ClassicSimilarity], result of:
            0.040744867 = score(doc=761,freq=1.0), product of:
              0.117172584 = queryWeight, product of:
                1.090588 = boost
                5.5637407 = idf(docFreq=462, maxDocs=44421)
                0.019310718 = queryNorm
              0.3477338 = fieldWeight in 761, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5637407 = idf(docFreq=462, maxDocs=44421)
                0.0625 = fieldNorm(doc=761)
          0.033371534 = weight(abstract_txt:research in 761) [ClassicSimilarity], result of:
            0.033371534 = score(doc=761,freq=5.0), product of:
              0.07557558 = queryWeight, product of:
                1.2386638 = boost
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.019310718 = queryNorm
              0.441565 = fieldWeight in 761, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.0625 = fieldNorm(doc=761)
          0.20163634 = weight(abstract_txt:trec in 761) [ClassicSimilarity], result of:
            0.20163634 = score(doc=761,freq=8.0), product of:
              0.17013486 = queryWeight, product of:
                1.3141482 = boost
                6.704255 = idf(docFreq=147, maxDocs=44421)
                0.019310718 = queryNorm
              1.185156 = fieldWeight in 761, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                6.704255 = idf(docFreq=147, maxDocs=44421)
                0.0625 = fieldNorm(doc=761)
          0.103302516 = weight(abstract_txt:retrieval in 761) [ClassicSimilarity], result of:
            0.103302516 = score(doc=761,freq=12.0), product of:
              0.13724548 = queryWeight, product of:
                2.04436 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.019310718 = queryNorm
              0.7526843 = fieldWeight in 761, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=761)
          0.08478163 = weight(abstract_txt:processing in 761) [ClassicSimilarity], result of:
            0.08478163 = score(doc=761,freq=1.0), product of:
              0.2754349 = queryWeight, product of:
                2.8961284 = boost
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.019310718 = queryNorm
              0.30781004 = fieldWeight in 761, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.0625 = fieldNorm(doc=761)
          0.124879405 = weight(abstract_txt:text in 761) [ClassicSimilarity], result of:
            0.124879405 = score(doc=761,freq=4.0), product of:
              0.24723198 = queryWeight, product of:
                3.168327 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.019310718 = queryNorm
              0.50511026 = fieldWeight in 761, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=761)
        0.24 = coord(6/25)
    
  4. Frohmann, B.: Rules of indexing : a critique of mentalism in information retrieval theory (1990) 0.14
    0.14054972 = sum of:
      0.14054972 = product of:
        0.7027486 = sum of:
          0.022386303 = weight(abstract_txt:research in 3907) [ClassicSimilarity], result of:
            0.022386303 = score(doc=3907,freq=1.0), product of:
              0.07557558 = queryWeight, product of:
                1.2386638 = boost
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.019310718 = queryNorm
              0.2962108 = fieldWeight in 3907, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.09375 = fieldNorm(doc=3907)
          0.07747688 = weight(abstract_txt:retrieval in 3907) [ClassicSimilarity], result of:
            0.07747688 = score(doc=3907,freq=3.0), product of:
              0.13724548 = queryWeight, product of:
                2.04436 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.019310718 = queryNorm
              0.5645132 = fieldWeight in 3907, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.09375 = fieldNorm(doc=3907)
          0.12717244 = weight(abstract_txt:processing in 3907) [ClassicSimilarity], result of:
            0.12717244 = score(doc=3907,freq=1.0), product of:
              0.2754349 = queryWeight, product of:
                2.8961284 = boost
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.019310718 = queryNorm
              0.46171504 = fieldWeight in 3907, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.09375 = fieldNorm(doc=3907)
          0.09365956 = weight(abstract_txt:text in 3907) [ClassicSimilarity], result of:
            0.09365956 = score(doc=3907,freq=1.0), product of:
              0.24723198 = queryWeight, product of:
                3.168327 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.019310718 = queryNorm
              0.3788327 = fieldWeight in 3907, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.09375 = fieldNorm(doc=3907)
          0.3820534 = weight(abstract_txt:phrase in 3907) [ClassicSimilarity], result of:
            0.3820534 = score(doc=3907,freq=1.0), product of:
              0.5734643 = queryWeight, product of:
                4.1788955 = boost
                7.1063476 = idf(docFreq=98, maxDocs=44421)
                0.019310718 = queryNorm
              0.66622007 = fieldWeight in 3907, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1063476 = idf(docFreq=98, maxDocs=44421)
                0.09375 = fieldNorm(doc=3907)
        0.2 = coord(5/25)
    
  5. Harman, D.: ¬The Text REtrieval Conferences (TRECs) : providing a test-bed for information retrieval systems (1998) 0.14
    0.14047997 = sum of:
      0.14047997 = product of:
        0.5853332 = sum of:
          0.052669592 = weight(abstract_txt:projects in 2314) [ClassicSimilarity], result of:
            0.052669592 = score(doc=2314,freq=1.0), product of:
              0.10611006 = queryWeight, product of:
                1.0378294 = boost
                5.2945876 = idf(docFreq=605, maxDocs=44421)
                0.019310718 = queryNorm
              0.49636757 = fieldWeight in 2314, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2945876 = idf(docFreq=605, maxDocs=44421)
                0.09375 = fieldNorm(doc=2314)
          0.105858274 = weight(abstract_txt:conference in 2314) [ClassicSimilarity], result of:
            0.105858274 = score(doc=2314,freq=3.0), product of:
              0.117172584 = queryWeight, product of:
                1.090588 = boost
                5.5637407 = idf(docFreq=462, maxDocs=44421)
                0.019310718 = queryNorm
              0.9034389 = fieldWeight in 2314, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.5637407 = idf(docFreq=462, maxDocs=44421)
                0.09375 = fieldNorm(doc=2314)
          0.031659015 = weight(abstract_txt:research in 2314) [ClassicSimilarity], result of:
            0.031659015 = score(doc=2314,freq=2.0), product of:
              0.07557558 = queryWeight, product of:
                1.2386638 = boost
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.019310718 = queryNorm
              0.41890535 = fieldWeight in 2314, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.09375 = fieldNorm(doc=2314)
          0.18521482 = weight(abstract_txt:trec in 2314) [ClassicSimilarity], result of:
            0.18521482 = score(doc=2314,freq=3.0), product of:
              0.17013486 = queryWeight, product of:
                1.3141482 = boost
                6.704255 = idf(docFreq=147, maxDocs=44421)
                0.019310718 = queryNorm
              1.0886353 = fieldWeight in 2314, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.704255 = idf(docFreq=147, maxDocs=44421)
                0.09375 = fieldNorm(doc=2314)
          0.07747688 = weight(abstract_txt:retrieval in 2314) [ClassicSimilarity], result of:
            0.07747688 = score(doc=2314,freq=3.0), product of:
              0.13724548 = queryWeight, product of:
                2.04436 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.019310718 = queryNorm
              0.5645132 = fieldWeight in 2314, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.09375 = fieldNorm(doc=2314)
          0.13245462 = weight(abstract_txt:text in 2314) [ClassicSimilarity], result of:
            0.13245462 = score(doc=2314,freq=2.0), product of:
              0.24723198 = queryWeight, product of:
                3.168327 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.019310718 = queryNorm
              0.5357503 = fieldWeight in 2314, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.09375 = fieldNorm(doc=2314)
        0.24 = coord(6/25)