Document (#12600)

Author
Nakkouzi, Z.S.
Eastman, C.M.
Title
Query formulation for handling negation in information retrieval systems
Source
Journal of the American Society for Information Science. 41(1990) no.3, S.171-182
Year
1990
Abstract
Queries containing negation are widely recognised as presenting problems for both users and systems. In information retrieval systems such problems usually manifest themselves in the use of the NOT operator. Describes an algorithm to transform Boolean queries with negated terms into queries without negation; the transformation process is based on the use of a hierarchical thesaurus. Examines a set of user requests submitted to the Thomas Cooper Library at the University of South Carolina to determine the pattern and frequency of use of negation.
Theme
Retrievalalgorithmen

Similar documents (author)

  1. Eastman, C.M.: Overlaps in postings to thesaurus terms : a preliminary study (1988) 5.81
    5.814733 = sum of:
      5.814733 = weight(author_txt:eastman in 3623) [ClassicSimilarity], result of:
        5.814733 = fieldWeight in 3623, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.303573 = idf(docFreq=10, maxDocs=44421)
          0.625 = fieldNorm(doc=3623)
    
  2. Eastman, C.M.: 30,000 hits may be better than 300 : precision anomalies in Internet searches (2002) 5.81
    5.814733 = sum of:
      5.814733 = weight(author_txt:eastman in 231) [ClassicSimilarity], result of:
        5.814733 = fieldWeight in 231, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.303573 = idf(docFreq=10, maxDocs=44421)
          0.625 = fieldNorm(doc=231)
    
  3. Chang, Y.F.; Eastman, C.M.: ¬An information retrieval system for reusable software (1993) 4.65
    4.6517863 = sum of:
      4.6517863 = weight(author_txt:eastman in 6347) [ClassicSimilarity], result of:
        4.6517863 = fieldWeight in 6347, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.303573 = idf(docFreq=10, maxDocs=44421)
          0.5 = fieldNorm(doc=6347)
    
  4. Eastman, C.M.; Carter, R.M.: Anthropological perspectives on classification schemes (1994) 4.65
    4.6517863 = sum of:
      4.6517863 = weight(author_txt:eastman in 502) [ClassicSimilarity], result of:
        4.6517863 = fieldWeight in 502, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.303573 = idf(docFreq=10, maxDocs=44421)
          0.5 = fieldNorm(doc=502)
    
  5. Rose, J.R.; Eastman, C.M.: Hierarchical classification as an aid to browsing (1994) 4.65
    4.6517863 = sum of:
      4.6517863 = weight(author_txt:eastman in 508) [ClassicSimilarity], result of:
        4.6517863 = fieldWeight in 508, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.303573 = idf(docFreq=10, maxDocs=44421)
          0.5 = fieldNorm(doc=508)
    

Similar documents (content)

  1. Klein, S.T.: On the use of negation in Boolean IR queries. (2009) 0.32
    0.32003844 = sum of:
      0.32003844 = product of:
        1.3334935 = sum of:
          0.05619666 = weight(abstract_txt:boolean in 914) [ClassicSimilarity], result of:
            0.05619666 = score(doc=914,freq=1.0), product of:
              0.09583643 = queryWeight, product of:
                1.0514104 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.014573026 = queryNorm
              0.58638096 = fieldWeight in 914, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.09375 = fieldNorm(doc=914)
          0.019299297 = weight(abstract_txt:retrieval in 914) [ClassicSimilarity], result of:
            0.019299297 = score(doc=914,freq=1.0), product of:
              0.059214484 = queryWeight, product of:
                1.1687886 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.014573026 = queryNorm
              0.3259219 = fieldWeight in 914, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.09375 = fieldNorm(doc=914)
          0.12828957 = weight(abstract_txt:operator in 914) [ClassicSimilarity], result of:
            0.12828957 = score(doc=914,freq=1.0), product of:
              0.16615671 = queryWeight, product of:
                1.3844137 = boost
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.014573026 = queryNorm
              0.77209985 = fieldWeight in 914, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.09375 = fieldNorm(doc=914)
          0.35436022 = weight(abstract_txt:negated in 914) [ClassicSimilarity], result of:
            0.35436022 = score(doc=914,freq=3.0), product of:
              0.22680183 = queryWeight, product of:
                1.6174477 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.014573026 = queryNorm
              1.5624223 = fieldWeight in 914, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.09375 = fieldNorm(doc=914)
          0.1584474 = weight(abstract_txt:queries in 914) [ClassicSimilarity], result of:
            0.1584474 = score(doc=914,freq=3.0), product of:
              0.19127008 = queryWeight, product of:
                2.5727117 = boost
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.014573026 = queryNorm
              0.82839614 = fieldWeight in 914, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.09375 = fieldNorm(doc=914)
          0.6169004 = weight(abstract_txt:negation in 914) [ClassicSimilarity], result of:
            0.6169004 = score(doc=914,freq=1.0), product of:
              0.75142735 = queryWeight, product of:
                5.8881717 = boost
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.014573026 = queryNorm
              0.8209714 = fieldWeight in 914, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.09375 = fieldNorm(doc=914)
        0.24 = coord(6/25)
    
  2. Young, C.W.; Eastman, C.M.; Oakman, R.L.: ¬An analysis of ill-formed input in natural language queries to document retrieval systems (1991) 0.26
    0.26084122 = sum of:
      0.26084122 = product of:
        0.72455895 = sum of:
          0.040291402 = weight(abstract_txt:frequency in 6263) [ClassicSimilarity], result of:
            0.040291402 = score(doc=6263,freq=1.0), product of:
              0.086693406 = queryWeight, product of:
                5.948895 = idf(docFreq=314, maxDocs=44421)
                0.014573026 = queryNorm
              0.4647574 = fieldWeight in 6263, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.948895 = idf(docFreq=314, maxDocs=44421)
                0.078125 = fieldNorm(doc=6263)
          0.06348382 = weight(abstract_txt:requests in 6263) [ClassicSimilarity], result of:
            0.06348382 = score(doc=6263,freq=1.0), product of:
              0.11738695 = queryWeight, product of:
                1.1636353 = boost
                6.922344 = idf(docFreq=118, maxDocs=44421)
                0.014573026 = queryNorm
              0.54080814 = fieldWeight in 6263, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.922344 = idf(docFreq=118, maxDocs=44421)
                0.078125 = fieldNorm(doc=6263)
          0.016082747 = weight(abstract_txt:retrieval in 6263) [ClassicSimilarity], result of:
            0.016082747 = score(doc=6263,freq=1.0), product of:
              0.059214484 = queryWeight, product of:
                1.1687886 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.014573026 = queryNorm
              0.27160156 = fieldWeight in 6263, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.078125 = fieldNorm(doc=6263)
          0.06567217 = weight(abstract_txt:south in 6263) [ClassicSimilarity], result of:
            0.06567217 = score(doc=6263,freq=1.0), product of:
              0.12006931 = queryWeight, product of:
                1.1768551 = boost
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.014573026 = queryNorm
              0.5469521 = fieldWeight in 6263, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.078125 = fieldNorm(doc=6263)
          0.07747135 = weight(abstract_txt:thomas in 6263) [ClassicSimilarity], result of:
            0.07747135 = score(doc=6263,freq=1.0), product of:
              0.13405156 = queryWeight, product of:
                1.2434918 = boost
                7.3974023 = idf(docFreq=73, maxDocs=44421)
                0.014573026 = queryNorm
              0.57792205 = fieldWeight in 6263, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3974023 = idf(docFreq=73, maxDocs=44421)
                0.078125 = fieldNorm(doc=6263)
          0.10814914 = weight(abstract_txt:carolina in 6263) [ClassicSimilarity], result of:
            0.10814914 = score(doc=6263,freq=1.0), product of:
              0.16744025 = queryWeight, product of:
                1.3897507 = boost
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.014573026 = queryNorm
              0.6458969 = fieldWeight in 6263, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.078125 = fieldNorm(doc=6263)
          0.030450573 = weight(abstract_txt:problems in 6263) [ClassicSimilarity], result of:
            0.030450573 = score(doc=6263,freq=1.0), product of:
              0.09062572 = queryWeight, product of:
                1.4459314 = boost
                4.300847 = idf(docFreq=1636, maxDocs=44421)
                0.014573026 = queryNorm
              0.33600366 = fieldWeight in 6263, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.300847 = idf(docFreq=1636, maxDocs=44421)
                0.078125 = fieldNorm(doc=6263)
          0.17049165 = weight(abstract_txt:cooper in 6263) [ClassicSimilarity], result of:
            0.17049165 = score(doc=6263,freq=1.0), product of:
              0.22680183 = queryWeight, product of:
                1.6174477 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.014573026 = queryNorm
              0.7517208 = fieldWeight in 6263, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.078125 = fieldNorm(doc=6263)
          0.15246609 = weight(abstract_txt:queries in 6263) [ClassicSimilarity], result of:
            0.15246609 = score(doc=6263,freq=4.0), product of:
              0.19127008 = queryWeight, product of:
                2.5727117 = boost
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.014573026 = queryNorm
              0.79712456 = fieldWeight in 6263, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.078125 = fieldNorm(doc=6263)
        0.36 = coord(9/25)
    
  3. McQuire, A.R.; Eastman, C.M.: ¬The ambiguity of negation in natural language queries to information retrieval systems (1998) 0.21
    0.21318422 = sum of:
      0.21318422 = product of:
        1.0659211 = sum of:
          0.012866197 = weight(abstract_txt:retrieval in 2147) [ClassicSimilarity], result of:
            0.012866197 = score(doc=2147,freq=1.0), product of:
              0.059214484 = queryWeight, product of:
                1.1687886 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.014573026 = queryNorm
              0.21728125 = fieldWeight in 2147, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=2147)
          0.23624016 = weight(abstract_txt:negated in 2147) [ClassicSimilarity], result of:
            0.23624016 = score(doc=2147,freq=3.0), product of:
              0.22680183 = queryWeight, product of:
                1.6174477 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.014573026 = queryNorm
              1.0416149 = fieldWeight in 2147, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.0625 = fieldNorm(doc=2147)
          0.018231682 = weight(abstract_txt:systems in 2147) [ClassicSimilarity], result of:
            0.018231682 = score(doc=2147,freq=1.0), product of:
              0.08551508 = queryWeight, product of:
                1.7202396 = boost
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.014573026 = queryNorm
              0.21319844 = fieldWeight in 2147, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.0625 = fieldNorm(doc=2147)
          0.08624784 = weight(abstract_txt:queries in 2147) [ClassicSimilarity], result of:
            0.08624784 = score(doc=2147,freq=2.0), product of:
              0.19127008 = queryWeight, product of:
                2.5727117 = boost
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.014573026 = queryNorm
              0.45092174 = fieldWeight in 2147, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.0625 = fieldNorm(doc=2147)
          0.71233517 = weight(abstract_txt:negation in 2147) [ClassicSimilarity], result of:
            0.71233517 = score(doc=2147,freq=3.0), product of:
              0.75142735 = queryWeight, product of:
                5.8881717 = boost
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.014573026 = queryNorm
              0.9479761 = fieldWeight in 2147, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.0625 = fieldNorm(doc=2147)
        0.2 = coord(5/25)
    
  4. Lucas, W.; Topi, H.: Form and function : the impact of query term and operator usage on Web search results (2002) 0.10
    0.099112995 = sum of:
      0.099112995 = product of:
        0.4129708 = sum of:
          0.03746444 = weight(abstract_txt:boolean in 1198) [ClassicSimilarity], result of:
            0.03746444 = score(doc=1198,freq=1.0), product of:
              0.09583643 = queryWeight, product of:
                1.0514104 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.014573026 = queryNorm
              0.39092064 = fieldWeight in 1198, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.0625 = fieldNorm(doc=1198)
          0.012866197 = weight(abstract_txt:retrieval in 1198) [ClassicSimilarity], result of:
            0.012866197 = score(doc=1198,freq=1.0), product of:
              0.059214484 = queryWeight, product of:
                1.1687886 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.014573026 = queryNorm
              0.21728125 = fieldWeight in 1198, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=1198)
          0.07429957 = weight(abstract_txt:submitted in 1198) [ClassicSimilarity], result of:
            0.07429957 = score(doc=1198,freq=2.0), product of:
              0.12006931 = queryWeight, product of:
                1.1768551 = boost
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.014573026 = queryNorm
              0.61880565 = fieldWeight in 1198, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.0625 = fieldNorm(doc=1198)
          0.14813605 = weight(abstract_txt:operator in 1198) [ClassicSimilarity], result of:
            0.14813605 = score(doc=1198,freq=3.0), product of:
              0.16615671 = queryWeight, product of:
                1.3844137 = boost
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.014573026 = queryNorm
              0.89154416 = fieldWeight in 1198, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.0625 = fieldNorm(doc=1198)
          0.018231682 = weight(abstract_txt:systems in 1198) [ClassicSimilarity], result of:
            0.018231682 = score(doc=1198,freq=1.0), product of:
              0.08551508 = queryWeight, product of:
                1.7202396 = boost
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.014573026 = queryNorm
              0.21319844 = fieldWeight in 1198, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.0625 = fieldNorm(doc=1198)
          0.12197287 = weight(abstract_txt:queries in 1198) [ClassicSimilarity], result of:
            0.12197287 = score(doc=1198,freq=4.0), product of:
              0.19127008 = queryWeight, product of:
                2.5727117 = boost
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.014573026 = queryNorm
              0.63769966 = fieldWeight in 1198, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.0625 = fieldNorm(doc=1198)
        0.24 = coord(6/25)
    
  5. Spink, A.; Wolfram, D.; Jansen, B.J.; Saracevic, T.: Searching the Web : the public and their queries (2001) 0.08
    0.08342753 = sum of:
      0.08342753 = product of:
        0.34761474 = sum of:
          0.024174843 = weight(abstract_txt:frequency in 980) [ClassicSimilarity], result of:
            0.024174843 = score(doc=980,freq=1.0), product of:
              0.086693406 = queryWeight, product of:
                5.948895 = idf(docFreq=314, maxDocs=44421)
                0.014573026 = queryNorm
              0.27885446 = fieldWeight in 980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.948895 = idf(docFreq=314, maxDocs=44421)
                0.046875 = fieldNorm(doc=980)
          0.039737035 = weight(abstract_txt:boolean in 980) [ClassicSimilarity], result of:
            0.039737035 = score(doc=980,freq=2.0), product of:
              0.09583643 = queryWeight, product of:
                1.0514104 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.014573026 = queryNorm
              0.41463393 = fieldWeight in 980, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.046875 = fieldNorm(doc=980)
          0.028452935 = weight(abstract_txt:containing in 980) [ClassicSimilarity], result of:
            0.028452935 = score(doc=980,freq=1.0), product of:
              0.09664106 = queryWeight, product of:
                1.055815 = boost
                6.2809324 = idf(docFreq=225, maxDocs=44421)
                0.014573026 = queryNorm
              0.2944187 = fieldWeight in 980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2809324 = idf(docFreq=225, maxDocs=44421)
                0.046875 = fieldNorm(doc=980)
          0.039403297 = weight(abstract_txt:submitted in 980) [ClassicSimilarity], result of:
            0.039403297 = score(doc=980,freq=1.0), product of:
              0.12006931 = queryWeight, product of:
                1.1768551 = boost
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.014573026 = queryNorm
              0.32817125 = fieldWeight in 980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.046875 = fieldNorm(doc=980)
          0.06414478 = weight(abstract_txt:operator in 980) [ClassicSimilarity], result of:
            0.06414478 = score(doc=980,freq=1.0), product of:
              0.16615671 = queryWeight, product of:
                1.3844137 = boost
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.014573026 = queryNorm
              0.38604993 = fieldWeight in 980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.046875 = fieldNorm(doc=980)
          0.15170184 = weight(abstract_txt:queries in 980) [ClassicSimilarity], result of:
            0.15170184 = score(doc=980,freq=11.0), product of:
              0.19127008 = queryWeight, product of:
                2.5727117 = boost
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.014573026 = queryNorm
              0.79312897 = fieldWeight in 980, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.046875 = fieldNorm(doc=980)
        0.24 = coord(6/25)