Document (#35928)

Author
Klein, S.T.
Title
On the use of negation in Boolean IR queries.
Source
Information processing and management. 45(2009) no.2, S.298-311
Year
2009
Abstract
The negation operator, in various forms in which it appears in Information Retrieval queries, is investigated. The applications include negated terms in Boolean queries, more specifically in the presence of metrical constraints, but also negated characters used in the definition of extended keywords by means of regular expressions. Exact definitions are suggested and their usefulness is shown on several examples. Finally, some implementation issues are discussed, in particular as to the order in which the terms of long queries, with or without negated keywords, should be processed, and efficient heuristics for choosing a good order are suggested.
Theme
Retrievalalgorithmen

Similar documents (author)

  1. Klein, W.: Organisation des Wissens durch Sprache : Konsequenzen für die maschinelle Sprachanalyse (1977) 4.96
    4.9626675 = sum of:
      4.9626675 = weight(author_txt:klein in 1747) [ClassicSimilarity], result of:
        4.9626675 = fieldWeight in 1747, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.9402676 = idf(docFreq=42, maxDocs=44421)
          0.625 = fieldNorm(doc=1747)
    
  2. Klein, H.: GENIOS jetzt mit Thesaurus-Suche (1993) 4.96
    4.9626675 = sum of:
      4.9626675 = weight(author_txt:klein in 7536) [ClassicSimilarity], result of:
        4.9626675 = fieldWeight in 7536, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.9402676 = idf(docFreq=42, maxDocs=44421)
          0.625 = fieldNorm(doc=7536)
    
  3. Klein, R.D.: ¬The problem of cataloguing world literature using the Nippon Decimal Classification (1994) 4.96
    4.9626675 = sum of:
      4.9626675 = weight(author_txt:klein in 935) [ClassicSimilarity], result of:
        4.9626675 = fieldWeight in 935, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.9402676 = idf(docFreq=42, maxDocs=44421)
          0.625 = fieldNorm(doc=935)
    
  4. Klein, G.M.: Is there a standard default keyword operator? : a bibliometric analysis of processing options chosen by libraries to execute keyword searches in online public access catalogs (1994) 4.96
    4.9626675 = sum of:
      4.9626675 = weight(author_txt:klein in 2268) [ClassicSimilarity], result of:
        4.9626675 = fieldWeight in 2268, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.9402676 = idf(docFreq=42, maxDocs=44421)
          0.625 = fieldNorm(doc=2268)
    
  5. Klein, J.T.: Interdisciplinary needs : the current context (1996) 4.96
    4.9626675 = sum of:
      4.9626675 = weight(author_txt:klein in 245) [ClassicSimilarity], result of:
        4.9626675 = fieldWeight in 245, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.9402676 = idf(docFreq=42, maxDocs=44421)
          0.625 = fieldNorm(doc=245)
    

Similar documents (content)

  1. Nakkouzi, Z.S.; Eastman, C.M.: Query formulation for handling negation in information retrieval systems (1990) 0.38
    0.38330126 = sum of:
      0.38330126 = product of:
        1.5970886 = sum of:
          0.029751357 = weight(abstract_txt:terms in 3599) [ClassicSimilarity], result of:
            0.029751357 = score(doc=3599,freq=1.0), product of:
              0.07847933 = queryWeight, product of:
                1.3933473 = boost
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.013928862 = queryNorm
              0.379098 = fieldWeight in 3599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.09375 = fieldNorm(doc=3599)
          0.1256732 = weight(abstract_txt:operator in 3599) [ClassicSimilarity], result of:
            0.1256732 = score(doc=3599,freq=1.0), product of:
              0.16276808 = queryWeight, product of:
                1.4188986 = boost
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.013928862 = queryNorm
              0.77209985 = fieldWeight in 3599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.09375 = fieldNorm(doc=3599)
          0.11010114 = weight(abstract_txt:boolean in 3599) [ClassicSimilarity], result of:
            0.11010114 = score(doc=3599,freq=1.0), product of:
              0.18776384 = queryWeight, product of:
                2.1552007 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.013928862 = queryNorm
              0.58638096 = fieldWeight in 3599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.09375 = fieldNorm(doc=3599)
          0.52335566 = weight(abstract_txt:negation in 3599) [ClassicSimilarity], result of:
            0.52335566 = score(doc=3599,freq=3.0), product of:
              0.36805123 = queryWeight, product of:
                3.0174208 = boost
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.013928862 = queryNorm
              1.4219642 = fieldWeight in 3599, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.09375 = fieldNorm(doc=3599)
          0.20695464 = weight(abstract_txt:queries in 3599) [ClassicSimilarity], result of:
            0.20695464 = score(doc=3599,freq=3.0), product of:
              0.2498257 = queryWeight, product of:
                3.5157282 = boost
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.013928862 = queryNorm
              0.82839614 = fieldWeight in 3599, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.09375 = fieldNorm(doc=3599)
          0.6012525 = weight(abstract_txt:negated in 3599) [ClassicSimilarity], result of:
            0.6012525 = score(doc=3599,freq=1.0), product of:
              0.66652906 = queryWeight, product of:
                4.9732113 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.013928862 = queryNorm
              0.902065 = fieldWeight in 3599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.09375 = fieldNorm(doc=3599)
        0.24 = coord(6/25)
    
  2. McQuire, A.R.; Eastman, C.M.: ¬The ambiguity of negation in natural language queries to information retrieval systems (1998) 0.19
    0.18611889 = sum of:
      0.18611889 = product of:
        1.163243 = sum of:
          0.0074207736 = weight(abstract_txt:which in 2147) [ClassicSimilarity], result of:
            0.0074207736 = score(doc=2147,freq=1.0), product of:
              0.040748443 = queryWeight, product of:
                1.0040082 = boost
                2.9137893 = idf(docFreq=6552, maxDocs=44421)
                0.013928862 = queryNorm
              0.18211183 = fieldWeight in 2147, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9137893 = idf(docFreq=6552, maxDocs=44421)
                0.0625 = fieldNorm(doc=2147)
          0.34890378 = weight(abstract_txt:negation in 2147) [ClassicSimilarity], result of:
            0.34890378 = score(doc=2147,freq=3.0), product of:
              0.36805123 = queryWeight, product of:
                3.0174208 = boost
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.013928862 = queryNorm
              0.9479761 = fieldWeight in 2147, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.0625 = fieldNorm(doc=2147)
          0.11265184 = weight(abstract_txt:queries in 2147) [ClassicSimilarity], result of:
            0.11265184 = score(doc=2147,freq=2.0), product of:
              0.2498257 = queryWeight, product of:
                3.5157282 = boost
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.013928862 = queryNorm
              0.45092174 = fieldWeight in 2147, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.0625 = fieldNorm(doc=2147)
          0.6942666 = weight(abstract_txt:negated in 2147) [ClassicSimilarity], result of:
            0.6942666 = score(doc=2147,freq=3.0), product of:
              0.66652906 = queryWeight, product of:
                4.9732113 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.013928862 = queryNorm
              1.0416149 = fieldWeight in 2147, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.0625 = fieldNorm(doc=2147)
        0.16 = coord(4/25)
    
  3. Klein, S.T.: Processing queries with metrical constraints in XML-based IR systems (2008) 0.15
    0.15262006 = sum of:
      0.15262006 = product of:
        0.54507166 = sum of:
          0.043993518 = weight(abstract_txt:investigated in 2342) [ClassicSimilarity], result of:
            0.043993518 = score(doc=2342,freq=1.0), product of:
              0.08084749 = queryWeight, product of:
                5.8043137 = idf(docFreq=363, maxDocs=44421)
                0.013928862 = queryNorm
              0.5441544 = fieldWeight in 2342, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8043137 = idf(docFreq=363, maxDocs=44421)
                0.09375 = fieldNorm(doc=2342)
          0.01113116 = weight(abstract_txt:which in 2342) [ClassicSimilarity], result of:
            0.01113116 = score(doc=2342,freq=1.0), product of:
              0.040748443 = queryWeight, product of:
                1.0040082 = boost
                2.9137893 = idf(docFreq=6552, maxDocs=44421)
                0.013928862 = queryNorm
              0.27316773 = fieldWeight in 2342, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9137893 = idf(docFreq=6552, maxDocs=44421)
                0.09375 = fieldNorm(doc=2342)
          0.049794585 = weight(abstract_txt:usefulness in 2342) [ClassicSimilarity], result of:
            0.049794585 = score(doc=2342,freq=1.0), product of:
              0.08780693 = queryWeight, product of:
                1.0421522 = boost
                6.0489783 = idf(docFreq=284, maxDocs=44421)
                0.013928862 = queryNorm
              0.5670917 = fieldWeight in 2342, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0489783 = idf(docFreq=284, maxDocs=44421)
                0.09375 = fieldNorm(doc=2342)
          0.06971453 = weight(abstract_txt:constraints in 2342) [ClassicSimilarity], result of:
            0.06971453 = score(doc=2342,freq=1.0), product of:
              0.10988952 = queryWeight, product of:
                1.1658559 = boost
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.013928862 = queryNorm
              0.6344056 = fieldWeight in 2342, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.09375 = fieldNorm(doc=2342)
          0.042074773 = weight(abstract_txt:terms in 2342) [ClassicSimilarity], result of:
            0.042074773 = score(doc=2342,freq=2.0), product of:
              0.07847933 = queryWeight, product of:
                1.3933473 = boost
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.013928862 = queryNorm
              0.53612554 = fieldWeight in 2342, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.09375 = fieldNorm(doc=2342)
          0.20887779 = weight(abstract_txt:metrical in 2342) [ClassicSimilarity], result of:
            0.20887779 = score(doc=2342,freq=1.0), product of:
              0.2283857 = queryWeight, product of:
                1.6807426 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.013928862 = queryNorm
              0.91458344 = fieldWeight in 2342, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.09375 = fieldNorm(doc=2342)
          0.119485326 = weight(abstract_txt:queries in 2342) [ClassicSimilarity], result of:
            0.119485326 = score(doc=2342,freq=1.0), product of:
              0.2498257 = queryWeight, product of:
                3.5157282 = boost
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.013928862 = queryNorm
              0.47827476 = fieldWeight in 2342, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.09375 = fieldNorm(doc=2342)
        0.28 = coord(7/25)
    
  4. Kim, Y.W.; Kim, J.H.: ¬A model of knowledge based information retrieval with hierarchical concept graph (1990) 0.15
    0.14734082 = sum of:
      0.14734082 = product of:
        0.7367041 = sum of:
          0.009275967 = weight(abstract_txt:which in 3908) [ClassicSimilarity], result of:
            0.009275967 = score(doc=3908,freq=1.0), product of:
              0.040748443 = queryWeight, product of:
                1.0040082 = boost
                2.9137893 = idf(docFreq=6552, maxDocs=44421)
                0.013928862 = queryNorm
              0.2276398 = fieldWeight in 3908, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9137893 = idf(docFreq=6552, maxDocs=44421)
                0.078125 = fieldNorm(doc=3908)
          0.03506231 = weight(abstract_txt:terms in 3908) [ClassicSimilarity], result of:
            0.03506231 = score(doc=3908,freq=2.0), product of:
              0.07847933 = queryWeight, product of:
                1.3933473 = boost
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.013928862 = queryNorm
              0.44677126 = fieldWeight in 3908, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.078125 = fieldNorm(doc=3908)
          0.09175095 = weight(abstract_txt:boolean in 3908) [ClassicSimilarity], result of:
            0.09175095 = score(doc=3908,freq=1.0), product of:
              0.18776384 = queryWeight, product of:
                2.1552007 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.013928862 = queryNorm
              0.4886508 = fieldWeight in 3908, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.078125 = fieldNorm(doc=3908)
          0.0995711 = weight(abstract_txt:queries in 3908) [ClassicSimilarity], result of:
            0.0995711 = score(doc=3908,freq=1.0), product of:
              0.2498257 = queryWeight, product of:
                3.5157282 = boost
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.013928862 = queryNorm
              0.39856228 = fieldWeight in 3908, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.078125 = fieldNorm(doc=3908)
          0.50104374 = weight(abstract_txt:negated in 3908) [ClassicSimilarity], result of:
            0.50104374 = score(doc=3908,freq=1.0), product of:
              0.66652906 = queryWeight, product of:
                4.9732113 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.013928862 = queryNorm
              0.7517208 = fieldWeight in 3908, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.078125 = fieldNorm(doc=3908)
        0.2 = coord(5/25)
    
  5. Spink, A.; Wolfram, D.; Jansen, B.J.; Saracevic, T.: Searching the Web : the public and their queries (2001) 0.10
    0.10146253 = sum of:
      0.10146253 = product of:
        0.42276055 = sum of:
          0.030855818 = weight(abstract_txt:appears in 980) [ClassicSimilarity], result of:
            0.030855818 = score(doc=980,freq=1.0), product of:
              0.101309955 = queryWeight, product of:
                1.1194193 = boost
                6.497461 = idf(docFreq=181, maxDocs=44421)
                0.013928862 = queryNorm
              0.30456847 = fieldWeight in 980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.497461 = idf(docFreq=181, maxDocs=44421)
                0.046875 = fieldNorm(doc=980)
          0.033263028 = weight(abstract_txt:terms in 980) [ClassicSimilarity], result of:
            0.033263028 = score(doc=980,freq=5.0), product of:
              0.07847933 = queryWeight, product of:
                1.3933473 = boost
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.013928862 = queryNorm
              0.42384446 = fieldWeight in 980, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.046875 = fieldNorm(doc=980)
          0.0628366 = weight(abstract_txt:operator in 980) [ClassicSimilarity], result of:
            0.0628366 = score(doc=980,freq=1.0), product of:
              0.16276808 = queryWeight, product of:
                1.4188986 = boost
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.013928862 = queryNorm
              0.38604993 = fieldWeight in 980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.046875 = fieldNorm(doc=980)
          0.01980783 = weight(abstract_txt:order in 980) [ClassicSimilarity], result of:
            0.01980783 = score(doc=980,freq=1.0), product of:
              0.09498653 = queryWeight, product of:
                1.5328962 = boost
                4.448705 = idf(docFreq=1411, maxDocs=44421)
                0.013928862 = queryNorm
              0.20853305 = fieldWeight in 980, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.448705 = idf(docFreq=1411, maxDocs=44421)
                0.046875 = fieldNorm(doc=980)
          0.07785326 = weight(abstract_txt:boolean in 980) [ClassicSimilarity], result of:
            0.07785326 = score(doc=980,freq=2.0), product of:
              0.18776384 = queryWeight, product of:
                2.1552007 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.013928862 = queryNorm
              0.41463393 = fieldWeight in 980, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.046875 = fieldNorm(doc=980)
          0.198144 = weight(abstract_txt:queries in 980) [ClassicSimilarity], result of:
            0.198144 = score(doc=980,freq=11.0), product of:
              0.2498257 = queryWeight, product of:
                3.5157282 = boost
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.013928862 = queryNorm
              0.79312897 = fieldWeight in 980, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.046875 = fieldNorm(doc=980)
        0.24 = coord(6/25)