Document (#21513)

Author
Fox, E.
Betrabet, S.
Koushik, M.
Lee, W.
Title
Extended Boolean models
Source
Information retrieval: data structures and algorithms. Ed.: W.B. Frakes u. R. Baeza-Yates
Imprint
Englewood Cliffs, NJ : Prentice Hall
Year
1992
Pages
S.393-418
Abstract
The classical interpretation of Boolean operators in an information retrieval system is in general too strict. A standard Boolean query rarely comes close to retrieving all and only those documents which are relevant to a query. Many models have been proposed with the aim of softening the interpretation of the Boolean operators in order to improve the precision and recall of the search results. This chapter discusses 3 such models: the Mixed Min and Max (MMM), the Paice, and the P-noem models. The MMM and Paice models are essentially variations of the classical fuzzy-set model, while the P-norm scheme is a distance-based approach. Our experimental results indicate that each of the above models provide better performance than the classical Boolean model in terms of retrieval effectiveness
Theme
Retrievalalgorithmen

Similar documents (content)

  1. Lee, J.H.; Kim, M.H.; Lee, Y.J.: Information retrieval based on conceptual distance in is-a hierarchies (1993) 0.26
    0.2635735 = sum of:
      0.2635735 = product of:
        0.94133395 = sum of:
          0.07083941 = weight(abstract_txt:extended in 6728) [ClassicSimilarity], result of:
            0.07083941 = score(doc=6728,freq=2.0), product of:
              0.10531034 = queryWeight, product of:
                1.1022564 = boost
                6.0883393 = idf(docFreq=273, maxDocs=44421)
                0.015692405 = queryNorm
              0.67267287 = fieldWeight in 6728, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.0883393 = idf(docFreq=273, maxDocs=44421)
                0.078125 = fieldNorm(doc=6728)
          0.059975527 = weight(abstract_txt:distance in 6728) [ClassicSimilarity], result of:
            0.059975527 = score(doc=6728,freq=1.0), product of:
              0.11874458 = queryWeight, product of:
                1.1704532 = boost
                6.4650254 = idf(docFreq=187, maxDocs=44421)
                0.015692405 = queryNorm
              0.5050801 = fieldWeight in 6728, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4650254 = idf(docFreq=187, maxDocs=44421)
                0.078125 = fieldNorm(doc=6728)
          0.01865177 = weight(abstract_txt:retrieval in 6728) [ClassicSimilarity], result of:
            0.01865177 = score(doc=6728,freq=1.0), product of:
              0.06867328 = queryWeight, product of:
                1.258798 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.015692405 = queryNorm
              0.27160156 = fieldWeight in 6728, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.078125 = fieldNorm(doc=6728)
          0.039661393 = weight(abstract_txt:model in 6728) [ClassicSimilarity], result of:
            0.039661393 = score(doc=6728,freq=2.0), product of:
              0.09013146 = queryWeight, product of:
                1.4421165 = boost
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.015692405 = queryNorm
              0.4400394 = fieldWeight in 6728, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.078125 = fieldNorm(doc=6728)
          0.047709584 = weight(abstract_txt:query in 6728) [ClassicSimilarity], result of:
            0.047709584 = score(doc=6728,freq=1.0), product of:
              0.12844332 = queryWeight, product of:
                1.7215431 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.015692405 = queryNorm
              0.37144467 = fieldWeight in 6728, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.078125 = fieldNorm(doc=6728)
          0.16138464 = weight(abstract_txt:operators in 6728) [ClassicSimilarity], result of:
            0.16138464 = score(doc=6728,freq=1.0), product of:
              0.28943378 = queryWeight, product of:
                2.5842633 = boost
                7.1371193 = idf(docFreq=95, maxDocs=44421)
                0.015692405 = queryNorm
              0.55758744 = fieldWeight in 6728, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1371193 = idf(docFreq=95, maxDocs=44421)
                0.078125 = fieldNorm(doc=6728)
          0.5431116 = weight(abstract_txt:boolean in 6728) [ClassicSimilarity], result of:
            0.5431116 = score(doc=6728,freq=4.0), product of:
              0.5557257 = queryWeight, product of:
                5.6619024 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.015692405 = queryNorm
              0.9773016 = fieldWeight in 6728, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.078125 = fieldNorm(doc=6728)
        0.28 = coord(7/25)
    
  2. Lucas, W.; Topi, H.: Form and function : the impact of query term and operator usage on Web search results (2002) 0.22
    0.21878026 = sum of:
      0.21878026 = product of:
        0.68368834 = sum of:
          0.0149214165 = weight(abstract_txt:retrieval in 1198) [ClassicSimilarity], result of:
            0.0149214165 = score(doc=1198,freq=1.0), product of:
              0.06867328 = queryWeight, product of:
                1.258798 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.015692405 = queryNorm
              0.21728125 = fieldWeight in 1198, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=1198)
          0.02989812 = weight(abstract_txt:results in 1198) [ClassicSimilarity], result of:
            0.02989812 = score(doc=1198,freq=4.0), product of:
              0.06875807 = queryWeight, product of:
                1.2595749 = boost
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.015692405 = queryNorm
              0.4348307 = fieldWeight in 1198, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.0625 = fieldNorm(doc=1198)
          0.062152695 = weight(abstract_txt:rarely in 1198) [ClassicSimilarity], result of:
            0.062152695 = score(doc=1198,freq=1.0), product of:
              0.14110565 = queryWeight, product of:
                1.275908 = boost
                7.0475073 = idf(docFreq=104, maxDocs=44421)
                0.015692405 = queryNorm
              0.4404692 = fieldWeight in 1198, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0475073 = idf(docFreq=104, maxDocs=44421)
                0.0625 = fieldNorm(doc=1198)
          0.022435874 = weight(abstract_txt:model in 1198) [ClassicSimilarity], result of:
            0.022435874 = score(doc=1198,freq=1.0), product of:
              0.09013146 = queryWeight, product of:
                1.4421165 = boost
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.015692405 = queryNorm
              0.24892388 = fieldWeight in 1198, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.0625 = fieldNorm(doc=1198)
          0.038167667 = weight(abstract_txt:query in 1198) [ClassicSimilarity], result of:
            0.038167667 = score(doc=1198,freq=1.0), product of:
              0.12844332 = queryWeight, product of:
                1.7215431 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.015692405 = queryNorm
              0.29715574 = fieldWeight in 1198, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.0625 = fieldNorm(doc=1198)
          0.075246826 = weight(abstract_txt:interpretation in 1198) [ClassicSimilarity], result of:
            0.075246826 = score(doc=1198,freq=1.0), product of:
              0.20194815 = queryWeight, product of:
                2.1586492 = boost
                5.9616747 = idf(docFreq=310, maxDocs=44421)
                0.015692405 = queryNorm
              0.37260467 = fieldWeight in 1198, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9616747 = idf(docFreq=310, maxDocs=44421)
                0.0625 = fieldNorm(doc=1198)
          0.2236211 = weight(abstract_txt:operators in 1198) [ClassicSimilarity], result of:
            0.2236211 = score(doc=1198,freq=3.0), product of:
              0.28943378 = queryWeight, product of:
                2.5842633 = boost
                7.1371193 = idf(docFreq=95, maxDocs=44421)
                0.015692405 = queryNorm
              0.7726158 = fieldWeight in 1198, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.1371193 = idf(docFreq=95, maxDocs=44421)
                0.0625 = fieldNorm(doc=1198)
          0.21724464 = weight(abstract_txt:boolean in 1198) [ClassicSimilarity], result of:
            0.21724464 = score(doc=1198,freq=1.0), product of:
              0.5557257 = queryWeight, product of:
                5.6619024 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.015692405 = queryNorm
              0.39092064 = fieldWeight in 1198, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.0625 = fieldNorm(doc=1198)
        0.32 = coord(8/25)
    
  3. Petry, F.E.; Buckles, B.P.; Prabhu, D.: Fuzzy information retrieval using genetic algorithms and relevance feedback (1993) 0.20
    0.1973049 = sum of:
      0.1973049 = product of:
        0.70466036 = sum of:
          0.03740347 = weight(abstract_txt:precision in 7961) [ClassicSimilarity], result of:
            0.03740347 = score(doc=7961,freq=1.0), product of:
              0.086677365 = queryWeight, product of:
                5.5235233 = idf(docFreq=481, maxDocs=44421)
                0.015692405 = queryNorm
              0.43152526 = fieldWeight in 7961, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5235233 = idf(docFreq=481, maxDocs=44421)
                0.078125 = fieldNorm(doc=7961)
          0.042213734 = weight(abstract_txt:recall in 7961) [ClassicSimilarity], result of:
            0.042213734 = score(doc=7961,freq=1.0), product of:
              0.09395797 = queryWeight, product of:
                1.0411515 = boost
                5.750825 = idf(docFreq=383, maxDocs=44421)
                0.015692405 = queryNorm
              0.44928318 = fieldWeight in 7961, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.750825 = idf(docFreq=383, maxDocs=44421)
                0.078125 = fieldNorm(doc=7961)
          0.10052219 = weight(abstract_txt:fuzzy in 7961) [ClassicSimilarity], result of:
            0.10052219 = score(doc=7961,freq=2.0), product of:
              0.13298288 = queryWeight, product of:
                1.2386397 = boost
                6.8416553 = idf(docFreq=128, maxDocs=44421)
                0.015692405 = queryNorm
              0.75590324 = fieldWeight in 7961, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8416553 = idf(docFreq=128, maxDocs=44421)
                0.078125 = fieldNorm(doc=7961)
          0.026377587 = weight(abstract_txt:retrieval in 7961) [ClassicSimilarity], result of:
            0.026377587 = score(doc=7961,freq=2.0), product of:
              0.06867328 = queryWeight, product of:
                1.258798 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.015692405 = queryNorm
              0.3841026 = fieldWeight in 7961, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.078125 = fieldNorm(doc=7961)
          0.018686326 = weight(abstract_txt:results in 7961) [ClassicSimilarity], result of:
            0.018686326 = score(doc=7961,freq=1.0), product of:
              0.06875807 = queryWeight, product of:
                1.2595749 = boost
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.015692405 = queryNorm
              0.2717692 = fieldWeight in 7961, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.078125 = fieldNorm(doc=7961)
          0.09541917 = weight(abstract_txt:query in 7961) [ClassicSimilarity], result of:
            0.09541917 = score(doc=7961,freq=4.0), product of:
              0.12844332 = queryWeight, product of:
                1.7215431 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.015692405 = queryNorm
              0.74288934 = fieldWeight in 7961, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.078125 = fieldNorm(doc=7961)
          0.38403788 = weight(abstract_txt:boolean in 7961) [ClassicSimilarity], result of:
            0.38403788 = score(doc=7961,freq=2.0), product of:
              0.5557257 = queryWeight, product of:
                5.6619024 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.015692405 = queryNorm
              0.69105655 = fieldWeight in 7961, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.078125 = fieldNorm(doc=7961)
        0.28 = coord(7/25)
    
  4. Kim, Y.W.; Kim, J.H.: ¬A model of knowledge based information retrieval with hierarchical concept graph (1990) 0.19
    0.18643594 = sum of:
      0.18643594 = product of:
        0.66584265 = sum of:
          0.059975527 = weight(abstract_txt:distance in 3908) [ClassicSimilarity], result of:
            0.059975527 = score(doc=3908,freq=1.0), product of:
              0.11874458 = queryWeight, product of:
                1.1704532 = boost
                6.4650254 = idf(docFreq=187, maxDocs=44421)
                0.015692405 = queryNorm
              0.5050801 = fieldWeight in 3908, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4650254 = idf(docFreq=187, maxDocs=44421)
                0.078125 = fieldNorm(doc=3908)
          0.01865177 = weight(abstract_txt:retrieval in 3908) [ClassicSimilarity], result of:
            0.01865177 = score(doc=3908,freq=1.0), product of:
              0.06867328 = queryWeight, product of:
                1.258798 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.015692405 = queryNorm
              0.27160156 = fieldWeight in 3908, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.078125 = fieldNorm(doc=3908)
          0.032365665 = weight(abstract_txt:results in 3908) [ClassicSimilarity], result of:
            0.032365665 = score(doc=3908,freq=3.0), product of:
              0.06875807 = queryWeight, product of:
                1.2595749 = boost
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.015692405 = queryNorm
              0.47071803 = fieldWeight in 3908, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.078125 = fieldNorm(doc=3908)
          0.07419968 = weight(abstract_txt:model in 3908) [ClassicSimilarity], result of:
            0.07419968 = score(doc=3908,freq=7.0), product of:
              0.09013146 = queryWeight, product of:
                1.4421165 = boost
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.015692405 = queryNorm
              0.8232384 = fieldWeight in 3908, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.078125 = fieldNorm(doc=3908)
          0.047709584 = weight(abstract_txt:query in 3908) [ClassicSimilarity], result of:
            0.047709584 = score(doc=3908,freq=1.0), product of:
              0.12844332 = queryWeight, product of:
                1.7215431 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.015692405 = queryNorm
              0.37144467 = fieldWeight in 3908, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.078125 = fieldNorm(doc=3908)
          0.16138464 = weight(abstract_txt:operators in 3908) [ClassicSimilarity], result of:
            0.16138464 = score(doc=3908,freq=1.0), product of:
              0.28943378 = queryWeight, product of:
                2.5842633 = boost
                7.1371193 = idf(docFreq=95, maxDocs=44421)
                0.015692405 = queryNorm
              0.55758744 = fieldWeight in 3908, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1371193 = idf(docFreq=95, maxDocs=44421)
                0.078125 = fieldNorm(doc=3908)
          0.2715558 = weight(abstract_txt:boolean in 3908) [ClassicSimilarity], result of:
            0.2715558 = score(doc=3908,freq=1.0), product of:
              0.5557257 = queryWeight, product of:
                5.6619024 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.015692405 = queryNorm
              0.4886508 = fieldWeight in 3908, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.078125 = fieldNorm(doc=3908)
        0.28 = coord(7/25)
    
  5. Losee, R.M.: Upper bounds for retrieval performance and their user measuring performance and generating optimal queries : can it get any better than this? (1994) 0.18
    0.1829421 = sum of:
      0.1829421 = product of:
        0.65336466 = sum of:
          0.03740347 = weight(abstract_txt:precision in 7417) [ClassicSimilarity], result of:
            0.03740347 = score(doc=7417,freq=1.0), product of:
              0.086677365 = queryWeight, product of:
                5.5235233 = idf(docFreq=481, maxDocs=44421)
                0.015692405 = queryNorm
              0.43152526 = fieldWeight in 7417, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5235233 = idf(docFreq=481, maxDocs=44421)
                0.078125 = fieldNorm(doc=7417)
          0.042213734 = weight(abstract_txt:recall in 7417) [ClassicSimilarity], result of:
            0.042213734 = score(doc=7417,freq=1.0), product of:
              0.09395797 = queryWeight, product of:
                1.0411515 = boost
                5.750825 = idf(docFreq=383, maxDocs=44421)
                0.015692405 = queryNorm
              0.44928318 = fieldWeight in 7417, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.750825 = idf(docFreq=383, maxDocs=44421)
                0.078125 = fieldNorm(doc=7417)
          0.0578644 = weight(abstract_txt:close in 7417) [ClassicSimilarity], result of:
            0.0578644 = score(doc=7417,freq=1.0), product of:
              0.11594145 = queryWeight, product of:
                1.1565555 = boost
                6.388262 = idf(docFreq=202, maxDocs=44421)
                0.015692405 = queryNorm
              0.49908295 = fieldWeight in 7417, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.388262 = idf(docFreq=202, maxDocs=44421)
                0.078125 = fieldNorm(doc=7417)
          0.045687325 = weight(abstract_txt:retrieval in 7417) [ClassicSimilarity], result of:
            0.045687325 = score(doc=7417,freq=6.0), product of:
              0.06867328 = queryWeight, product of:
                1.258798 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.015692405 = queryNorm
              0.6652853 = fieldWeight in 7417, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.078125 = fieldNorm(doc=7417)
          0.018686326 = weight(abstract_txt:results in 7417) [ClassicSimilarity], result of:
            0.018686326 = score(doc=7417,freq=1.0), product of:
              0.06875807 = queryWeight, product of:
                1.2595749 = boost
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.015692405 = queryNorm
              0.2717692 = fieldWeight in 7417, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.078125 = fieldNorm(doc=7417)
          0.067471534 = weight(abstract_txt:query in 7417) [ClassicSimilarity], result of:
            0.067471534 = score(doc=7417,freq=2.0), product of:
              0.12844332 = queryWeight, product of:
                1.7215431 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.015692405 = queryNorm
              0.52530205 = fieldWeight in 7417, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.078125 = fieldNorm(doc=7417)
          0.38403788 = weight(abstract_txt:boolean in 7417) [ClassicSimilarity], result of:
            0.38403788 = score(doc=7417,freq=2.0), product of:
              0.5557257 = queryWeight, product of:
                5.6619024 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.015692405 = queryNorm
              0.69105655 = fieldWeight in 7417, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.078125 = fieldNorm(doc=7417)
        0.28 = coord(7/25)