Document (#5503)

Author
Waller, W.G.
Kraft, D.H.
Title
¬A mathematical model of a weighted Boolean retrieval system
Source
Information processing and management. 15(1979), S.235-245
Year
1979
Abstract
The use of weights to denote a query representation and/or the indexing of a document is analysed as a generalization of a Boolean retrieval system. Criteria are given for the functions used to evaluate the relevance of the records to a specific query, including self-consistency. Various mechnaisms suggested in the literature for evaluating the relevance of records with regard to a given query are tested and found to be less than satisfactory. A new approach is suggested to avoid some of the perils of a weighted Boolean retrieval system

Similar documents (author)

  1. Kraft, A.: Mit silbernen Scheibchen will sich der Buchhandel seine Zukunft vergolden : CD-ROMs sind auch bei der eher innovationsscheuen Branche auf dem Vormarsch, doch Experten warnen vor unübersichtlichem Markt mit minderwertigen Angeboten (1995) 5.81
    5.814733 = sum of:
      5.814733 = weight(author_txt:kraft in 1926) [ClassicSimilarity], result of:
        5.814733 = fieldWeight in 1926, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.303573 = idf(docFreq=10, maxDocs=44421)
          0.625 = fieldNorm(doc=1926)
    
  2. Kraft, U.: Wo Gott wohnt : Religion (2002) 5.81
    5.814733 = sum of:
      5.814733 = weight(author_txt:kraft in 1953) [ClassicSimilarity], result of:
        5.814733 = fieldWeight in 1953, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.303573 = idf(docFreq=10, maxDocs=44421)
          0.625 = fieldNorm(doc=1953)
    
  3. Kraft, M.: Juristische Online-Datenbanken : Eine Einkaufshilfe (2005) 5.81
    5.814733 = sum of:
      5.814733 = weight(author_txt:kraft in 4054) [ClassicSimilarity], result of:
        5.814733 = fieldWeight in 4054, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.303573 = idf(docFreq=10, maxDocs=44421)
          0.625 = fieldNorm(doc=4054)
    
  4. Born, J.; Kraft, U.: Lernen im Schlaf - kein Traum (2004) 4.65
    4.6517863 = sum of:
      4.6517863 = weight(author_txt:kraft in 3892) [ClassicSimilarity], result of:
        4.6517863 = fieldWeight in 3892, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.303573 = idf(docFreq=10, maxDocs=44421)
          0.5 = fieldNorm(doc=3892)
    
  5. Colvin, E.; Kraft, D.H.: Fuzzy retrieval for software reuse (2016) 4.65
    4.6517863 = sum of:
      4.6517863 = weight(author_txt:kraft in 4119) [ClassicSimilarity], result of:
        4.6517863 = fieldWeight in 4119, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.303573 = idf(docFreq=10, maxDocs=44421)
          0.5 = fieldNorm(doc=4119)
    

Similar documents (content)

  1. Petry, F.E.; Buckles, B.P.; Prabhu, D.: Fuzzy information retrieval using genetic algorithms and relevance feedback (1993) 0.45
    0.45193186 = sum of:
      0.45193186 = product of:
        1.2553662 = sum of:
          0.045563854 = weight(abstract_txt:functions in 7961) [ClassicSimilarity], result of:
            0.045563854 = score(doc=7961,freq=1.0), product of:
              0.10637085 = queryWeight, product of:
                1.0313367 = boost
                5.4828677 = idf(docFreq=501, maxDocs=44421)
                0.018811109 = queryNorm
              0.42834905 = fieldWeight in 7961, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4828677 = idf(docFreq=501, maxDocs=44421)
                0.078125 = fieldNorm(doc=7961)
          0.054918617 = weight(abstract_txt:tested in 7961) [ClassicSimilarity], result of:
            0.054918617 = score(doc=7961,freq=1.0), product of:
              0.12047273 = queryWeight, product of:
                1.0975733 = boost
                5.8349996 = idf(docFreq=352, maxDocs=44421)
                0.018811109 = queryNorm
              0.45585933 = fieldWeight in 7961, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8349996 = idf(docFreq=352, maxDocs=44421)
                0.078125 = fieldNorm(doc=7961)
          0.10521885 = weight(abstract_txt:weights in 7961) [ClassicSimilarity], result of:
            0.10521885 = score(doc=7961,freq=1.0), product of:
              0.1858395 = queryWeight, product of:
                1.3631957 = boost
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.018811109 = queryNorm
              0.5661813 = fieldWeight in 7961, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.078125 = fieldNorm(doc=7961)
          0.13245678 = weight(abstract_txt:relevance in 7961) [ClassicSimilarity], result of:
            0.13245678 = score(doc=7961,freq=4.0), product of:
              0.17196833 = queryWeight, product of:
                1.8545066 = boost
                4.929532 = idf(docFreq=872, maxDocs=44421)
                0.018811109 = queryNorm
              0.77023935 = fieldWeight in 7961, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.929532 = idf(docFreq=872, maxDocs=44421)
                0.078125 = fieldNorm(doc=7961)
          0.03181856 = weight(abstract_txt:system in 7961) [ClassicSimilarity], result of:
            0.03181856 = score(doc=7961,freq=1.0), product of:
              0.12075444 = queryWeight, product of:
                1.9032742 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.018811109 = queryNorm
              0.26349807 = fieldWeight in 7961, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.078125 = fieldNorm(doc=7961)
          0.0492788 = weight(abstract_txt:retrieval in 7961) [ClassicSimilarity], result of:
            0.0492788 = score(doc=7961,freq=2.0), product of:
              0.12829593 = queryWeight, product of:
                1.9618068 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.018811109 = queryNorm
              0.3841026 = fieldWeight in 7961, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.078125 = fieldNorm(doc=7961)
          0.37086305 = weight(abstract_txt:weighted in 7961) [ClassicSimilarity], result of:
            0.37086305 = score(doc=7961,freq=4.0), product of:
              0.3416185 = queryWeight, product of:
                2.613815 = boost
                6.9478774 = idf(docFreq=115, maxDocs=44421)
                0.018811109 = queryNorm
              1.0856059 = fieldWeight in 7961, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.9478774 = idf(docFreq=115, maxDocs=44421)
                0.078125 = fieldNorm(doc=7961)
          0.17826279 = weight(abstract_txt:query in 7961) [ClassicSimilarity], result of:
            0.17826279 = score(doc=7961,freq=4.0), product of:
              0.23995873 = queryWeight, product of:
                2.6829839 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.018811109 = queryNorm
              0.74288934 = fieldWeight in 7961, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.078125 = fieldNorm(doc=7961)
          0.28698495 = weight(abstract_txt:boolean in 7961) [ClassicSimilarity], result of:
            0.28698495 = score(doc=7961,freq=2.0), product of:
              0.4152843 = queryWeight, product of:
                3.529576 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.018811109 = queryNorm
              0.69105655 = fieldWeight in 7961, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.078125 = fieldNorm(doc=7961)
        0.36 = coord(9/25)
    
  2. Bordogna, G.; Pasi, G.: ¬A fuzzy linguistic approach generalizing Boolean information retrieval : a model and its evaluation (1993) 0.23
    0.22913456 = sum of:
      0.22913456 = product of:
        1.1456728 = sum of:
          0.16835016 = weight(abstract_txt:weights in 3569) [ClassicSimilarity], result of:
            0.16835016 = score(doc=3569,freq=1.0), product of:
              0.1858395 = queryWeight, product of:
                1.3631957 = boost
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.018811109 = queryNorm
              0.90589005 = fieldWeight in 3569, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.125 = fieldNorm(doc=3569)
          0.07884608 = weight(abstract_txt:retrieval in 3569) [ClassicSimilarity], result of:
            0.07884608 = score(doc=3569,freq=2.0), product of:
              0.12829593 = queryWeight, product of:
                1.9618068 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.018811109 = queryNorm
              0.6145642 = fieldWeight in 3569, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.125 = fieldNorm(doc=3569)
          0.29669043 = weight(abstract_txt:weighted in 3569) [ClassicSimilarity], result of:
            0.29669043 = score(doc=3569,freq=1.0), product of:
              0.3416185 = queryWeight, product of:
                2.613815 = boost
                6.9478774 = idf(docFreq=115, maxDocs=44421)
                0.018811109 = queryNorm
              0.8684847 = fieldWeight in 3569, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9478774 = idf(docFreq=115, maxDocs=44421)
                0.125 = fieldNorm(doc=3569)
          0.14261022 = weight(abstract_txt:query in 3569) [ClassicSimilarity], result of:
            0.14261022 = score(doc=3569,freq=1.0), product of:
              0.23995873 = queryWeight, product of:
                2.6829839 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.018811109 = queryNorm
              0.5943115 = fieldWeight in 3569, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.125 = fieldNorm(doc=3569)
          0.4591759 = weight(abstract_txt:boolean in 3569) [ClassicSimilarity], result of:
            0.4591759 = score(doc=3569,freq=2.0), product of:
              0.4152843 = queryWeight, product of:
                3.529576 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.018811109 = queryNorm
              1.1056905 = fieldWeight in 3569, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.125 = fieldNorm(doc=3569)
        0.2 = coord(5/25)
    
  3. Harman, D.: Ranking algorithms (1992) 0.19
    0.19093883 = sum of:
      0.19093883 = product of:
        0.7955785 = sum of:
          0.094793886 = weight(abstract_txt:records in 4511) [ClassicSimilarity], result of:
            0.094793886 = score(doc=4511,freq=2.0), product of:
              0.13851938 = queryWeight, product of:
                1.6644065 = boost
                4.42422 = idf(docFreq=1446, maxDocs=44421)
                0.018811109 = queryNorm
              0.68433666 = fieldWeight in 4511, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.42422 = idf(docFreq=1446, maxDocs=44421)
                0.109375 = fieldNorm(doc=4511)
          0.09271975 = weight(abstract_txt:relevance in 4511) [ClassicSimilarity], result of:
            0.09271975 = score(doc=4511,freq=1.0), product of:
              0.17196833 = queryWeight, product of:
                1.8545066 = boost
                4.929532 = idf(docFreq=872, maxDocs=44421)
                0.018811109 = queryNorm
              0.5391676 = fieldWeight in 4511, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.929532 = idf(docFreq=872, maxDocs=44421)
                0.109375 = fieldNorm(doc=4511)
          0.062997535 = weight(abstract_txt:system in 4511) [ClassicSimilarity], result of:
            0.062997535 = score(doc=4511,freq=2.0), product of:
              0.12075444 = queryWeight, product of:
                1.9032742 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.018811109 = queryNorm
              0.5216995 = fieldWeight in 4511, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.109375 = fieldNorm(doc=4511)
          0.084495544 = weight(abstract_txt:retrieval in 4511) [ClassicSimilarity], result of:
            0.084495544 = score(doc=4511,freq=3.0), product of:
              0.12829593 = queryWeight, product of:
                1.9618068 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.018811109 = queryNorm
              0.6585988 = fieldWeight in 4511, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.109375 = fieldNorm(doc=4511)
          0.17647114 = weight(abstract_txt:query in 4511) [ClassicSimilarity], result of:
            0.17647114 = score(doc=4511,freq=2.0), product of:
              0.23995873 = queryWeight, product of:
                2.6829839 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.018811109 = queryNorm
              0.7354229 = fieldWeight in 4511, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.109375 = fieldNorm(doc=4511)
          0.28410062 = weight(abstract_txt:boolean in 4511) [ClassicSimilarity], result of:
            0.28410062 = score(doc=4511,freq=1.0), product of:
              0.4152843 = queryWeight, product of:
                3.529576 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.018811109 = queryNorm
              0.6841111 = fieldWeight in 4511, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.109375 = fieldNorm(doc=4511)
        0.24 = coord(6/25)
    
  4. Smith, M.P.; Smith, M.: ¬The use of genetic programming to build Boolean queries for text retrieval through relevance feedback (1997) 0.18
    0.18246774 = sum of:
      0.18246774 = product of:
        0.7602823 = sum of:
          0.0810477 = weight(abstract_txt:given in 1761) [ClassicSimilarity], result of:
            0.0810477 = score(doc=1761,freq=2.0), product of:
              0.1561599 = queryWeight, product of:
                1.767213 = boost
                4.6974936 = idf(docFreq=1100, maxDocs=44421)
                0.018811109 = queryNorm
              0.5190046 = fieldWeight in 1761, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6974936 = idf(docFreq=1100, maxDocs=44421)
                0.078125 = fieldNorm(doc=1761)
          0.06622839 = weight(abstract_txt:relevance in 1761) [ClassicSimilarity], result of:
            0.06622839 = score(doc=1761,freq=1.0), product of:
              0.17196833 = queryWeight, product of:
                1.8545066 = boost
                4.929532 = idf(docFreq=872, maxDocs=44421)
                0.018811109 = queryNorm
              0.38511968 = fieldWeight in 1761, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.929532 = idf(docFreq=872, maxDocs=44421)
                0.078125 = fieldNorm(doc=1761)
          0.03181856 = weight(abstract_txt:system in 1761) [ClassicSimilarity], result of:
            0.03181856 = score(doc=1761,freq=1.0), product of:
              0.12075444 = queryWeight, product of:
                1.9032742 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.018811109 = queryNorm
              0.26349807 = fieldWeight in 1761, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.078125 = fieldNorm(doc=1761)
          0.0492788 = weight(abstract_txt:retrieval in 1761) [ClassicSimilarity], result of:
            0.0492788 = score(doc=1761,freq=2.0), product of:
              0.12829593 = queryWeight, product of:
                1.9618068 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.018811109 = queryNorm
              0.3841026 = fieldWeight in 1761, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.078125 = fieldNorm(doc=1761)
          0.12605081 = weight(abstract_txt:query in 1761) [ClassicSimilarity], result of:
            0.12605081 = score(doc=1761,freq=2.0), product of:
              0.23995873 = queryWeight, product of:
                2.6829839 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.018811109 = queryNorm
              0.52530205 = fieldWeight in 1761, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.078125 = fieldNorm(doc=1761)
          0.405858 = weight(abstract_txt:boolean in 1761) [ClassicSimilarity], result of:
            0.405858 = score(doc=1761,freq=4.0), product of:
              0.4152843 = queryWeight, product of:
                3.529576 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.018811109 = queryNorm
              0.9773016 = fieldWeight in 1761, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.078125 = fieldNorm(doc=1761)
        0.24 = coord(6/25)
    
  5. Losee, R.M.: Upper bounds for retrieval performance and their user measuring performance and generating optimal queries : can it get any better than this? (1994) 0.15
    0.15193942 = sum of:
      0.15193942 = product of:
        0.63308096 = sum of:
          0.045563854 = weight(abstract_txt:functions in 7417) [ClassicSimilarity], result of:
            0.045563854 = score(doc=7417,freq=1.0), product of:
              0.10637085 = queryWeight, product of:
                1.0313367 = boost
                5.4828677 = idf(docFreq=501, maxDocs=44421)
                0.018811109 = queryNorm
              0.42834905 = fieldWeight in 7417, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4828677 = idf(docFreq=501, maxDocs=44421)
                0.078125 = fieldNorm(doc=7417)
          0.057309385 = weight(abstract_txt:given in 7417) [ClassicSimilarity], result of:
            0.057309385 = score(doc=7417,freq=1.0), product of:
              0.1561599 = queryWeight, product of:
                1.767213 = boost
                4.6974936 = idf(docFreq=1100, maxDocs=44421)
                0.018811109 = queryNorm
              0.3669917 = fieldWeight in 7417, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6974936 = idf(docFreq=1100, maxDocs=44421)
                0.078125 = fieldNorm(doc=7417)
          0.03181856 = weight(abstract_txt:system in 7417) [ClassicSimilarity], result of:
            0.03181856 = score(doc=7417,freq=1.0), product of:
              0.12075444 = queryWeight, product of:
                1.9032742 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.018811109 = queryNorm
              0.26349807 = fieldWeight in 7417, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.078125 = fieldNorm(doc=7417)
          0.0853534 = weight(abstract_txt:retrieval in 7417) [ClassicSimilarity], result of:
            0.0853534 = score(doc=7417,freq=6.0), product of:
              0.12829593 = queryWeight, product of:
                1.9618068 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.018811109 = queryNorm
              0.6652853 = fieldWeight in 7417, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.078125 = fieldNorm(doc=7417)
          0.12605081 = weight(abstract_txt:query in 7417) [ClassicSimilarity], result of:
            0.12605081 = score(doc=7417,freq=2.0), product of:
              0.23995873 = queryWeight, product of:
                2.6829839 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.018811109 = queryNorm
              0.52530205 = fieldWeight in 7417, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.078125 = fieldNorm(doc=7417)
          0.28698495 = weight(abstract_txt:boolean in 7417) [ClassicSimilarity], result of:
            0.28698495 = score(doc=7417,freq=2.0), product of:
              0.4152843 = queryWeight, product of:
                3.529576 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.018811109 = queryNorm
              0.69105655 = fieldWeight in 7417, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.078125 = fieldNorm(doc=7417)
        0.24 = coord(6/25)