Document (#2294)

Author
Paice, C.D.
Title
¬A thesaural model of information retrieval
Source
Information processing and management. 27(1991) no.5, S.433-447
Year
1991
Abstract
In an information retrieval system both queries and document representatives can be viewed as representations of topics. The central activity of such a system is thus the comparison of one topic representation with another. The set of terms contained in a typical topic representation may be adjusted or extended by reference to a domain thesaurus. This paper proposes that topic representations should actually consist of excerpts from a domain thesaurus, generated by a spreading activation technique. An algorithm for generating excerpts is outlines and exemplified, and the problem of assessing the resemblance between two excerpts is discussed. The paper questions whether existing thesauri would be adequate for this purpose, and offers some ideas on how suitable thesauri might be constructed

Similar documents (author)

  1. Paice, C.D.: Expert systems for information retrieval? (1986) 5.94
    5.9401517 = sum of:
      5.9401517 = weight(author_txt:paice in 1100) [ClassicSimilarity], result of:
        5.9401517 = fieldWeight in 1100, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.625 = fieldNorm(doc=1100)
    
  2. Paice, C.D.: Automatic abstracting (1994) 5.94
    5.9401517 = sum of:
      5.9401517 = weight(author_txt:paice in 985) [ClassicSimilarity], result of:
        5.9401517 = fieldWeight in 985, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.625 = fieldNorm(doc=985)
    
  3. Paice, C.D.: Automatic abstracting (1994) 5.94
    5.9401517 = sum of:
      5.9401517 = weight(author_txt:paice in 1323) [ClassicSimilarity], result of:
        5.9401517 = fieldWeight in 1323, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.625 = fieldNorm(doc=1323)
    
  4. Paice, C.D.: Method for evaluation of stemming algorithms based on error counting (1996) 5.94
    5.9401517 = sum of:
      5.9401517 = weight(author_txt:paice in 5867) [ClassicSimilarity], result of:
        5.9401517 = fieldWeight in 5867, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.625 = fieldNorm(doc=5867)
    
  5. Paice, C.D.: Soft evaluation of Boolean search queries in information retrieval systems (1984) 5.94
    5.9401517 = sum of:
      5.9401517 = weight(author_txt:paice in 1789) [ClassicSimilarity], result of:
        5.9401517 = fieldWeight in 1789, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.625 = fieldNorm(doc=1789)
    

Similar documents (content)

  1. Diaz, I.: Semi-automatic construction of thesaurus applying domain analysis techniques (1998) 0.14
    0.14185736 = sum of:
      0.14185736 = product of:
        0.5910723 = sum of:
          0.02301783 = weight(abstract_txt:system in 4744) [ClassicSimilarity], result of:
            0.02301783 = score(doc=4744,freq=1.0), product of:
              0.062396314 = queryWeight, product of:
                1.0591642 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.017466595 = queryNorm
              0.36889726 = fieldWeight in 4744, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.109375 = fieldNorm(doc=4744)
          0.12688255 = weight(abstract_txt:domain in 4744) [ClassicSimilarity], result of:
            0.12688255 = score(doc=4744,freq=4.0), product of:
              0.1226584 = queryWeight, product of:
                1.4850205 = boost
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.017466595 = queryNorm
              1.0344384 = fieldWeight in 4744, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.109375 = fieldNorm(doc=4744)
          0.071715385 = weight(abstract_txt:representation in 4744) [ClassicSimilarity], result of:
            0.071715385 = score(doc=4744,freq=1.0), product of:
              0.13310394 = queryWeight, product of:
                1.5469606 = boost
                4.9261017 = idf(docFreq=875, maxDocs=44421)
                0.017466595 = queryNorm
              0.5387924 = fieldWeight in 4744, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9261017 = idf(docFreq=875, maxDocs=44421)
                0.109375 = fieldNorm(doc=4744)
          0.14364254 = weight(abstract_txt:thesaurus in 4744) [ClassicSimilarity], result of:
            0.14364254 = score(doc=4744,freq=3.0), product of:
              0.146644 = queryWeight, product of:
                1.623738 = boost
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.017466595 = queryNorm
              0.97953236 = fieldWeight in 4744, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.109375 = fieldNorm(doc=4744)
          0.09637856 = weight(abstract_txt:thesauri in 4744) [ClassicSimilarity], result of:
            0.09637856 = score(doc=4744,freq=1.0), product of:
              0.162095 = queryWeight, product of:
                1.707138 = boost
                5.4361663 = idf(docFreq=525, maxDocs=44421)
                0.017466595 = queryNorm
              0.5945807 = fieldWeight in 4744, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4361663 = idf(docFreq=525, maxDocs=44421)
                0.109375 = fieldNorm(doc=4744)
          0.12943542 = weight(abstract_txt:representations in 4744) [ClassicSimilarity], result of:
            0.12943542 = score(doc=4744,freq=1.0), product of:
              0.19731106 = queryWeight, product of:
                1.8834736 = boost
                5.997685 = idf(docFreq=299, maxDocs=44421)
                0.017466595 = queryNorm
              0.6559968 = fieldWeight in 4744, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.997685 = idf(docFreq=299, maxDocs=44421)
                0.109375 = fieldNorm(doc=4744)
        0.24 = coord(6/25)
    
  2. Salton, G.; Buckley, C.: Approaches to global text analysis (1990) 0.13
    0.12700714 = sum of:
      0.12700714 = product of:
        1.0583929 = sum of:
          0.02301783 = weight(abstract_txt:system in 4900) [ClassicSimilarity], result of:
            0.02301783 = score(doc=4900,freq=1.0), product of:
              0.062396314 = queryWeight, product of:
                1.0591642 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.017466595 = queryNorm
              0.36889726 = fieldWeight in 4900, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.109375 = fieldNorm(doc=4900)
          0.12943542 = weight(abstract_txt:representations in 4900) [ClassicSimilarity], result of:
            0.12943542 = score(doc=4900,freq=1.0), product of:
              0.19731106 = queryWeight, product of:
                1.8834736 = boost
                5.997685 = idf(docFreq=299, maxDocs=44421)
                0.017466595 = queryNorm
              0.6559968 = fieldWeight in 4900, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.997685 = idf(docFreq=299, maxDocs=44421)
                0.109375 = fieldNorm(doc=4900)
          0.90593964 = weight(abstract_txt:excerpts in 4900) [ClassicSimilarity], result of:
            0.90593964 = score(doc=4900,freq=2.0), product of:
              0.6559478 = queryWeight, product of:
                4.20595 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.017466595 = queryNorm
              1.3811154 = fieldWeight in 4900, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.109375 = fieldNorm(doc=4900)
        0.12 = coord(3/25)
    
  3. Tudhope, D.; Binding, C.; Blocks, D.; Cunliffe, D.: FACET: thesaurus retrieval with semantic term expansion (2002) 0.12
    0.123520516 = sum of:
      0.123520516 = product of:
        0.4411447 = sum of:
          0.011508915 = weight(abstract_txt:system in 1175) [ClassicSimilarity], result of:
            0.011508915 = score(doc=1175,freq=1.0), product of:
              0.062396314 = queryWeight, product of:
                1.0591642 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.017466595 = queryNorm
              0.18444863 = fieldWeight in 1175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1175)
          0.012603726 = weight(abstract_txt:retrieval in 1175) [ClassicSimilarity], result of:
            0.012603726 = score(doc=1175,freq=1.0), product of:
              0.06629315 = queryWeight, product of:
                1.0917373 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.017466595 = queryNorm
              0.1901211 = fieldWeight in 1175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1175)
          0.100719266 = weight(abstract_txt:activation in 1175) [ClassicSimilarity], result of:
            0.100719266 = score(doc=1175,freq=1.0), product of:
              0.21031377 = queryWeight, product of:
                1.3749999 = boost
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.017466595 = queryNorm
              0.47890002 = fieldWeight in 1175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1175)
          0.10259638 = weight(abstract_txt:spreading in 1175) [ClassicSimilarity], result of:
            0.10259638 = score(doc=1175,freq=1.0), product of:
              0.2129188 = queryWeight, product of:
                1.3834894 = boost
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.017466595 = queryNorm
              0.48185682 = fieldWeight in 1175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1175)
          0.035857692 = weight(abstract_txt:representation in 1175) [ClassicSimilarity], result of:
            0.035857692 = score(doc=1175,freq=1.0), product of:
              0.13310394 = queryWeight, product of:
                1.5469606 = boost
                4.9261017 = idf(docFreq=875, maxDocs=44421)
                0.017466595 = queryNorm
              0.2693962 = fieldWeight in 1175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9261017 = idf(docFreq=875, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1175)
          0.1097088 = weight(abstract_txt:thesaurus in 1175) [ClassicSimilarity], result of:
            0.1097088 = score(doc=1175,freq=7.0), product of:
              0.146644 = queryWeight, product of:
                1.623738 = boost
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.017466595 = queryNorm
              0.7481302 = fieldWeight in 1175, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1175)
          0.06814993 = weight(abstract_txt:thesauri in 1175) [ClassicSimilarity], result of:
            0.06814993 = score(doc=1175,freq=2.0), product of:
              0.162095 = queryWeight, product of:
                1.707138 = boost
                5.4361663 = idf(docFreq=525, maxDocs=44421)
                0.017466595 = queryNorm
              0.42043203 = fieldWeight in 1175, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4361663 = idf(docFreq=525, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1175)
        0.28 = coord(7/25)
    
  4. Mazzocchi, F.; Tiberi, M.: Knowledge organization in the philosophical domain : dealing with polysemy in thesaurus building (2009) 0.11
    0.10849545 = sum of:
      0.10849545 = product of:
        0.38748375 = sum of:
          0.016441308 = weight(abstract_txt:system in 254) [ClassicSimilarity], result of:
            0.016441308 = score(doc=254,freq=1.0), product of:
              0.062396314 = queryWeight, product of:
                1.0591642 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.017466595 = queryNorm
              0.26349807 = fieldWeight in 254, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.078125 = fieldNorm(doc=254)
          0.017774966 = weight(abstract_txt:paper in 254) [ClassicSimilarity], result of:
            0.017774966 = score(doc=254,freq=1.0), product of:
              0.065726504 = queryWeight, product of:
                1.0870614 = boost
                3.4616103 = idf(docFreq=3788, maxDocs=44421)
                0.017466595 = queryNorm
              0.2704383 = fieldWeight in 254, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4616103 = idf(docFreq=3788, maxDocs=44421)
                0.078125 = fieldNorm(doc=254)
          0.018005323 = weight(abstract_txt:retrieval in 254) [ClassicSimilarity], result of:
            0.018005323 = score(doc=254,freq=1.0), product of:
              0.06629315 = queryWeight, product of:
                1.0917373 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.017466595 = queryNorm
              0.27160156 = fieldWeight in 254, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.078125 = fieldNorm(doc=254)
          0.12897092 = weight(abstract_txt:thesaural in 254) [ClassicSimilarity], result of:
            0.12897092 = score(doc=254,freq=1.0), product of:
              0.19551761 = queryWeight, product of:
                1.3257504 = boost
                8.443371 = idf(docFreq=25, maxDocs=44421)
                0.017466595 = queryNorm
              0.65963835 = fieldWeight in 254, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.443371 = idf(docFreq=25, maxDocs=44421)
                0.078125 = fieldNorm(doc=254)
          0.06408537 = weight(abstract_txt:domain in 254) [ClassicSimilarity], result of:
            0.06408537 = score(doc=254,freq=2.0), product of:
              0.1226584 = queryWeight, product of:
                1.4850205 = boost
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.017466595 = queryNorm
              0.5224703 = fieldWeight in 254, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.078125 = fieldNorm(doc=254)
          0.059237186 = weight(abstract_txt:thesaurus in 254) [ClassicSimilarity], result of:
            0.059237186 = score(doc=254,freq=1.0), product of:
              0.146644 = queryWeight, product of:
                1.623738 = boost
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.017466595 = queryNorm
              0.40395233 = fieldWeight in 254, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.078125 = fieldNorm(doc=254)
          0.08296868 = weight(abstract_txt:topic in 254) [ClassicSimilarity], result of:
            0.08296868 = score(doc=254,freq=1.0), product of:
              0.2101396 = queryWeight, product of:
                2.3805833 = boost
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.017466595 = queryNorm
              0.3948265 = fieldWeight in 254, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.078125 = fieldNorm(doc=254)
        0.28 = coord(7/25)
    
  5. Tudhope, D.; Binding, C.; Blocks, D.; Cuncliffe, D.: Representation and retrieval in faceted systems (2003) 0.11
    0.10517838 = sum of:
      0.10517838 = product of:
        0.37563705 = sum of:
          0.018601215 = weight(abstract_txt:system in 3703) [ClassicSimilarity], result of:
            0.018601215 = score(doc=3703,freq=2.0), product of:
              0.062396314 = queryWeight, product of:
                1.0591642 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.017466595 = queryNorm
              0.298114 = fieldWeight in 3703, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.0625 = fieldNorm(doc=3703)
          0.020110076 = weight(abstract_txt:paper in 3703) [ClassicSimilarity], result of:
            0.020110076 = score(doc=3703,freq=2.0), product of:
              0.065726504 = queryWeight, product of:
                1.0870614 = boost
                3.4616103 = idf(docFreq=3788, maxDocs=44421)
                0.017466595 = queryNorm
              0.30596602 = fieldWeight in 3703, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4616103 = idf(docFreq=3788, maxDocs=44421)
                0.0625 = fieldNorm(doc=3703)
          0.014404259 = weight(abstract_txt:retrieval in 3703) [ClassicSimilarity], result of:
            0.014404259 = score(doc=3703,freq=1.0), product of:
              0.06629315 = queryWeight, product of:
                1.0917373 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.017466595 = queryNorm
              0.21728125 = fieldWeight in 3703, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=3703)
          0.057954784 = weight(abstract_txt:representation in 3703) [ClassicSimilarity], result of:
            0.057954784 = score(doc=3703,freq=2.0), product of:
              0.13310394 = queryWeight, product of:
                1.5469606 = boost
                4.9261017 = idf(docFreq=875, maxDocs=44421)
                0.017466595 = queryNorm
              0.43541 = fieldWeight in 3703, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.9261017 = idf(docFreq=875, maxDocs=44421)
                0.0625 = fieldNorm(doc=3703)
          0.08208145 = weight(abstract_txt:thesaurus in 3703) [ClassicSimilarity], result of:
            0.08208145 = score(doc=3703,freq=3.0), product of:
              0.146644 = queryWeight, product of:
                1.623738 = boost
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.017466595 = queryNorm
              0.5597328 = fieldWeight in 3703, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.0625 = fieldNorm(doc=3703)
          0.077885635 = weight(abstract_txt:thesauri in 3703) [ClassicSimilarity], result of:
            0.077885635 = score(doc=3703,freq=2.0), product of:
              0.162095 = queryWeight, product of:
                1.707138 = boost
                5.4361663 = idf(docFreq=525, maxDocs=44421)
                0.017466595 = queryNorm
              0.48049375 = fieldWeight in 3703, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4361663 = idf(docFreq=525, maxDocs=44421)
                0.0625 = fieldNorm(doc=3703)
          0.10459961 = weight(abstract_txt:representations in 3703) [ClassicSimilarity], result of:
            0.10459961 = score(doc=3703,freq=2.0), product of:
              0.19731106 = queryWeight, product of:
                1.8834736 = boost
                5.997685 = idf(docFreq=299, maxDocs=44421)
                0.017466595 = queryNorm
              0.53012544 = fieldWeight in 3703, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.997685 = idf(docFreq=299, maxDocs=44421)
                0.0625 = fieldNorm(doc=3703)
        0.28 = coord(7/25)