Document (#5467)

Author
Salton, G.
Title
Mathematics and information retrieval
Source
Journal of documentation. 35(1979) no.1, S.1-29
Year
1979
Abstract
The development of a given discipline in science and technology often depends on the availability of theorie capable of describing the processes which control the field and of modelling the interactions between the processes. The absence of an accepted theory of information retrieval has benn blamed for the relative disorder and the lack of technical advances in the area. The main mathematical approaches to information retrieval are examined in this study, including both algebraic and probabilistic models, and the difficulties which impede the formalization of information retrieval processes are described. A number of developments are covered where new theoretical understandings have directly led to the improvemenet of retrieval techniques and operations

Similar documents (author)

  1. Salton, G.: Another look at automatic text-retrieval systems (1986) 4.87
    4.8684025 = sum of:
      4.8684025 = weight(author_txt:salton in 1355) [ClassicSimilarity], result of:
        4.8684025 = score(doc=1355,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.7894444 = idf(docFreq=49, maxDocs=44421)
            0.12837885 = queryNorm
          4.868403 = fieldWeight in 1355, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.7894444 = idf(docFreq=49, maxDocs=44421)
            0.625 = fieldNorm(doc=1355)
    
  2. Salton, G.: ¬A new comparison between conventional indexing (MEDLARS) and automatic text processing (SMART) (1972) 4.87
    4.8684025 = sum of:
      4.8684025 = weight(author_txt:salton in 2324) [ClassicSimilarity], result of:
        4.8684025 = score(doc=2324,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.7894444 = idf(docFreq=49, maxDocs=44421)
            0.12837885 = queryNorm
          4.868403 = fieldWeight in 2324, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.7894444 = idf(docFreq=49, maxDocs=44421)
            0.625 = fieldNorm(doc=2324)
    
  3. Salton, G.: Future prospects for text-based information retrieval (1990) 4.87
    4.8684025 = sum of:
      4.8684025 = weight(author_txt:salton in 2326) [ClassicSimilarity], result of:
        4.8684025 = score(doc=2326,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.7894444 = idf(docFreq=49, maxDocs=44421)
            0.12837885 = queryNorm
          4.868403 = fieldWeight in 2326, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.7894444 = idf(docFreq=49, maxDocs=44421)
            0.625 = fieldNorm(doc=2326)
    
  4. Salton, G.: Fast document classification in automatic information retrieval (1978) 4.87
    4.8684025 = sum of:
      4.8684025 = weight(author_txt:salton in 2330) [ClassicSimilarity], result of:
        4.8684025 = score(doc=2330,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.7894444 = idf(docFreq=49, maxDocs=44421)
            0.12837885 = queryNorm
          4.868403 = fieldWeight in 2330, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.7894444 = idf(docFreq=49, maxDocs=44421)
            0.625 = fieldNorm(doc=2330)
    
  5. Salton, G.: Expert systems and information retrieval (1987) 4.87
    4.8684025 = sum of:
      4.8684025 = weight(author_txt:salton in 2836) [ClassicSimilarity], result of:
        4.8684025 = score(doc=2836,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.7894444 = idf(docFreq=49, maxDocs=44421)
            0.12837885 = queryNorm
          4.868403 = fieldWeight in 2836, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.7894444 = idf(docFreq=49, maxDocs=44421)
            0.625 = fieldNorm(doc=2836)
    

Similar documents (content)

  1. Egghe, L.: Vector retrieval, fuzzy retrieval and the universal fuzzy IR surface for IR evaluation (2004) 0.16
    0.16202591 = sum of:
      0.16202591 = product of:
        0.8101295 = sum of:
          0.17249922 = weight(abstract_txt:operations in 3531) [ClassicSimilarity], result of:
            0.17249922 = score(doc=3531,freq=3.0), product of:
              0.16265848 = queryWeight, product of:
                1.096085 = boost
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.022722386 = queryNorm
              1.0604994 = fieldWeight in 3531, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.09375 = fieldNorm(doc=3531)
          0.15667263 = weight(abstract_txt:probabilistic in 3531) [ClassicSimilarity], result of:
            0.15667263 = score(doc=3531,freq=2.0), product of:
              0.17462687 = queryWeight, product of:
                1.1356941 = boost
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.022722386 = queryNorm
              0.8971851 = fieldWeight in 3531, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.09375 = fieldNorm(doc=3531)
          0.38560975 = weight(abstract_txt:algebraic in 3531) [ClassicSimilarity], result of:
            0.38560975 = score(doc=3531,freq=2.0), product of:
              0.31833252 = queryWeight, product of:
                1.5333679 = boost
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.022722386 = queryNorm
              1.2113426 = fieldWeight in 3531, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.09375 = fieldNorm(doc=3531)
          0.02023971 = weight(abstract_txt:information in 3531) [ClassicSimilarity], result of:
            0.02023971 = score(doc=3531,freq=1.0), product of:
              0.08925144 = queryWeight, product of:
                1.6238408 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.022722386 = queryNorm
              0.22677183 = fieldWeight in 3531, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.09375 = fieldNorm(doc=3531)
          0.07510824 = weight(abstract_txt:retrieval in 3531) [ClassicSimilarity], result of:
            0.07510824 = score(doc=3531,freq=1.0), product of:
              0.23044857 = queryWeight, product of:
                2.9172783 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.022722386 = queryNorm
              0.3259219 = fieldWeight in 3531, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.09375 = fieldNorm(doc=3531)
        0.2 = coord(5/25)
    
  2. Burrell, Q.L.: "Ambiguity" ans scientometric measurement : a dissenting view (2001) 0.12
    0.11538035 = sum of:
      0.11538035 = product of:
        0.48075145 = sum of:
          0.07626945 = weight(abstract_txt:mathematical in 981) [ClassicSimilarity], result of:
            0.07626945 = score(doc=981,freq=1.0), product of:
              0.15374945 = queryWeight, product of:
                1.0656452 = boost
                6.3496094 = idf(docFreq=210, maxDocs=44421)
                0.022722386 = queryNorm
              0.49606323 = fieldWeight in 981, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3496094 = idf(docFreq=210, maxDocs=44421)
                0.078125 = fieldNorm(doc=981)
          0.086436264 = weight(abstract_txt:modelling in 981) [ClassicSimilarity], result of:
            0.086436264 = score(doc=981,freq=1.0), product of:
              0.16712593 = queryWeight, product of:
                1.1110351 = boost
                6.6200633 = idf(docFreq=160, maxDocs=44421)
                0.022722386 = queryNorm
              0.5171924 = fieldWeight in 981, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6200633 = idf(docFreq=160, maxDocs=44421)
                0.078125 = fieldNorm(doc=981)
          0.08844804 = weight(abstract_txt:mathematics in 981) [ClassicSimilarity], result of:
            0.08844804 = score(doc=981,freq=1.0), product of:
              0.16970918 = queryWeight, product of:
                1.1195887 = boost
                6.6710296 = idf(docFreq=152, maxDocs=44421)
                0.022722386 = queryNorm
              0.5211742 = fieldWeight in 981, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6710296 = idf(docFreq=152, maxDocs=44421)
                0.078125 = fieldNorm(doc=981)
          0.09232023 = weight(abstract_txt:probabilistic in 981) [ClassicSimilarity], result of:
            0.09232023 = score(doc=981,freq=1.0), product of:
              0.17462687 = queryWeight, product of:
                1.1356941 = boost
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.022722386 = queryNorm
              0.5286714 = fieldWeight in 981, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.078125 = fieldNorm(doc=981)
          0.016866427 = weight(abstract_txt:information in 981) [ClassicSimilarity], result of:
            0.016866427 = score(doc=981,freq=1.0), product of:
              0.08925144 = queryWeight, product of:
                1.6238408 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.022722386 = queryNorm
              0.18897653 = fieldWeight in 981, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.078125 = fieldNorm(doc=981)
          0.12041102 = weight(abstract_txt:processes in 981) [ClassicSimilarity], result of:
            0.12041102 = score(doc=981,freq=1.0), product of:
              0.3006522 = queryWeight, product of:
                2.5810635 = boost
                5.126392 = idf(docFreq=716, maxDocs=44421)
                0.022722386 = queryNorm
              0.40049937 = fieldWeight in 981, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.126392 = idf(docFreq=716, maxDocs=44421)
                0.078125 = fieldNorm(doc=981)
        0.24 = coord(6/25)
    
  3. Cooper, W.S.: Some inconsistencies and misidentified modelling assumptions in probalistic information retrieval (1995) 0.11
    0.10703441 = sum of:
      0.10703441 = product of:
        0.53517205 = sum of:
          0.12203112 = weight(abstract_txt:mathematical in 2007) [ClassicSimilarity], result of:
            0.12203112 = score(doc=2007,freq=1.0), product of:
              0.15374945 = queryWeight, product of:
                1.0656452 = boost
                6.3496094 = idf(docFreq=210, maxDocs=44421)
                0.022722386 = queryNorm
              0.7937012 = fieldWeight in 2007, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3496094 = idf(docFreq=210, maxDocs=44421)
                0.125 = fieldNorm(doc=2007)
          0.13829802 = weight(abstract_txt:modelling in 2007) [ClassicSimilarity], result of:
            0.13829802 = score(doc=2007,freq=1.0), product of:
              0.16712593 = queryWeight, product of:
                1.1110351 = boost
                6.6200633 = idf(docFreq=160, maxDocs=44421)
                0.022722386 = queryNorm
              0.8275079 = fieldWeight in 2007, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6200633 = idf(docFreq=160, maxDocs=44421)
                0.125 = fieldNorm(doc=2007)
          0.14771236 = weight(abstract_txt:probabilistic in 2007) [ClassicSimilarity], result of:
            0.14771236 = score(doc=2007,freq=1.0), product of:
              0.17462687 = queryWeight, product of:
                1.1356941 = boost
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.022722386 = queryNorm
              0.8458742 = fieldWeight in 2007, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.125 = fieldNorm(doc=2007)
          0.026986282 = weight(abstract_txt:information in 2007) [ClassicSimilarity], result of:
            0.026986282 = score(doc=2007,freq=1.0), product of:
              0.08925144 = queryWeight, product of:
                1.6238408 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.022722386 = queryNorm
              0.30236244 = fieldWeight in 2007, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.125 = fieldNorm(doc=2007)
          0.10014431 = weight(abstract_txt:retrieval in 2007) [ClassicSimilarity], result of:
            0.10014431 = score(doc=2007,freq=1.0), product of:
              0.23044857 = queryWeight, product of:
                2.9172783 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.022722386 = queryNorm
              0.4345625 = fieldWeight in 2007, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.125 = fieldNorm(doc=2007)
        0.2 = coord(5/25)
    
  4. Wong, S.K.M.: On modelling information retrieval with probabilistic inference (1995) 0.10
    0.10045009 = sum of:
      0.10045009 = product of:
        0.50225043 = sum of:
          0.09152334 = weight(abstract_txt:mathematical in 2006) [ClassicSimilarity], result of:
            0.09152334 = score(doc=2006,freq=1.0), product of:
              0.15374945 = queryWeight, product of:
                1.0656452 = boost
                6.3496094 = idf(docFreq=210, maxDocs=44421)
                0.022722386 = queryNorm
              0.5952759 = fieldWeight in 2006, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3496094 = idf(docFreq=210, maxDocs=44421)
                0.09375 = fieldNorm(doc=2006)
          0.10372352 = weight(abstract_txt:modelling in 2006) [ClassicSimilarity], result of:
            0.10372352 = score(doc=2006,freq=1.0), product of:
              0.16712593 = queryWeight, product of:
                1.1110351 = boost
                6.6200633 = idf(docFreq=160, maxDocs=44421)
                0.022722386 = queryNorm
              0.6206309 = fieldWeight in 2006, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6200633 = idf(docFreq=160, maxDocs=44421)
                0.09375 = fieldNorm(doc=2006)
          0.15667263 = weight(abstract_txt:probabilistic in 2006) [ClassicSimilarity], result of:
            0.15667263 = score(doc=2006,freq=2.0), product of:
              0.17462687 = queryWeight, product of:
                1.1356941 = boost
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.022722386 = queryNorm
              0.8971851 = fieldWeight in 2006, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.09375 = fieldNorm(doc=2006)
          0.02023971 = weight(abstract_txt:information in 2006) [ClassicSimilarity], result of:
            0.02023971 = score(doc=2006,freq=1.0), product of:
              0.08925144 = queryWeight, product of:
                1.6238408 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.022722386 = queryNorm
              0.22677183 = fieldWeight in 2006, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.09375 = fieldNorm(doc=2006)
          0.13009126 = weight(abstract_txt:retrieval in 2006) [ClassicSimilarity], result of:
            0.13009126 = score(doc=2006,freq=3.0), product of:
              0.23044857 = queryWeight, product of:
                2.9172783 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.022722386 = queryNorm
              0.5645132 = fieldWeight in 2006, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.09375 = fieldNorm(doc=2006)
        0.2 = coord(5/25)
    
  5. Dominich, S.: ¬A unified mathematical definition of classical information retrieval (2000) 0.10
    0.09524977 = sum of:
      0.09524977 = product of:
        0.5953111 = sum of:
          0.18304668 = weight(abstract_txt:mathematical in 5768) [ClassicSimilarity], result of:
            0.18304668 = score(doc=5768,freq=1.0), product of:
              0.15374945 = queryWeight, product of:
                1.0656452 = boost
                6.3496094 = idf(docFreq=210, maxDocs=44421)
                0.022722386 = queryNorm
              1.1905518 = fieldWeight in 5768, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3496094 = idf(docFreq=210, maxDocs=44421)
                0.1875 = fieldNorm(doc=5768)
          0.22156854 = weight(abstract_txt:probabilistic in 5768) [ClassicSimilarity], result of:
            0.22156854 = score(doc=5768,freq=1.0), product of:
              0.17462687 = queryWeight, product of:
                1.1356941 = boost
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.022722386 = queryNorm
              1.2688112 = fieldWeight in 5768, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.1875 = fieldNorm(doc=5768)
          0.04047942 = weight(abstract_txt:information in 5768) [ClassicSimilarity], result of:
            0.04047942 = score(doc=5768,freq=1.0), product of:
              0.08925144 = queryWeight, product of:
                1.6238408 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.022722386 = queryNorm
              0.45354366 = fieldWeight in 5768, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.1875 = fieldNorm(doc=5768)
          0.15021648 = weight(abstract_txt:retrieval in 5768) [ClassicSimilarity], result of:
            0.15021648 = score(doc=5768,freq=1.0), product of:
              0.23044857 = queryWeight, product of:
                2.9172783 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.022722386 = queryNorm
              0.6518438 = fieldWeight in 5768, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.1875 = fieldNorm(doc=5768)
        0.16 = coord(4/25)