Document (#29423)

Author
Sparck Jones, K.
Title
IDF term weighting and IR research lessons
Source
Journal of documentation. 60(2004) no.5, S.521-523
Year
2004
Abstract
Robertson comments on the theoretical status of IDF term weighting. Its history illustrates how ideas develop in a specific research context, in theory/experiment interaction, and in operational practice.
Footnote
Vgl. auch unter:http://www.emeraldinsight.com/10.1108/00220410410560591.
Theme
Retrievalalgorithmen
Object
IDF

Similar documents (author)

  1. Sparck Jones, K.: Fashionable trends and feasible strategies in information management (1988) 5.31
    5.3145585 = sum of:
      5.3145585 = sum of:
        2.0560415 = weight(author_txt:jones in 816) [ClassicSimilarity], result of:
          2.0560415 = score(doc=816,freq=1.0), product of:
            0.5925794 = queryWeight, product of:
              6.939294 = idf(docFreq=116, maxDocs=44421)
              0.08539477 = queryNorm
            3.469647 = fieldWeight in 816, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.939294 = idf(docFreq=116, maxDocs=44421)
              0.5 = fieldNorm(doc=816)
        3.258517 = weight(author_txt:sparck in 816) [ClassicSimilarity], result of:
          3.258517 = score(doc=816,freq=1.0), product of:
            0.80551195 = queryWeight, product of:
              1.1659038 = boost
              8.090549 = idf(docFreq=36, maxDocs=44421)
              0.08539477 = queryNorm
            4.0452747 = fieldWeight in 816, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.090549 = idf(docFreq=36, maxDocs=44421)
              0.5 = fieldNorm(doc=816)
    
  2. Sparck Jones, K.: Automatic classification (1976) 5.31
    5.3145585 = sum of:
      5.3145585 = sum of:
        2.0560415 = weight(author_txt:jones in 2907) [ClassicSimilarity], result of:
          2.0560415 = score(doc=2907,freq=1.0), product of:
            0.5925794 = queryWeight, product of:
              6.939294 = idf(docFreq=116, maxDocs=44421)
              0.08539477 = queryNorm
            3.469647 = fieldWeight in 2907, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.939294 = idf(docFreq=116, maxDocs=44421)
              0.5 = fieldNorm(doc=2907)
        3.258517 = weight(author_txt:sparck in 2907) [ClassicSimilarity], result of:
          3.258517 = score(doc=2907,freq=1.0), product of:
            0.80551195 = queryWeight, product of:
              1.1659038 = boost
              8.090549 = idf(docFreq=36, maxDocs=44421)
              0.08539477 = queryNorm
            4.0452747 = fieldWeight in 2907, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.090549 = idf(docFreq=36, maxDocs=44421)
              0.5 = fieldNorm(doc=2907)
    
  3. Sparck Jones, K.: ¬The role of artificial intelligence in information retrieval (1991) 5.31
    5.3145585 = sum of:
      5.3145585 = sum of:
        2.0560415 = weight(author_txt:jones in 4810) [ClassicSimilarity], result of:
          2.0560415 = score(doc=4810,freq=1.0), product of:
            0.5925794 = queryWeight, product of:
              6.939294 = idf(docFreq=116, maxDocs=44421)
              0.08539477 = queryNorm
            3.469647 = fieldWeight in 4810, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.939294 = idf(docFreq=116, maxDocs=44421)
              0.5 = fieldNorm(doc=4810)
        3.258517 = weight(author_txt:sparck in 4810) [ClassicSimilarity], result of:
          3.258517 = score(doc=4810,freq=1.0), product of:
            0.80551195 = queryWeight, product of:
              1.1659038 = boost
              8.090549 = idf(docFreq=36, maxDocs=44421)
              0.08539477 = queryNorm
            4.0452747 = fieldWeight in 4810, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.090549 = idf(docFreq=36, maxDocs=44421)
              0.5 = fieldNorm(doc=4810)
    
  4. Sparck Jones, K.: Automatic keyword classification for information retrieval (1971) 5.31
    5.3145585 = sum of:
      5.3145585 = sum of:
        2.0560415 = weight(author_txt:jones in 5175) [ClassicSimilarity], result of:
          2.0560415 = score(doc=5175,freq=1.0), product of:
            0.5925794 = queryWeight, product of:
              6.939294 = idf(docFreq=116, maxDocs=44421)
              0.08539477 = queryNorm
            3.469647 = fieldWeight in 5175, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.939294 = idf(docFreq=116, maxDocs=44421)
              0.5 = fieldNorm(doc=5175)
        3.258517 = weight(author_txt:sparck in 5175) [ClassicSimilarity], result of:
          3.258517 = score(doc=5175,freq=1.0), product of:
            0.80551195 = queryWeight, product of:
              1.1659038 = boost
              8.090549 = idf(docFreq=36, maxDocs=44421)
              0.08539477 = queryNorm
            4.0452747 = fieldWeight in 5175, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.090549 = idf(docFreq=36, maxDocs=44421)
              0.5 = fieldNorm(doc=5175)
    
  5. Sparck Jones, K.: ¬A statistical interpretation of term specifity and its application in retrieval (1972) 5.31
    5.3145585 = sum of:
      5.3145585 = sum of:
        2.0560415 = weight(author_txt:jones in 5186) [ClassicSimilarity], result of:
          2.0560415 = score(doc=5186,freq=1.0), product of:
            0.5925794 = queryWeight, product of:
              6.939294 = idf(docFreq=116, maxDocs=44421)
              0.08539477 = queryNorm
            3.469647 = fieldWeight in 5186, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.939294 = idf(docFreq=116, maxDocs=44421)
              0.5 = fieldNorm(doc=5186)
        3.258517 = weight(author_txt:sparck in 5186) [ClassicSimilarity], result of:
          3.258517 = score(doc=5186,freq=1.0), product of:
            0.80551195 = queryWeight, product of:
              1.1659038 = boost
              8.090549 = idf(docFreq=36, maxDocs=44421)
              0.08539477 = queryNorm
            4.0452747 = fieldWeight in 5186, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.090549 = idf(docFreq=36, maxDocs=44421)
              0.5 = fieldNorm(doc=5186)
    

Similar documents (content)

  1. Wong, S.K.M.: On modelling information retrieval with probabilistic inference (1995) 0.22
    0.22198637 = sum of:
      0.22198637 = product of:
        0.70295686 = sum of:
          0.04197686 = weight(abstract_txt:context in 2006) [ClassicSimilarity], result of:
            0.04197686 = score(doc=2006,freq=1.0), product of:
              0.10333256 = queryWeight, product of:
                1.0063593 = boost
                4.333128 = idf(docFreq=1584, maxDocs=44421)
                0.023696411 = queryNorm
              0.40623075 = fieldWeight in 2006, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.333128 = idf(docFreq=1584, maxDocs=44421)
                0.09375 = fieldNorm(doc=2006)
          0.0479563 = weight(abstract_txt:theory in 2006) [ClassicSimilarity], result of:
            0.0479563 = score(doc=2006,freq=1.0), product of:
              0.112926096 = queryWeight, product of:
                1.0520386 = boost
                4.529811 = idf(docFreq=1301, maxDocs=44421)
                0.023696411 = queryNorm
              0.42466977 = fieldWeight in 2006, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.529811 = idf(docFreq=1301, maxDocs=44421)
                0.09375 = fieldNorm(doc=2006)
          0.05837559 = weight(abstract_txt:theoretical in 2006) [ClassicSimilarity], result of:
            0.05837559 = score(doc=2006,freq=1.0), product of:
              0.12874135 = queryWeight, product of:
                1.1232942 = boost
                4.83662 = idf(docFreq=957, maxDocs=44421)
                0.023696411 = queryNorm
              0.4534331 = fieldWeight in 2006, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.83662 = idf(docFreq=957, maxDocs=44421)
                0.09375 = fieldNorm(doc=2006)
          0.090882815 = weight(abstract_txt:ideas in 2006) [ClassicSimilarity], result of:
            0.090882815 = score(doc=2006,freq=1.0), product of:
              0.17293586 = queryWeight, product of:
                1.3018982 = boost
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.023696411 = queryNorm
              0.525529 = fieldWeight in 2006, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.09375 = fieldNorm(doc=2006)
          0.11374264 = weight(abstract_txt:term in 2006) [ClassicSimilarity], result of:
            0.11374264 = score(doc=2006,freq=1.0), product of:
              0.25304013 = queryWeight, product of:
                2.227123 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.023696411 = queryNorm
              0.44950435 = fieldWeight in 2006, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.09375 = fieldNorm(doc=2006)
          0.35002264 = weight(abstract_txt:weighting in 2006) [ClassicSimilarity], result of:
            0.35002264 = score(doc=2006,freq=1.0), product of:
              0.5353502 = queryWeight, product of:
                3.2394292 = boost
                6.9740796 = idf(docFreq=112, maxDocs=44421)
                0.023696411 = queryNorm
              0.65382 = fieldWeight in 2006, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9740796 = idf(docFreq=112, maxDocs=44421)
                0.09375 = fieldNorm(doc=2006)
        0.31578946 = coord(6/19)
    
  2. Robertson, S.E.: OKAPI at TREC-1 (1994) 0.20
    0.2011005 = sum of:
      0.2011005 = product of:
        1.2736366 = sum of:
          0.46197078 = weight(abstract_txt:robertson in 7952) [ClassicSimilarity], result of:
            0.46197078 = score(doc=7952,freq=1.0), product of:
              0.42203426 = queryWeight, product of:
                2.0338006 = boost
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.023696411 = queryNorm
              1.0946286 = fieldWeight in 7952, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.125 = fieldNorm(doc=7952)
          0.15165685 = weight(abstract_txt:term in 7952) [ClassicSimilarity], result of:
            0.15165685 = score(doc=7952,freq=1.0), product of:
              0.25304013 = queryWeight, product of:
                2.227123 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.023696411 = queryNorm
              0.5993391 = fieldWeight in 7952, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.125 = fieldNorm(doc=7952)
          0.660009 = weight(abstract_txt:weighting in 7952) [ClassicSimilarity], result of:
            0.660009 = score(doc=7952,freq=2.0), product of:
              0.5353502 = queryWeight, product of:
                3.2394292 = boost
                6.9740796 = idf(docFreq=112, maxDocs=44421)
                0.023696411 = queryNorm
              1.2328547 = fieldWeight in 7952, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.9740796 = idf(docFreq=112, maxDocs=44421)
                0.125 = fieldNorm(doc=7952)
        0.15789473 = coord(3/19)
    
  3. Robertson, S.E.; Sparck Jones, K.: Relevance weighting of search terms (1976) 0.19
    0.18513772 = sum of:
      0.18513772 = product of:
        0.8794042 = sum of:
          0.048050452 = weight(abstract_txt:specific in 139) [ClassicSimilarity], result of:
            0.048050452 = score(doc=139,freq=1.0), product of:
              0.10203073 = queryWeight, product of:
                4.305746 = idf(docFreq=1628, maxDocs=44421)
                0.023696411 = queryNorm
              0.47094098 = fieldWeight in 139, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.305746 = idf(docFreq=1628, maxDocs=44421)
                0.109375 = fieldNorm(doc=139)
          0.055949014 = weight(abstract_txt:theory in 139) [ClassicSimilarity], result of:
            0.055949014 = score(doc=139,freq=1.0), product of:
              0.112926096 = queryWeight, product of:
                1.0520386 = boost
                4.529811 = idf(docFreq=1301, maxDocs=44421)
                0.023696411 = queryNorm
              0.49544805 = fieldWeight in 139, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.529811 = idf(docFreq=1301, maxDocs=44421)
                0.109375 = fieldNorm(doc=139)
          0.068104856 = weight(abstract_txt:theoretical in 139) [ClassicSimilarity], result of:
            0.068104856 = score(doc=139,freq=1.0), product of:
              0.12874135 = queryWeight, product of:
                1.1232942 = boost
                4.83662 = idf(docFreq=957, maxDocs=44421)
                0.023696411 = queryNorm
              0.5290053 = fieldWeight in 139, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.83662 = idf(docFreq=957, maxDocs=44421)
                0.109375 = fieldNorm(doc=139)
          0.7072998 = weight(abstract_txt:weighting in 139) [ClassicSimilarity], result of:
            0.7072998 = score(doc=139,freq=3.0), product of:
              0.5353502 = queryWeight, product of:
                3.2394292 = boost
                6.9740796 = idf(docFreq=112, maxDocs=44421)
                0.023696411 = queryNorm
              1.321191 = fieldWeight in 139, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.9740796 = idf(docFreq=112, maxDocs=44421)
                0.109375 = fieldNorm(doc=139)
        0.21052632 = coord(4/19)
    
  4. Dominich, S.; Góth, J.; Kiezer, T.; Szlávik, Z.: ¬An entropy-based interpretation of retrieval status value-based retrieval, and its application to the computation of term and query discrimination value (2004) 0.17
    0.1665383 = sum of:
      0.1665383 = product of:
        0.6328455 = sum of:
          0.03383356 = weight(abstract_txt:practice in 3237) [ClassicSimilarity], result of:
            0.03383356 = score(doc=3237,freq=1.0), product of:
              0.12818912 = queryWeight, product of:
                1.1208825 = boost
                4.8262353 = idf(docFreq=967, maxDocs=44421)
                0.023696411 = queryNorm
              0.26393473 = fieldWeight in 3237, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8262353 = idf(docFreq=967, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3237)
          0.04815741 = weight(abstract_txt:theoretical in 3237) [ClassicSimilarity], result of:
            0.04815741 = score(doc=3237,freq=2.0), product of:
              0.12874135 = queryWeight, product of:
                1.1232942 = boost
                4.83662 = idf(docFreq=957, maxDocs=44421)
                0.023696411 = queryNorm
              0.37406325 = fieldWeight in 3237, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.83662 = idf(docFreq=957, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3237)
          0.06450491 = weight(abstract_txt:status in 3237) [ClassicSimilarity], result of:
            0.06450491 = score(doc=3237,freq=1.0), product of:
              0.19709753 = queryWeight, product of:
                1.3898729 = boost
                5.98444 = idf(docFreq=303, maxDocs=44421)
                0.023696411 = queryNorm
              0.32727405 = fieldWeight in 3237, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.98444 = idf(docFreq=303, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3237)
          0.13269976 = weight(abstract_txt:term in 3237) [ClassicSimilarity], result of:
            0.13269976 = score(doc=3237,freq=4.0), product of:
              0.25304013 = queryWeight, product of:
                2.227123 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.023696411 = queryNorm
              0.52442175 = fieldWeight in 3237, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3237)
          0.3536499 = weight(abstract_txt:weighting in 3237) [ClassicSimilarity], result of:
            0.3536499 = score(doc=3237,freq=3.0), product of:
              0.5353502 = queryWeight, product of:
                3.2394292 = boost
                6.9740796 = idf(docFreq=112, maxDocs=44421)
                0.023696411 = queryNorm
              0.6605955 = fieldWeight in 3237, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.9740796 = idf(docFreq=112, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3237)
        0.2631579 = coord(5/19)
    
  5. Li, X.; Zhang, A.; Li, C.; Ouyang, J.; Cai, Y.: Exploring coherent topics by topic modeling with term weighting (2018) 0.16
    0.16299082 = sum of:
      0.16299082 = product of:
        0.7742064 = sum of:
          0.027457401 = weight(abstract_txt:specific in 45) [ClassicSimilarity], result of:
            0.027457401 = score(doc=45,freq=1.0), product of:
              0.10203073 = queryWeight, product of:
                4.305746 = idf(docFreq=1628, maxDocs=44421)
                0.023696411 = queryNorm
              0.26910913 = fieldWeight in 45, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.305746 = idf(docFreq=1628, maxDocs=44421)
                0.0625 = fieldNorm(doc=45)
          0.043825634 = weight(abstract_txt:develop in 45) [ClassicSimilarity], result of:
            0.043825634 = score(doc=45,freq=1.0), product of:
              0.13935103 = queryWeight, product of:
                1.1686637 = boost
                5.0319695 = idf(docFreq=787, maxDocs=44421)
                0.023696411 = queryNorm
              0.3144981 = fieldWeight in 45, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0319695 = idf(docFreq=787, maxDocs=44421)
                0.0625 = fieldNorm(doc=45)
          0.13133869 = weight(abstract_txt:term in 45) [ClassicSimilarity], result of:
            0.13133869 = score(doc=45,freq=3.0), product of:
              0.25304013 = queryWeight, product of:
                2.227123 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.023696411 = queryNorm
              0.5190429 = fieldWeight in 45, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.0625 = fieldNorm(doc=45)
          0.57158464 = weight(abstract_txt:weighting in 45) [ClassicSimilarity], result of:
            0.57158464 = score(doc=45,freq=6.0), product of:
              0.5353502 = queryWeight, product of:
                3.2394292 = boost
                6.9740796 = idf(docFreq=112, maxDocs=44421)
                0.023696411 = queryNorm
              1.0676836 = fieldWeight in 45, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.9740796 = idf(docFreq=112, maxDocs=44421)
                0.0625 = fieldNorm(doc=45)
        0.21052632 = coord(4/19)