Document (#34861)

Author
Bache, R.
Baillie, M.
Crestani, F.
Title
Measuring the likelihood property of scoring functions in general retrieval models
Source
Journal of the American Society for Information Science and Technology. 60(2009) no.6, S.1294-1297
Year
2009
Series
Brief Communications
Abstract
Although retrieval systems based on probabilistic models will rank the objects (e.g., documents) being retrieved according to the probability of some matching criterion (e.g., relevance), they rarely yield an actual probability, and the scoring function is interpreted to be purely ordinal within a given retrieval task. In this brief communication, it is shown that some scoring functions possess the likelihood property, which means that the scoring function indicates the likelihood of matching when compared to other retrieval tasks, which is potentially more useful than pure ranking although it cannot be interpreted as an actual probability. This property can be detected by using two modified effectiveness measures: entire precision and entire recall.

Similar documents (author)

  1. Crestani, F.: Combination of similarity measures for effective spoken document retrieval (2003) 1.69
    1.6947751 = sum of:
      1.6947751 = product of:
        3.3895502 = sum of:
          3.3895502 = weight(author_txt:crestani in 5690) [ClassicSimilarity], result of:
            3.3895502 = score(doc=5690,freq=1.0), product of:
              0.6229549 = queryWeight, product of:
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.071556844 = queryNorm
              5.4410844 = fieldWeight in 5690, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.625 = fieldNorm(doc=5690)
        0.5 = coord(1/2)
    
  2. Lee, M.; Baillie, S.; Dell'Oro, J.: TML: a Thesaural Markpup Language (200?) 1.43
    1.4308801 = sum of:
      1.4308801 = product of:
        2.8617601 = sum of:
          2.8617601 = weight(author_txt:baillie in 2622) [ClassicSimilarity], result of:
            2.8617601 = score(doc=2622,freq=1.0), product of:
              0.7822578 = queryWeight, product of:
                1.1205897 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.071556844 = queryNorm
              3.6583338 = fieldWeight in 2622, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.375 = fieldNorm(doc=2622)
        0.5 = coord(1/2)
    
  3. Ruthven, I.; Baillie, M.; Elsweiler, D.: ¬The relative effects of knowledge, interest and confidence in assessing relevance (2007) 1.43
    1.4308801 = sum of:
      1.4308801 = product of:
        2.8617601 = sum of:
          2.8617601 = weight(author_txt:baillie in 1835) [ClassicSimilarity], result of:
            2.8617601 = score(doc=1835,freq=1.0), product of:
              0.7822578 = queryWeight, product of:
                1.1205897 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.071556844 = queryNorm
              3.6583338 = fieldWeight in 1835, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.375 = fieldNorm(doc=1835)
        0.5 = coord(1/2)
    
  4. Baillie, M.; Azzopardi, L.; Ruthven, I.: Evaluating epistemic uncertainty under incomplete assessments (2008) 1.43
    1.4308801 = sum of:
      1.4308801 = product of:
        2.8617601 = sum of:
          2.8617601 = weight(author_txt:baillie in 3065) [ClassicSimilarity], result of:
            2.8617601 = score(doc=3065,freq=1.0), product of:
              0.7822578 = queryWeight, product of:
                1.1205897 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.071556844 = queryNorm
              3.6583338 = fieldWeight in 3065, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.375 = fieldNorm(doc=3065)
        0.5 = coord(1/2)
    
  5. Htun, N.N.; Halvey, M.; Baillie, L.: Beyond traditional collaborative search : understanding the effect of awareness on multi-level collaborative information retrieval (2018) 1.43
    1.4308801 = sum of:
      1.4308801 = product of:
        2.8617601 = sum of:
          2.8617601 = weight(author_txt:baillie in 94) [ClassicSimilarity], result of:
            2.8617601 = score(doc=94,freq=1.0), product of:
              0.7822578 = queryWeight, product of:
                1.1205897 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.071556844 = queryNorm
              3.6583338 = fieldWeight in 94, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.375 = fieldNorm(doc=94)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Dang, E.K.F.; Luk, R.W.P.; Allan, J.: ¬A retrieval model family based on the probability ranking principle for ad hoc retrieval (2022) 0.15
    0.14900124 = sum of:
      0.14900124 = product of:
        0.6208385 = sum of:
          0.062431745 = weight(abstract_txt:probabilistic in 1639) [ClassicSimilarity], result of:
            0.062431745 = score(doc=1639,freq=1.0), product of:
              0.098409824 = queryWeight, product of:
                1.0278758 = boost
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.014148228 = queryNorm
              0.6344056 = fieldWeight in 1639, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.09375 = fieldNorm(doc=1639)
          0.020057904 = weight(abstract_txt:some in 1639) [ClassicSimilarity], result of:
            0.020057904 = score(doc=1639,freq=1.0), product of:
              0.05816144 = queryWeight, product of:
                1.1175166 = boost
                3.6785707 = idf(docFreq=3049, maxDocs=44421)
                0.014148228 = queryNorm
              0.344866 = fieldWeight in 1639, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6785707 = idf(docFreq=3049, maxDocs=44421)
                0.09375 = fieldNorm(doc=1639)
          0.0796316 = weight(abstract_txt:models in 1639) [ClassicSimilarity], result of:
            0.0796316 = score(doc=1639,freq=4.0), product of:
              0.091864645 = queryWeight, product of:
                1.404464 = boost
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.014148228 = queryNorm
              0.86683613 = fieldWeight in 1639, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.09375 = fieldNorm(doc=1639)
          0.06972349 = weight(abstract_txt:function in 1639) [ClassicSimilarity], result of:
            0.06972349 = score(doc=1639,freq=1.0), product of:
              0.13346402 = queryWeight, product of:
                1.6928501 = boost
                5.5724173 = idf(docFreq=458, maxDocs=44421)
                0.014148228 = queryNorm
              0.5224141 = fieldWeight in 1639, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5724173 = idf(docFreq=458, maxDocs=44421)
                0.09375 = fieldNorm(doc=1639)
          0.04788725 = weight(abstract_txt:retrieval in 1639) [ClassicSimilarity], result of:
            0.04788725 = score(doc=1639,freq=2.0), product of:
              0.10389422 = queryWeight, product of:
                2.1122587 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.014148228 = queryNorm
              0.46092314 = fieldWeight in 1639, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.09375 = fieldNorm(doc=1639)
          0.3411065 = weight(abstract_txt:probability in 1639) [ClassicSimilarity], result of:
            0.3411065 = score(doc=1639,freq=3.0), product of:
              0.3052775 = queryWeight, product of:
                3.1356635 = boost
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.014148228 = queryNorm
              1.1173654 = fieldWeight in 1639, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.09375 = fieldNorm(doc=1639)
        0.24 = coord(6/25)
    
  2. Fuhr, N.: Probabilistic datalog : implementing logical information retrieval for advanced applications (2000) 0.11
    0.11114567 = sum of:
      0.11114567 = product of:
        0.5557283 = sum of:
          0.08324233 = weight(abstract_txt:probabilistic in 5380) [ClassicSimilarity], result of:
            0.08324233 = score(doc=5380,freq=1.0), product of:
              0.098409824 = queryWeight, product of:
                1.0278758 = boost
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.014148228 = queryNorm
              0.8458742 = fieldWeight in 5380, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.125 = fieldNorm(doc=5380)
          0.05308773 = weight(abstract_txt:models in 5380) [ClassicSimilarity], result of:
            0.05308773 = score(doc=5380,freq=1.0), product of:
              0.091864645 = queryWeight, product of:
                1.404464 = boost
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.014148228 = queryNorm
              0.57789075 = fieldWeight in 5380, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.125 = fieldNorm(doc=5380)
          0.09296466 = weight(abstract_txt:function in 5380) [ClassicSimilarity], result of:
            0.09296466 = score(doc=5380,freq=1.0), product of:
              0.13346402 = queryWeight, product of:
                1.6928501 = boost
                5.5724173 = idf(docFreq=458, maxDocs=44421)
                0.014148228 = queryNorm
              0.69655216 = fieldWeight in 5380, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5724173 = idf(docFreq=458, maxDocs=44421)
                0.125 = fieldNorm(doc=5380)
          0.063849665 = weight(abstract_txt:retrieval in 5380) [ClassicSimilarity], result of:
            0.063849665 = score(doc=5380,freq=2.0), product of:
              0.10389422 = queryWeight, product of:
                2.1122587 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.014148228 = queryNorm
              0.6145642 = fieldWeight in 5380, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.125 = fieldNorm(doc=5380)
          0.2625839 = weight(abstract_txt:probability in 5380) [ClassicSimilarity], result of:
            0.2625839 = score(doc=5380,freq=1.0), product of:
              0.3052775 = queryWeight, product of:
                3.1356635 = boost
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.014148228 = queryNorm
              0.86014825 = fieldWeight in 5380, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.125 = fieldNorm(doc=5380)
        0.2 = coord(5/25)
    
  3. Liu, X.; Croft, W.B.: Statistical language modeling for information retrieval (2004) 0.10
    0.098253824 = sum of:
      0.098253824 = product of:
        0.40939093 = sum of:
          0.03641852 = weight(abstract_txt:probabilistic in 5277) [ClassicSimilarity], result of:
            0.03641852 = score(doc=5277,freq=1.0), product of:
              0.098409824 = queryWeight, product of:
                1.0278758 = boost
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.014148228 = queryNorm
              0.37006995 = fieldWeight in 5277, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.0546875 = fieldNorm(doc=5277)
          0.032846358 = weight(abstract_txt:models in 5277) [ClassicSimilarity], result of:
            0.032846358 = score(doc=5277,freq=2.0), product of:
              0.091864645 = queryWeight, product of:
                1.404464 = boost
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.014148228 = queryNorm
              0.35755166 = fieldWeight in 5277, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.0546875 = fieldNorm(doc=5277)
          0.040672034 = weight(abstract_txt:function in 5277) [ClassicSimilarity], result of:
            0.040672034 = score(doc=5277,freq=1.0), product of:
              0.13346402 = queryWeight, product of:
                1.6928501 = boost
                5.5724173 = idf(docFreq=458, maxDocs=44421)
                0.014148228 = queryNorm
              0.30474156 = fieldWeight in 5277, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5724173 = idf(docFreq=458, maxDocs=44421)
                0.0546875 = fieldNorm(doc=5277)
          0.034212302 = weight(abstract_txt:retrieval in 5277) [ClassicSimilarity], result of:
            0.034212302 = score(doc=5277,freq=3.0), product of:
              0.10389422 = queryWeight, product of:
                2.1122587 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.014148228 = queryNorm
              0.3292994 = fieldWeight in 5277, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0546875 = fieldNorm(doc=5277)
          0.11488046 = weight(abstract_txt:probability in 5277) [ClassicSimilarity], result of:
            0.11488046 = score(doc=5277,freq=1.0), product of:
              0.3052775 = queryWeight, product of:
                3.1356635 = boost
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.014148228 = queryNorm
              0.37631485 = fieldWeight in 5277, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.0546875 = fieldNorm(doc=5277)
          0.15036125 = weight(abstract_txt:likelihood in 5277) [ClassicSimilarity], result of:
            0.15036125 = score(doc=5277,freq=1.0), product of:
              0.36527616 = queryWeight, product of:
                3.4299889 = boost
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.014148228 = queryNorm
              0.41163722 = fieldWeight in 5277, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.0546875 = fieldNorm(doc=5277)
        0.24 = coord(6/25)
    
  4. Robertson, S.E.; Sparck Jones, K.: Simple, proven approaches to text retrieval (1997) 0.10
    0.09507667 = sum of:
      0.09507667 = product of:
        0.47538334 = sum of:
          0.013371936 = weight(abstract_txt:some in 5532) [ClassicSimilarity], result of:
            0.013371936 = score(doc=5532,freq=1.0), product of:
              0.05816144 = queryWeight, product of:
                1.1175166 = boost
                3.6785707 = idf(docFreq=3049, maxDocs=44421)
                0.014148228 = queryNorm
              0.22991067 = fieldWeight in 5532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6785707 = idf(docFreq=3049, maxDocs=44421)
                0.0625 = fieldNorm(doc=5532)
          0.05925104 = weight(abstract_txt:matching in 5532) [ClassicSimilarity], result of:
            0.05925104 = score(doc=5532,freq=1.0), product of:
              0.15690482 = queryWeight, product of:
                1.8355007 = boost
                6.0419855 = idf(docFreq=286, maxDocs=44421)
                0.014148228 = queryNorm
              0.3776241 = fieldWeight in 5532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0419855 = idf(docFreq=286, maxDocs=44421)
                0.0625 = fieldNorm(doc=5532)
          0.06084436 = weight(abstract_txt:actual in 5532) [ClassicSimilarity], result of:
            0.06084436 = score(doc=5532,freq=1.0), product of:
              0.15970525 = queryWeight, product of:
                1.8518083 = boost
                6.0956655 = idf(docFreq=271, maxDocs=44421)
                0.014148228 = queryNorm
              0.3809791 = fieldWeight in 5532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0956655 = idf(docFreq=271, maxDocs=44421)
                0.0625 = fieldNorm(doc=5532)
          0.045148533 = weight(abstract_txt:retrieval in 5532) [ClassicSimilarity], result of:
            0.045148533 = score(doc=5532,freq=4.0), product of:
              0.10389422 = queryWeight, product of:
                2.1122587 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.014148228 = queryNorm
              0.4345625 = fieldWeight in 5532, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=5532)
          0.29676747 = weight(abstract_txt:scoring in 5532) [ClassicSimilarity], result of:
            0.29676747 = score(doc=5532,freq=1.0), product of:
              0.57870847 = queryWeight, product of:
                4.9851856 = boost
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.014148228 = queryNorm
              0.51281 = fieldWeight in 5532, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.0625 = fieldNorm(doc=5532)
        0.2 = coord(5/25)
    
  5. López-Pujalte, C.; Guerrero-Bote, V.P.; Moya-Anegón, F. de: Order-based fitness functions for genetic algorithms applied to relevance feedback (2003) 0.09
    0.09351666 = sum of:
      0.09351666 = product of:
        0.4675833 = sum of:
          0.07748512 = weight(abstract_txt:functions in 154) [ClassicSimilarity], result of:
            0.07748512 = score(doc=154,freq=4.0), product of:
              0.12920892 = queryWeight, product of:
                1.6656458 = boost
                5.4828677 = idf(docFreq=501, maxDocs=44421)
                0.014148228 = queryNorm
              0.59968865 = fieldWeight in 154, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.4828677 = idf(docFreq=501, maxDocs=44421)
                0.0546875 = fieldNorm(doc=154)
          0.057518944 = weight(abstract_txt:function in 154) [ClassicSimilarity], result of:
            0.057518944 = score(doc=154,freq=2.0), product of:
              0.13346402 = queryWeight, product of:
                1.6928501 = boost
                5.5724173 = idf(docFreq=458, maxDocs=44421)
                0.014148228 = queryNorm
              0.43096966 = fieldWeight in 154, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5724173 = idf(docFreq=458, maxDocs=44421)
                0.0546875 = fieldNorm(doc=154)
          0.019752484 = weight(abstract_txt:retrieval in 154) [ClassicSimilarity], result of:
            0.019752484 = score(doc=154,freq=1.0), product of:
              0.10389422 = queryWeight, product of:
                2.1122587 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.014148228 = queryNorm
              0.1901211 = fieldWeight in 154, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0546875 = fieldNorm(doc=154)
          0.1624655 = weight(abstract_txt:probability in 154) [ClassicSimilarity], result of:
            0.1624655 = score(doc=154,freq=2.0), product of:
              0.3052775 = queryWeight, product of:
                3.1356635 = boost
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.014148228 = queryNorm
              0.53218955 = fieldWeight in 154, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.0546875 = fieldNorm(doc=154)
          0.15036125 = weight(abstract_txt:likelihood in 154) [ClassicSimilarity], result of:
            0.15036125 = score(doc=154,freq=1.0), product of:
              0.36527616 = queryWeight, product of:
                3.4299889 = boost
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.014148228 = queryNorm
              0.41163722 = fieldWeight in 154, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.0546875 = fieldNorm(doc=154)
        0.2 = coord(5/25)