Document (#34447)

Author
Bai, J.
Nie, J.-Y.
Title
Adapting information retrieval to query contexts
Source
Information processing and management. 44(2008) no.6, S.1901-1922
Year
2008
Abstract
In current IR approaches documents are retrieved only according to the terms specified in the query. The same answers are returned for the same query whatever the user and the search goal are. In reality, many other contextual factors strongly influence document's relevance and they should be taken into account in IR operations. This paper proposes a method, based on language modeling, to integrate several contextual factors so that document ranking will be adapted to the specific query contexts. We will consider three contextual factors in this paper: the topic domain of the query, the characteristics of the document collection, as well as context words within the query. Each contextual factor is used to generate a new query language model to specify some aspect of the information need. All these query models are then combined together to produce a more complete model for the underlying information need. Our experiments on TREC collections show that each contextual factor can positively influence the IR effectiveness and the combined model results in the highest effectiveness. This study shows that it is both beneficial and feasible to integrate more contextual factors in the current IR practice.
Footnote
Beitrag in einem Themenheft "Adaptive information retrieval"
Theme
Semantisches Umfeld in Indexierung u. Retrieval

Similar documents (content)

  1. Lu, K.; Joo, S.; Lee, T.; Hu, R.: Factors that influence query reformulations and search performance in health information retrieval : a multilevel modeling approach (2017) 0.26
    0.2597146 = sum of:
      0.2597146 = product of:
        0.92755216 = sum of:
          0.021610063 = weight(abstract_txt:each in 4754) [ClassicSimilarity], result of:
            0.021610063 = score(doc=4754,freq=1.0), product of:
              0.08396921 = queryWeight, product of:
                1.1363717 = boost
                4.1177115 = idf(docFreq=1965, maxDocs=44421)
                0.01794501 = queryNorm
              0.25735697 = fieldWeight in 4754, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1177115 = idf(docFreq=1965, maxDocs=44421)
                0.0625 = fieldNorm(doc=4754)
          0.033005927 = weight(abstract_txt:same in 4754) [ClassicSimilarity], result of:
            0.033005927 = score(doc=4754,freq=1.0), product of:
              0.1113638 = queryWeight, product of:
                1.3086768 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.01794501 = queryNorm
              0.29637933 = fieldWeight in 4754, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.0625 = fieldNorm(doc=4754)
          0.06258775 = weight(abstract_txt:influence in 4754) [ClassicSimilarity], result of:
            0.06258775 = score(doc=4754,freq=2.0), product of:
              0.13541465 = queryWeight, product of:
                1.4430894 = boost
                5.229121 = idf(docFreq=646, maxDocs=44421)
                0.01794501 = queryNorm
              0.46219337 = fieldWeight in 4754, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.229121 = idf(docFreq=646, maxDocs=44421)
                0.0625 = fieldNorm(doc=4754)
          0.041481473 = weight(abstract_txt:model in 4754) [ClassicSimilarity], result of:
            0.041481473 = score(doc=4754,freq=2.0), product of:
              0.11783454 = queryWeight, product of:
                1.6487026 = boost
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.01794501 = queryNorm
              0.35203153 = fieldWeight in 4754, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.0625 = fieldNorm(doc=4754)
          0.13035138 = weight(abstract_txt:factors in 4754) [ClassicSimilarity], result of:
            0.13035138 = score(doc=4754,freq=3.0), product of:
              0.24306892 = queryWeight, product of:
                2.734262 = boost
                4.9538813 = idf(docFreq=851, maxDocs=44421)
                0.01794501 = queryNorm
              0.53627336 = fieldWeight in 4754, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.9538813 = idf(docFreq=851, maxDocs=44421)
                0.0625 = fieldNorm(doc=4754)
          0.39919195 = weight(abstract_txt:query in 4754) [ClassicSimilarity], result of:
            0.39919195 = score(doc=4754,freq=9.0), product of:
              0.44779208 = queryWeight, product of:
                5.2484202 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.01794501 = queryNorm
              0.8914672 = fieldWeight in 4754, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.0625 = fieldNorm(doc=4754)
          0.23932366 = weight(abstract_txt:contextual in 4754) [ClassicSimilarity], result of:
            0.23932366 = score(doc=4754,freq=1.0), product of:
              0.6017003 = queryWeight, product of:
                5.268793 = boost
                6.3639297 = idf(docFreq=207, maxDocs=44421)
                0.01794501 = queryNorm
              0.3977456 = fieldWeight in 4754, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3639297 = idf(docFreq=207, maxDocs=44421)
                0.0625 = fieldNorm(doc=4754)
        0.28 = coord(7/25)
    
  2. Ponte, J.M.: Language models for relevance feedback (2000) 0.21
    0.21167617 = sum of:
      0.21167617 = product of:
        0.58798933 = sum of:
          0.008213779 = weight(abstract_txt:information in 1035) [ClassicSimilarity], result of:
            0.008213779 = score(doc=1035,freq=1.0), product of:
              0.043464545 = queryWeight, product of:
                1.0013217 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.01794501 = queryNorm
              0.18897653 = fieldWeight in 1035, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.078125 = fieldNorm(doc=1035)
          0.022278331 = weight(abstract_txt:will in 1035) [ClassicSimilarity], result of:
            0.022278331 = score(doc=1035,freq=1.0), product of:
              0.07384671 = queryWeight, product of:
                1.065678 = boost
                3.8615482 = idf(docFreq=2539, maxDocs=44421)
                0.01794501 = queryNorm
              0.30168346 = fieldWeight in 1035, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8615482 = idf(docFreq=2539, maxDocs=44421)
                0.078125 = fieldNorm(doc=1035)
          0.02701258 = weight(abstract_txt:each in 1035) [ClassicSimilarity], result of:
            0.02701258 = score(doc=1035,freq=1.0), product of:
              0.08396921 = queryWeight, product of:
                1.1363717 = boost
                4.1177115 = idf(docFreq=1965, maxDocs=44421)
                0.01794501 = queryNorm
              0.32169622 = fieldWeight in 1035, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1177115 = idf(docFreq=1965, maxDocs=44421)
                0.078125 = fieldNorm(doc=1035)
          0.056149375 = weight(abstract_txt:language in 1035) [ClassicSimilarity], result of:
            0.056149375 = score(doc=1035,freq=4.0), product of:
              0.08615609 = queryWeight, product of:
                1.1510744 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.01794501 = queryNorm
              0.6517168 = fieldWeight in 1035, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.078125 = fieldNorm(doc=1035)
          0.028205315 = weight(abstract_txt:need in 1035) [ClassicSimilarity], result of:
            0.028205315 = score(doc=1035,freq=1.0), product of:
              0.08642314 = queryWeight, product of:
                1.152857 = boost
                4.1774464 = idf(docFreq=1851, maxDocs=44421)
                0.01794501 = queryNorm
              0.326363 = fieldWeight in 1035, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1774464 = idf(docFreq=1851, maxDocs=44421)
                0.078125 = fieldNorm(doc=1035)
          0.04332563 = weight(abstract_txt:document in 1035) [ClassicSimilarity], result of:
            0.04332563 = score(doc=1035,freq=2.0), product of:
              0.091319315 = queryWeight, product of:
                1.1850637 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.01794501 = queryNorm
              0.47444102 = fieldWeight in 1035, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.078125 = fieldNorm(doc=1035)
          0.051207054 = weight(abstract_txt:effectiveness in 1035) [ClassicSimilarity], result of:
            0.051207054 = score(doc=1035,freq=1.0), product of:
              0.12861627 = queryWeight, product of:
                1.4063984 = boost
                5.0961695 = idf(docFreq=738, maxDocs=44421)
                0.01794501 = queryNorm
              0.39813823 = fieldWeight in 1035, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0961695 = idf(docFreq=738, maxDocs=44421)
                0.078125 = fieldNorm(doc=1035)
          0.06350528 = weight(abstract_txt:model in 1035) [ClassicSimilarity], result of:
            0.06350528 = score(doc=1035,freq=3.0), product of:
              0.11783454 = queryWeight, product of:
                1.6487026 = boost
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.01794501 = queryNorm
              0.538936 = fieldWeight in 1035, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.078125 = fieldNorm(doc=1035)
          0.28809202 = weight(abstract_txt:query in 1035) [ClassicSimilarity], result of:
            0.28809202 = score(doc=1035,freq=3.0), product of:
              0.44779208 = queryWeight, product of:
                5.2484202 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.01794501 = queryNorm
              0.6433611 = fieldWeight in 1035, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.078125 = fieldNorm(doc=1035)
        0.36 = coord(9/25)
    
  3. Shvartzshnaider, Y.; Sanfilippo, M.R.; Apthorpe, N.: GKC-CI : a unifying framework for contextual norms and information governance (2022) 0.20
    0.19881706 = sum of:
      0.19881706 = product of:
        0.8284044 = sum of:
          0.017072016 = weight(abstract_txt:information in 1652) [ClassicSimilarity], result of:
            0.017072016 = score(doc=1652,freq=3.0), product of:
              0.043464545 = queryWeight, product of:
                1.0013217 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.01794501 = queryNorm
              0.3927803 = fieldWeight in 1652, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.09375 = fieldNorm(doc=1652)
          0.03384638 = weight(abstract_txt:need in 1652) [ClassicSimilarity], result of:
            0.03384638 = score(doc=1652,freq=1.0), product of:
              0.08642314 = queryWeight, product of:
                1.152857 = boost
                4.1774464 = idf(docFreq=1851, maxDocs=44421)
                0.01794501 = queryNorm
              0.3916356 = fieldWeight in 1652, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1774464 = idf(docFreq=1851, maxDocs=44421)
                0.09375 = fieldNorm(doc=1652)
          0.06638434 = weight(abstract_txt:influence in 1652) [ClassicSimilarity], result of:
            0.06638434 = score(doc=1652,freq=1.0), product of:
              0.13541465 = queryWeight, product of:
                1.4430894 = boost
                5.229121 = idf(docFreq=646, maxDocs=44421)
                0.01794501 = queryNorm
              0.4902301 = fieldWeight in 1652, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.229121 = idf(docFreq=646, maxDocs=44421)
                0.09375 = fieldNorm(doc=1652)
          0.09053189 = weight(abstract_txt:factor in 1652) [ClassicSimilarity], result of:
            0.09053189 = score(doc=1652,freq=1.0), product of:
              0.1665289 = queryWeight, product of:
                1.600314 = boost
                5.7988343 = idf(docFreq=365, maxDocs=44421)
                0.01794501 = queryNorm
              0.54364073 = fieldWeight in 1652, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7988343 = idf(docFreq=365, maxDocs=44421)
                0.09375 = fieldNorm(doc=1652)
          0.11288761 = weight(abstract_txt:factors in 1652) [ClassicSimilarity], result of:
            0.11288761 = score(doc=1652,freq=1.0), product of:
              0.24306892 = queryWeight, product of:
                2.734262 = boost
                4.9538813 = idf(docFreq=851, maxDocs=44421)
                0.01794501 = queryNorm
              0.46442637 = fieldWeight in 1652, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9538813 = idf(docFreq=851, maxDocs=44421)
                0.09375 = fieldNorm(doc=1652)
          0.50768214 = weight(abstract_txt:contextual in 1652) [ClassicSimilarity], result of:
            0.50768214 = score(doc=1652,freq=2.0), product of:
              0.6017003 = queryWeight, product of:
                5.268793 = boost
                6.3639297 = idf(docFreq=207, maxDocs=44421)
                0.01794501 = queryNorm
              0.8437458 = fieldWeight in 1652, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.3639297 = idf(docFreq=207, maxDocs=44421)
                0.09375 = fieldNorm(doc=1652)
        0.24 = coord(6/25)
    
  4. Hofmann, K.; Balog, K.; Bogers, T.; Rijke, M. de: Contextual factors for finding similar experts (2010) 0.19
    0.186895 = sum of:
      0.186895 = product of:
        0.7787292 = sum of:
          0.021445092 = weight(abstract_txt:document in 443) [ClassicSimilarity], result of:
            0.021445092 = score(doc=443,freq=1.0), product of:
              0.091319315 = queryWeight, product of:
                1.1850637 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.01794501 = queryNorm
              0.23483633 = fieldWeight in 443, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.0546875 = fieldNorm(doc=443)
          0.028880186 = weight(abstract_txt:same in 443) [ClassicSimilarity], result of:
            0.028880186 = score(doc=443,freq=1.0), product of:
              0.1113638 = queryWeight, product of:
                1.3086768 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.01794501 = queryNorm
              0.2593319 = fieldWeight in 443, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.0546875 = fieldNorm(doc=443)
          0.054764282 = weight(abstract_txt:influence in 443) [ClassicSimilarity], result of:
            0.054764282 = score(doc=443,freq=2.0), product of:
              0.13541465 = queryWeight, product of:
                1.4430894 = boost
                5.229121 = idf(docFreq=646, maxDocs=44421)
                0.01794501 = queryNorm
              0.40441918 = fieldWeight in 443, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.229121 = idf(docFreq=646, maxDocs=44421)
                0.0546875 = fieldNorm(doc=443)
          0.06856821 = weight(abstract_txt:integrate in 443) [ClassicSimilarity], result of:
            0.06856821 = score(doc=443,freq=1.0), product of:
              0.19819494 = queryWeight, product of:
                1.745849 = boost
                6.326189 = idf(docFreq=215, maxDocs=44421)
                0.01794501 = queryNorm
              0.34596348 = fieldWeight in 443, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.326189 = idf(docFreq=215, maxDocs=44421)
                0.0546875 = fieldNorm(doc=443)
          0.18625507 = weight(abstract_txt:factors in 443) [ClassicSimilarity], result of:
            0.18625507 = score(doc=443,freq=8.0), product of:
              0.24306892 = queryWeight, product of:
                2.734262 = boost
                4.9538813 = idf(docFreq=851, maxDocs=44421)
                0.01794501 = queryNorm
              0.76626444 = fieldWeight in 443, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.9538813 = idf(docFreq=851, maxDocs=44421)
                0.0546875 = fieldNorm(doc=443)
          0.4188164 = weight(abstract_txt:contextual in 443) [ClassicSimilarity], result of:
            0.4188164 = score(doc=443,freq=4.0), product of:
              0.6017003 = queryWeight, product of:
                5.268793 = boost
                6.3639297 = idf(docFreq=207, maxDocs=44421)
                0.01794501 = queryNorm
              0.6960548 = fieldWeight in 443, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.3639297 = idf(docFreq=207, maxDocs=44421)
                0.0546875 = fieldNorm(doc=443)
        0.24 = coord(6/25)
    
  5. Liu, J.; Belkin, N.J.: Personalizing information retrieval for multi-session tasks : examining the roles of task stage, task type, and topic knowledge on the interpretation of dwell time as an indicator of document usefulness (2015) 0.18
    0.18317641 = sum of:
      0.18317641 = product of:
        0.7632351 = sum of:
          0.011381345 = weight(abstract_txt:information in 2608) [ClassicSimilarity], result of:
            0.011381345 = score(doc=2608,freq=3.0), product of:
              0.043464545 = queryWeight, product of:
                1.0013217 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.01794501 = queryNorm
              0.26185355 = fieldWeight in 2608, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0625 = fieldNorm(doc=2608)
          0.021610063 = weight(abstract_txt:each in 2608) [ClassicSimilarity], result of:
            0.021610063 = score(doc=2608,freq=1.0), product of:
              0.08396921 = queryWeight, product of:
                1.1363717 = boost
                4.1177115 = idf(docFreq=1965, maxDocs=44421)
                0.01794501 = queryNorm
              0.25735697 = fieldWeight in 2608, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1177115 = idf(docFreq=1965, maxDocs=44421)
                0.0625 = fieldNorm(doc=2608)
          0.04245027 = weight(abstract_txt:document in 2608) [ClassicSimilarity], result of:
            0.04245027 = score(doc=2608,freq=3.0), product of:
              0.091319315 = queryWeight, product of:
                1.1850637 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.01794501 = queryNorm
              0.46485534 = fieldWeight in 2608, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.0625 = fieldNorm(doc=2608)
          0.05862926 = weight(abstract_txt:contexts in 2608) [ClassicSimilarity], result of:
            0.05862926 = score(doc=2608,freq=1.0), product of:
              0.16333991 = queryWeight, product of:
                1.5849172 = boost
                5.743043 = idf(docFreq=386, maxDocs=44421)
                0.01794501 = queryNorm
              0.35894018 = fieldWeight in 2608, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.743043 = idf(docFreq=386, maxDocs=44421)
                0.0625 = fieldNorm(doc=2608)
          0.15051682 = weight(abstract_txt:factors in 2608) [ClassicSimilarity], result of:
            0.15051682 = score(doc=2608,freq=4.0), product of:
              0.24306892 = queryWeight, product of:
                2.734262 = boost
                4.9538813 = idf(docFreq=851, maxDocs=44421)
                0.01794501 = queryNorm
              0.61923516 = fieldWeight in 2608, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.9538813 = idf(docFreq=851, maxDocs=44421)
                0.0625 = fieldNorm(doc=2608)
          0.47864732 = weight(abstract_txt:contextual in 2608) [ClassicSimilarity], result of:
            0.47864732 = score(doc=2608,freq=4.0), product of:
              0.6017003 = queryWeight, product of:
                5.268793 = boost
                6.3639297 = idf(docFreq=207, maxDocs=44421)
                0.01794501 = queryNorm
              0.7954912 = fieldWeight in 2608, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.3639297 = idf(docFreq=207, maxDocs=44421)
                0.0625 = fieldNorm(doc=2608)
        0.24 = coord(6/25)