Document (#42762)

Editor
Belkin, N.J.
Author
Liu, J.
Liu, C.
Title
Personalization in text information retrieval : a survey
Source
Journal of the Association for Information Science and Technology. 71(2020) no.3, S.349-369
Year
2020
Series
AIS review
Abstract
Personalization of information retrieval (PIR) is aimed at tailoring a search toward individual users and user groups by taking account of additional information about users besides their queries. In the past two decades or so, PIR has received extensive attention in both academia and industry. This article surveys the literature of personalization in text retrieval, following a framework for aspects or factors that can be used for personalization. The framework consists of additional information about users that can be explicitly obtained by asking users for their preferences, or implicitly inferred from users' search behaviors. Users' characteristics and contextual factors such as tasks, time, location, etc., can be helpful for personalization. This article also addresses various issues including when to personalize, the evaluation of PIR, privacy, usability, etc. Based on the extensive review, challenges are discussed and directions for future effort are suggested.
Content
https://asistdl.onlinelibrary.wiley.com/doi/10.1002/asi.24234.
Theme
Retrievalalgorithmen

Similar documents (content)

  1. Liu, J.; Belkin, N.J.: Personalizing information retrieval for multi-session tasks : examining the roles of task stage, task type, and topic knowledge on the interpretation of dwell time as an indicator of document usefulness (2015) 0.27
    0.26873615 = sum of:
      0.26873615 = product of:
        0.83980054 = sum of:
          0.080245785 = weight(abstract_txt:contextual in 2608) [ClassicSimilarity], result of:
            0.080245785 = score(doc=2608,freq=4.0), product of:
              0.100875765 = queryWeight, product of:
                6.3639297 = idf(docFreq=207, maxDocs=44421)
                0.015851175 = queryNorm
              0.7954912 = fieldWeight in 2608, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.3639297 = idf(docFreq=207, maxDocs=44421)
                0.0625 = fieldNorm(doc=2608)
          0.015197248 = weight(abstract_txt:search in 2608) [ClassicSimilarity], result of:
            0.015197248 = score(doc=2608,freq=1.0), product of:
              0.06653426 = queryWeight, product of:
                1.148535 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.015851175 = queryNorm
              0.22841237 = fieldWeight in 2608, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.0625 = fieldNorm(doc=2608)
          0.018693468 = weight(abstract_txt:about in 2608) [ClassicSimilarity], result of:
            0.018693468 = score(doc=2608,freq=1.0), product of:
              0.07638275 = queryWeight, product of:
                1.2306066 = boost
                3.9157467 = idf(docFreq=2405, maxDocs=44421)
                0.015851175 = queryNorm
              0.24473417 = fieldWeight in 2608, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9157467 = idf(docFreq=2405, maxDocs=44421)
                0.0625 = fieldNorm(doc=2608)
          0.015264768 = weight(abstract_txt:information in 2608) [ClassicSimilarity], result of:
            0.015264768 = score(doc=2608,freq=3.0), product of:
              0.05829506 = queryWeight, product of:
                1.5203811 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.015851175 = queryNorm
              0.26185355 = fieldWeight in 2608, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0625 = fieldNorm(doc=2608)
          0.075702965 = weight(abstract_txt:factors in 2608) [ClassicSimilarity], result of:
            0.075702965 = score(doc=2608,freq=4.0), product of:
              0.12225237 = queryWeight, product of:
                1.5568624 = boost
                4.9538813 = idf(docFreq=851, maxDocs=44421)
                0.015851175 = queryNorm
              0.61923516 = fieldWeight in 2608, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.9538813 = idf(docFreq=851, maxDocs=44421)
                0.0625 = fieldNorm(doc=2608)
          0.01962294 = weight(abstract_txt:retrieval in 2608) [ClassicSimilarity], result of:
            0.01962294 = score(doc=2608,freq=1.0), product of:
              0.090311244 = queryWeight, product of:
                1.6388459 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.015851175 = queryNorm
              0.21728125 = fieldWeight in 2608, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=2608)
          0.09481322 = weight(abstract_txt:users in 2608) [ClassicSimilarity], result of:
            0.09481322 = score(doc=2608,freq=5.0), product of:
              0.19018008 = queryWeight, product of:
                3.3632932 = boost
                3.5672934 = idf(docFreq=3408, maxDocs=44421)
                0.015851175 = queryNorm
              0.49854442 = fieldWeight in 2608, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.5672934 = idf(docFreq=3408, maxDocs=44421)
                0.0625 = fieldNorm(doc=2608)
          0.52026016 = weight(abstract_txt:personalization in 2608) [ClassicSimilarity], result of:
            0.52026016 = score(doc=2608,freq=2.0), product of:
              0.7556472 = queryWeight, product of:
                6.119996 = boost
                7.7894444 = idf(docFreq=49, maxDocs=44421)
                0.015851175 = queryNorm
              0.6884961 = fieldWeight in 2608, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.7894444 = idf(docFreq=49, maxDocs=44421)
                0.0625 = fieldNorm(doc=2608)
        0.32 = coord(8/25)
    
  2. Eskens, S.: ¬The personal information sphere : an integral approach to privacy and related information and communication rights (2020) 0.22
    0.2163405 = sum of:
      0.2163405 = product of:
        1.0817025 = sum of:
          0.096172854 = weight(abstract_txt:privacy in 941) [ClassicSimilarity], result of:
            0.096172854 = score(doc=941,freq=4.0), product of:
              0.11381697 = queryWeight, product of:
                1.0622092 = boost
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.015851175 = queryNorm
              0.84497815 = fieldWeight in 941, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.0625 = fieldNorm(doc=941)
          0.018693468 = weight(abstract_txt:about in 941) [ClassicSimilarity], result of:
            0.018693468 = score(doc=941,freq=1.0), product of:
              0.07638275 = queryWeight, product of:
                1.2306066 = boost
                3.9157467 = idf(docFreq=2405, maxDocs=44421)
                0.015851175 = queryNorm
              0.24473417 = fieldWeight in 941, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9157467 = idf(docFreq=2405, maxDocs=44421)
                0.0625 = fieldNorm(doc=941)
          0.023317318 = weight(abstract_txt:information in 941) [ClassicSimilarity], result of:
            0.023317318 = score(doc=941,freq=7.0), product of:
              0.05829506 = queryWeight, product of:
                1.5203811 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.015851175 = queryNorm
              0.3999879 = fieldWeight in 941, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0625 = fieldNorm(doc=941)
          0.042401757 = weight(abstract_txt:users in 941) [ClassicSimilarity], result of:
            0.042401757 = score(doc=941,freq=1.0), product of:
              0.19018008 = queryWeight, product of:
                3.3632932 = boost
                3.5672934 = idf(docFreq=3408, maxDocs=44421)
                0.015851175 = queryNorm
              0.22295584 = fieldWeight in 941, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5672934 = idf(docFreq=3408, maxDocs=44421)
                0.0625 = fieldNorm(doc=941)
          0.9011171 = weight(abstract_txt:personalization in 941) [ClassicSimilarity], result of:
            0.9011171 = score(doc=941,freq=6.0), product of:
              0.7556472 = queryWeight, product of:
                6.119996 = boost
                7.7894444 = idf(docFreq=49, maxDocs=44421)
                0.015851175 = queryNorm
              1.1925104 = fieldWeight in 941, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.7894444 = idf(docFreq=49, maxDocs=44421)
                0.0625 = fieldNorm(doc=941)
        0.2 = coord(5/25)
    
  3. Wang, J.; Clements, M.; Yang, J.; Vries, A.P. de; Reinders, M.J.T.: Personalization of tagging systems (2010) 0.21
    0.20675404 = sum of:
      0.20675404 = product of:
        0.8614752 = sum of:
          0.056141056 = weight(abstract_txt:preferences in 229) [ClassicSimilarity], result of:
            0.056141056 = score(doc=229,freq=1.0), product of:
              0.10875245 = queryWeight, product of:
                1.0383078 = boost
                6.6077175 = idf(docFreq=162, maxDocs=44421)
                0.015851175 = queryNorm
              0.51622796 = fieldWeight in 229, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6077175 = idf(docFreq=162, maxDocs=44421)
                0.078125 = fieldNorm(doc=229)
          0.01899656 = weight(abstract_txt:search in 229) [ClassicSimilarity], result of:
            0.01899656 = score(doc=229,freq=1.0), product of:
              0.06653426 = queryWeight, product of:
                1.148535 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.015851175 = queryNorm
              0.28551546 = fieldWeight in 229, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.078125 = fieldNorm(doc=229)
          0.036527313 = weight(abstract_txt:framework in 229) [ClassicSimilarity], result of:
            0.036527313 = score(doc=229,freq=1.0), product of:
              0.10288226 = queryWeight, product of:
                1.4282092 = boost
                4.5445113 = idf(docFreq=1282, maxDocs=44421)
                0.015851175 = queryNorm
              0.35503995 = fieldWeight in 229, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5445113 = idf(docFreq=1282, maxDocs=44421)
                0.078125 = fieldNorm(doc=229)
          0.024528675 = weight(abstract_txt:retrieval in 229) [ClassicSimilarity], result of:
            0.024528675 = score(doc=229,freq=1.0), product of:
              0.090311244 = queryWeight, product of:
                1.6388459 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.015851175 = queryNorm
              0.27160156 = fieldWeight in 229, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.078125 = fieldNorm(doc=229)
          0.074956425 = weight(abstract_txt:users in 229) [ClassicSimilarity], result of:
            0.074956425 = score(doc=229,freq=2.0), product of:
              0.19018008 = queryWeight, product of:
                3.3632932 = boost
                3.5672934 = idf(docFreq=3408, maxDocs=44421)
                0.015851175 = queryNorm
              0.39413396 = fieldWeight in 229, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5672934 = idf(docFreq=3408, maxDocs=44421)
                0.078125 = fieldNorm(doc=229)
          0.6503252 = weight(abstract_txt:personalization in 229) [ClassicSimilarity], result of:
            0.6503252 = score(doc=229,freq=2.0), product of:
              0.7556472 = queryWeight, product of:
                6.119996 = boost
                7.7894444 = idf(docFreq=49, maxDocs=44421)
                0.015851175 = queryNorm
              0.86062014 = fieldWeight in 229, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.7894444 = idf(docFreq=49, maxDocs=44421)
                0.078125 = fieldNorm(doc=229)
        0.24 = coord(6/25)
    
  4. Leginus, M.; Zhai, C.X.; Dolog, P.: Personalized generation of word clouds from tweets (2016) 0.20
    0.20123433 = sum of:
      0.20123433 = product of:
        1.0061716 = sum of:
          0.056141056 = weight(abstract_txt:preferences in 3886) [ClassicSimilarity], result of:
            0.056141056 = score(doc=3886,freq=1.0), product of:
              0.10875245 = queryWeight, product of:
                1.0383078 = boost
                6.6077175 = idf(docFreq=162, maxDocs=44421)
                0.015851175 = queryNorm
              0.51622796 = fieldWeight in 3886, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6077175 = idf(docFreq=162, maxDocs=44421)
                0.078125 = fieldNorm(doc=3886)
          0.036527313 = weight(abstract_txt:framework in 3886) [ClassicSimilarity], result of:
            0.036527313 = score(doc=3886,freq=1.0), product of:
              0.10288226 = queryWeight, product of:
                1.4282092 = boost
                4.5445113 = idf(docFreq=1282, maxDocs=44421)
                0.015851175 = queryNorm
              0.35503995 = fieldWeight in 3886, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5445113 = idf(docFreq=1282, maxDocs=44421)
                0.078125 = fieldNorm(doc=3886)
          0.011016398 = weight(abstract_txt:information in 3886) [ClassicSimilarity], result of:
            0.011016398 = score(doc=3886,freq=1.0), product of:
              0.05829506 = queryWeight, product of:
                1.5203811 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.015851175 = queryNorm
              0.18897653 = fieldWeight in 3886, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.078125 = fieldNorm(doc=3886)
          0.1060044 = weight(abstract_txt:users in 3886) [ClassicSimilarity], result of:
            0.1060044 = score(doc=3886,freq=4.0), product of:
              0.19018008 = queryWeight, product of:
                3.3632932 = boost
                3.5672934 = idf(docFreq=3408, maxDocs=44421)
                0.015851175 = queryNorm
              0.5573896 = fieldWeight in 3886, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.5672934 = idf(docFreq=3408, maxDocs=44421)
                0.078125 = fieldNorm(doc=3886)
          0.7964824 = weight(abstract_txt:personalization in 3886) [ClassicSimilarity], result of:
            0.7964824 = score(doc=3886,freq=3.0), product of:
              0.7556472 = queryWeight, product of:
                6.119996 = boost
                7.7894444 = idf(docFreq=49, maxDocs=44421)
                0.015851175 = queryNorm
              1.0540401 = fieldWeight in 3886, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.7894444 = idf(docFreq=49, maxDocs=44421)
                0.078125 = fieldNorm(doc=3886)
        0.2 = coord(5/25)
    
  5. Kelly, D.: Implicit feedback : using behavior to infer relevance (2005) 0.20
    0.20018515 = sum of:
      0.20018515 = product of:
        0.55606985 = sum of:
          0.034261458 = weight(abstract_txt:helpful in 770) [ClassicSimilarity], result of:
            0.034261458 = score(doc=770,freq=1.0), product of:
              0.10999047 = queryWeight, product of:
                1.044201 = boost
                6.6452217 = idf(docFreq=156, maxDocs=44421)
                0.015851175 = queryNorm
              0.31149477 = fieldWeight in 770, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6452217 = idf(docFreq=156, maxDocs=44421)
                0.046875 = fieldNorm(doc=770)
          0.038729057 = weight(abstract_txt:explicitly in 770) [ClassicSimilarity], result of:
            0.038729057 = score(doc=770,freq=1.0), product of:
              0.11935551 = queryWeight, product of:
                1.0877467 = boost
                6.922344 = idf(docFreq=118, maxDocs=44421)
                0.015851175 = queryNorm
              0.32448488 = fieldWeight in 770, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.922344 = idf(docFreq=118, maxDocs=44421)
                0.046875 = fieldNorm(doc=770)
          0.011397935 = weight(abstract_txt:search in 770) [ClassicSimilarity], result of:
            0.011397935 = score(doc=770,freq=1.0), product of:
              0.06653426 = queryWeight, product of:
                1.148535 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.015851175 = queryNorm
              0.17130928 = fieldWeight in 770, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.046875 = fieldNorm(doc=770)
          0.014020101 = weight(abstract_txt:about in 770) [ClassicSimilarity], result of:
            0.014020101 = score(doc=770,freq=1.0), product of:
              0.07638275 = queryWeight, product of:
                1.2306066 = boost
                3.9157467 = idf(docFreq=2405, maxDocs=44421)
                0.015851175 = queryNorm
              0.18355063 = fieldWeight in 770, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9157467 = idf(docFreq=2405, maxDocs=44421)
                0.046875 = fieldNorm(doc=770)
          0.011448576 = weight(abstract_txt:information in 770) [ClassicSimilarity], result of:
            0.011448576 = score(doc=770,freq=3.0), product of:
              0.05829506 = queryWeight, product of:
                1.5203811 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.015851175 = queryNorm
              0.19639015 = fieldWeight in 770, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.046875 = fieldNorm(doc=770)
          0.020813271 = weight(abstract_txt:retrieval in 770) [ClassicSimilarity], result of:
            0.020813271 = score(doc=770,freq=2.0), product of:
              0.090311244 = queryWeight, product of:
                1.6388459 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.015851175 = queryNorm
              0.23046157 = fieldWeight in 770, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.046875 = fieldNorm(doc=770)
          0.059542123 = weight(abstract_txt:additional in 770) [ClassicSimilarity], result of:
            0.059542123 = score(doc=770,freq=2.0), product of:
              0.15898912 = queryWeight, product of:
                1.775437 = boost
                5.6493783 = idf(docFreq=424, maxDocs=44421)
                0.015851175 = queryNorm
              0.3745044 = fieldWeight in 770, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6493783 = idf(docFreq=424, maxDocs=44421)
                0.046875 = fieldNorm(doc=770)
          0.089947715 = weight(abstract_txt:users in 770) [ClassicSimilarity], result of:
            0.089947715 = score(doc=770,freq=8.0), product of:
              0.19018008 = queryWeight, product of:
                3.3632932 = boost
                3.5672934 = idf(docFreq=3408, maxDocs=44421)
                0.015851175 = queryNorm
              0.47296077 = fieldWeight in 770, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                3.5672934 = idf(docFreq=3408, maxDocs=44421)
                0.046875 = fieldNorm(doc=770)
          0.27590963 = weight(abstract_txt:personalization in 770) [ClassicSimilarity], result of:
            0.27590963 = score(doc=770,freq=1.0), product of:
              0.7556472 = queryWeight, product of:
                6.119996 = boost
                7.7894444 = idf(docFreq=49, maxDocs=44421)
                0.015851175 = queryNorm
              0.36513022 = fieldWeight in 770, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7894444 = idf(docFreq=49, maxDocs=44421)
                0.046875 = fieldNorm(doc=770)
        0.36 = coord(9/25)