Document (#30186)

Author
Widyantoro, D.H.
Ioerger, T.R.
Yen, J.
Title
Learning user Interest dynamics with a three-descriptor representation
Source
Journal of the American Society for Information Science and technology. 52(2001) no.3, S.212-225
Year
2001
Abstract
The use of documents ranked high by user feedback to profile user interests is commonly done with Rocchio's `s algorithm which uses a single list of attribute value pairs called a descriptor to carry term value weights for an individual. Negative feed back on old preferences or positive feedback on new preferences adjusts the descriptor at a fixed, predetermined, and often slow pace. Widyantoro, et alia, suggest a three descriptor model which adds two short term interest descriptors, one each for positive and negative feedback. User short term interest in a particular document is computed by subtracting the similarity measure with the negative descriptor from the similarity measure with the positive descriptor. Using a constant to represent the desired impact of long and short term interests these values may be summed for a single interest value. Using the Reuters 21578 1.0 test collection split into training and test sets, topics with at least 100 documents in a tight cluster were chosen. The TDR handles change well showing better recovery speed and accuracy than the single descriptor model. The nearest neighbor update strategy appears to keep the category concept relatively consistent when multiple TDRs are used.
Theme
Retrievalalgorithmen
Object
Rocchio-Algorithmus

Similar documents (content)

  1. Díaz, A.; Gervás, P.: User-model based personalized summarization (2007) 0.16
    0.16021028 = sum of:
      0.16021028 = product of:
        0.44502854 = sum of:
          0.033943728 = weight(abstract_txt:measure in 1952) [ClassicSimilarity], result of:
            0.033943728 = score(doc=1952,freq=1.0), product of:
              0.099904895 = queryWeight, product of:
                1.2346443 = boost
                5.4361663 = idf(docFreq=525, maxDocs=44421)
                0.014885114 = queryNorm
              0.3397604 = fieldWeight in 1952, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4361663 = idf(docFreq=525, maxDocs=44421)
                0.0625 = fieldNorm(doc=1952)
          0.008215386 = weight(abstract_txt:with in 1952) [ClassicSimilarity], result of:
            0.008215386 = score(doc=1952,freq=1.0), product of:
              0.052659784 = queryWeight, product of:
                1.4172877 = boost
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.014885114 = queryNorm
              0.15600874 = fieldWeight in 1952, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.0625 = fieldNorm(doc=1952)
          0.055212338 = weight(abstract_txt:interests in 1952) [ClassicSimilarity], result of:
            0.055212338 = score(doc=1952,freq=1.0), product of:
              0.13817766 = queryWeight, product of:
                1.4520026 = boost
                6.3932 = idf(docFreq=201, maxDocs=44421)
                0.014885114 = queryNorm
              0.399575 = fieldWeight in 1952, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3932 = idf(docFreq=201, maxDocs=44421)
                0.0625 = fieldNorm(doc=1952)
          0.0609587 = weight(abstract_txt:preferences in 1952) [ClassicSimilarity], result of:
            0.0609587 = score(doc=1952,freq=1.0), product of:
              0.14760606 = queryWeight, product of:
                1.5007231 = boost
                6.6077175 = idf(docFreq=162, maxDocs=44421)
                0.014885114 = queryNorm
              0.41298234 = fieldWeight in 1952, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6077175 = idf(docFreq=162, maxDocs=44421)
                0.0625 = fieldNorm(doc=1952)
          0.055758506 = weight(abstract_txt:user in 1952) [ClassicSimilarity], result of:
            0.055758506 = score(doc=1952,freq=7.0), product of:
              0.09160767 = queryWeight, product of:
                1.6719736 = boost
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.014885114 = queryNorm
              0.60866636 = fieldWeight in 1952, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.0625 = fieldNorm(doc=1952)
          0.061800886 = weight(abstract_txt:short in 1952) [ClassicSimilarity], result of:
            0.061800886 = score(doc=1952,freq=1.0), product of:
              0.17051947 = queryWeight, product of:
                1.9755185 = boost
                5.7988343 = idf(docFreq=365, maxDocs=44421)
                0.014885114 = queryNorm
              0.36242715 = fieldWeight in 1952, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7988343 = idf(docFreq=365, maxDocs=44421)
                0.0625 = fieldNorm(doc=1952)
          0.06640547 = weight(abstract_txt:feedback in 1952) [ClassicSimilarity], result of:
            0.06640547 = score(doc=1952,freq=1.0), product of:
              0.17888752 = queryWeight, product of:
                2.023411 = boost
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.014885114 = queryNorm
              0.37121353 = fieldWeight in 1952, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.0625 = fieldNorm(doc=1952)
          0.046579964 = weight(abstract_txt:term in 1952) [ClassicSimilarity], result of:
            0.046579964 = score(doc=1952,freq=1.0), product of:
              0.15543775 = queryWeight, product of:
                2.1779191 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.014885114 = queryNorm
              0.29966956 = fieldWeight in 1952, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.0625 = fieldNorm(doc=1952)
          0.05615358 = weight(abstract_txt:interest in 1952) [ClassicSimilarity], result of:
            0.05615358 = score(doc=1952,freq=1.0), product of:
              0.17606595 = queryWeight, product of:
                2.3179345 = boost
                5.1029587 = idf(docFreq=733, maxDocs=44421)
                0.014885114 = queryNorm
              0.31893492 = fieldWeight in 1952, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1029587 = idf(docFreq=733, maxDocs=44421)
                0.0625 = fieldNorm(doc=1952)
        0.36 = coord(9/25)
    
  2. Spiteri, L.F.: Word association testing and thesaurus construction : a pilot study (2005) 0.15
    0.14936362 = sum of:
      0.14936362 = product of:
        0.7468181 = sum of:
          0.057588577 = weight(abstract_txt:test in 216) [ClassicSimilarity], result of:
            0.057588577 = score(doc=216,freq=2.0), product of:
              0.08607965 = queryWeight, product of:
                1.1460372 = boost
                5.046027 = idf(docFreq=776, maxDocs=44421)
                0.014885114 = queryNorm
              0.669015 = fieldWeight in 216, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.046027 = idf(docFreq=776, maxDocs=44421)
                0.09375 = fieldNorm(doc=216)
          0.01232308 = weight(abstract_txt:with in 216) [ClassicSimilarity], result of:
            0.01232308 = score(doc=216,freq=1.0), product of:
              0.052659784 = queryWeight, product of:
                1.4172877 = boost
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.014885114 = queryNorm
              0.23401311 = fieldWeight in 216, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.09375 = fieldNorm(doc=216)
          0.031612106 = weight(abstract_txt:user in 216) [ClassicSimilarity], result of:
            0.031612106 = score(doc=216,freq=1.0), product of:
              0.09160767 = queryWeight, product of:
                1.6719736 = boost
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.014885114 = queryNorm
              0.34508142 = fieldWeight in 216, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.09375 = fieldNorm(doc=216)
          0.12101829 = weight(abstract_txt:term in 216) [ClassicSimilarity], result of:
            0.12101829 = score(doc=216,freq=3.0), product of:
              0.15543775 = queryWeight, product of:
                2.1779191 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.014885114 = queryNorm
              0.77856433 = fieldWeight in 216, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.09375 = fieldNorm(doc=216)
          0.5242761 = weight(abstract_txt:descriptor in 216) [ClassicSimilarity], result of:
            0.5242761 = score(doc=216,freq=1.0), product of:
              0.7179303 = queryWeight, product of:
                6.191896 = boost
                7.7894444 = idf(docFreq=49, maxDocs=44421)
                0.014885114 = queryNorm
              0.73026043 = fieldWeight in 216, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7894444 = idf(docFreq=49, maxDocs=44421)
                0.09375 = fieldNorm(doc=216)
        0.2 = coord(5/25)
    
  3. Chen, Z.; Meng, X.; Fowler, R.H.; Zhu, B.: Real-time adaptive feature and document learning for Web search (2001) 0.15
    0.1491074 = sum of:
      0.1491074 = product of:
        0.46596062 = sum of:
          0.07368274 = weight(abstract_txt:alia in 209) [ClassicSimilarity], result of:
            0.07368274 = score(doc=209,freq=1.0), product of:
              0.13293752 = queryWeight, product of:
                1.0070645 = boost
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.014885114 = queryNorm
              0.5542659 = fieldWeight in 209, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.0625 = fieldNorm(doc=209)
          0.102809645 = weight(abstract_txt:summed in 209) [ClassicSimilarity], result of:
            0.102809645 = score(doc=209,freq=1.0), product of:
              0.16599423 = queryWeight, product of:
                1.1253302 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.014885114 = queryNorm
              0.61935675 = fieldWeight in 209, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.0625 = fieldNorm(doc=209)
          0.027147517 = weight(abstract_txt:test in 209) [ClassicSimilarity], result of:
            0.027147517 = score(doc=209,freq=1.0), product of:
              0.08607965 = queryWeight, product of:
                1.1460372 = boost
                5.046027 = idf(docFreq=776, maxDocs=44421)
                0.014885114 = queryNorm
              0.3153767 = fieldWeight in 209, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.046027 = idf(docFreq=776, maxDocs=44421)
                0.0625 = fieldNorm(doc=209)
          0.008215386 = weight(abstract_txt:with in 209) [ClassicSimilarity], result of:
            0.008215386 = score(doc=209,freq=1.0), product of:
              0.052659784 = queryWeight, product of:
                1.4172877 = boost
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.014885114 = queryNorm
              0.15600874 = fieldWeight in 209, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.0625 = fieldNorm(doc=209)
          0.04214947 = weight(abstract_txt:user in 209) [ClassicSimilarity], result of:
            0.04214947 = score(doc=209,freq=4.0), product of:
              0.09160767 = queryWeight, product of:
                1.6719736 = boost
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.014885114 = queryNorm
              0.46010855 = fieldWeight in 209, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.0625 = fieldNorm(doc=209)
          0.064871475 = weight(abstract_txt:positive in 209) [ClassicSimilarity], result of:
            0.064871475 = score(doc=209,freq=1.0), product of:
              0.17612189 = queryWeight, product of:
                2.007709 = boost
                5.8933253 = idf(docFreq=332, maxDocs=44421)
                0.014885114 = queryNorm
              0.36833283 = fieldWeight in 209, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8933253 = idf(docFreq=332, maxDocs=44421)
                0.0625 = fieldNorm(doc=209)
          0.06640547 = weight(abstract_txt:feedback in 209) [ClassicSimilarity], result of:
            0.06640547 = score(doc=209,freq=1.0), product of:
              0.17888752 = queryWeight, product of:
                2.023411 = boost
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.014885114 = queryNorm
              0.37121353 = fieldWeight in 209, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.0625 = fieldNorm(doc=209)
          0.080678865 = weight(abstract_txt:term in 209) [ClassicSimilarity], result of:
            0.080678865 = score(doc=209,freq=3.0), product of:
              0.15543775 = queryWeight, product of:
                2.1779191 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.014885114 = queryNorm
              0.5190429 = fieldWeight in 209, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.0625 = fieldNorm(doc=209)
        0.32 = coord(8/25)
    
  4. Fagni, T.; Sebastiani, F.: Selecting negative examples for hierarchical text classification: An experimental comparison (2010) 0.10
    0.10064791 = sum of:
      0.10064791 = product of:
        0.4193663 = sum of:
          0.025506387 = weight(abstract_txt:three in 101) [ClassicSimilarity], result of:
            0.025506387 = score(doc=101,freq=2.0), product of:
              0.06553949 = queryWeight, product of:
                4.4030223 = idf(docFreq=1477, maxDocs=44421)
                0.014885114 = queryNorm
              0.38917586 = fieldWeight in 101, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4030223 = idf(docFreq=1477, maxDocs=44421)
                0.0625 = fieldNorm(doc=101)
          0.102809645 = weight(abstract_txt:21578 in 101) [ClassicSimilarity], result of:
            0.102809645 = score(doc=101,freq=1.0), product of:
              0.16599423 = queryWeight, product of:
                1.1253302 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.014885114 = queryNorm
              0.61935675 = fieldWeight in 101, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.0625 = fieldNorm(doc=101)
          0.011618311 = weight(abstract_txt:with in 101) [ClassicSimilarity], result of:
            0.011618311 = score(doc=101,freq=2.0), product of:
              0.052659784 = queryWeight, product of:
                1.4172877 = boost
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.014885114 = queryNorm
              0.22062966 = fieldWeight in 101, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.0625 = fieldNorm(doc=101)
          0.064871475 = weight(abstract_txt:positive in 101) [ClassicSimilarity], result of:
            0.064871475 = score(doc=101,freq=1.0), product of:
              0.17612189 = queryWeight, product of:
                2.007709 = boost
                5.8933253 = idf(docFreq=332, maxDocs=44421)
                0.014885114 = queryNorm
              0.36833283 = fieldWeight in 101, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8933253 = idf(docFreq=332, maxDocs=44421)
                0.0625 = fieldNorm(doc=101)
          0.15840688 = weight(abstract_txt:negative in 101) [ClassicSimilarity], result of:
            0.15840688 = score(doc=101,freq=4.0), product of:
              0.20119023 = queryWeight, product of:
                2.1458411 = boost
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.014885114 = queryNorm
              0.7873488 = fieldWeight in 101, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.0625 = fieldNorm(doc=101)
          0.05615358 = weight(abstract_txt:interest in 101) [ClassicSimilarity], result of:
            0.05615358 = score(doc=101,freq=1.0), product of:
              0.17606595 = queryWeight, product of:
                2.3179345 = boost
                5.1029587 = idf(docFreq=733, maxDocs=44421)
                0.014885114 = queryNorm
              0.31893492 = fieldWeight in 101, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1029587 = idf(docFreq=733, maxDocs=44421)
                0.0625 = fieldNorm(doc=101)
        0.24 = coord(6/25)
    
  5. Cho, H.; Donovan, A.; Lee, J.H.: Art in an algorithm : a taxonomy for describing video game visual styles (2018) 0.10
    0.096224606 = sum of:
      0.096224606 = product of:
        0.48112303 = sum of:
          0.008215386 = weight(abstract_txt:with in 218) [ClassicSimilarity], result of:
            0.008215386 = score(doc=218,freq=1.0), product of:
              0.052659784 = queryWeight, product of:
                1.4172877 = boost
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.014885114 = queryNorm
              0.15600874 = fieldWeight in 218, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.0625 = fieldNorm(doc=218)
          0.037432473 = weight(abstract_txt:value in 218) [ClassicSimilarity], result of:
            0.037432473 = score(doc=218,freq=2.0), product of:
              0.09688722 = queryWeight, product of:
                1.489112 = boost
                4.3710623 = idf(docFreq=1525, maxDocs=44421)
                0.014885114 = queryNorm
              0.38635096 = fieldWeight in 218, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3710623 = idf(docFreq=1525, maxDocs=44421)
                0.0625 = fieldNorm(doc=218)
          0.029804176 = weight(abstract_txt:user in 218) [ClassicSimilarity], result of:
            0.029804176 = score(doc=218,freq=2.0), product of:
              0.09160767 = queryWeight, product of:
                1.6719736 = boost
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.014885114 = queryNorm
              0.32534587 = fieldWeight in 218, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.0625 = fieldNorm(doc=218)
          0.05615358 = weight(abstract_txt:interest in 218) [ClassicSimilarity], result of:
            0.05615358 = score(doc=218,freq=1.0), product of:
              0.17606595 = queryWeight, product of:
                2.3179345 = boost
                5.1029587 = idf(docFreq=733, maxDocs=44421)
                0.014885114 = queryNorm
              0.31893492 = fieldWeight in 218, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1029587 = idf(docFreq=733, maxDocs=44421)
                0.0625 = fieldNorm(doc=218)
          0.3495174 = weight(abstract_txt:descriptor in 218) [ClassicSimilarity], result of:
            0.3495174 = score(doc=218,freq=1.0), product of:
              0.7179303 = queryWeight, product of:
                6.191896 = boost
                7.7894444 = idf(docFreq=49, maxDocs=44421)
                0.014885114 = queryNorm
              0.48684028 = fieldWeight in 218, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7894444 = idf(docFreq=49, maxDocs=44421)
                0.0625 = fieldNorm(doc=218)
        0.2 = coord(5/25)