Document (#22857)

Author
Watters, C.
Wang, H.
Title
Rating new documents for similarity
Source
Journal of the American Society for Information Science. 51(2000) no.9, S.793-804
Year
2000
Abstract
Electronic news has long held the promise of personalized and dynamic delivery of current event new items, particularly for Web users. Although wlwctronic versions of print news are now widely available, the personalization of that delivery has not yet been accomplished. In this paper, we present a methodology of associating news documents based on the extraction of feature phrases, where feature phrases identify dates, locations, people and organizations. A news representation is created from these feature phrases to define news objects that can then be compared and ranked to find related news items. Unlike tradtional information retrieval, we are much more interested in precision than recall. That is, the user would like to see one or more specifically related articles, rather than all somewhat related articles. The algorithm is designed to work interactively the the user using regular web browsers as the interface
Theme
Internet
Form
Zeitungen

Similar documents (author)

  1. Watters, C.: Extending the multimedia class hierarchy for hypermedia applications (1996) 2.48
    2.484996 = sum of:
      2.484996 = product of:
        4.969992 = sum of:
          4.969992 = weight(author_txt:watters in 605) [ClassicSimilarity], result of:
            4.969992 = score(doc=605,freq=1.0), product of:
              0.8842009 = queryWeight, product of:
                1.3758384 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.0714593 = queryNorm
              5.620886 = fieldWeight in 605, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.625 = fieldNorm(doc=605)
        0.5 = coord(1/2)
    
  2. Watters, C.: Information retrieval and the virtual document (1999) 2.48
    2.484996 = sum of:
      2.484996 = product of:
        4.969992 = sum of:
          4.969992 = weight(author_txt:watters in 5319) [ClassicSimilarity], result of:
            4.969992 = score(doc=5319,freq=1.0), product of:
              0.8842009 = queryWeight, product of:
                1.3758384 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.0714593 = queryNorm
              5.620886 = fieldWeight in 5319, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.625 = fieldNorm(doc=5319)
        0.5 = coord(1/2)
    
  3. Watters, C.; Shepherd, M.A.: Shifting the information paradigm from data-centered to user-centered (1994) 1.99
    1.9879969 = sum of:
      1.9879969 = product of:
        3.9759939 = sum of:
          3.9759939 = weight(author_txt:watters in 7289) [ClassicSimilarity], result of:
            3.9759939 = score(doc=7289,freq=1.0), product of:
              0.8842009 = queryWeight, product of:
                1.3758384 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.0714593 = queryNorm
              4.496709 = fieldWeight in 7289, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.5 = fieldNorm(doc=7289)
        0.5 = coord(1/2)
    
  4. Carrick, C.; Watters, C.: Automatic association of news items (1997) 1.99
    1.9879969 = sum of:
      1.9879969 = product of:
        3.9759939 = sum of:
          3.9759939 = weight(author_txt:watters in 2549) [ClassicSimilarity], result of:
            3.9759939 = score(doc=2549,freq=1.0), product of:
              0.8842009 = queryWeight, product of:
                1.3758384 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.0714593 = queryNorm
              4.496709 = fieldWeight in 2549, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.5 = fieldNorm(doc=2549)
        0.5 = coord(1/2)
    
  5. Watters, C.; Amoudi, A.: Geosearcher : location-based ranking of search engine results (2003) 1.99
    1.9879969 = sum of:
      1.9879969 = product of:
        3.9759939 = sum of:
          3.9759939 = weight(author_txt:watters in 152) [ClassicSimilarity], result of:
            3.9759939 = score(doc=152,freq=1.0), product of:
              0.8842009 = queryWeight, product of:
                1.3758384 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.0714593 = queryNorm
              4.496709 = fieldWeight in 152, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.5 = fieldNorm(doc=152)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Sela, M.; Lavie, T.; Inbar, O.; Oppenheim, I.; Meyer, J.: Personalizing news content : an experimental study (2015) 0.33
    0.32903466 = sum of:
      0.32903466 = product of:
        1.1751238 = sum of:
          0.00933464 = weight(abstract_txt:that in 2604) [ClassicSimilarity], result of:
            0.00933464 = score(doc=2604,freq=2.0), product of:
              0.04465636 = queryWeight, product of:
                1.0147123 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.018608918 = queryNorm
              0.20903271 = fieldWeight in 2604, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=2604)
          0.15079753 = weight(abstract_txt:personalized in 2604) [ClassicSimilarity], result of:
            0.15079753 = score(doc=2604,freq=6.0), product of:
              0.13719352 = queryWeight, product of:
                1.026851 = boost
                7.179679 = idf(docFreq=91, maxDocs=44421)
                0.018608918 = queryNorm
              1.0991594 = fieldWeight in 2604, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.179679 = idf(docFreq=91, maxDocs=44421)
                0.0625 = fieldNorm(doc=2604)
          0.03318291 = weight(abstract_txt:user in 2604) [ClassicSimilarity], result of:
            0.03318291 = score(doc=2604,freq=4.0), product of:
              0.07211974 = queryWeight, product of:
                1.0528893 = boost
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.018608918 = queryNorm
              0.46010855 = fieldWeight in 2604, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.0625 = fieldNorm(doc=2604)
          0.07861819 = weight(abstract_txt:personalization in 2604) [ClassicSimilarity], result of:
            0.07861819 = score(doc=2604,freq=1.0), product of:
              0.16148663 = queryWeight, product of:
                1.1140609 = boost
                7.7894444 = idf(docFreq=49, maxDocs=44421)
                0.018608918 = queryNorm
              0.48684028 = fieldWeight in 2604, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7894444 = idf(docFreq=49, maxDocs=44421)
                0.0625 = fieldNorm(doc=2604)
          0.12917599 = weight(abstract_txt:items in 2604) [ClassicSimilarity], result of:
            0.12917599 = score(doc=2604,freq=5.0), product of:
              0.16567706 = queryWeight, product of:
                1.5958308 = boost
                5.5789747 = idf(docFreq=455, maxDocs=44421)
                0.018608918 = queryNorm
              0.77968544 = fieldWeight in 2604, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.5789747 = idf(docFreq=455, maxDocs=44421)
                0.0625 = fieldNorm(doc=2604)
          0.07376267 = weight(abstract_txt:delivery in 2604) [ClassicSimilarity], result of:
            0.07376267 = score(doc=2604,freq=1.0), product of:
              0.19499446 = queryWeight, product of:
                1.731278 = boost
                6.0524936 = idf(docFreq=283, maxDocs=44421)
                0.018608918 = queryNorm
              0.37828085 = fieldWeight in 2604, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0524936 = idf(docFreq=283, maxDocs=44421)
                0.0625 = fieldNorm(doc=2604)
          0.7002519 = weight(abstract_txt:news in 2604) [ClassicSimilarity], result of:
            0.7002519 = score(doc=2604,freq=11.0), product of:
              0.5669484 = queryWeight, product of:
                5.1131444 = boost
                5.9584646 = idf(docFreq=311, maxDocs=44421)
                0.018608918 = queryNorm
              1.2351245 = fieldWeight in 2604, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                5.9584646 = idf(docFreq=311, maxDocs=44421)
                0.0625 = fieldNorm(doc=2604)
        0.28 = coord(7/25)
    
  2. Watters, C.R.; Shepherd, M.A.; Burkowski, F.J.: Electronic news delivery project (1998) 0.19
    0.19284743 = sum of:
      0.19284743 = product of:
        0.96423715 = sum of:
          0.006600587 = weight(abstract_txt:that in 1444) [ClassicSimilarity], result of:
            0.006600587 = score(doc=1444,freq=1.0), product of:
              0.04465636 = queryWeight, product of:
                1.0147123 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.018608918 = queryNorm
              0.14780845 = fieldWeight in 1444, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=1444)
          0.061562836 = weight(abstract_txt:personalized in 1444) [ClassicSimilarity], result of:
            0.061562836 = score(doc=1444,freq=1.0), product of:
              0.13719352 = queryWeight, product of:
                1.026851 = boost
                7.179679 = idf(docFreq=91, maxDocs=44421)
                0.018608918 = queryNorm
              0.44872993 = fieldWeight in 1444, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.179679 = idf(docFreq=91, maxDocs=44421)
                0.0625 = fieldNorm(doc=1444)
          0.1277607 = weight(abstract_txt:delivery in 1444) [ClassicSimilarity], result of:
            0.1277607 = score(doc=1444,freq=3.0), product of:
              0.19499446 = queryWeight, product of:
                1.731278 = boost
                6.0524936 = idf(docFreq=283, maxDocs=44421)
                0.018608918 = queryNorm
              0.6552016 = fieldWeight in 1444, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.0524936 = idf(docFreq=283, maxDocs=44421)
                0.0625 = fieldNorm(doc=1444)
          0.036923885 = weight(abstract_txt:related in 1444) [ClassicSimilarity], result of:
            0.036923885 = score(doc=1444,freq=1.0), product of:
              0.14072347 = queryWeight, product of:
                1.8012938 = boost
                4.198178 = idf(docFreq=1813, maxDocs=44421)
                0.018608918 = queryNorm
              0.2623861 = fieldWeight in 1444, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.198178 = idf(docFreq=1813, maxDocs=44421)
                0.0625 = fieldNorm(doc=1444)
          0.73138916 = weight(abstract_txt:news in 1444) [ClassicSimilarity], result of:
            0.73138916 = score(doc=1444,freq=12.0), product of:
              0.5669484 = queryWeight, product of:
                5.1131444 = boost
                5.9584646 = idf(docFreq=311, maxDocs=44421)
                0.018608918 = queryNorm
              1.2900454 = fieldWeight in 1444, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                5.9584646 = idf(docFreq=311, maxDocs=44421)
                0.0625 = fieldNorm(doc=1444)
        0.2 = coord(5/25)
    
  3. Shapira, B.; Shoval, P.; Tractinsky, N.; Meyer, J.: ePaper : a personalized mobile newspaper (2009) 0.18
    0.18395346 = sum of:
      0.18395346 = product of:
        0.9197673 = sum of:
          0.076953545 = weight(abstract_txt:personalized in 155) [ClassicSimilarity], result of:
            0.076953545 = score(doc=155,freq=1.0), product of:
              0.13719352 = queryWeight, product of:
                1.026851 = boost
                7.179679 = idf(docFreq=91, maxDocs=44421)
                0.018608918 = queryNorm
              0.56091243 = fieldWeight in 155, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.179679 = idf(docFreq=91, maxDocs=44421)
                0.078125 = fieldNorm(doc=155)
          0.029329825 = weight(abstract_txt:user in 155) [ClassicSimilarity], result of:
            0.029329825 = score(doc=155,freq=2.0), product of:
              0.07211974 = queryWeight, product of:
                1.0528893 = boost
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.018608918 = queryNorm
              0.40668234 = fieldWeight in 155, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.078125 = fieldNorm(doc=155)
          0.09827275 = weight(abstract_txt:personalization in 155) [ClassicSimilarity], result of:
            0.09827275 = score(doc=155,freq=1.0), product of:
              0.16148663 = queryWeight, product of:
                1.1140609 = boost
                7.7894444 = idf(docFreq=49, maxDocs=44421)
                0.018608918 = queryNorm
              0.60855037 = fieldWeight in 155, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7894444 = idf(docFreq=49, maxDocs=44421)
                0.078125 = fieldNorm(doc=155)
          0.1250741 = weight(abstract_txt:items in 155) [ClassicSimilarity], result of:
            0.1250741 = score(doc=155,freq=3.0), product of:
              0.16567706 = queryWeight, product of:
                1.5958308 = boost
                5.5789747 = idf(docFreq=455, maxDocs=44421)
                0.018608918 = queryNorm
              0.75492716 = fieldWeight in 155, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.5789747 = idf(docFreq=455, maxDocs=44421)
                0.078125 = fieldNorm(doc=155)
          0.5901371 = weight(abstract_txt:news in 155) [ClassicSimilarity], result of:
            0.5901371 = score(doc=155,freq=5.0), product of:
              0.5669484 = queryWeight, product of:
                5.1131444 = boost
                5.9584646 = idf(docFreq=311, maxDocs=44421)
                0.018608918 = queryNorm
              1.040901 = fieldWeight in 155, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.9584646 = idf(docFreq=311, maxDocs=44421)
                0.078125 = fieldNorm(doc=155)
        0.2 = coord(5/25)
    
  4. Carrick, C.; Watters, C.: Automatic association of news items (1997) 0.18
    0.18292731 = sum of:
      0.18292731 = product of:
        0.91463655 = sum of:
          0.100512765 = weight(abstract_txt:event in 2549) [ClassicSimilarity], result of:
            0.100512765 = score(doc=2549,freq=2.0), product of:
              0.1301124 = queryWeight, product of:
                6.9919376 = idf(docFreq=110, maxDocs=44421)
                0.018608918 = queryNorm
              0.77250725 = fieldWeight in 2549, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.9919376 = idf(docFreq=110, maxDocs=44421)
                0.078125 = fieldNorm(doc=2549)
          0.0142906895 = weight(abstract_txt:that in 2549) [ClassicSimilarity], result of:
            0.0142906895 = score(doc=2549,freq=3.0), product of:
              0.04465636 = queryWeight, product of:
                1.0147123 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.018608918 = queryNorm
              0.32001466 = fieldWeight in 2549, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.078125 = fieldNorm(doc=2549)
          0.14442314 = weight(abstract_txt:items in 2549) [ClassicSimilarity], result of:
            0.14442314 = score(doc=2549,freq=4.0), product of:
              0.16567706 = queryWeight, product of:
                1.5958308 = boost
                5.5789747 = idf(docFreq=455, maxDocs=44421)
                0.018608918 = queryNorm
              0.87171483 = fieldWeight in 2549, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.5789747 = idf(docFreq=455, maxDocs=44421)
                0.078125 = fieldNorm(doc=2549)
          0.06527282 = weight(abstract_txt:related in 2549) [ClassicSimilarity], result of:
            0.06527282 = score(doc=2549,freq=2.0), product of:
              0.14072347 = queryWeight, product of:
                1.8012938 = boost
                4.198178 = idf(docFreq=1813, maxDocs=44421)
                0.018608918 = queryNorm
              0.4638375 = fieldWeight in 2549, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.198178 = idf(docFreq=1813, maxDocs=44421)
                0.078125 = fieldNorm(doc=2549)
          0.5901371 = weight(abstract_txt:news in 2549) [ClassicSimilarity], result of:
            0.5901371 = score(doc=2549,freq=5.0), product of:
              0.5669484 = queryWeight, product of:
                5.1131444 = boost
                5.9584646 = idf(docFreq=311, maxDocs=44421)
                0.018608918 = queryNorm
              1.040901 = fieldWeight in 2549, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.9584646 = idf(docFreq=311, maxDocs=44421)
                0.078125 = fieldNorm(doc=2549)
        0.2 = coord(5/25)
    
  5. Ou, S.; Khoo, C.S.G.; Goh, D.H.: Multi-document summarization of news articles using an event-based framework (2006) 0.18
    0.17530583 = sum of:
      0.17530583 = product of:
        0.87652916 = sum of:
          0.15043373 = weight(abstract_txt:event in 782) [ClassicSimilarity], result of:
            0.15043373 = score(doc=782,freq=7.0), product of:
              0.1301124 = queryWeight, product of:
                6.9919376 = idf(docFreq=110, maxDocs=44421)
                0.018608918 = queryNorm
              1.156183 = fieldWeight in 782, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.9919376 = idf(docFreq=110, maxDocs=44421)
                0.0625 = fieldNorm(doc=782)
          0.00933464 = weight(abstract_txt:that in 782) [ClassicSimilarity], result of:
            0.00933464 = score(doc=782,freq=2.0), product of:
              0.04465636 = queryWeight, product of:
                1.0147123 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.018608918 = queryNorm
              0.20903271 = fieldWeight in 782, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=782)
          0.02346386 = weight(abstract_txt:user in 782) [ClassicSimilarity], result of:
            0.02346386 = score(doc=782,freq=2.0), product of:
              0.07211974 = queryWeight, product of:
                1.0528893 = boost
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.018608918 = queryNorm
              0.32534587 = fieldWeight in 782, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.0625 = fieldNorm(doc=782)
          0.096120134 = weight(abstract_txt:articles in 782) [ClassicSimilarity], result of:
            0.096120134 = score(doc=782,freq=7.0), product of:
              0.121611536 = queryWeight, product of:
                1.3672346 = boost
                4.7798095 = idf(docFreq=1013, maxDocs=44421)
                0.018608918 = queryNorm
              0.7903867 = fieldWeight in 782, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.7798095 = idf(docFreq=1013, maxDocs=44421)
                0.0625 = fieldNorm(doc=782)
          0.5971768 = weight(abstract_txt:news in 782) [ClassicSimilarity], result of:
            0.5971768 = score(doc=782,freq=8.0), product of:
              0.5669484 = queryWeight, product of:
                5.1131444 = boost
                5.9584646 = idf(docFreq=311, maxDocs=44421)
                0.018608918 = queryNorm
              1.0533177 = fieldWeight in 782, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                5.9584646 = idf(docFreq=311, maxDocs=44421)
                0.0625 = fieldNorm(doc=782)
        0.2 = coord(5/25)