Document (#38186)

Author
Arapakis, I.
Lalmas, M.
Ceylan, H.
Donmez, P.
Title
Automatically embedding newsworthy links to articles : from implementation to evaluation
Source
Journal of the Association for Information Science and Technology. 65(2014) no.1, S.129-145
Year
2014
Abstract
News portals are a popular destination for web users. News providers are therefore interested in attaining higher visitor rates and promoting greater engagement with their content. One aspect of engagement deals with keeping users on site longer by allowing them to have enhanced click-through experiences. News portals have invested in ways to embed links within news stories but so far these links have been curated by news editors. Given the manual effort involved, the use of such links is limited to a small scale. In this article, we evaluate a system-based approach that detects newsworthy events in a news article and locates other articles related to these events. Our system does not rely on resources like Wikipedia to identify events, and it was designed to be domain independent. A rigorous evaluation, using Amazon's Mechanical Turk, was performed to assess the system-embedded links against the manually-curated ones. Our findings reveal that our system's performance is comparable with that of professional editors, and that users find the automatically generated highlights interesting and the associated articles worthy of reading. Our evaluation also provides quantitative and qualitative insights into the curation of links, from the perspective of users and professional editors.
Content
Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.22959/abstract.

Similar documents (author)

  1. Lalmas, M.: Logical models in information retrieval : introduction and overview (1998) 5.30
    5.3016195 = sum of:
      5.3016195 = weight(author_txt:lalmas in 3668) [ClassicSimilarity], result of:
        5.3016195 = fieldWeight in 3668, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.482592 = idf(docFreq=24, maxDocs=44421)
          0.625 = fieldNorm(doc=3668)
    
  2. Lalmas, M.: XML information retrieval (2009) 5.30
    5.3016195 = sum of:
      5.3016195 = weight(author_txt:lalmas in 867) [ClassicSimilarity], result of:
        5.3016195 = fieldWeight in 867, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.482592 = idf(docFreq=24, maxDocs=44421)
          0.625 = fieldNorm(doc=867)
    
  3. Lalmas, M.: XML retrieval (2009) 5.30
    5.3016195 = sum of:
      5.3016195 = weight(author_txt:lalmas in 998) [ClassicSimilarity], result of:
        5.3016195 = fieldWeight in 998, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.482592 = idf(docFreq=24, maxDocs=44421)
          0.625 = fieldNorm(doc=998)
    
  4. Lalmas, M.; Ruthven, I.: ¬A model for structured document retrieval : empirical investigations (1997) 4.24
    4.241296 = sum of:
      4.241296 = weight(author_txt:lalmas in 1727) [ClassicSimilarity], result of:
        4.241296 = fieldWeight in 1727, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.482592 = idf(docFreq=24, maxDocs=44421)
          0.5 = fieldNorm(doc=1727)
    
  5. Lalmas, M.; Ruthven, I.: Representing and retrieving structured documents using the Dempster-Shafer theory of evidence : modelling and evaluation (1998) 4.24
    4.241296 = sum of:
      4.241296 = weight(author_txt:lalmas in 2076) [ClassicSimilarity], result of:
        4.241296 = fieldWeight in 2076, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.482592 = idf(docFreq=24, maxDocs=44421)
          0.5 = fieldNorm(doc=2076)
    

Similar documents (content)

  1. Lehmann, J.; Castillo, C.; Lalmas, M.; Baeza-Yates, R.: Story-focused reading in online news and its potential for user engagement (2017) 0.21
    0.21094638 = sum of:
      0.21094638 = product of:
        0.87894326 = sum of:
          0.016447185 = weight(abstract_txt:that in 4529) [ClassicSimilarity], result of:
            0.016447185 = score(doc=4529,freq=6.0), product of:
              0.045427274 = queryWeight, product of:
                1.1806098 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.01627013 = queryNorm
              0.3620553 = fieldWeight in 4529, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=4529)
          0.12317456 = weight(abstract_txt:engagement in 4529) [ClassicSimilarity], result of:
            0.12317456 = score(doc=4529,freq=2.0), product of:
              0.1990521 = queryWeight, product of:
                1.747497 = boost
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.01627013 = queryNorm
              0.61880565 = fieldWeight in 4529, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.0625 = fieldNorm(doc=4529)
          0.05152995 = weight(abstract_txt:users in 4529) [ClassicSimilarity], result of:
            0.05152995 = score(doc=4529,freq=5.0), product of:
              0.1033608 = queryWeight, product of:
                1.7808444 = boost
                3.5672934 = idf(docFreq=3408, maxDocs=44421)
                0.01627013 = queryNorm
              0.49854442 = fieldWeight in 4529, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.5672934 = idf(docFreq=3408, maxDocs=44421)
                0.0625 = fieldNorm(doc=4529)
          0.072013155 = weight(abstract_txt:articles in 4529) [ClassicSimilarity], result of:
            0.072013155 = score(doc=4529,freq=3.0), product of:
              0.1391748 = queryWeight, product of:
                1.7896124 = boost
                4.7798095 = idf(docFreq=1013, maxDocs=44421)
                0.01627013 = queryNorm
              0.51742953 = fieldWeight in 4529, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7798095 = idf(docFreq=1013, maxDocs=44421)
                0.0625 = fieldNorm(doc=4529)
          0.18959007 = weight(abstract_txt:links in 4529) [ClassicSimilarity], result of:
            0.18959007 = score(doc=4529,freq=3.0), product of:
              0.33432826 = queryWeight, product of:
                3.922656 = boost
                5.238438 = idf(docFreq=640, maxDocs=44421)
                0.01627013 = queryNorm
              0.5670776 = fieldWeight in 4529, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.238438 = idf(docFreq=640, maxDocs=44421)
                0.0625 = fieldNorm(doc=4529)
          0.42618832 = weight(abstract_txt:news in 4529) [ClassicSimilarity], result of:
            0.42618832 = score(doc=4529,freq=7.0), product of:
              0.43255183 = queryWeight, product of:
                4.4618273 = boost
                5.9584646 = idf(docFreq=311, maxDocs=44421)
                0.01627013 = queryNorm
              0.98528844 = fieldWeight in 4529, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.9584646 = idf(docFreq=311, maxDocs=44421)
                0.0625 = fieldNorm(doc=4529)
        0.24 = coord(6/25)
    
  2. O'Brien, H.L.; Lebow, M.: Mixed-methods approach to measuring user experience in online news interactions (2013) 0.19
    0.18759151 = sum of:
      0.18759151 = product of:
        0.6699697 = sum of:
          0.01342907 = weight(abstract_txt:that in 2001) [ClassicSimilarity], result of:
            0.01342907 = score(doc=2001,freq=4.0), product of:
              0.045427274 = queryWeight, product of:
                1.1806098 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.01627013 = queryNorm
              0.2956169 = fieldWeight in 2001, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=2001)
          0.014607692 = weight(abstract_txt:system in 2001) [ClassicSimilarity], result of:
            0.014607692 = score(doc=2001,freq=1.0), product of:
              0.06929696 = queryWeight, product of:
                1.2628036 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.01627013 = queryNorm
              0.21079844 = fieldWeight in 2001, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.0625 = fieldNorm(doc=2001)
          0.03422524 = weight(abstract_txt:evaluation in 2001) [ClassicSimilarity], result of:
            0.03422524 = score(doc=2001,freq=1.0), product of:
              0.12224304 = queryWeight, product of:
                1.6772228 = boost
                4.479632 = idf(docFreq=1368, maxDocs=44421)
                0.01627013 = queryNorm
              0.279977 = fieldWeight in 2001, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.479632 = idf(docFreq=1368, maxDocs=44421)
                0.0625 = fieldNorm(doc=2001)
          0.15085742 = weight(abstract_txt:engagement in 2001) [ClassicSimilarity], result of:
            0.15085742 = score(doc=2001,freq=3.0), product of:
              0.1990521 = queryWeight, product of:
                1.747497 = boost
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.01627013 = queryNorm
              0.7578791 = fieldWeight in 2001, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.0625 = fieldNorm(doc=2001)
          0.023044894 = weight(abstract_txt:users in 2001) [ClassicSimilarity], result of:
            0.023044894 = score(doc=2001,freq=1.0), product of:
              0.1033608 = queryWeight, product of:
                1.7808444 = boost
                3.5672934 = idf(docFreq=3408, maxDocs=44421)
                0.01627013 = queryNorm
              0.22295584 = fieldWeight in 2001, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5672934 = idf(docFreq=3408, maxDocs=44421)
                0.0625 = fieldNorm(doc=2001)
          0.15479964 = weight(abstract_txt:links in 2001) [ClassicSimilarity], result of:
            0.15479964 = score(doc=2001,freq=2.0), product of:
              0.33432826 = queryWeight, product of:
                3.922656 = boost
                5.238438 = idf(docFreq=640, maxDocs=44421)
                0.01627013 = queryNorm
              0.4630169 = fieldWeight in 2001, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.238438 = idf(docFreq=640, maxDocs=44421)
                0.0625 = fieldNorm(doc=2001)
          0.27900574 = weight(abstract_txt:news in 2001) [ClassicSimilarity], result of:
            0.27900574 = score(doc=2001,freq=3.0), product of:
              0.43255183 = queryWeight, product of:
                4.4618273 = boost
                5.9584646 = idf(docFreq=311, maxDocs=44421)
                0.01627013 = queryNorm
              0.6450227 = fieldWeight in 2001, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.9584646 = idf(docFreq=311, maxDocs=44421)
                0.0625 = fieldNorm(doc=2001)
        0.28 = coord(7/25)
    
  3. Arapakis, I.; Lalmas, M.; Cambazoglu, B.B.; MarcosM.-C.; Jose, J.M.: User engagement in online news : under the scope of sentiment, interest, affect, and gaze (2014) 0.19
    0.18652731 = sum of:
      0.18652731 = product of:
        0.7771971 = sum of:
          0.015014157 = weight(abstract_txt:that in 2497) [ClassicSimilarity], result of:
            0.015014157 = score(doc=2497,freq=5.0), product of:
              0.045427274 = queryWeight, product of:
                1.1806098 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.01627013 = queryNorm
              0.33050975 = fieldWeight in 2497, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=2497)
          0.15085742 = weight(abstract_txt:engagement in 2497) [ClassicSimilarity], result of:
            0.15085742 = score(doc=2497,freq=3.0), product of:
              0.1990521 = queryWeight, product of:
                1.747497 = boost
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.01627013 = queryNorm
              0.7578791 = fieldWeight in 2497, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.0625 = fieldNorm(doc=2497)
          0.091089346 = weight(abstract_txt:portals in 2497) [ClassicSimilarity], result of:
            0.091089346 = score(doc=2497,freq=1.0), product of:
              0.2050884 = queryWeight, product of:
                1.7737957 = boost
                7.1063476 = idf(docFreq=98, maxDocs=44421)
                0.01627013 = queryNorm
              0.44414672 = fieldWeight in 2497, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1063476 = idf(docFreq=98, maxDocs=44421)
                0.0625 = fieldNorm(doc=2497)
          0.023044894 = weight(abstract_txt:users in 2497) [ClassicSimilarity], result of:
            0.023044894 = score(doc=2497,freq=1.0), product of:
              0.1033608 = queryWeight, product of:
                1.7808444 = boost
                3.5672934 = idf(docFreq=3408, maxDocs=44421)
                0.01627013 = queryNorm
              0.22295584 = fieldWeight in 2497, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5672934 = idf(docFreq=3408, maxDocs=44421)
                0.0625 = fieldNorm(doc=2497)
          0.041576814 = weight(abstract_txt:articles in 2497) [ClassicSimilarity], result of:
            0.041576814 = score(doc=2497,freq=1.0), product of:
              0.1391748 = queryWeight, product of:
                1.7896124 = boost
                4.7798095 = idf(docFreq=1013, maxDocs=44421)
                0.01627013 = queryNorm
              0.2987381 = fieldWeight in 2497, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7798095 = idf(docFreq=1013, maxDocs=44421)
                0.0625 = fieldNorm(doc=2497)
          0.45561448 = weight(abstract_txt:news in 2497) [ClassicSimilarity], result of:
            0.45561448 = score(doc=2497,freq=8.0), product of:
              0.43255183 = queryWeight, product of:
                4.4618273 = boost
                5.9584646 = idf(docFreq=311, maxDocs=44421)
                0.01627013 = queryNorm
              1.0533177 = fieldWeight in 2497, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                5.9584646 = idf(docFreq=311, maxDocs=44421)
                0.0625 = fieldNorm(doc=2497)
        0.24 = coord(6/25)
    
  4. Ou, S.; Khoo, C.S.G.; Goh, D.H.: Multi-document summarization of news articles using an event-based framework (2006) 0.18
    0.17569545 = sum of:
      0.17569545 = product of:
        0.73206437 = sum of:
          0.009495786 = weight(abstract_txt:that in 782) [ClassicSimilarity], result of:
            0.009495786 = score(doc=782,freq=2.0), product of:
              0.045427274 = queryWeight, product of:
                1.1806098 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.01627013 = queryNorm
              0.20903271 = fieldWeight in 782, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=782)
          0.014607692 = weight(abstract_txt:system in 782) [ClassicSimilarity], result of:
            0.014607692 = score(doc=782,freq=1.0), product of:
              0.06929696 = queryWeight, product of:
                1.2628036 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.01627013 = queryNorm
              0.21079844 = fieldWeight in 782, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.0625 = fieldNorm(doc=782)
          0.048401795 = weight(abstract_txt:evaluation in 782) [ClassicSimilarity], result of:
            0.048401795 = score(doc=782,freq=2.0), product of:
              0.12224304 = queryWeight, product of:
                1.6772228 = boost
                4.479632 = idf(docFreq=1368, maxDocs=44421)
                0.01627013 = queryNorm
              0.39594725 = fieldWeight in 782, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.479632 = idf(docFreq=1368, maxDocs=44421)
                0.0625 = fieldNorm(doc=782)
          0.110001914 = weight(abstract_txt:articles in 782) [ClassicSimilarity], result of:
            0.110001914 = score(doc=782,freq=7.0), product of:
              0.1391748 = queryWeight, product of:
                1.7896124 = boost
                4.7798095 = idf(docFreq=1013, maxDocs=44421)
                0.01627013 = queryNorm
              0.7903867 = fieldWeight in 782, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.7798095 = idf(docFreq=1013, maxDocs=44421)
                0.0625 = fieldNorm(doc=782)
          0.09394269 = weight(abstract_txt:events in 782) [ClassicSimilarity], result of:
            0.09394269 = score(doc=782,freq=1.0), product of:
              0.23964505 = queryWeight, product of:
                2.3483505 = boost
                6.272122 = idf(docFreq=227, maxDocs=44421)
                0.01627013 = queryNorm
              0.39200762 = fieldWeight in 782, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.272122 = idf(docFreq=227, maxDocs=44421)
                0.0625 = fieldNorm(doc=782)
          0.45561448 = weight(abstract_txt:news in 782) [ClassicSimilarity], result of:
            0.45561448 = score(doc=782,freq=8.0), product of:
              0.43255183 = queryWeight, product of:
                4.4618273 = boost
                5.9584646 = idf(docFreq=311, maxDocs=44421)
                0.01627013 = queryNorm
              1.0533177 = fieldWeight in 782, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                5.9584646 = idf(docFreq=311, maxDocs=44421)
                0.0625 = fieldNorm(doc=782)
        0.24 = coord(6/25)
    
  5. Watters, C.R.; Shepherd, M.A.; Burkowski, F.J.: Electronic news delivery project (1998) 0.17
    0.16855615 = sum of:
      0.16855615 = product of:
        0.84278077 = sum of:
          0.006714535 = weight(abstract_txt:that in 1444) [ClassicSimilarity], result of:
            0.006714535 = score(doc=1444,freq=1.0), product of:
              0.045427274 = queryWeight, product of:
                1.1806098 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.01627013 = queryNorm
              0.14780845 = fieldWeight in 1444, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=1444)
          0.012468671 = weight(abstract_txt:have in 1444) [ClassicSimilarity], result of:
            0.012468671 = score(doc=1444,freq=1.0), product of:
              0.062355284 = queryWeight, product of:
                1.1978855 = boost
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.01627013 = queryNorm
              0.19996175 = fieldWeight in 1444, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.0625 = fieldNorm(doc=1444)
          0.13285503 = weight(abstract_txt:events in 1444) [ClassicSimilarity], result of:
            0.13285503 = score(doc=1444,freq=2.0), product of:
              0.23964505 = queryWeight, product of:
                2.3483505 = boost
                6.272122 = idf(docFreq=227, maxDocs=44421)
                0.01627013 = queryNorm
              0.5543825 = fieldWeight in 1444, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.272122 = idf(docFreq=227, maxDocs=44421)
                0.0625 = fieldNorm(doc=1444)
          0.13273105 = weight(abstract_txt:editors in 1444) [ClassicSimilarity], result of:
            0.13273105 = score(doc=1444,freq=1.0), product of:
              0.30174598 = queryWeight, product of:
                2.6351142 = boost
                7.0380287 = idf(docFreq=105, maxDocs=44421)
                0.01627013 = queryNorm
              0.4398768 = fieldWeight in 1444, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0380287 = idf(docFreq=105, maxDocs=44421)
                0.0625 = fieldNorm(doc=1444)
          0.5580115 = weight(abstract_txt:news in 1444) [ClassicSimilarity], result of:
            0.5580115 = score(doc=1444,freq=12.0), product of:
              0.43255183 = queryWeight, product of:
                4.4618273 = boost
                5.9584646 = idf(docFreq=311, maxDocs=44421)
                0.01627013 = queryNorm
              1.2900454 = fieldWeight in 1444, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                5.9584646 = idf(docFreq=311, maxDocs=44421)
                0.0625 = fieldNorm(doc=1444)
        0.2 = coord(5/25)