Document (#42247)

Author
Chin, J.Y.
Bhowmick, S.S.
Jatowt, A.
Title
On-demand recent personal tweets summarization on mobile devices
Source
Journal of the Association for Information Science and Technology. 70(2019) no.6, S.547-562
Year
2019
Abstract
Tweets summarization aims to find a group of representative tweets for a specific set of input tweets or a given topic. In recent times, there have been several research efforts toward devising a variety of techniques to summarize tweets in Twitter. However, these techniques are either not personal (that is, consider only tweets in the timeline of a specific user) or are too expensive to be realized on a mobile device. Given that 80% of active Twitter users access the site on mobile devices, in this article we present a lightweight, personal, on-demand, topic modeling-based tweets summarization engine called TOTEM, designed for such devices. Specifically, TOTEM first preprocesses recent tweets in a user's timeline and exploits Latent Dirichlet Allocation-based topic modeling to assign each preprocessed tweet to a topic. Then it generates a ranked list of relevant tweets, a topic label, and a topic summary for each of the topics. Our experimental study with real-world data sets demonstrates the superiority of TOTEM.
Content
Vgl.: https://onlinelibrary.wiley.com/doi/10.1002/asi.24137.

Similar documents (author)

  1. Jatowt, A.; Yeung, C.M.A.; Tanaka, K.: Generic method for detecting focus time of documents (2015) 3.72
    3.7161405 = sum of:
      3.7161405 = weight(author_txt:jatowt in 3668) [ClassicSimilarity], result of:
        3.7161405 = fieldWeight in 3668, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.909708 = idf(docFreq=5, maxDocs=44421)
          0.375 = fieldNorm(doc=3668)
    
  2. Joho, H.; Jatowt, A.; Blanco, R.: Temporal information searching behaviour and strategies (2015) 3.72
    3.7161405 = sum of:
      3.7161405 = weight(author_txt:jatowt in 3674) [ClassicSimilarity], result of:
        3.7161405 = fieldWeight in 3674, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.909708 = idf(docFreq=5, maxDocs=44421)
          0.375 = fieldNorm(doc=3674)
    
  3. Lee, J.; Jatowt, A.; Kim, K.-S..: Discovering underlying sensations of human emotions based on social media (2021) 3.72
    3.7161405 = sum of:
      3.7161405 = weight(author_txt:jatowt in 1164) [ClassicSimilarity], result of:
        3.7161405 = fieldWeight in 1164, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.909708 = idf(docFreq=5, maxDocs=44421)
          0.375 = fieldNorm(doc=1164)
    
  4. Zielinski, K.; Nielek, R.; Wierzbicki, A.; Jatowt, A.: Computing controversy : formal model and algorithms for detecting controversy on Wikipedia and in search queries (2018) 3.10
    3.0967836 = sum of:
      3.0967836 = weight(author_txt:jatowt in 93) [ClassicSimilarity], result of:
        3.0967836 = fieldWeight in 93, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.909708 = idf(docFreq=5, maxDocs=44421)
          0.3125 = fieldNorm(doc=93)
    

Similar documents (content)

  1. Luo, Z.; Yu, Y.; Osborne, M.; Wang, T.: Structuring tweets for improving Twitter search (2015) 0.35
    0.35489577 = sum of:
      0.35489577 = product of:
        1.2674849 = sum of:
          0.012852332 = weight(abstract_txt:each in 3335) [ClassicSimilarity], result of:
            0.012852332 = score(doc=3335,freq=1.0), product of:
              0.049939707 = queryWeight, product of:
                1.107092 = boost
                4.1177115 = idf(docFreq=1965, maxDocs=44421)
                0.010954848 = queryNorm
              0.25735697 = fieldWeight in 3335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1177115 = idf(docFreq=1965, maxDocs=44421)
                0.0625 = fieldNorm(doc=3335)
          0.09871553 = weight(abstract_txt:tweet in 3335) [ClassicSimilarity], result of:
            0.09871553 = score(doc=3335,freq=3.0), product of:
              0.10698707 = queryWeight, product of:
                1.1458068 = boost
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.010954848 = queryNorm
              0.9226866 = fieldWeight in 3335, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.0625 = fieldNorm(doc=3335)
          0.014694657 = weight(abstract_txt:specific in 3335) [ClassicSimilarity], result of:
            0.014694657 = score(doc=3335,freq=1.0), product of:
              0.054604825 = queryWeight, product of:
                1.1576473 = boost
                4.305746 = idf(docFreq=1628, maxDocs=44421)
                0.010954848 = queryNorm
              0.26910913 = fieldWeight in 3335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.305746 = idf(docFreq=1628, maxDocs=44421)
                0.0625 = fieldNorm(doc=3335)
          0.039715968 = weight(abstract_txt:modeling in 3335) [ClassicSimilarity], result of:
            0.039715968 = score(doc=3335,freq=1.0), product of:
              0.105950125 = queryWeight, product of:
                1.6125436 = boost
                5.997685 = idf(docFreq=299, maxDocs=44421)
                0.010954848 = queryNorm
              0.3748553 = fieldWeight in 3335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.997685 = idf(docFreq=299, maxDocs=44421)
                0.0625 = fieldNorm(doc=3335)
          0.14849389 = weight(abstract_txt:twitter in 3335) [ClassicSimilarity], result of:
            0.14849389 = score(doc=3335,freq=6.0), product of:
              0.14045806 = queryWeight, product of:
                1.8566672 = boost
                6.905677 = idf(docFreq=120, maxDocs=44421)
                0.010954848 = queryNorm
              1.0572115 = fieldWeight in 3335, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.905677 = idf(docFreq=120, maxDocs=44421)
                0.0625 = fieldNorm(doc=3335)
          0.0712827 = weight(abstract_txt:topic in 3335) [ClassicSimilarity], result of:
            0.0712827 = score(doc=3335,freq=1.0), product of:
              0.2256773 = queryWeight, product of:
                4.076292 = boost
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.010954848 = queryNorm
              0.3158612 = fieldWeight in 3335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.0625 = fieldNorm(doc=3335)
          0.8817298 = weight(abstract_txt:tweets in 3335) [ClassicSimilarity], result of:
            0.8817298 = score(doc=3335,freq=6.0), product of:
              0.7603884 = queryWeight, product of:
                9.163993 = boost
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.010954848 = queryNorm
              1.1595782 = fieldWeight in 3335, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.0625 = fieldNorm(doc=3335)
        0.28 = coord(7/25)
    
  2. Zheng, H.; Goh, D.H.-L.; Lee, E.W.J.; Lee, C.S.; Theng, Y.-L.: Understanding the effects of message cues on COVID-19 information sharing on Twitter (2022) 0.26
    0.26359656 = sum of:
      0.26359656 = product of:
        1.098319 = sum of:
          0.04025422 = weight(abstract_txt:allocation in 1565) [ClassicSimilarity], result of:
            0.04025422 = score(doc=1565,freq=1.0), product of:
              0.08485074 = queryWeight, product of:
                1.0204073 = boost
                7.590594 = idf(docFreq=60, maxDocs=44421)
                0.010954848 = queryNorm
              0.4744121 = fieldWeight in 1565, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.590594 = idf(docFreq=60, maxDocs=44421)
                0.0625 = fieldNorm(doc=1565)
          0.052012023 = weight(abstract_txt:dirichlet in 1565) [ClassicSimilarity], result of:
            0.052012023 = score(doc=1565,freq=1.0), product of:
              0.10065851 = queryWeight, product of:
                1.1114016 = boost
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.010954848 = queryNorm
              0.51671755 = fieldWeight in 1565, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.0625 = fieldNorm(doc=1565)
          0.039715968 = weight(abstract_txt:modeling in 1565) [ClassicSimilarity], result of:
            0.039715968 = score(doc=1565,freq=1.0), product of:
              0.105950125 = queryWeight, product of:
                1.6125436 = boost
                5.997685 = idf(docFreq=299, maxDocs=44421)
                0.010954848 = queryNorm
              0.3748553 = fieldWeight in 1565, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.997685 = idf(docFreq=299, maxDocs=44421)
                0.0625 = fieldNorm(doc=1565)
          0.060622375 = weight(abstract_txt:twitter in 1565) [ClassicSimilarity], result of:
            0.060622375 = score(doc=1565,freq=1.0), product of:
              0.14045806 = queryWeight, product of:
                1.8566672 = boost
                6.905677 = idf(docFreq=120, maxDocs=44421)
                0.010954848 = queryNorm
              0.4316048 = fieldWeight in 1565, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.905677 = idf(docFreq=120, maxDocs=44421)
                0.0625 = fieldNorm(doc=1565)
          0.10080896 = weight(abstract_txt:topic in 1565) [ClassicSimilarity], result of:
            0.10080896 = score(doc=1565,freq=2.0), product of:
              0.2256773 = queryWeight, product of:
                4.076292 = boost
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.010954848 = queryNorm
              0.44669518 = fieldWeight in 1565, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.0625 = fieldNorm(doc=1565)
          0.8049055 = weight(abstract_txt:tweets in 1565) [ClassicSimilarity], result of:
            0.8049055 = score(doc=1565,freq=5.0), product of:
              0.7603884 = queryWeight, product of:
                9.163993 = boost
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.010954848 = queryNorm
              1.0585452 = fieldWeight in 1565, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.0625 = fieldNorm(doc=1565)
        0.24 = coord(6/25)
    
  3. Sedhai, S.; Sun, A.: ¬An analysis of 14 Million tweets on hashtag-oriented spamming* (2017) 0.20
    0.19624308 = sum of:
      0.19624308 = product of:
        1.2265192 = sum of:
          0.09871553 = weight(abstract_txt:tweet in 4683) [ClassicSimilarity], result of:
            0.09871553 = score(doc=4683,freq=3.0), product of:
              0.10698707 = queryWeight, product of:
                1.1458068 = boost
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.010954848 = queryNorm
              0.9226866 = fieldWeight in 4683, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.0625 = fieldNorm(doc=4683)
          0.13555574 = weight(abstract_txt:twitter in 4683) [ClassicSimilarity], result of:
            0.13555574 = score(doc=4683,freq=5.0), product of:
              0.14045806 = queryWeight, product of:
                1.8566672 = boost
                6.905677 = idf(docFreq=120, maxDocs=44421)
                0.010954848 = queryNorm
              0.96509767 = fieldWeight in 4683, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.905677 = idf(docFreq=120, maxDocs=44421)
                0.0625 = fieldNorm(doc=4683)
          0.039871003 = weight(abstract_txt:personal in 4683) [ClassicSimilarity], result of:
            0.039871003 = score(doc=4683,freq=1.0), product of:
              0.12159804 = queryWeight, product of:
                2.1157758 = boost
                5.246269 = idf(docFreq=635, maxDocs=44421)
                0.010954848 = queryNorm
              0.32789183 = fieldWeight in 4683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.246269 = idf(docFreq=635, maxDocs=44421)
                0.0625 = fieldNorm(doc=4683)
          0.95237696 = weight(abstract_txt:tweets in 4683) [ClassicSimilarity], result of:
            0.95237696 = score(doc=4683,freq=7.0), product of:
              0.7603884 = queryWeight, product of:
                9.163993 = boost
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.010954848 = queryNorm
              1.2524875 = fieldWeight in 4683, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.0625 = fieldNorm(doc=4683)
        0.16 = coord(4/25)
    
  4. Gonçalo Oliveira, H.: Automatic generation of poetry inspired by Twitter trends (2016) 0.19
    0.19071595 = sum of:
      0.19071595 = product of:
        0.9535798 = sum of:
          0.02385189 = weight(abstract_txt:given in 3388) [ClassicSimilarity], result of:
            0.02385189 = score(doc=3388,freq=1.0), product of:
              0.064992994 = queryWeight, product of:
                1.2629728 = boost
                4.6974936 = idf(docFreq=1100, maxDocs=44421)
                0.010954848 = queryNorm
              0.3669917 = fieldWeight in 3388, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6974936 = idf(docFreq=1100, maxDocs=44421)
                0.078125 = fieldNorm(doc=3388)
          0.07577797 = weight(abstract_txt:twitter in 3388) [ClassicSimilarity], result of:
            0.07577797 = score(doc=3388,freq=1.0), product of:
              0.14045806 = queryWeight, product of:
                1.8566672 = boost
                6.905677 = idf(docFreq=120, maxDocs=44421)
                0.010954848 = queryNorm
              0.539506 = fieldWeight in 3388, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.905677 = idf(docFreq=120, maxDocs=44421)
                0.078125 = fieldNorm(doc=3388)
          0.0394095 = weight(abstract_txt:recent in 3388) [ClassicSimilarity], result of:
            0.0394095 = score(doc=3388,freq=1.0), product of:
              0.10397982 = queryWeight, product of:
                1.9565047 = boost
                4.8513412 = idf(docFreq=943, maxDocs=44421)
                0.010954848 = queryNorm
              0.37901103 = fieldWeight in 3388, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8513412 = idf(docFreq=943, maxDocs=44421)
                0.078125 = fieldNorm(doc=3388)
          0.17820676 = weight(abstract_txt:topic in 3388) [ClassicSimilarity], result of:
            0.17820676 = score(doc=3388,freq=4.0), product of:
              0.2256773 = queryWeight, product of:
                4.076292 = boost
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.010954848 = queryNorm
              0.789653 = fieldWeight in 3388, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.078125 = fieldNorm(doc=3388)
          0.63633364 = weight(abstract_txt:tweets in 3388) [ClassicSimilarity], result of:
            0.63633364 = score(doc=3388,freq=2.0), product of:
              0.7603884 = queryWeight, product of:
                9.163993 = boost
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.010954848 = queryNorm
              0.83685344 = fieldWeight in 3388, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.078125 = fieldNorm(doc=3388)
        0.2 = coord(5/25)
    
  5. Zheng, X.; Sun, A.: Collecting event-related tweets from twitter stream (2019) 0.19
    0.18985036 = sum of:
      0.18985036 = product of:
        0.9492518 = sum of:
          0.052633327 = weight(abstract_txt:superiority in 672) [ClassicSimilarity], result of:
            0.052633327 = score(doc=672,freq=1.0), product of:
              0.101458535 = queryWeight, product of:
                1.1158094 = boost
                8.30027 = idf(docFreq=29, maxDocs=44421)
                0.010954848 = queryNorm
              0.5187669 = fieldWeight in 672, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.30027 = idf(docFreq=29, maxDocs=44421)
                0.0625 = fieldNorm(doc=672)
          0.05699344 = weight(abstract_txt:tweet in 672) [ClassicSimilarity], result of:
            0.05699344 = score(doc=672,freq=1.0), product of:
              0.10698707 = queryWeight, product of:
                1.1458068 = boost
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.010954848 = queryNorm
              0.53271335 = fieldWeight in 672, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.0625 = fieldNorm(doc=672)
          0.014694657 = weight(abstract_txt:specific in 672) [ClassicSimilarity], result of:
            0.014694657 = score(doc=672,freq=1.0), product of:
              0.054604825 = queryWeight, product of:
                1.1576473 = boost
                4.305746 = idf(docFreq=1628, maxDocs=44421)
                0.010954848 = queryNorm
              0.26910913 = fieldWeight in 672, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.305746 = idf(docFreq=1628, maxDocs=44421)
                0.0625 = fieldNorm(doc=672)
          0.10500103 = weight(abstract_txt:twitter in 672) [ClassicSimilarity], result of:
            0.10500103 = score(doc=672,freq=3.0), product of:
              0.14045806 = queryWeight, product of:
                1.8566672 = boost
                6.905677 = idf(docFreq=120, maxDocs=44421)
                0.010954848 = queryNorm
              0.74756145 = fieldWeight in 672, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.905677 = idf(docFreq=120, maxDocs=44421)
                0.0625 = fieldNorm(doc=672)
          0.71992934 = weight(abstract_txt:tweets in 672) [ClassicSimilarity], result of:
            0.71992934 = score(doc=672,freq=4.0), product of:
              0.7603884 = queryWeight, product of:
                9.163993 = boost
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.010954848 = queryNorm
              0.94679165 = fieldWeight in 672, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.0625 = fieldNorm(doc=672)
        0.2 = coord(5/25)