Document (#38218)

Author
Cotelo, J.M.
Cruz, F.L.
Troyano, J.A.
Title
Dynamic topic-related tweet retrieval
Source
Journal of the Association for Information Science and Technology. 65(2014) no.3, S.513-523
Year
2014
Abstract
Twitter is a social network in which people publish publicly accessible brief, instant messages. With its exponential growth and the public nature and transversality of its contents, more researchers are using Twitter as a source of data for multiple purposes. In this context, the ability to retrieve those messages (tweets) related to a certain topic becomes critical. In this work, we define the topic-related tweet retrieval task and propose a dynamic, graph-based method with which to address it. We have applied our method to capture a data set containing tweets related to the participation of the Spanish team in the Euro 2012 soccer competition, measuring the precision and recall against other simple but commonly used approaches. The results demonstrate the effectiveness of our method, which significantly increases coverage of the chosen topic and is able to capture related but unknown à priori subtopics.
Object
Twitter

Similar documents (author)

  1. Díaz, N.P. Cruz -> Cruz Díaz, N.P.: 4.98
    4.9845104 = sum of:
      4.9845104 = weight(author_txt:cruz in 1233) [ClassicSimilarity], result of:
        4.9845104 = fieldWeight in 1233, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          9.398883 = idf(docFreq=9, maxDocs=44421)
          0.375 = fieldNorm(doc=1233)
    
  2. Cruz, T. Trindade => Trindade Cruz, T.: 4.98
    4.9845104 = sum of:
      4.9845104 = weight(author_txt:cruz in 843) [ClassicSimilarity], result of:
        4.9845104 = fieldWeight in 843, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          9.398883 = idf(docFreq=9, maxDocs=44421)
          0.375 = fieldNorm(doc=843)
    
  3. Trindade Cruz, T.: Digital heritage : challenges and opportunities in the access and organisation of digital knowledge in contemporary societies (2018) 4.70
    4.6994414 = sum of:
      4.6994414 = weight(author_txt:cruz in 846) [ClassicSimilarity], result of:
        4.6994414 = fieldWeight in 846, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.398883 = idf(docFreq=9, maxDocs=44421)
          0.5 = fieldNorm(doc=846)
    
  4. Barrueco Cruz, J.M.; Krichel, T.: Subject description in the Academic Metadata Format (2003) 4.11
    4.1120114 = sum of:
      4.1120114 = weight(author_txt:cruz in 4548) [ClassicSimilarity], result of:
        4.1120114 = fieldWeight in 4548, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.398883 = idf(docFreq=9, maxDocs=44421)
          0.4375 = fieldNorm(doc=4548)
    
  5. Cruz, J.M.B.; Garcia, J.A.C.; Lopez, R.F.: Preprints: communication through electronic nets : an example of bibliographic control (1996) 3.52
    3.524581 = sum of:
      3.524581 = weight(author_txt:cruz in 4791) [ClassicSimilarity], result of:
        3.524581 = fieldWeight in 4791, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.398883 = idf(docFreq=9, maxDocs=44421)
          0.375 = fieldNorm(doc=4791)
    

Similar documents (content)

  1. Zheng, X.; Sun, A.: Collecting event-related tweets from twitter stream (2019) 0.37
    0.36547986 = sum of:
      0.36547986 = product of:
        1.1421245 = sum of:
          0.099964134 = weight(abstract_txt:instant in 672) [ClassicSimilarity], result of:
            0.099964134 = score(doc=672,freq=1.0), product of:
              0.19191182 = queryWeight, product of:
                1.2659233 = boost
                8.334172 = idf(docFreq=28, maxDocs=44421)
                0.018189965 = queryNorm
              0.52088577 = fieldWeight in 672, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.334172 = idf(docFreq=28, maxDocs=44421)
                0.0625 = fieldNorm(doc=672)
          0.012815997 = weight(abstract_txt:which in 672) [ClassicSimilarity], result of:
            0.012815997 = score(doc=672,freq=1.0), product of:
              0.070374325 = queryWeight, product of:
                1.3277743 = boost
                2.9137893 = idf(docFreq=6552, maxDocs=44421)
                0.018189965 = queryNorm
              0.18211183 = fieldWeight in 672, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9137893 = idf(docFreq=6552, maxDocs=44421)
                0.0625 = fieldNorm(doc=672)
          0.0943409 = weight(abstract_txt:method in 672) [ClassicSimilarity], result of:
            0.0943409 = score(doc=672,freq=4.0), product of:
              0.16776165 = queryWeight, product of:
                2.0500455 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.018189965 = queryNorm
              0.5623508 = fieldWeight in 672, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0625 = fieldNorm(doc=672)
          0.11333182 = weight(abstract_txt:messages in 672) [ClassicSimilarity], result of:
            0.11333182 = score(doc=672,freq=1.0), product of:
              0.2628957 = queryWeight, product of:
                2.0953822 = boost
                6.8974466 = idf(docFreq=121, maxDocs=44421)
                0.018189965 = queryNorm
              0.4310904 = fieldWeight in 672, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8974466 = idf(docFreq=121, maxDocs=44421)
                0.0625 = fieldNorm(doc=672)
          0.19699998 = weight(abstract_txt:twitter in 672) [ClassicSimilarity], result of:
            0.19699998 = score(doc=672,freq=3.0), product of:
              0.26352346 = queryWeight, product of:
                2.0978825 = boost
                6.905677 = idf(docFreq=120, maxDocs=44421)
                0.018189965 = queryNorm
              0.74756145 = fieldWeight in 672, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.905677 = idf(docFreq=120, maxDocs=44421)
                0.0625 = fieldNorm(doc=672)
          0.30015805 = weight(abstract_txt:tweets in 672) [ClassicSimilarity], result of:
            0.30015805 = score(doc=672,freq=4.0), product of:
              0.3170265 = queryWeight, product of:
                2.3010142 = boost
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.018189965 = queryNorm
              0.94679165 = fieldWeight in 672, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.0625 = fieldNorm(doc=672)
          0.21385898 = weight(abstract_txt:tweet in 672) [ClassicSimilarity], result of:
            0.21385898 = score(doc=672,freq=1.0), product of:
              0.40145224 = queryWeight, product of:
                2.5893364 = boost
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.018189965 = queryNorm
              0.53271335 = fieldWeight in 672, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.0625 = fieldNorm(doc=672)
          0.110654704 = weight(abstract_txt:related in 672) [ClassicSimilarity], result of:
            0.110654704 = score(doc=672,freq=3.0), product of:
              0.24348287 = queryWeight, product of:
                3.1884215 = boost
                4.198178 = idf(docFreq=1813, maxDocs=44421)
                0.018189965 = queryNorm
              0.45446607 = fieldWeight in 672, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.198178 = idf(docFreq=1813, maxDocs=44421)
                0.0625 = fieldNorm(doc=672)
        0.32 = coord(8/25)
    
  2. Luo, Z.; Yu, Y.; Osborne, M.; Wang, T.: Structuring tweets for improving Twitter search (2015) 0.35
    0.35146767 = sum of:
      0.35146767 = product of:
        1.2552416 = sum of:
          0.03839392 = weight(abstract_txt:retrieval in 3335) [ClassicSimilarity], result of:
            0.03839392 = score(doc=3335,freq=7.0), product of:
              0.066786885 = queryWeight, product of:
                1.0561293 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.018189965 = queryNorm
              0.57487214 = fieldWeight in 3335, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=3335)
          0.04717045 = weight(abstract_txt:method in 3335) [ClassicSimilarity], result of:
            0.04717045 = score(doc=3335,freq=1.0), product of:
              0.16776165 = queryWeight, product of:
                2.0500455 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.018189965 = queryNorm
              0.2811754 = fieldWeight in 3335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0625 = fieldNorm(doc=3335)
          0.27860004 = weight(abstract_txt:twitter in 3335) [ClassicSimilarity], result of:
            0.27860004 = score(doc=3335,freq=6.0), product of:
              0.26352346 = queryWeight, product of:
                2.0978825 = boost
                6.905677 = idf(docFreq=120, maxDocs=44421)
                0.018189965 = queryNorm
              1.0572115 = fieldWeight in 3335, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.905677 = idf(docFreq=120, maxDocs=44421)
                0.0625 = fieldNorm(doc=3335)
          0.367617 = weight(abstract_txt:tweets in 3335) [ClassicSimilarity], result of:
            0.367617 = score(doc=3335,freq=6.0), product of:
              0.3170265 = queryWeight, product of:
                2.3010142 = boost
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.018189965 = queryNorm
              1.1595782 = fieldWeight in 3335, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.0625 = fieldNorm(doc=3335)
          0.37041458 = weight(abstract_txt:tweet in 3335) [ClassicSimilarity], result of:
            0.37041458 = score(doc=3335,freq=3.0), product of:
              0.40145224 = queryWeight, product of:
                2.5893364 = boost
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.018189965 = queryNorm
              0.9226866 = fieldWeight in 3335, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.0625 = fieldNorm(doc=3335)
          0.08915908 = weight(abstract_txt:topic in 3335) [ClassicSimilarity], result of:
            0.08915908 = score(doc=3335,freq=1.0), product of:
              0.28227296 = queryWeight, product of:
                3.0705853 = boost
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.018189965 = queryNorm
              0.3158612 = fieldWeight in 3335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.0625 = fieldNorm(doc=3335)
          0.06388652 = weight(abstract_txt:related in 3335) [ClassicSimilarity], result of:
            0.06388652 = score(doc=3335,freq=1.0), product of:
              0.24348287 = queryWeight, product of:
                3.1884215 = boost
                4.198178 = idf(docFreq=1813, maxDocs=44421)
                0.018189965 = queryNorm
              0.2623861 = fieldWeight in 3335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.198178 = idf(docFreq=1813, maxDocs=44421)
                0.0625 = fieldNorm(doc=3335)
        0.28 = coord(7/25)
    
  3. Bae, Y.; Lee, H.: Sentiment analysis of twitter audiences : measuring the positive or negative influence of popular twitterers (2012) 0.23
    0.23386486 = sum of:
      0.23386486 = product of:
        0.97443694 = sum of:
          0.16027538 = weight(abstract_txt:messages in 1520) [ClassicSimilarity], result of:
            0.16027538 = score(doc=1520,freq=2.0), product of:
              0.2628957 = queryWeight, product of:
                2.0953822 = boost
                6.8974466 = idf(docFreq=121, maxDocs=44421)
                0.018189965 = queryNorm
              0.6096539 = fieldWeight in 1520, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8974466 = idf(docFreq=121, maxDocs=44421)
                0.0625 = fieldNorm(doc=1520)
          0.16084981 = weight(abstract_txt:twitter in 1520) [ClassicSimilarity], result of:
            0.16084981 = score(doc=1520,freq=2.0), product of:
              0.26352346 = queryWeight, product of:
                2.0978825 = boost
                6.905677 = idf(docFreq=120, maxDocs=44421)
                0.018189965 = queryNorm
              0.61038136 = fieldWeight in 1520, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.905677 = idf(docFreq=120, maxDocs=44421)
                0.0625 = fieldNorm(doc=1520)
          0.2599445 = weight(abstract_txt:tweets in 1520) [ClassicSimilarity], result of:
            0.2599445 = score(doc=1520,freq=3.0), product of:
              0.3170265 = queryWeight, product of:
                2.3010142 = boost
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.018189965 = queryNorm
              0.81994563 = fieldWeight in 1520, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.0625 = fieldNorm(doc=1520)
          0.21385898 = weight(abstract_txt:tweet in 1520) [ClassicSimilarity], result of:
            0.21385898 = score(doc=1520,freq=1.0), product of:
              0.40145224 = queryWeight, product of:
                2.5893364 = boost
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.018189965 = queryNorm
              0.53271335 = fieldWeight in 1520, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.0625 = fieldNorm(doc=1520)
          0.08915908 = weight(abstract_txt:topic in 1520) [ClassicSimilarity], result of:
            0.08915908 = score(doc=1520,freq=1.0), product of:
              0.28227296 = queryWeight, product of:
                3.0705853 = boost
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.018189965 = queryNorm
              0.3158612 = fieldWeight in 1520, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.0625 = fieldNorm(doc=1520)
          0.09034919 = weight(abstract_txt:related in 1520) [ClassicSimilarity], result of:
            0.09034919 = score(doc=1520,freq=2.0), product of:
              0.24348287 = queryWeight, product of:
                3.1884215 = boost
                4.198178 = idf(docFreq=1813, maxDocs=44421)
                0.018189965 = queryNorm
              0.37107 = fieldWeight in 1520, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.198178 = idf(docFreq=1813, maxDocs=44421)
                0.0625 = fieldNorm(doc=1520)
        0.24 = coord(6/25)
    
  4. Fang, Z.; Dudek, J.; Costas, R.: Facing the volatility of tweets in altmetric research (2022) 0.23
    0.2304452 = sum of:
      0.2304452 = product of:
        0.9601884 = sum of:
          0.035653543 = weight(abstract_txt:data in 1606) [ClassicSimilarity], result of:
            0.035653543 = score(doc=1606,freq=5.0), product of:
              0.061284978 = queryWeight, product of:
                1.0116925 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.018189965 = queryNorm
              0.5817664 = fieldWeight in 1606, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.078125 = fieldNorm(doc=1606)
          0.018139422 = weight(abstract_txt:retrieval in 1606) [ClassicSimilarity], result of:
            0.018139422 = score(doc=1606,freq=1.0), product of:
              0.066786885 = queryWeight, product of:
                1.0561293 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.018189965 = queryNorm
              0.27160156 = fieldWeight in 1606, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.078125 = fieldNorm(doc=1606)
          0.082671 = weight(abstract_txt:dynamic in 1606) [ClassicSimilarity], result of:
            0.082671 = score(doc=1606,freq=1.0), product of:
              0.18358803 = queryWeight, product of:
                1.7510304 = boost
                5.7639313 = idf(docFreq=378, maxDocs=44421)
                0.018189965 = queryNorm
              0.45030713 = fieldWeight in 1606, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7639313 = idf(docFreq=378, maxDocs=44421)
                0.078125 = fieldNorm(doc=1606)
          0.28434497 = weight(abstract_txt:twitter in 1606) [ClassicSimilarity], result of:
            0.28434497 = score(doc=1606,freq=4.0), product of:
              0.26352346 = queryWeight, product of:
                2.0978825 = boost
                6.905677 = idf(docFreq=120, maxDocs=44421)
                0.018189965 = queryNorm
              1.079012 = fieldWeight in 1606, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.905677 = idf(docFreq=120, maxDocs=44421)
                0.078125 = fieldNorm(doc=1606)
          0.45952126 = weight(abstract_txt:tweets in 1606) [ClassicSimilarity], result of:
            0.45952126 = score(doc=1606,freq=6.0), product of:
              0.3170265 = queryWeight, product of:
                2.3010142 = boost
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.018189965 = queryNorm
              1.4494728 = fieldWeight in 1606, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.078125 = fieldNorm(doc=1606)
          0.079858154 = weight(abstract_txt:related in 1606) [ClassicSimilarity], result of:
            0.079858154 = score(doc=1606,freq=1.0), product of:
              0.24348287 = queryWeight, product of:
                3.1884215 = boost
                4.198178 = idf(docFreq=1813, maxDocs=44421)
                0.018189965 = queryNorm
              0.32798263 = fieldWeight in 1606, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.198178 = idf(docFreq=1813, maxDocs=44421)
                0.078125 = fieldNorm(doc=1606)
        0.24 = coord(6/25)
    
  5. Yi, K.; Choi, N.; Kim, Y.S.: ¬A content analysis of Twitter hyperlinks and their application in web resource indexing (2016) 0.22
    0.21725343 = sum of:
      0.21725343 = product of:
        1.0862671 = sum of:
          0.19629647 = weight(abstract_txt:messages in 4075) [ClassicSimilarity], result of:
            0.19629647 = score(doc=4075,freq=3.0), product of:
              0.2628957 = queryWeight, product of:
                2.0953822 = boost
                6.8974466 = idf(docFreq=121, maxDocs=44421)
                0.018189965 = queryNorm
              0.7466705 = fieldWeight in 4075, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.8974466 = idf(docFreq=121, maxDocs=44421)
                0.0625 = fieldNorm(doc=4075)
          0.16084981 = weight(abstract_txt:twitter in 4075) [ClassicSimilarity], result of:
            0.16084981 = score(doc=4075,freq=2.0), product of:
              0.26352346 = queryWeight, product of:
                2.0978825 = boost
                6.905677 = idf(docFreq=120, maxDocs=44421)
                0.018189965 = queryNorm
              0.61038136 = fieldWeight in 4075, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.905677 = idf(docFreq=120, maxDocs=44421)
                0.0625 = fieldNorm(doc=4075)
          0.21224378 = weight(abstract_txt:tweets in 4075) [ClassicSimilarity], result of:
            0.21224378 = score(doc=4075,freq=2.0), product of:
              0.3170265 = queryWeight, product of:
                2.3010142 = boost
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.018189965 = queryNorm
              0.66948277 = fieldWeight in 4075, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.0625 = fieldNorm(doc=4075)
          0.42771795 = weight(abstract_txt:tweet in 4075) [ClassicSimilarity], result of:
            0.42771795 = score(doc=4075,freq=4.0), product of:
              0.40145224 = queryWeight, product of:
                2.5893364 = boost
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.018189965 = queryNorm
              1.0654267 = fieldWeight in 4075, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.0625 = fieldNorm(doc=4075)
          0.08915908 = weight(abstract_txt:topic in 4075) [ClassicSimilarity], result of:
            0.08915908 = score(doc=4075,freq=1.0), product of:
              0.28227296 = queryWeight, product of:
                3.0705853 = boost
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.018189965 = queryNorm
              0.3158612 = fieldWeight in 4075, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.0625 = fieldNorm(doc=4075)
        0.2 = coord(5/25)