Document (#39339)

Author
Kong, S.
Ye, F.
Feng, L.
Zhao, Z.
Title
Towards the prediction problems of bursting hashtags on Twitter
Source
Journal of the Association for Information Science and Technology. 66(2015) no.12, S.2566-2579
Year
2015
Abstract
Hundreds of thousands of hashtags are generated every day on Twitter. Only a few will burst and become trending topics. In this article, we provide the definition of a bursting hashtag and conduct a systematic study of a series of challenging prediction problems that span the entire life cycles of bursting hashtags. Around the problem of "how to build a system to predict bursting hashtags," we explore different types of features and present machine learning solutions. On real data sets from Twitter, experiments are conducted to evaluate the effectiveness of the proposed solutions and the contributions of features.
Content
Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23342/abstract.
Theme
Internet
Data Mining
Object
Twitter

Similar documents (author)

  1. Feng, S.: ¬A comparative study of indexing languages in single and multidatabase searching (1989) 2.58
    2.5784209 = sum of:
      2.5784209 = product of:
        5.1568418 = sum of:
          5.1568418 = weight(author_txt:feng in 2562) [ClassicSimilarity], result of:
            5.1568418 = score(doc=2562,freq=1.0), product of:
              0.85750616 = queryWeight, product of:
                1.2910321 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.06902933 = queryNorm
              6.0137663 = fieldWeight in 2562, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.625 = fieldNorm(doc=2562)
        0.5 = coord(1/2)
    
  2. Feng, Y.; Agosto, D.E.: Revisiting personal information management through information practices with activity tracking technology (2019) 2.06
    2.0627367 = sum of:
      2.0627367 = product of:
        4.1254735 = sum of:
          4.1254735 = weight(author_txt:feng in 438) [ClassicSimilarity], result of:
            4.1254735 = score(doc=438,freq=1.0), product of:
              0.85750616 = queryWeight, product of:
                1.2910321 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.06902933 = queryNorm
              4.811013 = fieldWeight in 438, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.5 = fieldNorm(doc=438)
        0.5 = coord(1/2)
    
  3. Feng, L.; Jeusfeld, M.A.; Hoppenbrouwers, J.: Beyond information searching and browsing : acquiring knowledge from digital libraries (2005) 1.55
    1.5470525 = sum of:
      1.5470525 = product of:
        3.094105 = sum of:
          3.094105 = weight(author_txt:feng in 2000) [ClassicSimilarity], result of:
            3.094105 = score(doc=2000,freq=1.0), product of:
              0.85750616 = queryWeight, product of:
                1.2910321 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.06902933 = queryNorm
              3.60826 = fieldWeight in 2000, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.375 = fieldNorm(doc=2000)
        0.5 = coord(1/2)
    
  4. Xu, G.; Cao, Y.; Ren, Y.; Li, X.; Feng, Z.: Network security situation awareness based on semantic ontology and user-defined rules for Internet of Things (2017) 1.29
    1.2892104 = sum of:
      1.2892104 = product of:
        2.5784209 = sum of:
          2.5784209 = weight(author_txt:feng in 1307) [ClassicSimilarity], result of:
            2.5784209 = score(doc=1307,freq=1.0), product of:
              0.85750616 = queryWeight, product of:
                1.2910321 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.06902933 = queryNorm
              3.0068831 = fieldWeight in 1307, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.3125 = fieldNorm(doc=1307)
        0.5 = coord(1/2)
    
  5. Zhao, L.: Save space for "newcomers" : analyzing problems in book number assignment under the LCC system (2004) 1.20
    1.198237 = sum of:
      1.198237 = product of:
        2.396474 = sum of:
          2.396474 = weight(author_txt:zhao in 4081) [ClassicSimilarity], result of:
            2.396474 = score(doc=4081,freq=1.0), product of:
              0.5144737 = queryWeight, product of:
                7.4529724 = idf(docFreq=69, maxDocs=44421)
                0.06902933 = queryNorm
              4.6581078 = fieldWeight in 4081, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4529724 = idf(docFreq=69, maxDocs=44421)
                0.625 = fieldNorm(doc=4081)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Çelebi, A.; Özgür, A.: Segmenting hashtags and analyzing their grammatical structure (2018) 0.43
    0.42702013 = sum of:
      0.42702013 = product of:
        1.7792506 = sum of:
          0.041501496 = weight(abstract_txt:challenging in 221) [ClassicSimilarity], result of:
            0.041501496 = score(doc=221,freq=1.0), product of:
              0.09914473 = queryWeight, product of:
                1.2157185 = boost
                6.697521 = idf(docFreq=148, maxDocs=44421)
                0.012176501 = queryNorm
              0.41859508 = fieldWeight in 221, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.697521 = idf(docFreq=148, maxDocs=44421)
                0.0625 = fieldNorm(doc=221)
          0.036577363 = weight(abstract_txt:features in 221) [ClassicSimilarity], result of:
            0.036577363 = score(doc=221,freq=2.0), product of:
              0.09113854 = queryWeight, product of:
                1.6484063 = boost
                4.5406218 = idf(docFreq=1287, maxDocs=44421)
                0.012176501 = queryNorm
              0.40133804 = fieldWeight in 221, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5406218 = idf(docFreq=1287, maxDocs=44421)
                0.0625 = fieldNorm(doc=221)
          0.11469649 = weight(abstract_txt:trending in 221) [ClassicSimilarity], result of:
            0.11469649 = score(doc=221,freq=1.0), product of:
              0.19525127 = queryWeight, product of:
                1.7060634 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.012176501 = queryNorm
              0.5874302 = fieldWeight in 221, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.0625 = fieldNorm(doc=221)
          0.28678912 = weight(abstract_txt:hashtag in 221) [ClassicSimilarity], result of:
            0.28678912 = score(doc=221,freq=5.0), product of:
              0.21035148 = queryWeight, product of:
                1.7708061 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.012176501 = queryNorm
              1.3633806 = fieldWeight in 221, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.0625 = fieldNorm(doc=221)
          0.13647762 = weight(abstract_txt:twitter in 221) [ClassicSimilarity], result of:
            0.13647762 = score(doc=221,freq=1.0), product of:
              0.31620967 = queryWeight, product of:
                3.760507 = boost
                6.905677 = idf(docFreq=120, maxDocs=44421)
                0.012176501 = queryNorm
              0.4316048 = fieldWeight in 221, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.905677 = idf(docFreq=120, maxDocs=44421)
                0.0625 = fieldNorm(doc=221)
          1.1632085 = weight(abstract_txt:hashtags in 221) [ClassicSimilarity], result of:
            1.1632085 = score(doc=221,freq=8.0), product of:
              0.72608733 = queryWeight, product of:
                6.579951 = boost
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.012176501 = queryNorm
              1.6020229 = fieldWeight in 221, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.0625 = fieldNorm(doc=221)
        0.24 = coord(6/25)
    
  2. Ma, Z.; Sun, A.; Cong, G.: On predicting the popularity of newly emerging hashtags in Twitter (2013) 0.35
    0.34886178 = sum of:
      0.34886178 = product of:
        1.4535908 = sum of:
          0.042806398 = weight(abstract_txt:predict in 1967) [ClassicSimilarity], result of:
            0.042806398 = score(doc=1967,freq=1.0), product of:
              0.10121221 = queryWeight, product of:
                1.228329 = boost
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.012176501 = queryNorm
              0.4229371 = fieldWeight in 1967, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.0625 = fieldNorm(doc=1967)
          0.06842998 = weight(abstract_txt:features in 1967) [ClassicSimilarity], result of:
            0.06842998 = score(doc=1967,freq=7.0), product of:
              0.09113854 = queryWeight, product of:
                1.6484063 = boost
                4.5406218 = idf(docFreq=1287, maxDocs=44421)
                0.012176501 = queryNorm
              0.7508347 = fieldWeight in 1967, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.5406218 = idf(docFreq=1287, maxDocs=44421)
                0.0625 = fieldNorm(doc=1967)
          0.22214589 = weight(abstract_txt:hashtag in 1967) [ClassicSimilarity], result of:
            0.22214589 = score(doc=1967,freq=3.0), product of:
              0.21035148 = queryWeight, product of:
                1.7708061 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.012176501 = queryNorm
              1.05607 = fieldWeight in 1967, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.0625 = fieldNorm(doc=1967)
          0.10271841 = weight(abstract_txt:prediction in 1967) [ClassicSimilarity], result of:
            0.10271841 = score(doc=1967,freq=1.0), product of:
              0.22856128 = queryWeight, product of:
                2.610445 = boost
                7.190608 = idf(docFreq=90, maxDocs=44421)
                0.012176501 = queryNorm
              0.449413 = fieldWeight in 1967, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.190608 = idf(docFreq=90, maxDocs=44421)
                0.0625 = fieldNorm(doc=1967)
          0.30517322 = weight(abstract_txt:twitter in 1967) [ClassicSimilarity], result of:
            0.30517322 = score(doc=1967,freq=5.0), product of:
              0.31620967 = queryWeight, product of:
                3.760507 = boost
                6.905677 = idf(docFreq=120, maxDocs=44421)
                0.012176501 = queryNorm
              0.96509767 = fieldWeight in 1967, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.905677 = idf(docFreq=120, maxDocs=44421)
                0.0625 = fieldNorm(doc=1967)
          0.7123169 = weight(abstract_txt:hashtags in 1967) [ClassicSimilarity], result of:
            0.7123169 = score(doc=1967,freq=3.0), product of:
              0.72608733 = queryWeight, product of:
                6.579951 = boost
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.012176501 = queryNorm
              0.9810347 = fieldWeight in 1967, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.0625 = fieldNorm(doc=1967)
        0.24 = coord(6/25)
    
  3. Chang, H.-C.; Iyer, I.: Trends in Twitter hashtag applications : design features for value-added dimensions to future library catalogues (2012) 0.24
    0.23965618 = sum of:
      0.23965618 = product of:
        1.4978511 = sum of:
          0.03233013 = weight(abstract_txt:features in 574) [ClassicSimilarity], result of:
            0.03233013 = score(doc=574,freq=1.0), product of:
              0.09113854 = queryWeight, product of:
                1.6484063 = boost
                4.5406218 = idf(docFreq=1287, maxDocs=44421)
                0.012176501 = queryNorm
              0.3547361 = fieldWeight in 574, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5406218 = idf(docFreq=1287, maxDocs=44421)
                0.078125 = fieldNorm(doc=574)
          0.32064 = weight(abstract_txt:hashtag in 574) [ClassicSimilarity], result of:
            0.32064 = score(doc=574,freq=4.0), product of:
              0.21035148 = queryWeight, product of:
                1.7708061 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.012176501 = queryNorm
              1.5243058 = fieldWeight in 574, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.078125 = fieldNorm(doc=574)
          0.41787562 = weight(abstract_txt:twitter in 574) [ClassicSimilarity], result of:
            0.41787562 = score(doc=574,freq=6.0), product of:
              0.31620967 = queryWeight, product of:
                3.760507 = boost
                6.905677 = idf(docFreq=120, maxDocs=44421)
                0.012176501 = queryNorm
              1.3215144 = fieldWeight in 574, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.905677 = idf(docFreq=120, maxDocs=44421)
                0.078125 = fieldNorm(doc=574)
          0.72700536 = weight(abstract_txt:hashtags in 574) [ClassicSimilarity], result of:
            0.72700536 = score(doc=574,freq=2.0), product of:
              0.72608733 = queryWeight, product of:
                6.579951 = boost
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.012176501 = queryNorm
              1.0012643 = fieldWeight in 574, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.078125 = fieldNorm(doc=574)
        0.16 = coord(4/25)
    
  4. Zhang, M.; Zhang, Y.: Professional organizations in Twittersphere : an empirical study of U.S. library and information science professional organizations-related Tweets (2020) 0.18
    0.17912765 = sum of:
      0.17912765 = product of:
        1.1195478 = sum of:
          0.04110582 = weight(abstract_txt:systematic in 775) [ClassicSimilarity], result of:
            0.04110582 = score(doc=775,freq=1.0), product of:
              0.07517992 = queryWeight, product of:
                1.058642 = boost
                5.8321705 = idf(docFreq=353, maxDocs=44421)
                0.012176501 = queryNorm
              0.546766 = fieldWeight in 775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8321705 = idf(docFreq=353, maxDocs=44421)
                0.09375 = fieldNorm(doc=775)
          0.17204472 = weight(abstract_txt:trending in 775) [ClassicSimilarity], result of:
            0.17204472 = score(doc=775,freq=1.0), product of:
              0.19525127 = queryWeight, product of:
                1.7060634 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.012176501 = queryNorm
              0.88114524 = fieldWeight in 775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.09375 = fieldNorm(doc=775)
          0.28951272 = weight(abstract_txt:twitter in 775) [ClassicSimilarity], result of:
            0.28951272 = score(doc=775,freq=2.0), product of:
              0.31620967 = queryWeight, product of:
                3.760507 = boost
                6.905677 = idf(docFreq=120, maxDocs=44421)
                0.012176501 = queryNorm
              0.91557205 = fieldWeight in 775, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.905677 = idf(docFreq=120, maxDocs=44421)
                0.09375 = fieldNorm(doc=775)
          0.6168845 = weight(abstract_txt:hashtags in 775) [ClassicSimilarity], result of:
            0.6168845 = score(doc=775,freq=1.0), product of:
              0.72608733 = queryWeight, product of:
                6.579951 = boost
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.012176501 = queryNorm
              0.849601 = fieldWeight in 775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.09375 = fieldNorm(doc=775)
        0.16 = coord(4/25)
    
  5. Luo, Z.; Yu, Y.; Osborne, M.; Wang, T.: Structuring tweets for improving Twitter search (2015) 0.13
    0.13178171 = sum of:
      0.13178171 = product of:
        0.8236357 = sum of:
          0.041501496 = weight(abstract_txt:challenging in 3335) [ClassicSimilarity], result of:
            0.041501496 = score(doc=3335,freq=1.0), product of:
              0.09914473 = queryWeight, product of:
                1.2157185 = boost
                6.697521 = idf(docFreq=148, maxDocs=44421)
                0.012176501 = queryNorm
              0.41859508 = fieldWeight in 3335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.697521 = idf(docFreq=148, maxDocs=44421)
                0.0625 = fieldNorm(doc=3335)
          0.036577363 = weight(abstract_txt:features in 3335) [ClassicSimilarity], result of:
            0.036577363 = score(doc=3335,freq=2.0), product of:
              0.09113854 = queryWeight, product of:
                1.6484063 = boost
                4.5406218 = idf(docFreq=1287, maxDocs=44421)
                0.012176501 = queryNorm
              0.40133804 = fieldWeight in 3335, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5406218 = idf(docFreq=1287, maxDocs=44421)
                0.0625 = fieldNorm(doc=3335)
          0.33430052 = weight(abstract_txt:twitter in 3335) [ClassicSimilarity], result of:
            0.33430052 = score(doc=3335,freq=6.0), product of:
              0.31620967 = queryWeight, product of:
                3.760507 = boost
                6.905677 = idf(docFreq=120, maxDocs=44421)
                0.012176501 = queryNorm
              1.0572115 = fieldWeight in 3335, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.905677 = idf(docFreq=120, maxDocs=44421)
                0.0625 = fieldNorm(doc=3335)
          0.41125634 = weight(abstract_txt:hashtags in 3335) [ClassicSimilarity], result of:
            0.41125634 = score(doc=3335,freq=1.0), product of:
              0.72608733 = queryWeight, product of:
                6.579951 = boost
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.012176501 = queryNorm
              0.56640065 = fieldWeight in 3335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.0625 = fieldNorm(doc=3335)
        0.16 = coord(4/25)