Document (#41253)

Author
Graus, D.
Odijk, D.
Rijke, M. de
Title
¬The birth of collective memories : analyzing emerging entities in text streams
Source
Journal of the Association for Information Science and Technology. 69(2018) no.6, S.773-786
Year
2018
Abstract
We study how collective memories are formed online. We do so by tracking entities that emerge in public discourse, that is, in online text streams such as social media and news streams, before they are incorporated into Wikipedia, which, we argue, can be viewed as an online place for collective memory. By tracking how entities emerge in public discourse, that is, the temporal patterns between their first mention in online text streams and subsequent incorporation into collective memory, we gain insights into how the collective remembrance process happens online. Specifically, we analyze nearly 80,000 entities as they emerge in online text streams before they are incorporated into Wikipedia. The online text streams we use for our analysis comprise of social media and news streams, and span over 579 million documents in a time span of 18 months. We discover two main emergence patterns: entities that emerge in a "bursty" fashion, that is, that appear in public discourse without a precedent, blast into activity and transition into collective memory. Other entities display a "delayed" pattern, where they appear in public discourse, experience a period of inactivity, and then resurface before transitioning into our cultural collective memory.
Content
Vgl.: https://onlinelibrary.wiley.com/doi/abs/10.1002/asi.24004.

Similar documents (author)

  1. De Rijke, M. -> Rijke, M. de: 4.59
    4.5886087 = sum of:
      4.5886087 = weight(author_txt:rijke in 4116) [ClassicSimilarity], result of:
        4.5886087 = score(doc=4116,freq=2.0), product of:
          0.99999994 = queryWeight, product of:
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.115575336 = queryNorm
          4.588609 = fieldWeight in 4116, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.375 = fieldNorm(doc=4116)
    
  2. Li, X.; Rijke, M.de: Characterizing and predicting downloads in academic search (2019) 4.33
    4.326182 = sum of:
      4.326182 = weight(author_txt:rijke in 5103) [ClassicSimilarity], result of:
        4.326182 = score(doc=5103,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.115575336 = queryNorm
          4.3261824 = fieldWeight in 5103, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.5 = fieldNorm(doc=5103)
    
  3. DeRijke, M. -> Rijke, M. de: 3.79
    3.7854092 = sum of:
      3.7854092 = weight(author_txt:rijke in 117) [ClassicSimilarity], result of:
        3.7854092 = score(doc=117,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.115575336 = queryNorm
          3.7854095 = fieldWeight in 117, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.4375 = fieldNorm(doc=117)
    
  4. Meij, E.; Rijke, M. de: Thesaurus-based feedback to support mixed search and browsing environments (2007) 3.79
    3.7854092 = sum of:
      3.7854092 = weight(author_txt:rijke in 2432) [ClassicSimilarity], result of:
        3.7854092 = score(doc=2432,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.115575336 = queryNorm
          3.7854095 = fieldWeight in 2432, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.4375 = fieldNorm(doc=2432)
    
  5. Cai, F.; Rijke, M. de: Learning from homologous queries and semantically related terms for query auto completion (2016) 3.79
    3.7854092 = sum of:
      3.7854092 = weight(author_txt:rijke in 2971) [ClassicSimilarity], result of:
        3.7854092 = score(doc=2971,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.115575336 = queryNorm
          3.7854095 = fieldWeight in 2971, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.4375 = fieldNorm(doc=2971)
    

Similar documents (content)

  1. Rohman, A.: ¬The emergence, peak, and abeyance of an online information ground : the lifecycle of a Facebook group for verifying information during violence (2021) 0.17
    0.16752043 = sum of:
      0.16752043 = product of:
        0.6980018 = sum of:
          0.13381913 = weight(abstract_txt:memories in 153) [ClassicSimilarity], result of:
            0.13381913 = score(doc=153,freq=1.0), product of:
              0.1957035 = queryWeight, product of:
                1.8426862 = boost
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.0121343825 = queryNorm
              0.683785 = fieldWeight in 153, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.078125 = fieldNorm(doc=153)
          0.039068997 = weight(abstract_txt:public in 153) [ClassicSimilarity], result of:
            0.039068997 = score(doc=153,freq=1.0), product of:
              0.10851372 = queryWeight, product of:
                1.940481 = boost
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.0121343825 = queryNorm
              0.3600374 = fieldWeight in 153, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.078125 = fieldNorm(doc=153)
          0.06884288 = weight(abstract_txt:online in 153) [ClassicSimilarity], result of:
            0.06884288 = score(doc=153,freq=4.0), product of:
              0.12017898 = queryWeight, product of:
                2.7014713 = boost
                3.6661522 = idf(docFreq=3073, maxDocs=44218)
                0.0121343825 = queryNorm
              0.5728363 = fieldWeight in 153, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.6661522 = idf(docFreq=3073, maxDocs=44218)
                0.078125 = fieldNorm(doc=153)
          0.035467777 = weight(abstract_txt:into in 153) [ClassicSimilarity], result of:
            0.035467777 = score(doc=153,freq=1.0), product of:
              0.12260226 = queryWeight, product of:
                2.7285714 = boost
                3.7029297 = idf(docFreq=2962, maxDocs=44218)
                0.0121343825 = queryNorm
              0.28929138 = fieldWeight in 153, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7029297 = idf(docFreq=2962, maxDocs=44218)
                0.078125 = fieldNorm(doc=153)
          0.19038428 = weight(abstract_txt:emerge in 153) [ClassicSimilarity], result of:
            0.19038428 = score(doc=153,freq=2.0), product of:
              0.24755625 = queryWeight, product of:
                2.9309204 = boost
                6.9606886 = idf(docFreq=113, maxDocs=44218)
                0.0121343825 = queryNorm
              0.76905465 = fieldWeight in 153, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.9606886 = idf(docFreq=113, maxDocs=44218)
                0.078125 = fieldNorm(doc=153)
          0.23041873 = weight(abstract_txt:collective in 153) [ClassicSimilarity], result of:
            0.23041873 = score(doc=153,freq=1.0), product of:
              0.42686218 = queryWeight, product of:
                5.0913143 = boost
                6.9093957 = idf(docFreq=119, maxDocs=44218)
                0.0121343825 = queryNorm
              0.53979653 = fieldWeight in 153, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9093957 = idf(docFreq=119, maxDocs=44218)
                0.078125 = fieldNorm(doc=153)
        0.24 = coord(6/25)
    
  2. Henninger, M.; Scifleet, P.: How are the new documents of social networks shaping our cultural memory (2016) 0.15
    0.1538466 = sum of:
      0.1538466 = product of:
        0.5494522 = sum of:
          0.04969515 = weight(abstract_txt:incorporated in 2656) [ClassicSimilarity], result of:
            0.04969515 = score(doc=2656,freq=1.0), product of:
              0.11732822 = queryWeight, product of:
                1.4267678 = boost
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.0121343825 = queryNorm
              0.42355666 = fieldWeight in 2656, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7769065 = idf(docFreq=136, maxDocs=44218)
                0.0625 = fieldNorm(doc=2656)
          0.011037172 = weight(abstract_txt:that in 2656) [ClassicSimilarity], result of:
            0.011037172 = score(doc=2656,freq=3.0), product of:
              0.043029375 = queryWeight, product of:
                1.4965638 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0121343825 = queryNorm
              0.2565032 = fieldWeight in 2656, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=2656)
          0.1070553 = weight(abstract_txt:memories in 2656) [ClassicSimilarity], result of:
            0.1070553 = score(doc=2656,freq=1.0), product of:
              0.1957035 = queryWeight, product of:
                1.8426862 = boost
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.0121343825 = queryNorm
              0.547028 = fieldWeight in 2656, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.0625 = fieldNorm(doc=2656)
          0.031255197 = weight(abstract_txt:public in 2656) [ClassicSimilarity], result of:
            0.031255197 = score(doc=2656,freq=1.0), product of:
              0.10851372 = queryWeight, product of:
                1.940481 = boost
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.0121343825 = queryNorm
              0.2880299 = fieldWeight in 2656, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6084785 = idf(docFreq=1197, maxDocs=44218)
                0.0625 = fieldNorm(doc=2656)
          0.078787014 = weight(abstract_txt:discourse in 2656) [ClassicSimilarity], result of:
            0.078787014 = score(doc=2656,freq=1.0), product of:
              0.20098929 = queryWeight, product of:
                2.6409097 = boost
                6.2719374 = idf(docFreq=226, maxDocs=44218)
                0.0121343825 = queryNorm
              0.3919961 = fieldWeight in 2656, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2719374 = idf(docFreq=226, maxDocs=44218)
                0.0625 = fieldNorm(doc=2656)
          0.028374223 = weight(abstract_txt:into in 2656) [ClassicSimilarity], result of:
            0.028374223 = score(doc=2656,freq=1.0), product of:
              0.12260226 = queryWeight, product of:
                2.7285714 = boost
                3.7029297 = idf(docFreq=2962, maxDocs=44218)
                0.0121343825 = queryNorm
              0.23143311 = fieldWeight in 2656, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7029297 = idf(docFreq=2962, maxDocs=44218)
                0.0625 = fieldNorm(doc=2656)
          0.2432481 = weight(abstract_txt:memory in 2656) [ClassicSimilarity], result of:
            0.2432481 = score(doc=2656,freq=7.0), product of:
              0.22277687 = queryWeight, product of:
                2.7803671 = boost
                6.603137 = idf(docFreq=162, maxDocs=44218)
                0.0121343825 = queryNorm
              1.0918912 = fieldWeight in 2656, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.603137 = idf(docFreq=162, maxDocs=44218)
                0.0625 = fieldNorm(doc=2656)
        0.28 = coord(7/25)
    
  3. Luyt, B.: Wikipedia, collective memory, and the Vietnam war (2016) 0.14
    0.14277497 = sum of:
      0.14277497 = product of:
        0.7138748 = sum of:
          0.08339065 = weight(abstract_txt:wikipedia in 3054) [ClassicSimilarity], result of:
            0.08339065 = score(doc=3054,freq=2.0), product of:
              0.10035382 = queryWeight, product of:
                1.3195293 = boost
                6.2675414 = idf(docFreq=227, maxDocs=44218)
                0.0121343825 = queryNorm
              0.8309664 = fieldWeight in 3054, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2675414 = idf(docFreq=227, maxDocs=44218)
                0.09375 = fieldNorm(doc=3054)
          0.019116944 = weight(abstract_txt:that in 3054) [ClassicSimilarity], result of:
            0.019116944 = score(doc=3054,freq=4.0), product of:
              0.043029375 = queryWeight, product of:
                1.4965638 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0121343825 = queryNorm
              0.44427657 = fieldWeight in 3054, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.09375 = fieldNorm(doc=3054)
          0.02530133 = weight(abstract_txt:they in 3054) [ClassicSimilarity], result of:
            0.02530133 = score(doc=3054,freq=1.0), product of:
              0.07192909 = queryWeight, product of:
                1.5798627 = boost
                3.7520406 = idf(docFreq=2820, maxDocs=44218)
                0.0121343825 = queryNorm
              0.3517538 = fieldWeight in 3054, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7520406 = idf(docFreq=2820, maxDocs=44218)
                0.09375 = fieldNorm(doc=3054)
          0.19503236 = weight(abstract_txt:memory in 3054) [ClassicSimilarity], result of:
            0.19503236 = score(doc=3054,freq=2.0), product of:
              0.22277687 = queryWeight, product of:
                2.7803671 = boost
                6.603137 = idf(docFreq=162, maxDocs=44218)
                0.0121343825 = queryNorm
              0.8754605 = fieldWeight in 3054, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.603137 = idf(docFreq=162, maxDocs=44218)
                0.09375 = fieldNorm(doc=3054)
          0.39103353 = weight(abstract_txt:collective in 3054) [ClassicSimilarity], result of:
            0.39103353 = score(doc=3054,freq=2.0), product of:
              0.42686218 = queryWeight, product of:
                5.0913143 = boost
                6.9093957 = idf(docFreq=119, maxDocs=44218)
                0.0121343825 = queryNorm
              0.9160651 = fieldWeight in 3054, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.9093957 = idf(docFreq=119, maxDocs=44218)
                0.09375 = fieldNorm(doc=3054)
        0.2 = coord(5/25)
    
  4. Multi-source, multilingual information extraction and summarization (2013) 0.12
    0.1198073 = sum of:
      0.1198073 = product of:
        0.5990365 = sum of:
          0.03375382 = weight(abstract_txt:news in 978) [ClassicSimilarity], result of:
            0.03375382 = score(doc=978,freq=1.0), product of:
              0.090658486 = queryWeight, product of:
                1.2541697 = boost
                5.957094 = idf(docFreq=310, maxDocs=44218)
                0.0121343825 = queryNorm
              0.3723184 = fieldWeight in 978, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.957094 = idf(docFreq=310, maxDocs=44218)
                0.0625 = fieldNorm(doc=978)
          0.006372315 = weight(abstract_txt:that in 978) [ClassicSimilarity], result of:
            0.006372315 = score(doc=978,freq=1.0), product of:
              0.043029375 = queryWeight, product of:
                1.4965638 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0121343825 = queryNorm
              0.1480922 = fieldWeight in 978, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=978)
          0.037330605 = weight(abstract_txt:text in 978) [ClassicSimilarity], result of:
            0.037330605 = score(doc=978,freq=2.0), product of:
              0.104441516 = queryWeight, product of:
                2.1284266 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0121343825 = queryNorm
              0.3574307 = fieldWeight in 978, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=978)
          0.095076665 = weight(abstract_txt:entities in 978) [ClassicSimilarity], result of:
            0.095076665 = score(doc=978,freq=1.0), product of:
              0.26078516 = queryWeight, product of:
                3.6842928 = boost
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.0121343825 = queryNorm
              0.36457852 = fieldWeight in 978, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.0625 = fieldNorm(doc=978)
          0.4265031 = weight(abstract_txt:streams in 978) [ClassicSimilarity], result of:
            0.4265031 = score(doc=978,freq=2.0), product of:
              0.59268045 = queryWeight, product of:
                5.9992423 = boost
                8.14154 = idf(docFreq=34, maxDocs=44218)
                0.0121343825 = queryNorm
              0.71961725 = fieldWeight in 978, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.14154 = idf(docFreq=34, maxDocs=44218)
                0.0625 = fieldNorm(doc=978)
        0.2 = coord(5/25)
    
  5. Hills, T.; Segev, E.: ¬The news is American but our memories are - Chinese? (2014) 0.12
    0.11851853 = sum of:
      0.11851853 = product of:
        0.49382722 = sum of:
          0.033210214 = weight(abstract_txt:patterns in 1342) [ClassicSimilarity], result of:
            0.033210214 = score(doc=1342,freq=2.0), product of:
              0.07118103 = queryWeight, product of:
                1.1113074 = boost
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.0121343825 = queryNorm
              0.4665599 = fieldWeight in 1342, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2785225 = idf(docFreq=612, maxDocs=44218)
                0.0625 = fieldNorm(doc=1342)
          0.121701136 = weight(abstract_txt:news in 1342) [ClassicSimilarity], result of:
            0.121701136 = score(doc=1342,freq=13.0), product of:
              0.090658486 = queryWeight, product of:
                1.2541697 = boost
                5.957094 = idf(docFreq=310, maxDocs=44218)
                0.0121343825 = queryNorm
              1.3424131 = fieldWeight in 1342, product of:
                3.6055512 = tf(freq=13.0), with freq of:
                  13.0 = termFreq=13.0
                5.957094 = idf(docFreq=310, maxDocs=44218)
                0.0625 = fieldNorm(doc=1342)
          0.009011813 = weight(abstract_txt:that in 1342) [ClassicSimilarity], result of:
            0.009011813 = score(doc=1342,freq=2.0), product of:
              0.043029375 = queryWeight, product of:
                1.4965638 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0121343825 = queryNorm
              0.20943399 = fieldWeight in 1342, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=1342)
          0.023854323 = weight(abstract_txt:they in 1342) [ClassicSimilarity], result of:
            0.023854323 = score(doc=1342,freq=2.0), product of:
              0.07192909 = queryWeight, product of:
                1.5798627 = boost
                3.7520406 = idf(docFreq=2820, maxDocs=44218)
                0.0121343825 = queryNorm
              0.33163667 = fieldWeight in 1342, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.7520406 = idf(docFreq=2820, maxDocs=44218)
                0.0625 = fieldNorm(doc=1342)
          0.2141106 = weight(abstract_txt:memories in 1342) [ClassicSimilarity], result of:
            0.2141106 = score(doc=1342,freq=4.0), product of:
              0.1957035 = queryWeight, product of:
                1.8426862 = boost
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.0121343825 = queryNorm
              1.094056 = fieldWeight in 1342, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.0625 = fieldNorm(doc=1342)
          0.09193914 = weight(abstract_txt:memory in 1342) [ClassicSimilarity], result of:
            0.09193914 = score(doc=1342,freq=1.0), product of:
              0.22277687 = queryWeight, product of:
                2.7803671 = boost
                6.603137 = idf(docFreq=162, maxDocs=44218)
                0.0121343825 = queryNorm
              0.41269606 = fieldWeight in 1342, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.603137 = idf(docFreq=162, maxDocs=44218)
                0.0625 = fieldNorm(doc=1342)
        0.24 = coord(6/25)