Document (#36456)

Author
Efron, M.
Title
Information search and retrieval in microblogs
Source
Journal of the American Society for Information Science and Technology. 62(2011) no.6, S.996-1008
Year
2011
Series
Advances in information science
Abstract
Modern information retrieval (IR) has come to terms with numerous new media in efforts to help people find information in increasingly diverse settings. Among these new media are so-called microblogs. A microblog is a stream of text that is written by an author over time. It comprises many very brief updates that are presented to the microblog's readers in reverse-chronological order. Today, the service called Twitter is the most popular microblogging platform. Although microblogging is increasingly popular, methods for organizing and providing access to microblog data are still new. This review offers an introduction to the problems that face researchers and developers of IR systems in microblog settings. After an overview of microblogs and the behavior surrounding them, the review describes established problems in microblog retrieval, such as entity search and sentiment analysis, and modeling abstractions, such as authority and quality. The review also treats user-created metadata that often appear in microblogs. Because the problem of microblog search is so new, the review concludes with a discussion of particularly pressing research issues yet to be studied in the field.

Similar documents (author)

  1. Efron, M.: Eigenvalue-based model selection during Latent Semantic Indexing (2005) 6.10
    6.0972233 = sum of:
      6.0972233 = weight(author_txt:efron in 4685) [ClassicSimilarity], result of:
        6.0972233 = fieldWeight in 4685, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.625 = fieldNorm(doc=4685)
    
  2. Efron, M.: Shannon meets Shortz : a probabilistic model of crossword puzzle difficulty (2008) 6.10
    6.0972233 = sum of:
      6.0972233 = weight(author_txt:efron in 2620) [ClassicSimilarity], result of:
        6.0972233 = fieldWeight in 2620, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.625 = fieldNorm(doc=2620)
    
  3. Efron, M.: Query expansion and dimensionality reduction : Notions of optimality in Rocchio relevance feedback and latent semantic indexing (2008) 6.10
    6.0972233 = sum of:
      6.0972233 = weight(author_txt:efron in 3020) [ClassicSimilarity], result of:
        6.0972233 = fieldWeight in 3020, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.625 = fieldNorm(doc=3020)
    
  4. Efron, M.: Linear time series models for term weighting in information retrieval (2010) 6.10
    6.0972233 = sum of:
      6.0972233 = weight(author_txt:efron in 675) [ClassicSimilarity], result of:
        6.0972233 = fieldWeight in 675, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.625 = fieldNorm(doc=675)
    
  5. Efron, M.; Winget, M.: Query polyrepresentation for ranking retrieval systems without relevance judgments (2010) 4.88
    4.8777785 = sum of:
      4.8777785 = weight(author_txt:efron in 456) [ClassicSimilarity], result of:
        4.8777785 = fieldWeight in 456, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.5 = fieldNorm(doc=456)
    

Similar documents (content)

  1. Jansen, B.J.; Zhang, M.; Sobel, K.; Chowdury, A.: Twitter power : tweets as electronic word of mouth (2009) 0.30
    0.29649028 = sum of:
      0.29649028 = product of:
        1.4824514 = sum of:
          0.059196316 = weight(abstract_txt:sentiment in 144) [ClassicSimilarity], result of:
            0.059196316 = score(doc=144,freq=2.0), product of:
              0.08897605 = queryWeight, product of:
                1.1001254 = boost
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.010744948 = queryNorm
              0.6653062 = fieldWeight in 144, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.0625 = fieldNorm(doc=144)
          0.007344 = weight(abstract_txt:that in 144) [ClassicSimilarity], result of:
            0.007344 = score(doc=144,freq=2.0), product of:
              0.035133258 = queryWeight, product of:
                1.3825947 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.010744948 = queryNorm
              0.20903271 = fieldWeight in 144, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=144)
          0.22058938 = weight(abstract_txt:microblogging in 144) [ClassicSimilarity], result of:
            0.22058938 = score(doc=144,freq=3.0), product of:
              0.23538528 = queryWeight, product of:
                2.5305233 = boost
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.010744948 = queryNorm
              0.9371418 = fieldWeight in 144, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.0625 = fieldNorm(doc=144)
          0.6519566 = weight(abstract_txt:microblogs in 144) [ClassicSimilarity], result of:
            0.6519566 = score(doc=144,freq=4.0), product of:
              0.55492264 = queryWeight, product of:
                5.4948 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.010744948 = queryNorm
              1.1748604 = fieldWeight in 144, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.0625 = fieldNorm(doc=144)
          0.54336506 = weight(abstract_txt:microblog in 144) [ClassicSimilarity], result of:
            0.54336506 = score(doc=144,freq=2.0), product of:
              0.66700304 = queryWeight, product of:
                6.7352633 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.010744948 = queryNorm
              0.8146366 = fieldWeight in 144, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.0625 = fieldNorm(doc=144)
        0.2 = coord(5/25)
    
  2. Bandaragoda, T.R.; Silva, D. de; Alahakoon, D.: Automatic event detection in microblogs using incremental machine learning (2017) 0.27
    0.27118197 = sum of:
      0.27118197 = product of:
        1.3559098 = sum of:
          0.011114962 = weight(abstract_txt:such in 4826) [ClassicSimilarity], result of:
            0.011114962 = score(doc=4826,freq=2.0), product of:
              0.036758576 = queryWeight, product of:
                3.42101 = idf(docFreq=3945, maxDocs=44421)
                0.010744948 = queryNorm
              0.3023774 = fieldWeight in 4826, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.42101 = idf(docFreq=3945, maxDocs=44421)
                0.0625 = fieldNorm(doc=4826)
          0.032323528 = weight(abstract_txt:twitter in 4826) [ClassicSimilarity], result of:
            0.032323528 = score(doc=4826,freq=1.0), product of:
              0.07489149 = queryWeight, product of:
                1.0093038 = boost
                6.905677 = idf(docFreq=120, maxDocs=44421)
                0.010744948 = queryNorm
              0.4316048 = fieldWeight in 4826, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.905677 = idf(docFreq=120, maxDocs=44421)
                0.0625 = fieldNorm(doc=4826)
          0.12735735 = weight(abstract_txt:microblogging in 4826) [ClassicSimilarity], result of:
            0.12735735 = score(doc=4826,freq=1.0), product of:
              0.23538528 = queryWeight, product of:
                2.5305233 = boost
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.010744948 = queryNorm
              0.5410591 = fieldWeight in 4826, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.0625 = fieldNorm(doc=4826)
          0.3259783 = weight(abstract_txt:microblogs in 4826) [ClassicSimilarity], result of:
            0.3259783 = score(doc=4826,freq=1.0), product of:
              0.55492264 = queryWeight, product of:
                5.4948 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.010744948 = queryNorm
              0.5874302 = fieldWeight in 4826, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.0625 = fieldNorm(doc=4826)
          0.8591357 = weight(abstract_txt:microblog in 4826) [ClassicSimilarity], result of:
            0.8591357 = score(doc=4826,freq=5.0), product of:
              0.66700304 = queryWeight, product of:
                6.7352633 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.010744948 = queryNorm
              1.2880536 = fieldWeight in 4826, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.0625 = fieldNorm(doc=4826)
        0.2 = coord(5/25)
    
  3. Sin, S.-C.J.: Social media and problematic everyday life information-seeking outcomes : differences across use frequency, gender, and problem-solving styles (2016) 0.22
    0.22468448 = sum of:
      0.22468448 = product of:
        0.93618536 = sum of:
          0.007859466 = weight(abstract_txt:such in 4043) [ClassicSimilarity], result of:
            0.007859466 = score(doc=4043,freq=1.0), product of:
              0.036758576 = queryWeight, product of:
                3.42101 = idf(docFreq=3945, maxDocs=44421)
                0.010744948 = queryNorm
              0.21381313 = fieldWeight in 4043, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.42101 = idf(docFreq=3945, maxDocs=44421)
                0.0625 = fieldNorm(doc=4043)
          0.009318795 = weight(abstract_txt:information in 4043) [ClassicSimilarity], result of:
            0.009318795 = score(doc=4043,freq=5.0), product of:
              0.0275662 = queryWeight, product of:
                1.0606077 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.010744948 = queryNorm
              0.3380515 = fieldWeight in 4043, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0625 = fieldNorm(doc=4043)
          0.0051929923 = weight(abstract_txt:that in 4043) [ClassicSimilarity], result of:
            0.0051929923 = score(doc=4043,freq=1.0), product of:
              0.035133258 = queryWeight, product of:
                1.3825947 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.010744948 = queryNorm
              0.14780845 = fieldWeight in 4043, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=4043)
          0.044470776 = weight(abstract_txt:media in 4043) [ClassicSimilarity], result of:
            0.044470776 = score(doc=4043,freq=3.0), product of:
              0.08092935 = queryWeight, product of:
                1.4837943 = boost
                5.076075 = idf(docFreq=753, maxDocs=44421)
                0.010744948 = queryNorm
              0.54950124 = fieldWeight in 4043, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.076075 = idf(docFreq=753, maxDocs=44421)
                0.0625 = fieldNorm(doc=4043)
          0.3259783 = weight(abstract_txt:microblogs in 4043) [ClassicSimilarity], result of:
            0.3259783 = score(doc=4043,freq=1.0), product of:
              0.55492264 = queryWeight, product of:
                5.4948 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.010744948 = queryNorm
              0.5874302 = fieldWeight in 4043, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.0625 = fieldNorm(doc=4043)
          0.54336506 = weight(abstract_txt:microblog in 4043) [ClassicSimilarity], result of:
            0.54336506 = score(doc=4043,freq=2.0), product of:
              0.66700304 = queryWeight, product of:
                6.7352633 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.010744948 = queryNorm
              0.8146366 = fieldWeight in 4043, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.0625 = fieldNorm(doc=4043)
        0.24 = coord(6/25)
    
  4. Xu, L.; Qiu, J.: Unsupervised multi-class sentiment classification approach (2019) 0.14
    0.13839802 = sum of:
      0.13839802 = product of:
        0.8649876 = sum of:
          0.12557435 = weight(abstract_txt:sentiment in 3) [ClassicSimilarity], result of:
            0.12557435 = score(doc=3,freq=9.0), product of:
              0.08897605 = queryWeight, product of:
                1.1001254 = boost
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.010744948 = queryNorm
              1.4113276 = fieldWeight in 3, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.0625 = fieldNorm(doc=3)
          0.029217774 = weight(abstract_txt:called in 3) [ClassicSimilarity], result of:
            0.029217774 = score(doc=3,freq=1.0), product of:
              0.08821208 = queryWeight, product of:
                1.5491186 = boost
                5.2995505 = idf(docFreq=602, maxDocs=44421)
                0.010744948 = queryNorm
              0.3312219 = fieldWeight in 3, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2995505 = idf(docFreq=602, maxDocs=44421)
                0.0625 = fieldNorm(doc=3)
          0.3259783 = weight(abstract_txt:microblogs in 3) [ClassicSimilarity], result of:
            0.3259783 = score(doc=3,freq=1.0), product of:
              0.55492264 = queryWeight, product of:
                5.4948 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.010744948 = queryNorm
              0.5874302 = fieldWeight in 3, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.0625 = fieldNorm(doc=3)
          0.38421714 = weight(abstract_txt:microblog in 3) [ClassicSimilarity], result of:
            0.38421714 = score(doc=3,freq=1.0), product of:
              0.66700304 = queryWeight, product of:
                6.7352633 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.010744948 = queryNorm
              0.5760351 = fieldWeight in 3, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.0625 = fieldNorm(doc=3)
        0.16 = coord(4/25)
    
  5. Moulahi, B.; Tamine, L.; Yahia, S.B.: iAggregator: multidimensional relevance aggregation based on a fuzzy operator (2014) 0.13
    0.12950689 = sum of:
      0.12950689 = product of:
        0.4625246 = sum of:
          0.007859466 = weight(abstract_txt:such in 2501) [ClassicSimilarity], result of:
            0.007859466 = score(doc=2501,freq=1.0), product of:
              0.036758576 = queryWeight, product of:
                3.42101 = idf(docFreq=3945, maxDocs=44421)
                0.010744948 = queryNorm
              0.21381313 = fieldWeight in 2501, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.42101 = idf(docFreq=3945, maxDocs=44421)
                0.0625 = fieldNorm(doc=2501)
          0.0041674916 = weight(abstract_txt:information in 2501) [ClassicSimilarity], result of:
            0.0041674916 = score(doc=2501,freq=1.0), product of:
              0.0275662 = queryWeight, product of:
                1.0606077 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.010744948 = queryNorm
              0.15118122 = fieldWeight in 2501, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0625 = fieldNorm(doc=2501)
          0.0051929923 = weight(abstract_txt:that in 2501) [ClassicSimilarity], result of:
            0.0051929923 = score(doc=2501,freq=1.0), product of:
              0.035133258 = queryWeight, product of:
                1.3825947 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.010744948 = queryNorm
              0.14780845 = fieldWeight in 2501, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=2501)
          0.017496975 = weight(abstract_txt:retrieval in 2501) [ClassicSimilarity], result of:
            0.017496975 = score(doc=2501,freq=2.0), product of:
              0.056941085 = queryWeight, product of:
                1.5243306 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.010744948 = queryNorm
              0.3072821 = fieldWeight in 2501, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=2501)
          0.029217774 = weight(abstract_txt:called in 2501) [ClassicSimilarity], result of:
            0.029217774 = score(doc=2501,freq=1.0), product of:
              0.08821208 = queryWeight, product of:
                1.5491186 = boost
                5.2995505 = idf(docFreq=602, maxDocs=44421)
                0.010744948 = queryNorm
              0.3312219 = fieldWeight in 2501, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2995505 = idf(docFreq=602, maxDocs=44421)
                0.0625 = fieldNorm(doc=2501)
          0.014372756 = weight(abstract_txt:search in 2501) [ClassicSimilarity], result of:
            0.014372756 = score(doc=2501,freq=1.0), product of:
              0.06292459 = queryWeight, product of:
                1.6024206 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.010744948 = queryNorm
              0.22841237 = fieldWeight in 2501, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.0625 = fieldNorm(doc=2501)
          0.38421714 = weight(abstract_txt:microblog in 2501) [ClassicSimilarity], result of:
            0.38421714 = score(doc=2501,freq=1.0), product of:
              0.66700304 = queryWeight, product of:
                6.7352633 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.010744948 = queryNorm
              0.5760351 = fieldWeight in 2501, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.0625 = fieldNorm(doc=2501)
        0.28 = coord(7/25)