Document (#44298)

Wang, Z.
He, H.
Xu, G.
Ren, M.
Will sentiment analysis need subculture? : a new data augmentation approach
Journal of the Association for Information Science and Technology. 75(2024) no.6, S.655-670
Nowadays, the omnipresence of the Internet has fostered a subculture that congregates around the contemporary milieu. The subculture artfully articulates the intricacies of human feelings by ardently pursuing the allure of novelty, a fact that cannot be disregarded in the sentiment analysis. This paper aims to enrich data through the lens of subculture, to address the insufficient training data faced by sentiment analysis. To this end, a new approach of subculture-based data augmentation (SCDA) is proposed, which engenders enhanced texts for each training text by leveraging the creation of specific subcultural expression generators. The extensive experiments attest to the effectiveness and potential of SCDA. The results also shed light on the phenomenon that disparate subcultural expressions elicit varying degrees of sentiment stimulation. Moreover, an intriguing conjecture arises, suggesting the linear reversibility of certain subcultural expressions.

Similar documents (author)

  1. Wang, H.; Wang, C.: Ontologies for universal information systems (1995) 4.62
    4.6221313 = sum of:
      4.6221313 = weight(author_txt:wang in 3262) [ClassicSimilarity], result of:
        4.6221313 = score(doc=3262,freq=2.0), product of:
          0.99999994 = queryWeight, product of:
            6.5366817 = idf(docFreq=174, maxDocs=44421)
            0.15298282 = queryNorm
          4.622132 = fieldWeight in 3262, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            6.5366817 = idf(docFreq=174, maxDocs=44421)
            0.5 = fieldNorm(doc=3262)
  2. Wang, F.; Wang, X.: Tracing theory diffusion : a text mining and citation-based analysis of TAM (2020) 4.62
    4.6221313 = sum of:
      4.6221313 = weight(author_txt:wang in 980) [ClassicSimilarity], result of:
        4.6221313 = score(doc=980,freq=2.0), product of:
          0.99999994 = queryWeight, product of:
            6.5366817 = idf(docFreq=174, maxDocs=44421)
            0.15298282 = queryNorm
          4.622132 = fieldWeight in 980, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            6.5366817 = idf(docFreq=174, maxDocs=44421)
            0.5 = fieldNorm(doc=980)
  3. Wang, C.: ¬The online catalogue, subject access and user reactions : a review (1985) 4.09
    4.0854254 = sum of:
      4.0854254 = weight(author_txt:wang in 985) [ClassicSimilarity], result of:
        4.0854254 = score(doc=985,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            6.5366817 = idf(docFreq=174, maxDocs=44421)
            0.15298282 = queryNorm
          4.085426 = fieldWeight in 985, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            6.5366817 = idf(docFreq=174, maxDocs=44421)
            0.625 = fieldNorm(doc=985)
  4. Wang, C.: Bibliometrics : a textbook (1990) 4.09
    4.0854254 = sum of:
      4.0854254 = weight(author_txt:wang in 5108) [ClassicSimilarity], result of:
        4.0854254 = score(doc=5108,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            6.5366817 = idf(docFreq=174, maxDocs=44421)
            0.15298282 = queryNorm
          4.085426 = fieldWeight in 5108, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            6.5366817 = idf(docFreq=174, maxDocs=44421)
            0.625 = fieldNorm(doc=5108)
  5. Wang, P.: Users' information needs at different stages of a research project : a cognitive view (1997) 4.09
    4.0854254 = sum of:
      4.0854254 = weight(author_txt:wang in 1320) [ClassicSimilarity], result of:
        4.0854254 = score(doc=1320,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            6.5366817 = idf(docFreq=174, maxDocs=44421)
            0.15298282 = queryNorm
          4.085426 = fieldWeight in 1320, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            6.5366817 = idf(docFreq=174, maxDocs=44421)
            0.625 = fieldNorm(doc=1320)

Similar documents (content)

  1. Xiang, R.; Chersoni, E.; Lu, Q.; Huang, C.-R.; Li, W.; Long, Y.: Lexical data augmentation for sentiment analysis (2021) 0.19
    0.19442613 = sum of:
      0.19442613 = product of:
        0.97213066 = sum of:
          0.031938825 = weight(abstract_txt:training in 1393) [ClassicSimilarity], result of:
            0.031938825 = score(doc=1393,freq=1.0), product of:
              0.11441759 = queryWeight, product of:
                1.3360307 = boost
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.016777927 = queryNorm
              0.27914262 = fieldWeight in 1393, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1393)
          0.03491081 = weight(abstract_txt:analysis in 1393) [ClassicSimilarity], result of:
            0.03491081 = score(doc=1393,freq=4.0), product of:
              0.08755156 = queryWeight, product of:
                1.4313558 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.016777927 = queryNorm
              0.39874572 = fieldWeight in 1393, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1393)
          0.046935853 = weight(abstract_txt:data in 1393) [ClassicSimilarity], result of:
            0.046935853 = score(doc=1393,freq=7.0), product of:
              0.097407855 = queryWeight, product of:
                1.7433398 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.016777927 = queryNorm
              0.48184875 = fieldWeight in 1393, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1393)
          0.4486654 = weight(abstract_txt:augmentation in 1393) [ClassicSimilarity], result of:
            0.4486654 = score(doc=1393,freq=6.0), product of:
              0.3665879 = queryWeight, product of:
                2.391438 = boost
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.016777927 = queryNorm
              1.2238958 = fieldWeight in 1393, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1393)
          0.40967974 = weight(abstract_txt:sentiment in 1393) [ClassicSimilarity], result of:
            0.40967974 = score(doc=1393,freq=4.0), product of:
              0.4976223 = queryWeight, product of:
                3.9403515 = boost
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.016777927 = queryNorm
              0.82327443 = fieldWeight in 1393, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1393)
        0.2 = coord(5/25)
  2. Yu, N.: Exploring co-training strategies for opinion detection (2014) 0.13
    0.13338757 = sum of:
      0.13338757 = product of:
        0.83367234 = sum of:
          0.073003024 = weight(abstract_txt:training in 2503) [ClassicSimilarity], result of:
            0.073003024 = score(doc=2503,freq=4.0), product of:
              0.11441759 = queryWeight, product of:
                1.3360307 = boost
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.016777927 = queryNorm
              0.63804024 = fieldWeight in 2503, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.0625 = fieldNorm(doc=2503)
          0.048864953 = weight(abstract_txt:analysis in 2503) [ClassicSimilarity], result of:
            0.048864953 = score(doc=2503,freq=6.0), product of:
              0.08755156 = queryWeight, product of:
                1.4313558 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.016777927 = queryNorm
              0.55812776 = fieldWeight in 2503, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.0625 = fieldNorm(doc=2503)
          0.049661893 = weight(abstract_txt:data in 2503) [ClassicSimilarity], result of:
            0.049661893 = score(doc=2503,freq=6.0), product of:
              0.097407855 = queryWeight, product of:
                1.7433398 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.016777927 = queryNorm
              0.5098346 = fieldWeight in 2503, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=2503)
          0.66214246 = weight(abstract_txt:sentiment in 2503) [ClassicSimilarity], result of:
            0.66214246 = score(doc=2503,freq=8.0), product of:
              0.4976223 = queryWeight, product of:
                3.9403515 = boost
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.016777927 = queryNorm
              1.3306124 = fieldWeight in 2503, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.0625 = fieldNorm(doc=2503)
        0.16 = coord(4/25)
  3. Chung, W.; Zeng, D.: Social-media-based public policy informatics : sentiment and network analyses of U.S. immigration and border security (2016) 0.11
    0.111880824 = sum of:
      0.111880824 = product of:
        0.69925517 = sum of:
          0.019949034 = weight(abstract_txt:analysis in 4969) [ClassicSimilarity], result of:
            0.019949034 = score(doc=4969,freq=1.0), product of:
              0.08755156 = queryWeight, product of:
                1.4313558 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.016777927 = queryNorm
              0.2278547 = fieldWeight in 4969, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.0625 = fieldNorm(doc=4969)
          0.020274382 = weight(abstract_txt:data in 4969) [ClassicSimilarity], result of:
            0.020274382 = score(doc=4969,freq=1.0), product of:
              0.097407855 = queryWeight, product of:
                1.7433398 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.016777927 = queryNorm
              0.20813909 = fieldWeight in 4969, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=4969)
          0.08559955 = weight(abstract_txt:expressions in 4969) [ClassicSimilarity], result of:
            0.08559955 = score(doc=4969,freq=1.0), product of:
              0.20196055 = queryWeight, product of:
                1.7750202 = boost
                6.7814865 = idf(docFreq=136, maxDocs=44421)
                0.016777927 = queryNorm
              0.4238429 = fieldWeight in 4969, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7814865 = idf(docFreq=136, maxDocs=44421)
                0.0625 = fieldNorm(doc=4969)
          0.5734322 = weight(abstract_txt:sentiment in 4969) [ClassicSimilarity], result of:
            0.5734322 = score(doc=4969,freq=6.0), product of:
              0.4976223 = queryWeight, product of:
                3.9403515 = boost
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.016777927 = queryNorm
              1.1523442 = fieldWeight in 4969, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.0625 = fieldNorm(doc=4969)
        0.16 = coord(4/25)
  4. Wei, W.; Liu, Y.-P.; Wei, L-R.: Feature-level sentiment analysis based on rules and fine-grained domain ontology (2020) 0.11
    0.11092947 = sum of:
      0.11092947 = product of:
        0.92441225 = sum of:
          0.043190926 = weight(abstract_txt:analysis in 876) [ClassicSimilarity], result of:
            0.043190926 = score(doc=876,freq=3.0), product of:
              0.08755156 = queryWeight, product of:
                1.4313558 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.016777927 = queryNorm
              0.4933199 = fieldWeight in 876, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.078125 = fieldNorm(doc=876)
          0.106999435 = weight(abstract_txt:expressions in 876) [ClassicSimilarity], result of:
            0.106999435 = score(doc=876,freq=1.0), product of:
              0.20196055 = queryWeight, product of:
                1.7750202 = boost
                6.7814865 = idf(docFreq=136, maxDocs=44421)
                0.016777927 = queryNorm
              0.52980363 = fieldWeight in 876, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7814865 = idf(docFreq=136, maxDocs=44421)
                0.078125 = fieldNorm(doc=876)
          0.7742219 = weight(abstract_txt:sentiment in 876) [ClassicSimilarity], result of:
            0.7742219 = score(doc=876,freq=7.0), product of:
              0.4976223 = queryWeight, product of:
                3.9403515 = boost
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.016777927 = queryNorm
              1.5558424 = fieldWeight in 876, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.078125 = fieldNorm(doc=876)
        0.12 = coord(3/25)
  5. Thelwall, M.; Buckley, K.; Paltoglou, G.: Sentiment strength detection for the social web (2012) 0.10
    0.09642273 = sum of:
      0.09642273 = product of:
        0.80352277 = sum of:
          0.034552738 = weight(abstract_txt:analysis in 972) [ClassicSimilarity], result of:
            0.034552738 = score(doc=972,freq=3.0), product of:
              0.08755156 = queryWeight, product of:
                1.4313558 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.016777927 = queryNorm
              0.3946559 = fieldWeight in 972, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.0625 = fieldNorm(doc=972)
          0.028672306 = weight(abstract_txt:data in 972) [ClassicSimilarity], result of:
            0.028672306 = score(doc=972,freq=2.0), product of:
              0.097407855 = queryWeight, product of:
                1.7433398 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.016777927 = queryNorm
              0.29435313 = fieldWeight in 972, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=972)
          0.74029773 = weight(abstract_txt:sentiment in 972) [ClassicSimilarity], result of:
            0.74029773 = score(doc=972,freq=10.0), product of:
              0.4976223 = queryWeight, product of:
                3.9403515 = boost
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.016777927 = queryNorm
              1.48767 = fieldWeight in 972, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.0625 = fieldNorm(doc=972)
        0.12 = coord(3/25)