Document (#42092)

Author
Malik, M.S.I.
Hussain, A.
Title
¬An analysis of review content and reviewer variables that contribute to review helpfulness
Source
Information processing and management. 54(2018) no.1, S.88-104
Year
2018
Abstract
Review helpfulness is attracting increasing attention of practitioners and academics. It helps in reducing risks and uncertainty faced by users in online shopping. This study examines uninvestigated variables by looking at not only the review characteristics but also important indicators of reviewers. Several significant review content and two reviewer variables are proposed and an effective review helpfulness prediction model is built using stochastic gradient boosting learning method. This study derived a mechanism to extract novel review content variables from review text. Six popular machine learning models and three real-life Amazon review data sets are used for analysis. Our results are robust to several product categories and along three Amazon review data sets. The results show that review content variables deliver the best performance as compared to the reviewer and state-of-the-art baseline as a standalone model. This study finds that reviewer helpfulness per day and syllables in review text strongly relates to review helpfulness. Moreover, the number of space, aux verb, drives words in review text and productivity score of a reviewer are also effective predictors of review helpfulness. The findings will help customers to write better reviews, help retailers to manage their websites intelligently and aid customers in their product purchasing decisions.
Content
Vgl.: https://doi.org/10.1016/j.ipm.2017.09.004.

Similar documents (author)

  1. Partridge, D.; Hussain, K.M.: Knowledge-based information systems (1994) 4.95
    4.954854 = sum of:
      4.954854 = weight(author_txt:hussain in 1728) [ClassicSimilarity], result of:
        4.954854 = fieldWeight in 1728, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.909708 = idf(docFreq=5, maxDocs=44421)
          0.5 = fieldNorm(doc=1728)
    
  2. Hussain, K.H.; Rajeev, J.S.: ¬The changing language technology and CDS/ ISIS : UNICODE and the emergence of OTF (2006) 4.95
    4.954854 = sum of:
      4.954854 = weight(author_txt:hussain in 2496) [ClassicSimilarity], result of:
        4.954854 = fieldWeight in 2496, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.909708 = idf(docFreq=5, maxDocs=44421)
          0.5 = fieldNorm(doc=2496)
    
  3. Pinfield, S.; Salter, J.; Bath, P.A.; Hubbard, B.; Millington, P.; Anders, J.H.S.; Hussain, A.: Open-access repositories worldwide, 2005-2012 : past growth, current characteristics, and future possibilities (2014) 2.48
    2.477427 = sum of:
      2.477427 = weight(author_txt:hussain in 2542) [ClassicSimilarity], result of:
        2.477427 = fieldWeight in 2542, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.909708 = idf(docFreq=5, maxDocs=44421)
          0.25 = fieldNorm(doc=2542)
    
  4. Sturges, P.; Bamkin, M.; Anders, J.H.S.; Hubbard, B.; Hussain, A.; Heeley, M.: Research data sharing : developing a stakeholder-driven model for journal policies (2015) 2.48
    2.477427 = sum of:
      2.477427 = weight(author_txt:hussain in 3330) [ClassicSimilarity], result of:
        2.477427 = fieldWeight in 3330, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.909708 = idf(docFreq=5, maxDocs=44421)
          0.25 = fieldNorm(doc=3330)
    

Similar documents (content)

  1. Chua, A.Y.K.; Banerjee, S.: Understanding review helpfulness as a function of reviewer reputation, review rating, and review depth (2015) 0.31
    0.31416178 = sum of:
      0.31416178 = product of:
        1.9635112 = sum of:
          0.11254096 = weight(abstract_txt:amazon in 2641) [ClassicSimilarity], result of:
            0.11254096 = score(doc=2641,freq=1.0), product of:
              0.14981887 = queryWeight, product of:
                1.9064811 = boost
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.009807563 = queryNorm
              0.7511802 = fieldWeight in 2641, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.09375 = fieldNorm(doc=2641)
          0.56262803 = weight(abstract_txt:reviewer in 2641) [ClassicSimilarity], result of:
            0.56262803 = score(doc=2641,freq=2.0), product of:
              0.47185692 = queryWeight, product of:
                5.3496385 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.009807563 = queryNorm
              1.1923699 = fieldWeight in 2641, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.09375 = fieldNorm(doc=2641)
          0.796858 = weight(abstract_txt:helpfulness in 2641) [ClassicSimilarity], result of:
            0.796858 = score(doc=2641,freq=2.0), product of:
              0.6323786 = queryWeight, product of:
                6.7841973 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.009807563 = queryNorm
              1.2600964 = fieldWeight in 2641, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.09375 = fieldNorm(doc=2641)
          0.49148414 = weight(abstract_txt:review in 2641) [ClassicSimilarity], result of:
            0.49148414 = score(doc=2641,freq=7.0), product of:
              0.4095939 = queryWeight, product of:
                8.632899 = boost
                4.837664 = idf(docFreq=956, maxDocs=44421)
                0.009807563 = queryNorm
              1.1999303 = fieldWeight in 2641, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.837664 = idf(docFreq=956, maxDocs=44421)
                0.09375 = fieldNorm(doc=2641)
        0.16 = coord(4/25)
    
  2. Tay, W.; Zhang, X.; Karimi , S.: Beyond mean rating : probabilistic aggregation of star ratings based on helpfulness (2020) 0.12
    0.1194593 = sum of:
      0.1194593 = product of:
        0.74662066 = sum of:
          0.016512314 = weight(abstract_txt:effective in 917) [ClassicSimilarity], result of:
            0.016512314 = score(doc=917,freq=1.0), product of:
              0.05461252 = queryWeight, product of:
                1.1510532 = boost
                4.837664 = idf(docFreq=956, maxDocs=44421)
                0.009807563 = queryNorm
              0.302354 = fieldWeight in 917, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.837664 = idf(docFreq=956, maxDocs=44421)
                0.0625 = fieldNorm(doc=917)
          0.07502731 = weight(abstract_txt:amazon in 917) [ClassicSimilarity], result of:
            0.07502731 = score(doc=917,freq=1.0), product of:
              0.14981887 = queryWeight, product of:
                1.9064811 = boost
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.009807563 = queryNorm
              0.5007868 = fieldWeight in 917, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.0625 = fieldNorm(doc=917)
          0.5312387 = weight(abstract_txt:helpfulness in 917) [ClassicSimilarity], result of:
            0.5312387 = score(doc=917,freq=2.0), product of:
              0.6323786 = queryWeight, product of:
                6.7841973 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.009807563 = queryNorm
              0.8400643 = fieldWeight in 917, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.0625 = fieldNorm(doc=917)
          0.12384236 = weight(abstract_txt:review in 917) [ClassicSimilarity], result of:
            0.12384236 = score(doc=917,freq=1.0), product of:
              0.4095939 = queryWeight, product of:
                8.632899 = boost
                4.837664 = idf(docFreq=956, maxDocs=44421)
                0.009807563 = queryNorm
              0.302354 = fieldWeight in 917, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.837664 = idf(docFreq=956, maxDocs=44421)
                0.0625 = fieldNorm(doc=917)
        0.16 = coord(4/25)
    
  3. García, J.A.; Rodriguez-Sánchez, R.; Fdez-Valdivia, J.: Adverse selection of reviewers (2015) 0.11
    0.105113894 = sum of:
      0.105113894 = product of:
        0.65696186 = sum of:
          0.013724022 = weight(abstract_txt:several in 2859) [ClassicSimilarity], result of:
            0.013724022 = score(doc=2859,freq=1.0), product of:
              0.048277102 = queryWeight, product of:
                1.0822307 = boost
                4.548416 = idf(docFreq=1277, maxDocs=44421)
                0.009807563 = queryNorm
              0.284276 = fieldWeight in 2859, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.548416 = idf(docFreq=1277, maxDocs=44421)
                0.0625 = fieldNorm(doc=2859)
          0.008714446 = weight(abstract_txt:study in 2859) [ClassicSimilarity], result of:
            0.008714446 = score(doc=2859,freq=1.0), product of:
              0.04082666 = queryWeight, product of:
                1.2188965 = boost
                3.415198 = idf(docFreq=3968, maxDocs=44421)
                0.009807563 = queryNorm
              0.21344988 = fieldWeight in 2859, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.415198 = idf(docFreq=3968, maxDocs=44421)
                0.0625 = fieldNorm(doc=2859)
          0.45938385 = weight(abstract_txt:reviewer in 2859) [ClassicSimilarity], result of:
            0.45938385 = score(doc=2859,freq=3.0), product of:
              0.47185692 = queryWeight, product of:
                5.3496385 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.009807563 = queryNorm
              0.973566 = fieldWeight in 2859, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.0625 = fieldNorm(doc=2859)
          0.17513955 = weight(abstract_txt:review in 2859) [ClassicSimilarity], result of:
            0.17513955 = score(doc=2859,freq=2.0), product of:
              0.4095939 = queryWeight, product of:
                8.632899 = boost
                4.837664 = idf(docFreq=956, maxDocs=44421)
                0.009807563 = queryNorm
              0.42759314 = fieldWeight in 2859, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.837664 = idf(docFreq=956, maxDocs=44421)
                0.0625 = fieldNorm(doc=2859)
        0.16 = coord(4/25)
    
  4. García, J.A.; Rodriguez-Sánchez, R.; Fdez-Valdivia, J.: ¬The principal-agent problem in peer review : an interactionist perspective on everyday use of biomedical information (2015) 0.09
    0.08858428 = sum of:
      0.08858428 = product of:
        0.73820233 = sum of:
          0.02129794 = weight(abstract_txt:content in 2638) [ClassicSimilarity], result of:
            0.02129794 = score(doc=2638,freq=1.0), product of:
              0.081530854 = queryWeight, product of:
                1.9889563 = boost
                4.1796083 = idf(docFreq=1847, maxDocs=44421)
                0.009807563 = queryNorm
              0.26122552 = fieldWeight in 2638, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1796083 = idf(docFreq=1847, maxDocs=44421)
                0.0625 = fieldNorm(doc=2638)
          0.59306204 = weight(abstract_txt:reviewer in 2638) [ClassicSimilarity], result of:
            0.59306204 = score(doc=2638,freq=5.0), product of:
              0.47185692 = queryWeight, product of:
                5.3496385 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.009807563 = queryNorm
              1.2568684 = fieldWeight in 2638, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.0625 = fieldNorm(doc=2638)
          0.12384236 = weight(abstract_txt:review in 2638) [ClassicSimilarity], result of:
            0.12384236 = score(doc=2638,freq=1.0), product of:
              0.4095939 = queryWeight, product of:
                8.632899 = boost
                4.837664 = idf(docFreq=956, maxDocs=44421)
                0.009807563 = queryNorm
              0.302354 = fieldWeight in 2638, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.837664 = idf(docFreq=956, maxDocs=44421)
                0.0625 = fieldNorm(doc=2638)
        0.12 = coord(3/25)
    
  5. Mizzaro, S.: Quality control in scholarly publishing : a new proposal (2003) 0.08
    0.08466625 = sum of:
      0.08466625 = product of:
        0.5291641 = sum of:
          0.013724022 = weight(abstract_txt:several in 2810) [ClassicSimilarity], result of:
            0.013724022 = score(doc=2810,freq=1.0), product of:
              0.048277102 = queryWeight, product of:
                1.0822307 = boost
                4.548416 = idf(docFreq=1277, maxDocs=44421)
                0.009807563 = queryNorm
              0.284276 = fieldWeight in 2810, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.548416 = idf(docFreq=1277, maxDocs=44421)
                0.0625 = fieldNorm(doc=2810)
          0.016512314 = weight(abstract_txt:effective in 2810) [ClassicSimilarity], result of:
            0.016512314 = score(doc=2810,freq=1.0), product of:
              0.05461252 = queryWeight, product of:
                1.1510532 = boost
                4.837664 = idf(docFreq=956, maxDocs=44421)
                0.009807563 = queryNorm
              0.302354 = fieldWeight in 2810, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.837664 = idf(docFreq=956, maxDocs=44421)
                0.0625 = fieldNorm(doc=2810)
          0.37508535 = weight(abstract_txt:reviewer in 2810) [ClassicSimilarity], result of:
            0.37508535 = score(doc=2810,freq=2.0), product of:
              0.47185692 = queryWeight, product of:
                5.3496385 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.009807563 = queryNorm
              0.7949133 = fieldWeight in 2810, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.0625 = fieldNorm(doc=2810)
          0.12384236 = weight(abstract_txt:review in 2810) [ClassicSimilarity], result of:
            0.12384236 = score(doc=2810,freq=1.0), product of:
              0.4095939 = queryWeight, product of:
                8.632899 = boost
                4.837664 = idf(docFreq=956, maxDocs=44421)
                0.009807563 = queryNorm
              0.302354 = fieldWeight in 2810, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.837664 = idf(docFreq=956, maxDocs=44421)
                0.0625 = fieldNorm(doc=2810)
        0.16 = coord(4/25)