Document (#42092)

Author
Malik, M.S.I.
Hussain, A.
Title
¬An analysis of review content and reviewer variables that contribute to review helpfulness
Source
Information processing and management. 54(2018) no.1, S.88-104
Year
2018
Abstract
Review helpfulness is attracting increasing attention of practitioners and academics. It helps in reducing risks and uncertainty faced by users in online shopping. This study examines uninvestigated variables by looking at not only the review characteristics but also important indicators of reviewers. Several significant review content and two reviewer variables are proposed and an effective review helpfulness prediction model is built using stochastic gradient boosting learning method. This study derived a mechanism to extract novel review content variables from review text. Six popular machine learning models and three real-life Amazon review data sets are used for analysis. Our results are robust to several product categories and along three Amazon review data sets. The results show that review content variables deliver the best performance as compared to the reviewer and state-of-the-art baseline as a standalone model. This study finds that reviewer helpfulness per day and syllables in review text strongly relates to review helpfulness. Moreover, the number of space, aux verb, drives words in review text and productivity score of a reviewer are also effective predictors of review helpfulness. The findings will help customers to write better reviews, help retailers to manage their websites intelligently and aid customers in their product purchasing decisions.
Content
Vgl.: https://doi.org/10.1016/j.ipm.2017.09.004.

Similar documents (author)

  1. Partridge, D.; Hussain, K.M.: Knowledge-based information systems (1994) 4.95
    4.952564 = sum of:
      4.952564 = weight(author_txt:hussain in 1660) [ClassicSimilarity], result of:
        4.952564 = fieldWeight in 1660, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.5 = fieldNorm(doc=1660)
    
  2. Hussain, K.H.; Rajeev, J.S.: ¬The changing language technology and CDS/ ISIS : UNICODE and the emergence of OTF (2006) 4.95
    4.952564 = sum of:
      4.952564 = weight(author_txt:hussain in 1496) [ClassicSimilarity], result of:
        4.952564 = fieldWeight in 1496, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.5 = fieldNorm(doc=1496)
    
  3. Pinfield, S.; Salter, J.; Bath, P.A.; Hubbard, B.; Millington, P.; Anders, J.H.S.; Hussain, A.: Open-access repositories worldwide, 2005-2012 : past growth, current characteristics, and future possibilities (2014) 2.48
    2.476282 = sum of:
      2.476282 = weight(author_txt:hussain in 1542) [ClassicSimilarity], result of:
        2.476282 = fieldWeight in 1542, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.25 = fieldNorm(doc=1542)
    
  4. Sturges, P.; Bamkin, M.; Anders, J.H.S.; Hubbard, B.; Hussain, A.; Heeley, M.: Research data sharing : developing a stakeholder-driven model for journal policies (2015) 2.48
    2.476282 = sum of:
      2.476282 = weight(author_txt:hussain in 2330) [ClassicSimilarity], result of:
        2.476282 = fieldWeight in 2330, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.905128 = idf(docFreq=5, maxDocs=44218)
          0.25 = fieldNorm(doc=2330)
    

Similar documents (content)

  1. Chua, A.Y.K.; Banerjee, S.: Understanding review helpfulness as a function of reviewer reputation, review rating, and review depth (2015) 0.31
    0.31459293 = sum of:
      0.31459293 = product of:
        1.9662058 = sum of:
          0.11226391 = weight(abstract_txt:amazon in 1641) [ClassicSimilarity], result of:
            0.11226391 = score(doc=1641,freq=1.0), product of:
              0.14953552 = queryWeight, product of:
                1.9064301 = boost
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.009794877 = queryNorm
              0.7507508 = fieldWeight in 1641, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.09375 = fieldNorm(doc=1641)
          0.5613481 = weight(abstract_txt:reviewer in 1641) [ClassicSimilarity], result of:
            0.5613481 = score(doc=1641,freq=2.0), product of:
              0.47102335 = queryWeight, product of:
                5.349829 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.009794877 = queryNorm
              1.1917627 = fieldWeight in 1641, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.09375 = fieldNorm(doc=1641)
          0.7951104 = weight(abstract_txt:helpfulness in 1641) [ClassicSimilarity], result of:
            0.7951104 = score(doc=1641,freq=2.0), product of:
              0.6312959 = queryWeight, product of:
                6.7846246 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.009794877 = queryNorm
              1.2594892 = fieldWeight in 1641, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.09375 = fieldNorm(doc=1641)
          0.4974835 = weight(abstract_txt:review in 1641) [ClassicSimilarity], result of:
            0.4974835 = score(doc=1641,freq=7.0), product of:
              0.4128172 = queryWeight, product of:
                8.674775 = boost
                4.858482 = idf(docFreq=932, maxDocs=44218)
                0.009794877 = queryNorm
              1.2050939 = fieldWeight in 1641, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.858482 = idf(docFreq=932, maxDocs=44218)
                0.09375 = fieldNorm(doc=1641)
        0.16 = coord(4/25)
    
  2. Tay, W.; Zhang, X.; Karimi , S.: Beyond mean rating : probabilistic aggregation of star ratings based on helpfulness (2020) 0.12
    0.11948087 = sum of:
      0.11948087 = product of:
        0.7467555 = sum of:
          0.016485212 = weight(abstract_txt:effective in 5917) [ClassicSimilarity], result of:
            0.016485212 = score(doc=5917,freq=1.0), product of:
              0.05453912 = queryWeight, product of:
                1.1513379 = boost
                4.8362236 = idf(docFreq=953, maxDocs=44218)
                0.009794877 = queryNorm
              0.30226398 = fieldWeight in 5917, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8362236 = idf(docFreq=953, maxDocs=44218)
                0.0625 = fieldNorm(doc=5917)
          0.0748426 = weight(abstract_txt:amazon in 5917) [ClassicSimilarity], result of:
            0.0748426 = score(doc=5917,freq=1.0), product of:
              0.14953552 = queryWeight, product of:
                1.9064301 = boost
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.009794877 = queryNorm
              0.5005005 = fieldWeight in 5917, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.008008 = idf(docFreq=39, maxDocs=44218)
                0.0625 = fieldNorm(doc=5917)
          0.5300736 = weight(abstract_txt:helpfulness in 5917) [ClassicSimilarity], result of:
            0.5300736 = score(doc=5917,freq=2.0), product of:
              0.6312959 = queryWeight, product of:
                6.7846246 = boost
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.009794877 = queryNorm
              0.83965945 = fieldWeight in 5917, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.0625 = fieldNorm(doc=5917)
          0.12535405 = weight(abstract_txt:review in 5917) [ClassicSimilarity], result of:
            0.12535405 = score(doc=5917,freq=1.0), product of:
              0.4128172 = queryWeight, product of:
                8.674775 = boost
                4.858482 = idf(docFreq=932, maxDocs=44218)
                0.009794877 = queryNorm
              0.30365512 = fieldWeight in 5917, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.858482 = idf(docFreq=932, maxDocs=44218)
                0.0625 = fieldNorm(doc=5917)
        0.16 = coord(4/25)
    
  3. García, J.A.; Rodriguez-Sánchez, R.; Fdez-Valdivia, J.: Adverse selection of reviewers (2015) 0.11
    0.10529566 = sum of:
      0.10529566 = product of:
        0.65809786 = sum of:
          0.013707768 = weight(abstract_txt:several in 1859) [ClassicSimilarity], result of:
            0.013707768 = score(doc=1859,freq=1.0), product of:
              0.04822693 = queryWeight, product of:
                1.0826635 = boost
                4.5477557 = idf(docFreq=1272, maxDocs=44218)
                0.009794877 = queryNorm
              0.28423473 = fieldWeight in 1859, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5477557 = idf(docFreq=1272, maxDocs=44218)
                0.0625 = fieldNorm(doc=1859)
          0.0087739285 = weight(abstract_txt:study in 1859) [ClassicSimilarity], result of:
            0.0087739285 = score(doc=1859,freq=1.0), product of:
              0.04100199 = queryWeight, product of:
                1.2226349 = boost
                3.423806 = idf(docFreq=3916, maxDocs=44218)
                0.009794877 = queryNorm
              0.21398787 = fieldWeight in 1859, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.423806 = idf(docFreq=3916, maxDocs=44218)
                0.0625 = fieldNorm(doc=1859)
          0.45833877 = weight(abstract_txt:reviewer in 1859) [ClassicSimilarity], result of:
            0.45833877 = score(doc=1859,freq=3.0), product of:
              0.47102335 = queryWeight, product of:
                5.349829 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.009794877 = queryNorm
              0.97307014 = fieldWeight in 1859, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.0625 = fieldNorm(doc=1859)
          0.1772774 = weight(abstract_txt:review in 1859) [ClassicSimilarity], result of:
            0.1772774 = score(doc=1859,freq=2.0), product of:
              0.4128172 = queryWeight, product of:
                8.674775 = boost
                4.858482 = idf(docFreq=932, maxDocs=44218)
                0.009794877 = queryNorm
              0.42943317 = fieldWeight in 1859, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.858482 = idf(docFreq=932, maxDocs=44218)
                0.0625 = fieldNorm(doc=1859)
        0.16 = coord(4/25)
    
  4. García, J.A.; Rodriguez-Sánchez, R.; Fdez-Valdivia, J.: ¬The principal-agent problem in peer review : an interactionist perspective on everyday use of biomedical information (2015) 0.09
    0.088602416 = sum of:
      0.088602416 = product of:
        0.7383535 = sum of:
          0.021286596 = weight(abstract_txt:content in 1638) [ClassicSimilarity], result of:
            0.021286596 = score(doc=1638,freq=1.0), product of:
              0.081481546 = queryWeight, product of:
                1.9901845 = boost
                4.17991 = idf(docFreq=1838, maxDocs=44218)
                0.009794877 = queryNorm
              0.2612444 = fieldWeight in 1638, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.17991 = idf(docFreq=1838, maxDocs=44218)
                0.0625 = fieldNorm(doc=1638)
          0.59171283 = weight(abstract_txt:reviewer in 1638) [ClassicSimilarity], result of:
            0.59171283 = score(doc=1638,freq=5.0), product of:
              0.47102335 = queryWeight, product of:
                5.349829 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.009794877 = queryNorm
              1.2562282 = fieldWeight in 1638, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.0625 = fieldNorm(doc=1638)
          0.12535405 = weight(abstract_txt:review in 1638) [ClassicSimilarity], result of:
            0.12535405 = score(doc=1638,freq=1.0), product of:
              0.4128172 = queryWeight, product of:
                8.674775 = boost
                4.858482 = idf(docFreq=932, maxDocs=44218)
                0.009794877 = queryNorm
              0.30365512 = fieldWeight in 1638, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.858482 = idf(docFreq=932, maxDocs=44218)
                0.0625 = fieldNorm(doc=1638)
        0.12 = coord(3/25)
    
  5. Mizzaro, S.: Quality control in scholarly publishing : a new proposal (2003) 0.08
    0.08476465 = sum of:
      0.08476465 = product of:
        0.5297791 = sum of:
          0.013707768 = weight(abstract_txt:several in 1810) [ClassicSimilarity], result of:
            0.013707768 = score(doc=1810,freq=1.0), product of:
              0.04822693 = queryWeight, product of:
                1.0826635 = boost
                4.5477557 = idf(docFreq=1272, maxDocs=44218)
                0.009794877 = queryNorm
              0.28423473 = fieldWeight in 1810, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5477557 = idf(docFreq=1272, maxDocs=44218)
                0.0625 = fieldNorm(doc=1810)
          0.016485212 = weight(abstract_txt:effective in 1810) [ClassicSimilarity], result of:
            0.016485212 = score(doc=1810,freq=1.0), product of:
              0.05453912 = queryWeight, product of:
                1.1513379 = boost
                4.8362236 = idf(docFreq=953, maxDocs=44218)
                0.009794877 = queryNorm
              0.30226398 = fieldWeight in 1810, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8362236 = idf(docFreq=953, maxDocs=44218)
                0.0625 = fieldNorm(doc=1810)
          0.37423202 = weight(abstract_txt:reviewer in 1810) [ClassicSimilarity], result of:
            0.37423202 = score(doc=1810,freq=2.0), product of:
              0.47102335 = queryWeight, product of:
                5.349829 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.009794877 = queryNorm
              0.79450846 = fieldWeight in 1810, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.0625 = fieldNorm(doc=1810)
          0.12535405 = weight(abstract_txt:review in 1810) [ClassicSimilarity], result of:
            0.12535405 = score(doc=1810,freq=1.0), product of:
              0.4128172 = queryWeight, product of:
                8.674775 = boost
                4.858482 = idf(docFreq=932, maxDocs=44218)
                0.009794877 = queryNorm
              0.30365512 = fieldWeight in 1810, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.858482 = idf(docFreq=932, maxDocs=44218)
                0.0625 = fieldNorm(doc=1810)
        0.16 = coord(4/25)