Document (#38817)

Author
Rubin, V.L.
Lukoianova, T.
Title
Truth and deception at the rhetorical structure level
Source
Journal of the Association for Information Science and Technology. 66(2015) no.5, S.905-917
Year
2015
Abstract
This paper furthers the development of methods to distinguish truth from deception in textual data. We use rhetorical structure theory (RST) as the analytic framework to identify systematic differences between deceptive and truthful stories in terms of their coherence and structure. A sample of 36 elicited personal stories, self-ranked as truthful or deceptive, is manually analyzed by assigning RST discourse relations among each story's constituent parts. A vector space model (VSM) assesses each story's position in multidimensional RST space with respect to its distance from truthful and deceptive centers as measures of the story's level of deception and truthfulness. Ten human judges evaluate independently whether each story is deceptive and assign their confidence levels (360 evaluations total), producing measures of the expected human ability to recognize deception. As a robustness check, a test sample of 18 truthful stories (with 180 additional evaluations) is used to determine the reliability of our RST-VSM method in determining deception. The contribution is in demonstration of the discourse structure analysis as a significant method for automated deception detection and an effective complement to lexicosemantic analysis. The potential is in developing novel discourse-based tools to alert information users to potential deception in computer-mediated texts.
Content
Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23216/abstract.

Similar documents (author)

  1. Rubin, V.L.: Epistemic modality : from uncertainty to certainty in the context of information seeking as interactions with texts (2010) 6.10
    6.0972233 = sum of:
      6.0972233 = weight(author_txt:rubin in 241) [ClassicSimilarity], result of:
        6.0972233 = fieldWeight in 241, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.625 = fieldNorm(doc=241)
    
  2. Rubin, R.: Foundations of library and information science (2010) 6.10
    6.0972233 = sum of:
      6.0972233 = weight(author_txt:rubin in 781) [ClassicSimilarity], result of:
        6.0972233 = fieldWeight in 781, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.625 = fieldNorm(doc=781)
    
  3. Rubin, V.L.: Disinformation and misinformation triangle (2019) 6.10
    6.0972233 = sum of:
      6.0972233 = weight(author_txt:rubin in 462) [ClassicSimilarity], result of:
        6.0972233 = fieldWeight in 462, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.625 = fieldNorm(doc=462)
    
  4. Kwasnik, B.H.; Rubin, V.L.: Stretching conceptual structures in classifications across languages and cultures (2003) 4.88
    4.8777785 = sum of:
      4.8777785 = weight(author_txt:rubin in 517) [ClassicSimilarity], result of:
        4.8777785 = fieldWeight in 517, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.5 = fieldNorm(doc=517)
    
  5. Rubin, R.; Froehlich, T.J.: Ethical aspects of library and information science (2009) 4.88
    4.8777785 = sum of:
      4.8777785 = weight(author_txt:rubin in 765) [ClassicSimilarity], result of:
        4.8777785 = fieldWeight in 765, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.5 = fieldNorm(doc=765)
    

Similar documents (content)

  1. Frohnsdorff, G.: Facts? of publication : cataloging problems posed by deceptive information (1999) 0.14
    0.14404759 = sum of:
      0.14404759 = product of:
        0.9002974 = sum of:
          0.016186766 = weight(abstract_txt:potential in 233) [ClassicSimilarity], result of:
            0.016186766 = score(doc=233,freq=1.0), product of:
              0.044897668 = queryWeight, product of:
                1.2210972 = boost
                4.61473 = idf(docFreq=1195, maxDocs=44421)
                0.007967595 = queryNorm
              0.3605258 = fieldWeight in 233, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.61473 = idf(docFreq=1195, maxDocs=44421)
                0.078125 = fieldNorm(doc=233)
          0.01724965 = weight(abstract_txt:each in 233) [ClassicSimilarity], result of:
            0.01724965 = score(doc=233,freq=1.0), product of:
              0.053620927 = queryWeight, product of:
                1.6343728 = boost
                4.1177115 = idf(docFreq=1965, maxDocs=44421)
                0.007967595 = queryNorm
              0.32169622 = fieldWeight in 233, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1177115 = idf(docFreq=1965, maxDocs=44421)
                0.078125 = fieldNorm(doc=233)
          0.30584928 = weight(abstract_txt:deceptive in 233) [ClassicSimilarity], result of:
            0.30584928 = score(doc=233,freq=1.0), product of:
              0.4012965 = queryWeight, product of:
                5.162809 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.007967595 = queryNorm
              0.7621529 = fieldWeight in 233, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.078125 = fieldNorm(doc=233)
          0.5610117 = weight(abstract_txt:deception in 233) [ClassicSimilarity], result of:
            0.5610117 = score(doc=233,freq=1.0), product of:
              0.72463787 = queryWeight, product of:
                9.17768 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.007967595 = queryNorm
              0.7741959 = fieldWeight in 233, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.078125 = fieldNorm(doc=233)
        0.16 = coord(4/25)
    
  2. Rubin, V.L.: Disinformation and misinformation triangle (2019) 0.13
    0.1274461 = sum of:
      0.1274461 = product of:
        0.6372305 = sum of:
          0.011968137 = weight(abstract_txt:level in 462) [ClassicSimilarity], result of:
            0.011968137 = score(doc=462,freq=1.0), product of:
              0.042599853 = queryWeight, product of:
                1.1894397 = boost
                4.4950905 = idf(docFreq=1347, maxDocs=44421)
                0.007967595 = queryNorm
              0.28094316 = fieldWeight in 462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4950905 = idf(docFreq=1347, maxDocs=44421)
                0.0625 = fieldNorm(doc=462)
          0.013517743 = weight(abstract_txt:human in 462) [ClassicSimilarity], result of:
            0.013517743 = score(doc=462,freq=1.0), product of:
              0.0462019 = queryWeight, product of:
                1.2387061 = boost
                4.681277 = idf(docFreq=1118, maxDocs=44421)
                0.007967595 = queryNorm
              0.2925798 = fieldWeight in 462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.681277 = idf(docFreq=1118, maxDocs=44421)
                0.0625 = fieldNorm(doc=462)
          0.021036247 = weight(abstract_txt:measures in 462) [ClassicSimilarity], result of:
            0.021036247 = score(doc=462,freq=1.0), product of:
              0.062044397 = queryWeight, product of:
                1.4354552 = boost
                5.424824 = idf(docFreq=531, maxDocs=44421)
                0.007967595 = queryNorm
              0.3390515 = fieldWeight in 462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.424824 = idf(docFreq=531, maxDocs=44421)
                0.0625 = fieldNorm(doc=462)
          0.34602895 = weight(abstract_txt:deceptive in 462) [ClassicSimilarity], result of:
            0.34602895 = score(doc=462,freq=2.0), product of:
              0.4012965 = queryWeight, product of:
                5.162809 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.007967595 = queryNorm
              0.86227757 = fieldWeight in 462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.0625 = fieldNorm(doc=462)
          0.24467944 = weight(abstract_txt:truthful in 462) [ClassicSimilarity], result of:
            0.24467944 = score(doc=462,freq=1.0), product of:
              0.4012965 = queryWeight, product of:
                5.162809 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.007967595 = queryNorm
              0.6097223 = fieldWeight in 462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.0625 = fieldNorm(doc=462)
        0.2 = coord(5/25)
    
  3. Ho, S.M.; Hancock, J.T.; Booth, C.: Ethical dilemma : deception dynamics in computer-mediated group communication (2017) 0.04
    0.04367466 = sum of:
      0.04367466 = product of:
        0.54593325 = sum of:
          0.016186766 = weight(abstract_txt:potential in 4821) [ClassicSimilarity], result of:
            0.016186766 = score(doc=4821,freq=1.0), product of:
              0.044897668 = queryWeight, product of:
                1.2210972 = boost
                4.61473 = idf(docFreq=1195, maxDocs=44421)
                0.007967595 = queryNorm
              0.3605258 = fieldWeight in 4821, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.61473 = idf(docFreq=1195, maxDocs=44421)
                0.078125 = fieldNorm(doc=4821)
          0.5297465 = weight(abstract_txt:deceptive in 4821) [ClassicSimilarity], result of:
            0.5297465 = score(doc=4821,freq=3.0), product of:
              0.4012965 = queryWeight, product of:
                5.162809 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.007967595 = queryNorm
              1.3200874 = fieldWeight in 4821, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.078125 = fieldNorm(doc=4821)
        0.08 = coord(2/25)
    
  4. Wang, X.; Song, N.; Zhou, H.; Cheng, H.: ¬The representation of argumentation in scientific papers : a comparative analysis of two research areas (2022) 0.04
    0.039093897 = sum of:
      0.039093897 = product of:
        0.19546948 = sum of:
          0.011997842 = weight(abstract_txt:method in 1568) [ClassicSimilarity], result of:
            0.011997842 = score(doc=1568,freq=1.0), product of:
              0.042670313 = queryWeight, product of:
                1.1904229 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.007967595 = queryNorm
              0.2811754 = fieldWeight in 1568, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0625 = fieldNorm(doc=1568)
          0.0137997195 = weight(abstract_txt:each in 1568) [ClassicSimilarity], result of:
            0.0137997195 = score(doc=1568,freq=1.0), product of:
              0.053620927 = queryWeight, product of:
                1.6343728 = boost
                4.1177115 = idf(docFreq=1965, maxDocs=44421)
                0.007967595 = queryNorm
              0.25735697 = fieldWeight in 1568, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1177115 = idf(docFreq=1965, maxDocs=44421)
                0.0625 = fieldNorm(doc=1568)
          0.099693105 = weight(abstract_txt:rhetorical in 1568) [ClassicSimilarity], result of:
            0.099693105 = score(doc=1568,freq=2.0), product of:
              0.13893889 = queryWeight, product of:
                2.1480792 = boost
                8.117949 = idf(docFreq=35, maxDocs=44421)
                0.007967595 = queryNorm
              0.71753204 = fieldWeight in 1568, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.117949 = idf(docFreq=35, maxDocs=44421)
                0.0625 = fieldNorm(doc=1568)
          0.021812992 = weight(abstract_txt:structure in 1568) [ClassicSimilarity], result of:
            0.021812992 = score(doc=1568,freq=1.0), product of:
              0.08008365 = queryWeight, product of:
                2.3063505 = boost
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.007967595 = queryNorm
              0.27237758 = fieldWeight in 1568, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.0625 = fieldNorm(doc=1568)
          0.04816583 = weight(abstract_txt:discourse in 1568) [ClassicSimilarity], result of:
            0.04816583 = score(doc=1568,freq=1.0), product of:
              0.123380594 = queryWeight, product of:
                2.4791763 = boost
                6.2461467 = idf(docFreq=233, maxDocs=44421)
                0.007967595 = queryNorm
              0.39038417 = fieldWeight in 1568, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2461467 = idf(docFreq=233, maxDocs=44421)
                0.0625 = fieldNorm(doc=1568)
        0.2 = coord(5/25)
    
  5. Fox, M.J.: Medical discourse's epistemic influence on gender classification in three editions of the Dewey Decimal Classification (2014) 0.04
    0.037678298 = sum of:
      0.037678298 = product of:
        0.23548937 = sum of:
          0.025840547 = weight(abstract_txt:space in 2427) [ClassicSimilarity], result of:
            0.025840547 = score(doc=2427,freq=1.0), product of:
              0.06132697 = queryWeight, product of:
                1.4271319 = boost
                5.393369 = idf(docFreq=548, maxDocs=44421)
                0.007967595 = queryNorm
              0.42135698 = fieldWeight in 2427, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.393369 = idf(docFreq=548, maxDocs=44421)
                0.078125 = fieldNorm(doc=2427)
          0.01724965 = weight(abstract_txt:each in 2427) [ClassicSimilarity], result of:
            0.01724965 = score(doc=2427,freq=1.0), product of:
              0.053620927 = queryWeight, product of:
                1.6343728 = boost
                4.1177115 = idf(docFreq=1965, maxDocs=44421)
                0.007967595 = queryNorm
              0.32169622 = fieldWeight in 2427, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1177115 = idf(docFreq=1965, maxDocs=44421)
                0.078125 = fieldNorm(doc=2427)
          0.08811709 = weight(abstract_txt:rhetorical in 2427) [ClassicSimilarity], result of:
            0.08811709 = score(doc=2427,freq=1.0), product of:
              0.13893889 = queryWeight, product of:
                2.1480792 = boost
                8.117949 = idf(docFreq=35, maxDocs=44421)
                0.007967595 = queryNorm
              0.63421476 = fieldWeight in 2427, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.117949 = idf(docFreq=35, maxDocs=44421)
                0.078125 = fieldNorm(doc=2427)
          0.10428208 = weight(abstract_txt:discourse in 2427) [ClassicSimilarity], result of:
            0.10428208 = score(doc=2427,freq=3.0), product of:
              0.123380594 = queryWeight, product of:
                2.4791763 = boost
                6.2461467 = idf(docFreq=233, maxDocs=44421)
                0.007967595 = queryNorm
              0.8452065 = fieldWeight in 2427, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.2461467 = idf(docFreq=233, maxDocs=44421)
                0.078125 = fieldNorm(doc=2427)
        0.16 = coord(4/25)