Document (#33873)

Author
Fu, T.
Abbasi, A.
Chen, H.
Title
¬A hybrid approach to Web forum interactional coherence analysis
Source
Journal of the American Society for Information Science and Technology. 59(2008) no.8, S.1195-1209
Year
2008
Abstract
Despite the rapid growth of text-based computer-mediated communication (CMC), its limitations have rendered the media highly incoherent. This poses problems for content analysis of online discourse archives. Interactional coherence analysis (ICA) attempts to accurately identify and construct CMC interaction networks. In this study, we propose the Hybrid Interactional Coherence (HIC) algorithm for identification of web forum interaction. HIC utilizes a bevy of system and linguistic features, including message header information, quotations, direct address, and lexical relations. Furthermore, several similarity-based methods including a Lexical Match Algorithm (LMA) and a sliding window method are utilized to account for interactional idiosyncrasies. Experiments results on two web forums revealed that the proposed HIC algorithm significantly outperformed comparison techniques in terms of precision, recall, and F-measure at both the forum and thread levels. Additionally, an example was used to illustrate how the improved ICA results can facilitate enhanced social network and role analysis capabilities.
Theme
Internet

Similar documents (author)

  1. Chen, Y.N.; Chen, S.J.: ¬A metadata practice of the OFLA FRBR model : a case study for the National Palace Museum in Taipai (2004) 4.34
    4.3394766 = sum of:
      4.3394766 = weight(author_txt:chen in 4384) [ClassicSimilarity], result of:
        4.3394766 = score(doc=4384,freq=2.0), product of:
          0.99999994 = queryWeight, product of:
            6.136947 = idf(docFreq=260, maxDocs=44421)
            0.16294746 = queryNorm
          4.339477 = fieldWeight in 4384, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            6.136947 = idf(docFreq=260, maxDocs=44421)
            0.5 = fieldNorm(doc=4384)
    
  2. Chen, C.C.; Chen, H.H.; Chen, K.H.: ¬The design of the XML/Metadata management system (2000) 3.99
    3.9860637 = sum of:
      3.9860637 = weight(author_txt:chen in 5633) [ClassicSimilarity], result of:
        3.9860637 = score(doc=5633,freq=3.0), product of:
          0.99999994 = queryWeight, product of:
            6.136947 = idf(docFreq=260, maxDocs=44421)
            0.16294746 = queryNorm
          3.986064 = fieldWeight in 5633, product of:
            1.7320508 = tf(freq=3.0), with freq of:
              3.0 = termFreq=3.0
            6.136947 = idf(docFreq=260, maxDocs=44421)
            0.375 = fieldNorm(doc=5633)
    
  3. Chen, W.Y.: Observations on cataloguing and classification (1991) 3.84
    3.8355918 = sum of:
      3.8355918 = weight(author_txt:chen in 4183) [ClassicSimilarity], result of:
        3.8355918 = score(doc=4183,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            6.136947 = idf(docFreq=260, maxDocs=44421)
            0.16294746 = queryNorm
          3.835592 = fieldWeight in 4183, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            6.136947 = idf(docFreq=260, maxDocs=44421)
            0.625 = fieldNorm(doc=4183)
    
  4. Chen, H.: Knowledge-based document retrieval : framework and design (1992) 3.84
    3.8355918 = sum of:
      3.8355918 = weight(author_txt:chen in 5282) [ClassicSimilarity], result of:
        3.8355918 = score(doc=5282,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            6.136947 = idf(docFreq=260, maxDocs=44421)
            0.16294746 = queryNorm
          3.835592 = fieldWeight in 5282, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            6.136947 = idf(docFreq=260, maxDocs=44421)
            0.625 = fieldNorm(doc=5282)
    
  5. Chen, P.S.: On inference rules of logic-based information retrieval systems (1994) 3.84
    3.8355918 = sum of:
      3.8355918 = weight(author_txt:chen in 6730) [ClassicSimilarity], result of:
        3.8355918 = score(doc=6730,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            6.136947 = idf(docFreq=260, maxDocs=44421)
            0.16294746 = queryNorm
          3.835592 = fieldWeight in 6730, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            6.136947 = idf(docFreq=260, maxDocs=44421)
            0.625 = fieldNorm(doc=6730)
    

Similar documents (content)

  1. Fu, T.; Abbasi, A.; Chen, H.: ¬A focused crawler for Dark Web forums (2010) 0.08
    0.078386 = sum of:
      0.078386 = product of:
        0.39192998 = sum of:
          0.009935556 = weight(abstract_txt:results in 458) [ClassicSimilarity], result of:
            0.009935556 = score(doc=458,freq=1.0), product of:
              0.045698505 = queryWeight, product of:
                1.0086763 = boost
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.013023868 = queryNorm
              0.21741535 = fieldWeight in 458, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.0625 = fieldNorm(doc=458)
          0.12771581 = weight(abstract_txt:forums in 458) [ClassicSimilarity], result of:
            0.12771581 = score(doc=458,freq=5.0), product of:
              0.116395704 = queryWeight, product of:
                1.1382936 = boost
                7.85132 = idf(docFreq=46, maxDocs=44421)
                0.013023868 = queryNorm
              1.0972553 = fieldWeight in 458, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.85132 = idf(docFreq=46, maxDocs=44421)
                0.0625 = fieldNorm(doc=458)
          0.06592321 = weight(abstract_txt:outperformed in 458) [ClassicSimilarity], result of:
            0.06592321 = score(doc=458,freq=1.0), product of:
              0.12807256 = queryWeight, product of:
                1.1940262 = boost
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.013023868 = queryNorm
              0.51473325 = fieldWeight in 458, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.0625 = fieldNorm(doc=458)
          0.022873128 = weight(abstract_txt:analysis in 458) [ClassicSimilarity], result of:
            0.022873128 = score(doc=458,freq=1.0), product of:
              0.10038471 = queryWeight, product of:
                2.1142173 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.013023868 = queryNorm
              0.2278547 = fieldWeight in 458, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.0625 = fieldNorm(doc=458)
          0.16548227 = weight(abstract_txt:forum in 458) [ClassicSimilarity], result of:
            0.16548227 = score(doc=458,freq=2.0), product of:
              0.2707875 = queryWeight, product of:
                3.0071893 = boost
                6.9139757 = idf(docFreq=119, maxDocs=44421)
                0.013023868 = queryNorm
              0.61111486 = fieldWeight in 458, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.9139757 = idf(docFreq=119, maxDocs=44421)
                0.0625 = fieldNorm(doc=458)
        0.2 = coord(5/25)
    
  2. Cohan, A.; Young, S.; Yates, A.; Goharian, N.: Triaging content severity in online mental health forums (2017) 0.07
    0.06772315 = sum of:
      0.06772315 = product of:
        0.33861572 = sum of:
          0.098928235 = weight(abstract_txt:forums in 4930) [ClassicSimilarity], result of:
            0.098928235 = score(doc=4930,freq=3.0), product of:
              0.116395704 = queryWeight, product of:
                1.1382936 = boost
                7.85132 = idf(docFreq=46, maxDocs=44421)
                0.013023868 = queryNorm
              0.8499303 = fieldWeight in 4930, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.85132 = idf(docFreq=46, maxDocs=44421)
                0.0625 = fieldNorm(doc=4930)
          0.034050472 = weight(abstract_txt:interaction in 4930) [ClassicSimilarity], result of:
            0.034050472 = score(doc=4930,freq=1.0), product of:
              0.10387777 = queryWeight, product of:
                1.5207651 = boost
                5.244698 = idf(docFreq=636, maxDocs=44421)
                0.013023868 = queryNorm
              0.32779363 = fieldWeight in 4930, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.244698 = idf(docFreq=636, maxDocs=44421)
                0.0625 = fieldNorm(doc=4930)
          0.06575025 = weight(abstract_txt:lexical in 4930) [ClassicSimilarity], result of:
            0.06575025 = score(doc=4930,freq=1.0), product of:
              0.16107896 = queryWeight, product of:
                1.8937395 = boost
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.013023868 = queryNorm
              0.40818647 = fieldWeight in 4930, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.0625 = fieldNorm(doc=4930)
          0.022873128 = weight(abstract_txt:analysis in 4930) [ClassicSimilarity], result of:
            0.022873128 = score(doc=4930,freq=1.0), product of:
              0.10038471 = queryWeight, product of:
                2.1142173 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.013023868 = queryNorm
              0.2278547 = fieldWeight in 4930, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.0625 = fieldNorm(doc=4930)
          0.11701364 = weight(abstract_txt:forum in 4930) [ClassicSimilarity], result of:
            0.11701364 = score(doc=4930,freq=1.0), product of:
              0.2707875 = queryWeight, product of:
                3.0071893 = boost
                6.9139757 = idf(docFreq=119, maxDocs=44421)
                0.013023868 = queryNorm
              0.43212348 = fieldWeight in 4930, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9139757 = idf(docFreq=119, maxDocs=44421)
                0.0625 = fieldNorm(doc=4930)
        0.2 = coord(5/25)
    
  3. Bhatia, S.; Biyani, P.; Mitra, P.: Identifying the role of individual user messages in an online discussion and its use in thread retrieval (2016) 0.07
    0.06708291 = sum of:
      0.06708291 = product of:
        0.4192682 = sum of:
          0.057116244 = weight(abstract_txt:forums in 3650) [ClassicSimilarity], result of:
            0.057116244 = score(doc=3650,freq=1.0), product of:
              0.116395704 = queryWeight, product of:
                1.1382936 = boost
                7.85132 = idf(docFreq=46, maxDocs=44421)
                0.013023868 = queryNorm
              0.4907075 = fieldWeight in 3650, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.85132 = idf(docFreq=46, maxDocs=44421)
                0.0625 = fieldNorm(doc=3650)
          0.22226518 = weight(abstract_txt:thread in 3650) [ClassicSimilarity], result of:
            0.22226518 = score(doc=3650,freq=7.0), product of:
              0.15053777 = queryWeight, product of:
                1.2945194 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.013023868 = queryNorm
              1.4764745 = fieldWeight in 3650, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.0625 = fieldNorm(doc=3650)
          0.022873128 = weight(abstract_txt:analysis in 3650) [ClassicSimilarity], result of:
            0.022873128 = score(doc=3650,freq=1.0), product of:
              0.10038471 = queryWeight, product of:
                2.1142173 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.013023868 = queryNorm
              0.2278547 = fieldWeight in 3650, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.0625 = fieldNorm(doc=3650)
          0.11701364 = weight(abstract_txt:forum in 3650) [ClassicSimilarity], result of:
            0.11701364 = score(doc=3650,freq=1.0), product of:
              0.2707875 = queryWeight, product of:
                3.0071893 = boost
                6.9139757 = idf(docFreq=119, maxDocs=44421)
                0.013023868 = queryNorm
              0.43212348 = fieldWeight in 3650, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9139757 = idf(docFreq=119, maxDocs=44421)
                0.0625 = fieldNorm(doc=3650)
        0.16 = coord(4/25)
    
  4. Landauer, T.K.; Foltz, P.W.; Laham, D.: ¬An introduction to Latent Semantic Analysis (1998) 0.06
    0.062032003 = sum of:
      0.062032003 = product of:
        0.38770002 = sum of:
          0.05339707 = weight(abstract_txt:accurately in 2162) [ClassicSimilarity], result of:
            0.05339707 = score(doc=2162,freq=1.0), product of:
              0.09590373 = queryWeight, product of:
                1.0332457 = boost
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.013023868 = queryNorm
              0.55677783 = fieldWeight in 2162, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.078125 = fieldNorm(doc=2162)
          0.08218782 = weight(abstract_txt:lexical in 2162) [ClassicSimilarity], result of:
            0.08218782 = score(doc=2162,freq=1.0), product of:
              0.16107896 = queryWeight, product of:
                1.8937395 = boost
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.013023868 = queryNorm
              0.5102331 = fieldWeight in 2162, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.078125 = fieldNorm(doc=2162)
          0.028591411 = weight(abstract_txt:analysis in 2162) [ClassicSimilarity], result of:
            0.028591411 = score(doc=2162,freq=1.0), product of:
              0.10038471 = queryWeight, product of:
                2.1142173 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.013023868 = queryNorm
              0.28481838 = fieldWeight in 2162, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.078125 = fieldNorm(doc=2162)
          0.22352372 = weight(abstract_txt:coherence in 2162) [ClassicSimilarity], result of:
            0.22352372 = score(doc=2162,freq=1.0), product of:
              0.35926372 = queryWeight, product of:
                3.4638026 = boost
                7.963798 = idf(docFreq=41, maxDocs=44421)
                0.013023868 = queryNorm
              0.6221717 = fieldWeight in 2162, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.963798 = idf(docFreq=41, maxDocs=44421)
                0.078125 = fieldNorm(doc=2162)
        0.16 = coord(4/25)
    
  5. Chuang, K.Y.; Yang, C.C.: Informational support exchanges using different computer-mediated communication formats in a social media alcoholism community (2014) 0.06
    0.058226757 = sum of:
      0.058226757 = product of:
        0.2426115 = sum of:
          0.039434604 = weight(abstract_txt:mediated in 2179) [ClassicSimilarity], result of:
            0.039434604 = score(doc=2179,freq=1.0), product of:
              0.09092476 = queryWeight, product of:
                1.006067 = boost
                6.939294 = idf(docFreq=116, maxDocs=44421)
                0.013023868 = queryNorm
              0.43370587 = fieldWeight in 2179, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.939294 = idf(docFreq=116, maxDocs=44421)
                0.0625 = fieldNorm(doc=2179)
          0.009935556 = weight(abstract_txt:results in 2179) [ClassicSimilarity], result of:
            0.009935556 = score(doc=2179,freq=1.0), product of:
              0.045698505 = queryWeight, product of:
                1.0086763 = boost
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.013023868 = queryNorm
              0.21741535 = fieldWeight in 2179, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.0625 = fieldNorm(doc=2179)
          0.019304087 = weight(abstract_txt:including in 2179) [ClassicSimilarity], result of:
            0.019304087 = score(doc=2179,freq=1.0), product of:
              0.07115521 = queryWeight, product of:
                1.2586477 = boost
                4.340728 = idf(docFreq=1572, maxDocs=44421)
                0.013023868 = queryNorm
              0.2712955 = fieldWeight in 2179, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.340728 = idf(docFreq=1572, maxDocs=44421)
                0.0625 = fieldNorm(doc=2179)
          0.034050472 = weight(abstract_txt:interaction in 2179) [ClassicSimilarity], result of:
            0.034050472 = score(doc=2179,freq=1.0), product of:
              0.10387777 = queryWeight, product of:
                1.5207651 = boost
                5.244698 = idf(docFreq=636, maxDocs=44421)
                0.013023868 = queryNorm
              0.32779363 = fieldWeight in 2179, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.244698 = idf(docFreq=636, maxDocs=44421)
                0.0625 = fieldNorm(doc=2179)
          0.022873128 = weight(abstract_txt:analysis in 2179) [ClassicSimilarity], result of:
            0.022873128 = score(doc=2179,freq=1.0), product of:
              0.10038471 = queryWeight, product of:
                2.1142173 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.013023868 = queryNorm
              0.2278547 = fieldWeight in 2179, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.0625 = fieldNorm(doc=2179)
          0.11701364 = weight(abstract_txt:forum in 2179) [ClassicSimilarity], result of:
            0.11701364 = score(doc=2179,freq=1.0), product of:
              0.2707875 = queryWeight, product of:
                3.0071893 = boost
                6.9139757 = idf(docFreq=119, maxDocs=44421)
                0.013023868 = queryNorm
              0.43212348 = fieldWeight in 2179, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9139757 = idf(docFreq=119, maxDocs=44421)
                0.0625 = fieldNorm(doc=2179)
        0.24 = coord(6/25)