Document (#37493)

Author
Guo, L.
Wan, X.
Title
Exploiting syntactic and semantic relationships between terms for opinion retrieval
Source
Journal of the American Society for Information Science and Technology. 63(2012) no.11, S.2269-2282
Year
2012
Abstract
Opinion retrieval is the task of finding documents that express an opinion about a given query. A key challenge in opinion retrieval is to capture the query-related opinion score of a document. Existing methods rely mainly on the proximity information between the opinion terms and the query terms to address the key challenge. In this study, we propose to incorporate the syntactic and semantic information of terms into a probabilistic model to capture the query-related opinion score more accurately. The syntactic tree structure of a sentence is used to evaluate the modifying probability between an opinion term and a noun within the sentence with a tree kernel method. Moreover, WordNet and the probabilistic topic model are used to evaluate the semantic relatedness between any noun and the given query. The experimental results over standard TREC baselines on the benchmark BLOG06 collection demonstrate the effectiveness of our proposed method, in comparison with the proximity-based method and other baselines.

Similar documents (content)

  1. Fang, L.; Tuan, L.A.; Hui, S.C.; Wu, L.: Syntactic based approach for grammar question retrieval (2018) 0.27
    0.2659759 = sum of:
      0.2659759 = product of:
        0.7388219 = sum of:
          0.016785057 = weight(abstract_txt:related in 86) [ClassicSimilarity], result of:
            0.016785057 = score(doc=86,freq=1.0), product of:
              0.06397083 = queryWeight, product of:
                1.0806619 = boost
                4.198178 = idf(docFreq=1813, maxDocs=44421)
                0.014100395 = queryNorm
              0.2623861 = fieldWeight in 86, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.198178 = idf(docFreq=1813, maxDocs=44421)
                0.0625 = fieldNorm(doc=86)
          0.11991033 = weight(abstract_txt:kernel in 86) [ClassicSimilarity], result of:
            0.11991033 = score(doc=86,freq=3.0), product of:
              0.13058323 = queryWeight, product of:
                1.091761 = boost
                8.482592 = idf(docFreq=24, maxDocs=44421)
                0.014100395 = queryNorm
              0.9182675 = fieldWeight in 86, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.482592 = idf(docFreq=24, maxDocs=44421)
                0.0625 = fieldNorm(doc=86)
          0.020219635 = weight(abstract_txt:retrieval in 86) [ClassicSimilarity], result of:
            0.020219635 = score(doc=86,freq=2.0), product of:
              0.06580154 = queryWeight, product of:
                1.34234 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.014100395 = queryNorm
              0.3072821 = fieldWeight in 86, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=86)
          0.05807394 = weight(abstract_txt:capture in 86) [ClassicSimilarity], result of:
            0.05807394 = score(doc=86,freq=1.0), product of:
              0.14633705 = queryWeight, product of:
                1.6344664 = boost
                6.3496094 = idf(docFreq=210, maxDocs=44421)
                0.014100395 = queryNorm
              0.3968506 = fieldWeight in 86, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3496094 = idf(docFreq=210, maxDocs=44421)
                0.0625 = fieldNorm(doc=86)
          0.07289621 = weight(abstract_txt:sentence in 86) [ClassicSimilarity], result of:
            0.07289621 = score(doc=86,freq=1.0), product of:
              0.17028251 = queryWeight, product of:
                1.7631282 = boost
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.014100395 = queryNorm
              0.42808983 = fieldWeight in 86, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.0625 = fieldNorm(doc=86)
          0.12625994 = weight(abstract_txt:tree in 86) [ClassicSimilarity], result of:
            0.12625994 = score(doc=86,freq=3.0), product of:
              0.17028251 = queryWeight, product of:
                1.7631282 = boost
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.014100395 = queryNorm
              0.7414733 = fieldWeight in 86, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.0625 = fieldNorm(doc=86)
          0.026517512 = weight(abstract_txt:between in 86) [ClassicSimilarity], result of:
            0.026517512 = score(doc=86,freq=2.0), product of:
              0.0867738 = queryWeight, product of:
                1.7799515 = boost
                3.4573963 = idf(docFreq=3804, maxDocs=44421)
                0.014100395 = queryNorm
              0.30559355 = fieldWeight in 86, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4573963 = idf(docFreq=3804, maxDocs=44421)
                0.0625 = fieldNorm(doc=86)
          0.21195923 = weight(abstract_txt:syntactic in 86) [ClassicSimilarity], result of:
            0.21195923 = score(doc=86,freq=5.0), product of:
              0.23222487 = queryWeight, product of:
                2.5217316 = boost
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.014100395 = queryNorm
              0.9127327 = fieldWeight in 86, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.0625 = fieldNorm(doc=86)
          0.08620006 = weight(abstract_txt:query in 86) [ClassicSimilarity], result of:
            0.08620006 = score(doc=86,freq=2.0), product of:
              0.20512022 = queryWeight, product of:
                3.0596595 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.014100395 = queryNorm
              0.42024165 = fieldWeight in 86, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.0625 = fieldNorm(doc=86)
        0.36 = coord(9/25)
    
  2. Belbachir, F.; Boughanem, M.: Using language models to improve opinion detection (2018) 0.26
    0.2607574 = sum of:
      0.2607574 = product of:
        1.0864891 = sum of:
          0.030643757 = weight(abstract_txt:retrieval in 44) [ClassicSimilarity], result of:
            0.030643757 = score(doc=44,freq=6.0), product of:
              0.06580154 = queryWeight, product of:
                1.34234 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.014100395 = queryNorm
              0.4656997 = fieldWeight in 44, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0546875 = fieldNorm(doc=44)
          0.016406875 = weight(abstract_txt:between in 44) [ClassicSimilarity], result of:
            0.016406875 = score(doc=44,freq=1.0), product of:
              0.0867738 = queryWeight, product of:
                1.7799515 = boost
                3.4573963 = idf(docFreq=3804, maxDocs=44421)
                0.014100395 = queryNorm
              0.18907636 = fieldWeight in 44, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4573963 = idf(docFreq=3804, maxDocs=44421)
                0.0546875 = fieldNorm(doc=44)
          0.066823214 = weight(abstract_txt:score in 44) [ClassicSimilarity], result of:
            0.066823214 = score(doc=44,freq=1.0), product of:
              0.17564924 = queryWeight, product of:
                1.7906965 = boost
                6.9565353 = idf(docFreq=114, maxDocs=44421)
                0.014100395 = queryNorm
              0.38043553 = fieldWeight in 44, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9565353 = idf(docFreq=114, maxDocs=44421)
                0.0546875 = fieldNorm(doc=44)
          0.026249375 = weight(abstract_txt:terms in 44) [ClassicSimilarity], result of:
            0.026249375 = score(doc=44,freq=1.0), product of:
              0.11869999 = queryWeight, product of:
                2.081801 = boost
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.014100395 = queryNorm
              0.2211405 = fieldWeight in 44, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.0546875 = fieldNorm(doc=44)
          0.053333566 = weight(abstract_txt:query in 44) [ClassicSimilarity], result of:
            0.053333566 = score(doc=44,freq=1.0), product of:
              0.20512022 = queryWeight, product of:
                3.0596595 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.014100395 = queryNorm
              0.26001126 = fieldWeight in 44, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.0546875 = fieldNorm(doc=44)
          0.89303225 = weight(abstract_txt:opinion in 44) [ClassicSimilarity], result of:
            0.89303225 = score(doc=44,freq=12.0), product of:
              0.6858551 = queryWeight, product of:
                7.0769324 = boost
                6.8731537 = idf(docFreq=124, maxDocs=44421)
                0.014100395 = queryNorm
              1.3020713 = fieldWeight in 44, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                6.8731537 = idf(docFreq=124, maxDocs=44421)
                0.0546875 = fieldNorm(doc=44)
        0.24 = coord(6/25)
    
  3. Fernández, R.T.; Losada, D.E.: Effective sentence retrieval based on query-independent evidence (2012) 0.23
    0.22966334 = sum of:
      0.22966334 = product of:
        0.95693064 = sum of:
          0.016785057 = weight(abstract_txt:related in 3728) [ClassicSimilarity], result of:
            0.016785057 = score(doc=3728,freq=1.0), product of:
              0.06397083 = queryWeight, product of:
                1.0806619 = boost
                4.198178 = idf(docFreq=1813, maxDocs=44421)
                0.014100395 = queryNorm
              0.2623861 = fieldWeight in 3728, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.198178 = idf(docFreq=1813, maxDocs=44421)
                0.0625 = fieldNorm(doc=3728)
          0.03782747 = weight(abstract_txt:retrieval in 3728) [ClassicSimilarity], result of:
            0.03782747 = score(doc=3728,freq=7.0), product of:
              0.06580154 = queryWeight, product of:
                1.34234 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.014100395 = queryNorm
              0.57487214 = fieldWeight in 3728, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=3728)
          0.030983001 = weight(abstract_txt:method in 3728) [ClassicSimilarity], result of:
            0.030983001 = score(doc=3728,freq=1.0), product of:
              0.11019101 = queryWeight, product of:
                1.737071 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.014100395 = queryNorm
              0.2811754 = fieldWeight in 3728, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0625 = fieldNorm(doc=3728)
          0.14579242 = weight(abstract_txt:sentence in 3728) [ClassicSimilarity], result of:
            0.14579242 = score(doc=3728,freq=4.0), product of:
              0.17028251 = queryWeight, product of:
                1.7631282 = boost
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.014100395 = queryNorm
              0.85617965 = fieldWeight in 3728, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.0625 = fieldNorm(doc=3728)
          0.13629428 = weight(abstract_txt:query in 3728) [ClassicSimilarity], result of:
            0.13629428 = score(doc=3728,freq=5.0), product of:
              0.20512022 = queryWeight, product of:
                3.0596595 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.014100395 = queryNorm
              0.6644604 = fieldWeight in 3728, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.0625 = fieldNorm(doc=3728)
          0.5892484 = weight(abstract_txt:opinion in 3728) [ClassicSimilarity], result of:
            0.5892484 = score(doc=3728,freq=4.0), product of:
              0.6858551 = queryWeight, product of:
                7.0769324 = boost
                6.8731537 = idf(docFreq=124, maxDocs=44421)
                0.014100395 = queryNorm
              0.8591442 = fieldWeight in 3728, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.8731537 = idf(docFreq=124, maxDocs=44421)
                0.0625 = fieldNorm(doc=3728)
        0.24 = coord(6/25)
    
  4. Zhang, M.; Zhou, G.D.; Aw, A.: Exploring syntactic structured features over parse trees for relation extraction using kernel methods (2008) 0.23
    0.22575504 = sum of:
      0.22575504 = product of:
        0.7054845 = sum of:
          0.014331773 = weight(abstract_txt:model in 3055) [ClassicSimilarity], result of:
            0.014331773 = score(doc=3055,freq=1.0), product of:
              0.05757492 = queryWeight, product of:
                1.0252163 = boost
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.014100395 = queryNorm
              0.24892388 = fieldWeight in 3055, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.0625 = fieldNorm(doc=3055)
          0.19581275 = weight(abstract_txt:kernel in 3055) [ClassicSimilarity], result of:
            0.19581275 = score(doc=3055,freq=8.0), product of:
              0.13058323 = queryWeight, product of:
                1.091761 = boost
                8.482592 = idf(docFreq=24, maxDocs=44421)
                0.014100395 = queryNorm
              1.4995245 = fieldWeight in 3055, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                8.482592 = idf(docFreq=24, maxDocs=44421)
                0.0625 = fieldNorm(doc=3055)
          0.05807394 = weight(abstract_txt:capture in 3055) [ClassicSimilarity], result of:
            0.05807394 = score(doc=3055,freq=1.0), product of:
              0.14633705 = queryWeight, product of:
                1.6344664 = boost
                6.3496094 = idf(docFreq=210, maxDocs=44421)
                0.014100395 = queryNorm
              0.3968506 = fieldWeight in 3055, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3496094 = idf(docFreq=210, maxDocs=44421)
                0.0625 = fieldNorm(doc=3055)
          0.030484168 = weight(abstract_txt:semantic in 3055) [ClassicSimilarity], result of:
            0.030484168 = score(doc=3055,freq=1.0), product of:
              0.10900508 = queryWeight, product of:
                1.7276981 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.014100395 = queryNorm
              0.27965823 = fieldWeight in 3055, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.0625 = fieldNorm(doc=3055)
          0.030983001 = weight(abstract_txt:method in 3055) [ClassicSimilarity], result of:
            0.030983001 = score(doc=3055,freq=1.0), product of:
              0.11019101 = queryWeight, product of:
                1.737071 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.014100395 = queryNorm
              0.2811754 = fieldWeight in 3055, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0625 = fieldNorm(doc=3055)
          0.19286524 = weight(abstract_txt:tree in 3055) [ClassicSimilarity], result of:
            0.19286524 = score(doc=3055,freq=7.0), product of:
              0.17028251 = queryWeight, product of:
                1.7631282 = boost
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.014100395 = queryNorm
              1.1326191 = fieldWeight in 3055, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.0625 = fieldNorm(doc=3055)
          0.018750712 = weight(abstract_txt:between in 3055) [ClassicSimilarity], result of:
            0.018750712 = score(doc=3055,freq=1.0), product of:
              0.0867738 = queryWeight, product of:
                1.7799515 = boost
                3.4573963 = idf(docFreq=3804, maxDocs=44421)
                0.014100395 = queryNorm
              0.21608727 = fieldWeight in 3055, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4573963 = idf(docFreq=3804, maxDocs=44421)
                0.0625 = fieldNorm(doc=3055)
          0.1641829 = weight(abstract_txt:syntactic in 3055) [ClassicSimilarity], result of:
            0.1641829 = score(doc=3055,freq=3.0), product of:
              0.23222487 = queryWeight, product of:
                2.5217316 = boost
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.014100395 = queryNorm
              0.70699966 = fieldWeight in 3055, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.0625 = fieldNorm(doc=3055)
        0.32 = coord(8/25)
    
  5. Li, D.; Tang, J.; Ding, Y.; Shuai, X.; Chambers, T.; Sun, G.; Luo, Z.; Zhang, J.: Topic-level opinion influence model (TOIM) : an investigation using tencent microblogging (2015) 0.21
    0.20731822 = sum of:
      0.20731822 = product of:
        1.036591 = sum of:
          0.024823358 = weight(abstract_txt:model in 3345) [ClassicSimilarity], result of:
            0.024823358 = score(doc=3345,freq=3.0), product of:
              0.05757492 = queryWeight, product of:
                1.0252163 = boost
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.014100395 = queryNorm
              0.4311488 = fieldWeight in 3345, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.0625 = fieldNorm(doc=3345)
          0.023514668 = weight(abstract_txt:given in 3345) [ClassicSimilarity], result of:
            0.023514668 = score(doc=3345,freq=1.0), product of:
              0.080092646 = queryWeight, product of:
                1.2091918 = boost
                4.6974936 = idf(docFreq=1100, maxDocs=44421)
                0.014100395 = queryNorm
              0.29359335 = fieldWeight in 3345, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6974936 = idf(docFreq=1100, maxDocs=44421)
                0.0625 = fieldNorm(doc=3345)
          0.034084912 = weight(abstract_txt:evaluate in 3345) [ClassicSimilarity], result of:
            0.034084912 = score(doc=3345,freq=1.0), product of:
              0.10258288 = queryWeight, product of:
                1.3684732 = boost
                5.316273 = idf(docFreq=592, maxDocs=44421)
                0.014100395 = queryNorm
              0.33226708 = fieldWeight in 3345, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.316273 = idf(docFreq=592, maxDocs=44421)
                0.0625 = fieldNorm(doc=3345)
          0.07029551 = weight(abstract_txt:probabilistic in 3345) [ClassicSimilarity], result of:
            0.07029551 = score(doc=3345,freq=1.0), product of:
              0.16620795 = queryWeight, product of:
                1.7419062 = boost
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.014100395 = queryNorm
              0.4229371 = fieldWeight in 3345, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.0625 = fieldNorm(doc=3345)
          0.8838726 = weight(abstract_txt:opinion in 3345) [ClassicSimilarity], result of:
            0.8838726 = score(doc=3345,freq=9.0), product of:
              0.6858551 = queryWeight, product of:
                7.0769324 = boost
                6.8731537 = idf(docFreq=124, maxDocs=44421)
                0.014100395 = queryNorm
              1.2887163 = fieldWeight in 3345, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                6.8731537 = idf(docFreq=124, maxDocs=44421)
                0.0625 = fieldNorm(doc=3345)
        0.2 = coord(5/25)