Document (#36361)

Author
Rushdi-Saleh, M.
Martín-Valdivia, M.T.
Ureña-López, L.A.
Perea-Ortega, J.M.
Title
OCA: Opinion corpus for Arabic
Source
Journal of the American Society for Information Science and Technology. 62(2011) no.10, S.2045-2054
Year
2011
Abstract
Sentiment analysis is a challenging new task related to text mining and natural language processing. Although there are, at present, several studies related to this theme, most of these focus mainly on English texts. The resources available for opinion mining (OM) in other languages are still limited. In this article, we present a new Arabic corpus for the OM task that has been made available to the scientific community for research purposes. The corpus contains 500 movie reviews collected from different web pages and blogs in Arabic, 250 of them considered as positive reviews, and the other 250 as negative opinions. Furthermore, different experiments have been carried out on this corpus, using machine learning algorithms such as support vector machines and Nave Bayes. The results obtained are very promising and we are encouraged to continue this line of research.

Similar documents (author)

  1. Perea-Ortega, J.M.; Martín-Valdivia, M.T.; Ureña-López, L.A.; Martínez-Cámara, E.: Improving polarity classification of bilingual parallel corpora combining machine learning and semantic orientation approaches (2013) 4.96
    4.95559 = sum of:
      4.95559 = sum of:
        0.6380822 = weight(author_txt:lópez in 2045) [ClassicSimilarity], result of:
          0.6380822 = score(doc=2045,freq=1.0), product of:
            0.331724 = queryWeight, product of:
              7.694134 = idf(docFreq=54, maxDocs=44421)
              0.04311388 = queryNorm
            1.9235336 = fieldWeight in 2045, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              7.694134 = idf(docFreq=54, maxDocs=44421)
              0.25 = fieldNorm(doc=2045)
        0.9770388 = weight(author_txt:martín in 2045) [ClassicSimilarity], result of:
          0.9770388 = score(doc=2045,freq=1.0), product of:
            0.44069046 = queryWeight, product of:
              1.1525995 = boost
              8.868255 = idf(docFreq=16, maxDocs=44421)
              0.04311388 = queryNorm
            2.2170637 = fieldWeight in 2045, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.868255 = idf(docFreq=16, maxDocs=44421)
              0.25 = fieldNorm(doc=2045)
        0.9972134 = weight(author_txt:ortega in 2045) [ClassicSimilarity], result of:
          0.9972134 = score(doc=2045,freq=1.0), product of:
            0.44673625 = queryWeight, product of:
              1.1604787 = boost
              8.928879 = idf(docFreq=15, maxDocs=44421)
              0.04311388 = queryNorm
            2.2322197 = fieldWeight in 2045, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.928879 = idf(docFreq=15, maxDocs=44421)
              0.25 = fieldNorm(doc=2045)
        1.0426259 = weight(author_txt:valdivia in 2045) [ClassicSimilarity], result of:
          1.0426259 = score(doc=2045,freq=1.0), product of:
            0.46019807 = queryWeight, product of:
              1.1778337 = boost
              9.06241 = idf(docFreq=13, maxDocs=44421)
              0.04311388 = queryNorm
            2.2656026 = fieldWeight in 2045, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.06241 = idf(docFreq=13, maxDocs=44421)
              0.25 = fieldNorm(doc=2045)
        1.3006294 = weight(author_txt:ureña in 2045) [ClassicSimilarity], result of:
          1.3006294 = score(doc=2045,freq=1.0), product of:
            0.5332876 = queryWeight, product of:
              1.2679213 = boost
              9.755557 = idf(docFreq=6, maxDocs=44421)
              0.04311388 = queryNorm
            2.4388893 = fieldWeight in 2045, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.755557 = idf(docFreq=6, maxDocs=44421)
              0.25 = fieldNorm(doc=2045)
    
  2. Martín-Valdivia, M.T.; Díaz-Galiano, M.C.; Montejo-Raez, A.; Ureña-López, L.A.: Using information gain to improve multi-modal information retrieval systems (2008) 3.17
    3.166701 = sum of:
      3.166701 = product of:
        3.9583762 = sum of:
          0.6380822 = weight(author_txt:lópez in 3086) [ClassicSimilarity], result of:
            0.6380822 = score(doc=3086,freq=1.0), product of:
              0.331724 = queryWeight, product of:
                7.694134 = idf(docFreq=54, maxDocs=44421)
                0.04311388 = queryNorm
              1.9235336 = fieldWeight in 3086, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.694134 = idf(docFreq=54, maxDocs=44421)
                0.25 = fieldNorm(doc=3086)
          0.9770388 = weight(author_txt:martín in 3086) [ClassicSimilarity], result of:
            0.9770388 = score(doc=3086,freq=1.0), product of:
              0.44069046 = queryWeight, product of:
                1.1525995 = boost
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.04311388 = queryNorm
              2.2170637 = fieldWeight in 3086, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.25 = fieldNorm(doc=3086)
          1.0426259 = weight(author_txt:valdivia in 3086) [ClassicSimilarity], result of:
            1.0426259 = score(doc=3086,freq=1.0), product of:
              0.46019807 = queryWeight, product of:
                1.1778337 = boost
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.04311388 = queryNorm
              2.2656026 = fieldWeight in 3086, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.25 = fieldNorm(doc=3086)
          1.3006294 = weight(author_txt:ureña in 3086) [ClassicSimilarity], result of:
            1.3006294 = score(doc=3086,freq=1.0), product of:
              0.5332876 = queryWeight, product of:
                1.2679213 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.04311388 = queryNorm
              2.4388893 = fieldWeight in 3086, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.25 = fieldNorm(doc=3086)
        0.8 = coord(4/5)
    
  3. Montejo-Ráez, A.; Martínez-Cámara, E.; Martín-Valdivia, M.T.; Ureña-López, L.A.: ¬A knowledge-based approach for polarity classification in Twitter (2014) 3.17
    3.166701 = sum of:
      3.166701 = product of:
        3.9583762 = sum of:
          0.6380822 = weight(author_txt:lópez in 2204) [ClassicSimilarity], result of:
            0.6380822 = score(doc=2204,freq=1.0), product of:
              0.331724 = queryWeight, product of:
                7.694134 = idf(docFreq=54, maxDocs=44421)
                0.04311388 = queryNorm
              1.9235336 = fieldWeight in 2204, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.694134 = idf(docFreq=54, maxDocs=44421)
                0.25 = fieldNorm(doc=2204)
          0.9770388 = weight(author_txt:martín in 2204) [ClassicSimilarity], result of:
            0.9770388 = score(doc=2204,freq=1.0), product of:
              0.44069046 = queryWeight, product of:
                1.1525995 = boost
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.04311388 = queryNorm
              2.2170637 = fieldWeight in 2204, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.25 = fieldNorm(doc=2204)
          1.0426259 = weight(author_txt:valdivia in 2204) [ClassicSimilarity], result of:
            1.0426259 = score(doc=2204,freq=1.0), product of:
              0.46019807 = queryWeight, product of:
                1.1778337 = boost
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.04311388 = queryNorm
              2.2656026 = fieldWeight in 2204, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.25 = fieldNorm(doc=2204)
          1.3006294 = weight(author_txt:ureña in 2204) [ClassicSimilarity], result of:
            1.3006294 = score(doc=2204,freq=1.0), product of:
              0.5332876 = queryWeight, product of:
                1.2679213 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.04311388 = queryNorm
              2.4388893 = fieldWeight in 2204, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.25 = fieldNorm(doc=2204)
        0.8 = coord(4/5)
    
  4. Delgado-Quirós, L.; Aguillo, I.F.; Martín-Martín, A.; López-Cózar, E.D.; Orduña-Malea, E.; Ortega, J.L.: Why are these publications missing? : uncovering the reasons behind the exclusion of documents in free-access scholarly databases (2024) 1.81
    1.8102224 = sum of:
      1.8102224 = product of:
        3.0170372 = sum of:
          0.6380822 = weight(author_txt:lópez in 2203) [ClassicSimilarity], result of:
            0.6380822 = score(doc=2203,freq=1.0), product of:
              0.331724 = queryWeight, product of:
                7.694134 = idf(docFreq=54, maxDocs=44421)
                0.04311388 = queryNorm
              1.9235336 = fieldWeight in 2203, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.694134 = idf(docFreq=54, maxDocs=44421)
                0.25 = fieldNorm(doc=2203)
          1.3817415 = weight(author_txt:martín in 2203) [ClassicSimilarity], result of:
            1.3817415 = score(doc=2203,freq=2.0), product of:
              0.44069046 = queryWeight, product of:
                1.1525995 = boost
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.04311388 = queryNorm
              3.1354015 = fieldWeight in 2203, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.25 = fieldNorm(doc=2203)
          0.9972134 = weight(author_txt:ortega in 2203) [ClassicSimilarity], result of:
            0.9972134 = score(doc=2203,freq=1.0), product of:
              0.44673625 = queryWeight, product of:
                1.1604787 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.04311388 = queryNorm
              2.2322197 = fieldWeight in 2203, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.25 = fieldNorm(doc=2203)
        0.6 = coord(3/5)
    
  5. García Cumbreras, M.A.; Perea-Ortega, J.M.; García Vega, M.; Ureña López, L.A.: Information retrieval with geographical references : relevant documents filtering vs. query expansion (2009) 1.76
    1.7615551 = sum of:
      1.7615551 = product of:
        2.935925 = sum of:
          0.6380822 = weight(author_txt:lópez in 222) [ClassicSimilarity], result of:
            0.6380822 = score(doc=222,freq=1.0), product of:
              0.331724 = queryWeight, product of:
                7.694134 = idf(docFreq=54, maxDocs=44421)
                0.04311388 = queryNorm
              1.9235336 = fieldWeight in 222, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.694134 = idf(docFreq=54, maxDocs=44421)
                0.25 = fieldNorm(doc=222)
          0.9972134 = weight(author_txt:ortega in 222) [ClassicSimilarity], result of:
            0.9972134 = score(doc=222,freq=1.0), product of:
              0.44673625 = queryWeight, product of:
                1.1604787 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.04311388 = queryNorm
              2.2322197 = fieldWeight in 222, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.25 = fieldNorm(doc=222)
          1.3006294 = weight(author_txt:ureña in 222) [ClassicSimilarity], result of:
            1.3006294 = score(doc=222,freq=1.0), product of:
              0.5332876 = queryWeight, product of:
                1.2679213 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.04311388 = queryNorm
              2.4388893 = fieldWeight in 222, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.25 = fieldNorm(doc=222)
        0.6 = coord(3/5)
    

Similar documents (content)

  1. Kanaan, G.; Al-Shalabi, R.; Ghwanmeh, S.; Al-Ma'adeed, H.: ¬A comparison of text-classification techniques applied to Arabic text (2009) 0.49
    0.490234 = sum of:
      0.490234 = product of:
        1.5319812 = sum of:
          0.025192864 = weight(abstract_txt:research in 83) [ClassicSimilarity], result of:
            0.025192864 = score(doc=83,freq=2.0), product of:
              0.060139753 = queryWeight, product of:
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.019034086 = queryNorm
              0.41890535 = fieldWeight in 83, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.09375 = fieldNorm(doc=83)
          0.08483714 = weight(abstract_txt:challenging in 83) [ClassicSimilarity], result of:
            0.08483714 = score(doc=83,freq=1.0), product of:
              0.13511409 = queryWeight, product of:
                1.0598747 = boost
                6.697521 = idf(docFreq=148, maxDocs=44421)
                0.019034086 = queryNorm
              0.6278926 = fieldWeight in 83, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.697521 = idf(docFreq=148, maxDocs=44421)
                0.09375 = fieldNorm(doc=83)
          0.046191085 = weight(abstract_txt:been in 83) [ClassicSimilarity], result of:
            0.046191085 = score(doc=83,freq=3.0), product of:
              0.078701854 = queryWeight, product of:
                1.1439621 = boost
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.019034086 = queryNorm
              0.5869123 = fieldWeight in 83, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.09375 = fieldNorm(doc=83)
          0.027683552 = weight(abstract_txt:different in 83) [ClassicSimilarity], result of:
            0.027683552 = score(doc=83,freq=1.0), product of:
              0.080686554 = queryWeight, product of:
                1.1582966 = boost
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.019034086 = queryNorm
              0.34309995 = fieldWeight in 83, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.09375 = fieldNorm(doc=83)
          0.24728592 = weight(abstract_txt:bayes in 83) [ClassicSimilarity], result of:
            0.24728592 = score(doc=83,freq=2.0), product of:
              0.21882631 = queryWeight, product of:
                1.3488199 = boost
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.019034086 = queryNorm
              1.1300557 = fieldWeight in 83, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.09375 = fieldNorm(doc=83)
          0.02225526 = weight(abstract_txt:this in 83) [ClassicSimilarity], result of:
            0.02225526 = score(doc=83,freq=2.0), product of:
              0.06976031 = queryWeight, product of:
                1.523135 = boost
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.019034086 = queryNorm
              0.31902468 = fieldWeight in 83, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.09375 = fieldNorm(doc=83)
          0.82315797 = weight(abstract_txt:arabic in 83) [ClassicSimilarity], result of:
            0.82315797 = score(doc=83,freq=5.0), product of:
              0.5184209 = queryWeight, product of:
                3.5958872 = boost
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.019034086 = queryNorm
              1.5878179 = fieldWeight in 83, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.09375 = fieldNorm(doc=83)
          0.2553774 = weight(abstract_txt:corpus in 83) [ClassicSimilarity], result of:
            0.2553774 = score(doc=83,freq=1.0), product of:
              0.44714832 = queryWeight, product of:
                3.8562038 = boost
                6.0919957 = idf(docFreq=272, maxDocs=44421)
                0.019034086 = queryNorm
              0.5711246 = fieldWeight in 83, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0919957 = idf(docFreq=272, maxDocs=44421)
                0.09375 = fieldNorm(doc=83)
        0.32 = coord(8/25)
    
  2. Pang, B.; Lee, L.: Opinion mining and sentiment analysis (2008) 0.42
    0.42217058 = sum of:
      0.42217058 = product of:
        0.8795221 = sum of:
          0.010391526 = weight(abstract_txt:research in 2171) [ClassicSimilarity], result of:
            0.010391526 = score(doc=2171,freq=1.0), product of:
              0.060139753 = queryWeight, product of:
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.019034086 = queryNorm
              0.17278963 = fieldWeight in 2171, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2171)
          0.07872073 = weight(abstract_txt:opinions in 2171) [ClassicSimilarity], result of:
            0.07872073 = score(doc=2171,freq=2.0), product of:
              0.14613296 = queryWeight, product of:
                1.1022453 = boost
                6.965269 = idf(docFreq=113, maxDocs=44421)
                0.019034086 = queryNorm
              0.53869253 = fieldWeight in 2171, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.965269 = idf(docFreq=113, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2171)
          0.014351947 = weight(abstract_txt:other in 2171) [ClassicSimilarity], result of:
            0.014351947 = score(doc=2171,freq=1.0), product of:
              0.074584626 = queryWeight, product of:
                1.1136374 = boost
                3.5186288 = idf(docFreq=3578, maxDocs=44421)
                0.019034086 = queryNorm
              0.19242501 = fieldWeight in 2171, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5186288 = idf(docFreq=3578, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2171)
          0.015556586 = weight(abstract_txt:been in 2171) [ClassicSimilarity], result of:
            0.015556586 = score(doc=2171,freq=1.0), product of:
              0.078701854 = queryWeight, product of:
                1.1439621 = boost
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.019034086 = queryNorm
              0.1976648 = fieldWeight in 2171, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2171)
          0.06426597 = weight(abstract_txt:blogs in 2171) [ClassicSimilarity], result of:
            0.06426597 = score(doc=2171,freq=1.0), product of:
              0.16082472 = queryWeight, product of:
                1.1563268 = boost
                7.3070183 = idf(docFreq=80, maxDocs=44421)
                0.019034086 = queryNorm
              0.39960256 = fieldWeight in 2171, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3070183 = idf(docFreq=80, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2171)
          0.15708157 = weight(abstract_txt:sentiment in 2171) [ClassicSimilarity], result of:
            0.15708157 = score(doc=2171,freq=5.0), product of:
              0.17065758 = queryWeight, product of:
                1.1911514 = boost
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.019034086 = queryNorm
              0.92044884 = fieldWeight in 2171, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2171)
          0.026276313 = weight(abstract_txt:available in 2171) [ClassicSimilarity], result of:
            0.026276313 = score(doc=2171,freq=1.0), product of:
              0.11162249 = queryWeight, product of:
                1.3623699 = boost
                4.304519 = idf(docFreq=1630, maxDocs=44421)
                0.019034086 = queryNorm
              0.23540339 = fieldWeight in 2171, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.304519 = idf(docFreq=1630, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2171)
          0.02704006 = weight(abstract_txt:present in 2171) [ClassicSimilarity], result of:
            0.02704006 = score(doc=2171,freq=1.0), product of:
              0.1137751 = queryWeight, product of:
                1.3754436 = boost
                4.3458266 = idf(docFreq=1564, maxDocs=44421)
                0.019034086 = queryNorm
              0.23766239 = fieldWeight in 2171, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3458266 = idf(docFreq=1564, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2171)
          0.009179826 = weight(abstract_txt:this in 2171) [ClassicSimilarity], result of:
            0.009179826 = score(doc=2171,freq=1.0), product of:
              0.06976031 = queryWeight, product of:
                1.523135 = boost
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.019034086 = queryNorm
              0.13159096 = fieldWeight in 2171, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2171)
          0.03993878 = weight(abstract_txt:reviews in 2171) [ClassicSimilarity], result of:
            0.03993878 = score(doc=2171,freq=1.0), product of:
              0.14756112 = queryWeight, product of:
                1.5664089 = boost
                4.9491973 = idf(docFreq=855, maxDocs=44421)
                0.019034086 = queryNorm
              0.27065924 = fieldWeight in 2171, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9491973 = idf(docFreq=855, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2171)
          0.13416429 = weight(abstract_txt:mining in 2171) [ClassicSimilarity], result of:
            0.13416429 = score(doc=2171,freq=3.0), product of:
              0.22948782 = queryWeight, product of:
                1.9534352 = boost
                6.1720386 = idf(docFreq=251, maxDocs=44421)
                0.019034086 = queryNorm
              0.5846249 = fieldWeight in 2171, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.1720386 = idf(docFreq=251, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2171)
          0.3025545 = weight(abstract_txt:opinion in 2171) [ClassicSimilarity], result of:
            0.3025545 = score(doc=2171,freq=8.0), product of:
              0.28458664 = queryWeight, product of:
                2.1753364 = boost
                6.8731537 = idf(docFreq=124, maxDocs=44421)
                0.019034086 = queryNorm
              1.0631367 = fieldWeight in 2171, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                6.8731537 = idf(docFreq=124, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2171)
        0.48 = coord(12/25)
    
  3. Perea-Ortega, J.M.; Martín-Valdivia, M.T.; Ureña-López, L.A.; Martínez-Cámara, E.: Improving polarity classification of bilingual parallel corpora combining machine learning and semantic orientation approaches (2013) 0.37
    0.37169862 = sum of:
      0.37169862 = product of:
        0.92924654 = sum of:
          0.06361596 = weight(abstract_txt:opinions in 2045) [ClassicSimilarity], result of:
            0.06361596 = score(doc=2045,freq=1.0), product of:
              0.14613296 = queryWeight, product of:
                1.1022453 = boost
                6.965269 = idf(docFreq=113, maxDocs=44421)
                0.019034086 = queryNorm
              0.43532932 = fieldWeight in 2045, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.965269 = idf(docFreq=113, maxDocs=44421)
                0.0625 = fieldNorm(doc=2045)
          0.017778955 = weight(abstract_txt:been in 2045) [ClassicSimilarity], result of:
            0.017778955 = score(doc=2045,freq=1.0), product of:
              0.078701854 = queryWeight, product of:
                1.1439621 = boost
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.019034086 = queryNorm
              0.22590263 = fieldWeight in 2045, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.0625 = fieldNorm(doc=2045)
          0.08028458 = weight(abstract_txt:sentiment in 2045) [ClassicSimilarity], result of:
            0.08028458 = score(doc=2045,freq=1.0), product of:
              0.17065758 = queryWeight, product of:
                1.1911514 = boost
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.019034086 = queryNorm
              0.47044253 = fieldWeight in 2045, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.0625 = fieldNorm(doc=2045)
          0.02785896 = weight(abstract_txt:related in 2045) [ClassicSimilarity], result of:
            0.02785896 = score(doc=2045,freq=1.0), product of:
              0.10617544 = queryWeight, product of:
                1.328713 = boost
                4.198178 = idf(docFreq=1813, maxDocs=44421)
                0.019034086 = queryNorm
              0.2623861 = fieldWeight in 2045, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.198178 = idf(docFreq=1813, maxDocs=44421)
                0.0625 = fieldNorm(doc=2045)
          0.018171342 = weight(abstract_txt:this in 2045) [ClassicSimilarity], result of:
            0.018171342 = score(doc=2045,freq=3.0), product of:
              0.06976031 = queryWeight, product of:
                1.523135 = boost
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.019034086 = queryNorm
              0.26048255 = fieldWeight in 2045, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.0625 = fieldNorm(doc=2045)
          0.044453073 = weight(abstract_txt:task in 2045) [ClassicSimilarity], result of:
            0.044453073 = score(doc=2045,freq=1.0), product of:
              0.14498241 = queryWeight, product of:
                1.5526617 = boost
                4.9057617 = idf(docFreq=893, maxDocs=44421)
                0.019034086 = queryNorm
              0.3066101 = fieldWeight in 2045, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9057617 = idf(docFreq=893, maxDocs=44421)
                0.0625 = fieldNorm(doc=2045)
          0.08852548 = weight(abstract_txt:mining in 2045) [ClassicSimilarity], result of:
            0.08852548 = score(doc=2045,freq=1.0), product of:
              0.22948782 = queryWeight, product of:
                1.9534352 = boost
                6.1720386 = idf(docFreq=251, maxDocs=44421)
                0.019034086 = queryNorm
              0.3857524 = fieldWeight in 2045, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1720386 = idf(docFreq=251, maxDocs=44421)
                0.0625 = fieldNorm(doc=2045)
          0.1728883 = weight(abstract_txt:opinion in 2045) [ClassicSimilarity], result of:
            0.1728883 = score(doc=2045,freq=2.0), product of:
              0.28458664 = queryWeight, product of:
                2.1753364 = boost
                6.8731537 = idf(docFreq=124, maxDocs=44421)
                0.019034086 = queryNorm
              0.6075067 = fieldWeight in 2045, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8731537 = idf(docFreq=124, maxDocs=44421)
                0.0625 = fieldNorm(doc=2045)
          0.24541828 = weight(abstract_txt:arabic in 2045) [ClassicSimilarity], result of:
            0.24541828 = score(doc=2045,freq=1.0), product of:
              0.5184209 = queryWeight, product of:
                3.5958872 = boost
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.019034086 = queryNorm
              0.47339582 = fieldWeight in 2045, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.0625 = fieldNorm(doc=2045)
          0.17025161 = weight(abstract_txt:corpus in 2045) [ClassicSimilarity], result of:
            0.17025161 = score(doc=2045,freq=1.0), product of:
              0.44714832 = queryWeight, product of:
                3.8562038 = boost
                6.0919957 = idf(docFreq=272, maxDocs=44421)
                0.019034086 = queryNorm
              0.38074973 = fieldWeight in 2045, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0919957 = idf(docFreq=272, maxDocs=44421)
                0.0625 = fieldNorm(doc=2045)
        0.4 = coord(10/25)
    
  4. Belbachir, F.; Boughanem, M.: Using language models to improve opinion detection (2018) 0.26
    0.25953332 = sum of:
      0.25953332 = product of:
        0.7209258 = sum of:
          0.014695838 = weight(abstract_txt:research in 44) [ClassicSimilarity], result of:
            0.014695838 = score(doc=44,freq=2.0), product of:
              0.060139753 = queryWeight, product of:
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.019034086 = queryNorm
              0.24436146 = fieldWeight in 44, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.0546875 = fieldNorm(doc=44)
          0.06426597 = weight(abstract_txt:blogs in 44) [ClassicSimilarity], result of:
            0.06426597 = score(doc=44,freq=1.0), product of:
              0.16082472 = queryWeight, product of:
                1.1563268 = boost
                7.3070183 = idf(docFreq=80, maxDocs=44421)
                0.019034086 = queryNorm
              0.39960256 = fieldWeight in 44, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3070183 = idf(docFreq=80, maxDocs=44421)
                0.0546875 = fieldNorm(doc=44)
          0.032297477 = weight(abstract_txt:different in 44) [ClassicSimilarity], result of:
            0.032297477 = score(doc=44,freq=4.0), product of:
              0.080686554 = queryWeight, product of:
                1.1582966 = boost
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.019034086 = queryNorm
              0.40028328 = fieldWeight in 44, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.0546875 = fieldNorm(doc=44)
          0.09535598 = weight(abstract_txt:movie in 44) [ClassicSimilarity], result of:
            0.09535598 = score(doc=44,freq=1.0), product of:
              0.2092172 = queryWeight, product of:
                1.3188727 = boost
                8.334172 = idf(docFreq=28, maxDocs=44421)
                0.019034086 = queryNorm
              0.45577505 = fieldWeight in 44, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.334172 = idf(docFreq=28, maxDocs=44421)
                0.0546875 = fieldNorm(doc=44)
          0.026276313 = weight(abstract_txt:available in 44) [ClassicSimilarity], result of:
            0.026276313 = score(doc=44,freq=1.0), product of:
              0.11162249 = queryWeight, product of:
                1.3623699 = boost
                4.304519 = idf(docFreq=1630, maxDocs=44421)
                0.019034086 = queryNorm
              0.23540339 = fieldWeight in 44, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.304519 = idf(docFreq=1630, maxDocs=44421)
                0.0546875 = fieldNorm(doc=44)
          0.02704006 = weight(abstract_txt:present in 44) [ClassicSimilarity], result of:
            0.02704006 = score(doc=44,freq=1.0), product of:
              0.1137751 = queryWeight, product of:
                1.3754436 = boost
                4.3458266 = idf(docFreq=1564, maxDocs=44421)
                0.019034086 = queryNorm
              0.23766239 = fieldWeight in 44, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3458266 = idf(docFreq=1564, maxDocs=44421)
                0.0546875 = fieldNorm(doc=44)
          0.012982234 = weight(abstract_txt:this in 44) [ClassicSimilarity], result of:
            0.012982234 = score(doc=44,freq=2.0), product of:
              0.06976031 = queryWeight, product of:
                1.523135 = boost
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.019034086 = queryNorm
              0.18609773 = fieldWeight in 44, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.0546875 = fieldNorm(doc=44)
          0.0774598 = weight(abstract_txt:mining in 44) [ClassicSimilarity], result of:
            0.0774598 = score(doc=44,freq=1.0), product of:
              0.22948782 = queryWeight, product of:
                1.9534352 = boost
                6.1720386 = idf(docFreq=251, maxDocs=44421)
                0.019034086 = queryNorm
              0.33753335 = fieldWeight in 44, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1720386 = idf(docFreq=251, maxDocs=44421)
                0.0546875 = fieldNorm(doc=44)
          0.3705521 = weight(abstract_txt:opinion in 44) [ClassicSimilarity], result of:
            0.3705521 = score(doc=44,freq=12.0), product of:
              0.28458664 = queryWeight, product of:
                2.1753364 = boost
                6.8731537 = idf(docFreq=124, maxDocs=44421)
                0.019034086 = queryNorm
              1.3020713 = fieldWeight in 44, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                6.8731537 = idf(docFreq=124, maxDocs=44421)
                0.0546875 = fieldNorm(doc=44)
        0.36 = coord(9/25)
    
  5. Abdelali, A.: Localization in modern standard Arabic (2004) 0.25
    0.24554054 = sum of:
      0.24554054 = product of:
        1.0230856 = sum of:
          0.03142905 = weight(abstract_txt:been in 3066) [ClassicSimilarity], result of:
            0.03142905 = score(doc=3066,freq=2.0), product of:
              0.078701854 = queryWeight, product of:
                1.1439621 = boost
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.019034086 = queryNorm
              0.3993432 = fieldWeight in 3066, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.078125 = fieldNorm(doc=3066)
          0.03262538 = weight(abstract_txt:different in 3066) [ClassicSimilarity], result of:
            0.03262538 = score(doc=3066,freq=2.0), product of:
              0.080686554 = queryWeight, product of:
                1.1582966 = boost
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.019034086 = queryNorm
              0.40434718 = fieldWeight in 3066, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.078125 = fieldNorm(doc=3066)
          0.03753759 = weight(abstract_txt:available in 3066) [ClassicSimilarity], result of:
            0.03753759 = score(doc=3066,freq=1.0), product of:
              0.11162249 = queryWeight, product of:
                1.3623699 = boost
                4.304519 = idf(docFreq=1630, maxDocs=44421)
                0.019034086 = queryNorm
              0.33629057 = fieldWeight in 3066, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.304519 = idf(docFreq=1630, maxDocs=44421)
                0.078125 = fieldNorm(doc=3066)
          0.022714179 = weight(abstract_txt:this in 3066) [ClassicSimilarity], result of:
            0.022714179 = score(doc=3066,freq=3.0), product of:
              0.06976031 = queryWeight, product of:
                1.523135 = boost
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.019034086 = queryNorm
              0.3256032 = fieldWeight in 3066, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.078125 = fieldNorm(doc=3066)
          0.68596494 = weight(abstract_txt:arabic in 3066) [ClassicSimilarity], result of:
            0.68596494 = score(doc=3066,freq=5.0), product of:
              0.5184209 = queryWeight, product of:
                3.5958872 = boost
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.019034086 = queryNorm
              1.3231815 = fieldWeight in 3066, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.078125 = fieldNorm(doc=3066)
          0.2128145 = weight(abstract_txt:corpus in 3066) [ClassicSimilarity], result of:
            0.2128145 = score(doc=3066,freq=1.0), product of:
              0.44714832 = queryWeight, product of:
                3.8562038 = boost
                6.0919957 = idf(docFreq=272, maxDocs=44421)
                0.019034086 = queryNorm
              0.47593716 = fieldWeight in 3066, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0919957 = idf(docFreq=272, maxDocs=44421)
                0.078125 = fieldNorm(doc=3066)
        0.24 = coord(6/25)