Document (#36739)

Author
Anizi, M.
Dichy, J.
Title
Improving information retrieval in Arabic through a multi-agent approach and a rich lexical resource
Source
Knowledge organization. 38(2011) no.5, S.405-413
Year
2011
Abstract
This paper addresses the optimization of information retrieval in Arabic. The results derived from the expanding development of sites in Arabic are often spectacular. Nevertheless, several observations indicate that the responses remain disappointing, particularly upon comparing users' requests and quality of responses. One of the problems encountered by users is the loss of time when navigating between different URLs to find adequate responses. This, in many cases, is due to the absence of forms morphologically related to the research keyword. Such problems can be approached through a morphological analyzer drawing on the DIINAR.1 morpho-lexical resource. A second problem concerns the formulation of the query, which may prove ambiguous, as in everyday language. We then focus on contextual disambiguation based on a rich lexical resource that includes collocations and set expressions. The overall scheme of such a resource will only be hinted at here. Our approach leads to the elaboration of a multi-agent system, motivated by a need to solve problems encountered when using conventional methods of analysis, and to improve the results of queries thanks to a better collaboration between different levels of analysis. We suggest resorting to four agents: morphological, morpho-lexical, contextualization, and an interface agent. These agents 'negotiate' and 'cooperate' throughout the analysis process, starting from the submission of the initial query, and going on until an adequate query is obtained.
Content
Beitrag innerhalb einer Special Section: Knowledge Organization, Competitive Intelligence, and Information Systems - Papers from 4th International Conference on "Information Systems & Economic Intelligence," February 17-19th, 2011. Marrakech - Morocco.
Footnote
Vgl.: http://www.ergon-verlag.de/isko_ko/downloads/ko_38_2011_5d.pdf.
Theme
Computerlinguistik

Similar documents (content)

  1. Fautsch, C.; Savoy, J.: Algorithmic stemmers or morphological analysis? : an evaluation (2009) 0.11
    0.11307497 = sum of:
      0.11307497 = product of:
        0.56537485 = sum of:
          0.021575518 = weight(abstract_txt:when in 3950) [ClassicSimilarity], result of:
            0.021575518 = score(doc=3950,freq=1.0), product of:
              0.066609 = queryWeight, product of:
                1.0029864 = boost
                4.1460857 = idf(docFreq=1910, maxDocs=44421)
                0.01601768 = queryNorm
              0.32391295 = fieldWeight in 3950, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1460857 = idf(docFreq=1910, maxDocs=44421)
                0.078125 = fieldNorm(doc=3950)
          0.118501306 = weight(abstract_txt:analyzer in 3950) [ClassicSimilarity], result of:
            0.118501306 = score(doc=3950,freq=1.0), product of:
              0.16457513 = queryWeight, product of:
                1.1147968 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.01601768 = queryNorm
              0.72004384 = fieldWeight in 3950, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.078125 = fieldNorm(doc=3950)
          0.12188933 = weight(abstract_txt:morphologically in 3950) [ClassicSimilarity], result of:
            0.12188933 = score(doc=3950,freq=1.0), product of:
              0.16769724 = queryWeight, product of:
                1.1253214 = boost
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.01601768 = queryNorm
              0.7268416 = fieldWeight in 3950, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.078125 = fieldNorm(doc=3950)
          0.0311162 = weight(abstract_txt:analysis in 3950) [ClassicSimilarity], result of:
            0.0311162 = score(doc=3950,freq=2.0), product of:
              0.077250905 = queryWeight, product of:
                1.322897 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.01601768 = queryNorm
              0.402794 = fieldWeight in 3950, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.078125 = fieldNorm(doc=3950)
          0.27229255 = weight(abstract_txt:morphological in 3950) [ClassicSimilarity], result of:
            0.27229255 = score(doc=3950,freq=3.0), product of:
              0.2503469 = queryWeight, product of:
                1.9444631 = boost
                8.037906 = idf(docFreq=38, maxDocs=44421)
                0.01601768 = queryNorm
              1.087661 = fieldWeight in 3950, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.037906 = idf(docFreq=38, maxDocs=44421)
                0.078125 = fieldNorm(doc=3950)
        0.2 = coord(5/25)
    
  2. Dumais, S.T.: Latent semantic analysis (2003) 0.10
    0.104584254 = sum of:
      0.104584254 = product of:
        0.3735152 = sum of:
          0.008630208 = weight(abstract_txt:when in 3462) [ClassicSimilarity], result of:
            0.008630208 = score(doc=3462,freq=1.0), product of:
              0.066609 = queryWeight, product of:
                1.0029864 = boost
                4.1460857 = idf(docFreq=1910, maxDocs=44421)
                0.01601768 = queryNorm
              0.12956518 = fieldWeight in 3462, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1460857 = idf(docFreq=1910, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.06895101 = weight(abstract_txt:morphologically in 3462) [ClassicSimilarity], result of:
            0.06895101 = score(doc=3462,freq=2.0), product of:
              0.16769724 = queryWeight, product of:
                1.1253214 = boost
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.01601768 = queryNorm
              0.4111637 = fieldWeight in 3462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.017601982 = weight(abstract_txt:analysis in 3462) [ClassicSimilarity], result of:
            0.017601982 = score(doc=3462,freq=4.0), product of:
              0.077250905 = queryWeight, product of:
                1.322897 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.01601768 = queryNorm
              0.2278547 = fieldWeight in 3462, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.028899457 = weight(abstract_txt:problems in 3462) [ClassicSimilarity], result of:
            0.028899457 = score(doc=3462,freq=4.0), product of:
              0.107511684 = queryWeight, product of:
                1.5606376 = boost
                4.300847 = idf(docFreq=1636, maxDocs=44421)
                0.01601768 = queryNorm
              0.26880294 = fieldWeight in 3462, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.300847 = idf(docFreq=1636, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.043651074 = weight(abstract_txt:query in 3462) [ClassicSimilarity], result of:
            0.043651074 = score(doc=3462,freq=5.0), product of:
              0.13138804 = queryWeight, product of:
                1.7252505 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.01601768 = queryNorm
              0.3322302 = fieldWeight in 3462, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.088930376 = weight(abstract_txt:morphological in 3462) [ClassicSimilarity], result of:
            0.088930376 = score(doc=3462,freq=2.0), product of:
              0.2503469 = queryWeight, product of:
                1.9444631 = boost
                8.037906 = idf(docFreq=38, maxDocs=44421)
                0.01601768 = queryNorm
              0.3552286 = fieldWeight in 3462, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.037906 = idf(docFreq=38, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
          0.116851084 = weight(abstract_txt:lexical in 3462) [ClassicSimilarity], result of:
            0.116851084 = score(doc=3462,freq=3.0), product of:
              0.33055484 = queryWeight, product of:
                3.159842 = boost
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.01601768 = queryNorm
              0.35349983 = fieldWeight in 3462, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.03125 = fieldNorm(doc=3462)
        0.28 = coord(7/25)
    
  3. Bicchieri, C.: ¬The potential for the evolution of co-operation among web agents (1998) 0.09
    0.09357023 = sum of:
      0.09357023 = product of:
        0.46785116 = sum of:
          0.021575518 = weight(abstract_txt:when in 3297) [ClassicSimilarity], result of:
            0.021575518 = score(doc=3297,freq=1.0), product of:
              0.066609 = queryWeight, product of:
                1.0029864 = boost
                4.1460857 = idf(docFreq=1910, maxDocs=44421)
                0.01601768 = queryNorm
              0.32391295 = fieldWeight in 3297, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1460857 = idf(docFreq=1910, maxDocs=44421)
                0.078125 = fieldNorm(doc=3297)
          0.03810941 = weight(abstract_txt:analysis in 3297) [ClassicSimilarity], result of:
            0.03810941 = score(doc=3297,freq=3.0), product of:
              0.077250905 = queryWeight, product of:
                1.322897 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.01601768 = queryNorm
              0.4933199 = fieldWeight in 3297, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.078125 = fieldNorm(doc=3297)
          0.124912426 = weight(abstract_txt:agents in 3297) [ClassicSimilarity], result of:
            0.124912426 = score(doc=3297,freq=2.0), product of:
              0.17045872 = queryWeight, product of:
                1.6044945 = boost
                6.6325636 = idf(docFreq=158, maxDocs=44421)
                0.01601768 = queryNorm
              0.7328016 = fieldWeight in 3297, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.6325636 = idf(docFreq=158, maxDocs=44421)
                0.078125 = fieldNorm(doc=3297)
          0.123005 = weight(abstract_txt:responses in 3297) [ClassicSimilarity], result of:
            0.123005 = score(doc=3297,freq=1.0), product of:
              0.24333487 = queryWeight, product of:
                2.3478827 = boost
                6.470359 = idf(docFreq=186, maxDocs=44421)
                0.01601768 = queryNorm
              0.5054968 = fieldWeight in 3297, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.470359 = idf(docFreq=186, maxDocs=44421)
                0.078125 = fieldNorm(doc=3297)
          0.16024882 = weight(abstract_txt:agent in 3297) [ClassicSimilarity], result of:
            0.16024882 = score(doc=3297,freq=1.0), product of:
              0.29025903 = queryWeight, product of:
                2.5642898 = boost
                7.0667386 = idf(docFreq=102, maxDocs=44421)
                0.01601768 = queryNorm
              0.552089 = fieldWeight in 3297, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0667386 = idf(docFreq=102, maxDocs=44421)
                0.078125 = fieldNorm(doc=3297)
        0.2 = coord(5/25)
    
  4. AI-Sughaiyer, I.A.; AI-Kharashi, I.A.: Arabic morphological analysis techniques : a comprehensive survey (2004) 0.09
    0.09290547 = sum of:
      0.09290547 = product of:
        0.77421224 = sum of:
          0.03733944 = weight(abstract_txt:analysis in 3206) [ClassicSimilarity], result of:
            0.03733944 = score(doc=3206,freq=2.0), product of:
              0.077250905 = queryWeight, product of:
                1.322897 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.01601768 = queryNorm
              0.48335278 = fieldWeight in 3206, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.09375 = fieldNorm(doc=3206)
          0.32675105 = weight(abstract_txt:morphological in 3206) [ClassicSimilarity], result of:
            0.32675105 = score(doc=3206,freq=3.0), product of:
              0.2503469 = queryWeight, product of:
                1.9444631 = boost
                8.037906 = idf(docFreq=38, maxDocs=44421)
                0.01601768 = queryNorm
              1.3051932 = fieldWeight in 3206, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.037906 = idf(docFreq=38, maxDocs=44421)
                0.09375 = fieldNorm(doc=3206)
          0.41012174 = weight(abstract_txt:arabic in 3206) [ClassicSimilarity], result of:
            0.41012174 = score(doc=3206,freq=3.0), product of:
              0.3334544 = queryWeight, product of:
                2.7484794 = boost
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.01601768 = queryNorm
              1.2299185 = fieldWeight in 3206, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.09375 = fieldNorm(doc=3206)
        0.12 = coord(3/25)
    
  5. Galitsky, B.: Can many agents answer questions better than one? (2005) 0.09
    0.09165338 = sum of:
      0.09165338 = product of:
        0.45826688 = sum of:
          0.021575518 = weight(abstract_txt:when in 4094) [ClassicSimilarity], result of:
            0.021575518 = score(doc=4094,freq=1.0), product of:
              0.066609 = queryWeight, product of:
                1.0029864 = boost
                4.1460857 = idf(docFreq=1910, maxDocs=44421)
                0.01601768 = queryNorm
              0.32391295 = fieldWeight in 4094, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1460857 = idf(docFreq=1910, maxDocs=44421)
                0.078125 = fieldNorm(doc=4094)
          0.022002477 = weight(abstract_txt:analysis in 4094) [ClassicSimilarity], result of:
            0.022002477 = score(doc=4094,freq=1.0), product of:
              0.077250905 = queryWeight, product of:
                1.322897 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.01601768 = queryNorm
              0.28481838 = fieldWeight in 4094, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.078125 = fieldNorm(doc=4094)
          0.08832643 = weight(abstract_txt:agents in 4094) [ClassicSimilarity], result of:
            0.08832643 = score(doc=4094,freq=1.0), product of:
              0.17045872 = queryWeight, product of:
                1.6044945 = boost
                6.6325636 = idf(docFreq=158, maxDocs=44421)
                0.01601768 = queryNorm
              0.51816905 = fieldWeight in 4094, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6325636 = idf(docFreq=158, maxDocs=44421)
                0.078125 = fieldNorm(doc=4094)
          0.048803385 = weight(abstract_txt:query in 4094) [ClassicSimilarity], result of:
            0.048803385 = score(doc=4094,freq=1.0), product of:
              0.13138804 = queryWeight, product of:
                1.7252505 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.01601768 = queryNorm
              0.37144467 = fieldWeight in 4094, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.078125 = fieldNorm(doc=4094)
          0.27755907 = weight(abstract_txt:agent in 4094) [ClassicSimilarity], result of:
            0.27755907 = score(doc=4094,freq=3.0), product of:
              0.29025903 = queryWeight, product of:
                2.5642898 = boost
                7.0667386 = idf(docFreq=102, maxDocs=44421)
                0.01601768 = queryNorm
              0.95624614 = fieldWeight in 4094, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.0667386 = idf(docFreq=102, maxDocs=44421)
                0.078125 = fieldNorm(doc=4094)
        0.2 = coord(5/25)