Document (#34373)

Author
Bellaachia, A.
Amor-Tijani, G.
Title
Proper nouns in English-Arabic cross language information retrieval
Source
Journal of the American Society for Information Science and Technology. 59(2008) no.12, S.1925-1932
Year
2008
Abstract
Out of vocabulary words, mostly proper nouns and technical terms, are one main source of performance degradation in Cross Language Information Retrieval (CLIR) systems. Those are words not found in the dictionary. Bilingual dictionaries in general do not cover most proper nouns, which are usually primary keys in the query. As they are spelling variants of each other in most languages, using an approximate string matching technique against the target database index is the common approach taken to find the target language correspondents of the original query key. N-gram technique proved to be the most effective among other string matching techniques. The issue arises when the languages dealt with have different alphabets. Transliteration is then applied based on phonetic similarities between the languages involved. In this study, both transliteration and the n-gram technique are combined to generate possible transliterations in an English-Arabic CLIR system. We refer to this technique as Transliteration N-Gram (TNG). We further enhance TNG by applying Part Of Speech disambiguation on the set of transliterations so that words with a similar spelling, but a different meaning, are excluded. Experimental results show that TNG gives promising results, and enhanced TNG further improves performance.
Theme
Multilinguale Probleme
Computerlinguistik

Similar documents (content)

  1. Fattah, M. Abdel; Ren, F.: English-Arabic proper-noun transliteration-pairs creation (2008) 0.69
    0.68936443 = sum of:
      0.68936443 = product of:
        1.5667374 = sum of:
          0.098246895 = weight(abstract_txt:alphabets in 2999) [ClassicSimilarity], result of:
            0.098246895 = score(doc=2999,freq=1.0), product of:
              0.16724864 = queryWeight, product of:
                1.1537865 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.015422718 = queryNorm
              0.5874302 = fieldWeight in 2999, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.0625 = fieldNorm(doc=2999)
          0.025435042 = weight(abstract_txt:query in 2999) [ClassicSimilarity], result of:
            0.025435042 = score(doc=2999,freq=1.0), product of:
              0.08559499 = queryWeight, product of:
                1.1673023 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.015422718 = queryNorm
              0.29715574 = fieldWeight in 2999, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.0625 = fieldNorm(doc=2999)
          0.081995666 = weight(abstract_txt:english in 2999) [ClassicSimilarity], result of:
            0.081995666 = score(doc=2999,freq=4.0), product of:
              0.11767042 = queryWeight, product of:
                1.3686513 = boost
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.015422718 = queryNorm
              0.6968248 = fieldWeight in 2999, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.0625 = fieldNorm(doc=2999)
          0.04349946 = weight(abstract_txt:most in 2999) [ClassicSimilarity], result of:
            0.04349946 = score(doc=2999,freq=4.0), product of:
              0.08827269 = queryWeight, product of:
                1.4518374 = boost
                3.94228 = idf(docFreq=2342, maxDocs=44421)
                0.015422718 = queryNorm
              0.492785 = fieldWeight in 2999, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.94228 = idf(docFreq=2342, maxDocs=44421)
                0.0625 = fieldNorm(doc=2999)
          0.025758948 = weight(abstract_txt:language in 2999) [ClassicSimilarity], result of:
            0.025758948 = score(doc=2999,freq=1.0), product of:
              0.09881189 = queryWeight, product of:
                1.5360643 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.015422718 = queryNorm
              0.26068673 = fieldWeight in 2999, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.0625 = fieldNorm(doc=2999)
          0.29086962 = weight(abstract_txt:arabic in 2999) [ClassicSimilarity], result of:
            0.29086962 = score(doc=2999,freq=8.0), product of:
              0.21723458 = queryWeight, product of:
                1.8596176 = boost
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.015422718 = queryNorm
              1.3389655 = fieldWeight in 2999, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.0625 = fieldNorm(doc=2999)
          0.07017116 = weight(abstract_txt:languages in 2999) [ClassicSimilarity], result of:
            0.07017116 = score(doc=2999,freq=2.0), product of:
              0.1529747 = queryWeight, product of:
                1.9112372 = boost
                5.189722 = idf(docFreq=672, maxDocs=44421)
                0.015422718 = queryNorm
              0.45871094 = fieldWeight in 2999, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.189722 = idf(docFreq=672, maxDocs=44421)
                0.0625 = fieldNorm(doc=2999)
          0.109074205 = weight(abstract_txt:words in 2999) [ClassicSimilarity], result of:
            0.109074205 = score(doc=2999,freq=4.0), product of:
              0.16292404 = queryWeight, product of:
                1.9724108 = boost
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.015422718 = queryNorm
              0.6694789 = fieldWeight in 2999, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.0625 = fieldNorm(doc=2999)
          0.22170138 = weight(abstract_txt:proper in 2999) [ClassicSimilarity], result of:
            0.22170138 = score(doc=2999,freq=5.0), product of:
              0.24268673 = queryWeight, product of:
                2.4072866 = boost
                6.5366817 = idf(docFreq=174, maxDocs=44421)
                0.015422718 = queryNorm
              0.91352904 = fieldWeight in 2999, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.5366817 = idf(docFreq=174, maxDocs=44421)
                0.0625 = fieldNorm(doc=2999)
          0.31629372 = weight(abstract_txt:nouns in 2999) [ClassicSimilarity], result of:
            0.31629372 = score(doc=2999,freq=3.0), product of:
              0.36465073 = queryWeight, product of:
                2.9508243 = boost
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.015422718 = queryNorm
              0.8673881 = fieldWeight in 2999, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.0625 = fieldNorm(doc=2999)
          0.2836913 = weight(abstract_txt:transliteration in 2999) [ClassicSimilarity], result of:
            0.2836913 = score(doc=2999,freq=2.0), product of:
              0.38821992 = queryWeight, product of:
                3.0446944 = boost
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.015422718 = queryNorm
              0.73074895 = fieldWeight in 2999, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.0625 = fieldNorm(doc=2999)
        0.44 = coord(11/25)
    
  2. Toivonen, J.; Pirkola, A.; Keskustalo, H.; Visala, K.; Järvelin, K.: Translating cross-lingual spelling variants using transformation rules (2005) 0.57
    0.5670493 = sum of:
      0.5670493 = product of:
        1.2887485 = sum of:
          0.05124729 = weight(abstract_txt:english in 2052) [ClassicSimilarity], result of:
            0.05124729 = score(doc=2052,freq=1.0), product of:
              0.11767042 = queryWeight, product of:
                1.3686513 = boost
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.015422718 = queryNorm
              0.4355155 = fieldWeight in 2052, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.078125 = fieldNorm(doc=2052)
          0.07351509 = weight(abstract_txt:cross in 2052) [ClassicSimilarity], result of:
            0.07351509 = score(doc=2052,freq=2.0), product of:
              0.11879396 = queryWeight, product of:
                1.3751699 = boost
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.015422718 = queryNorm
              0.61884534 = fieldWeight in 2052, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.078125 = fieldNorm(doc=2052)
          0.09227504 = weight(abstract_txt:matching in 2052) [ClassicSimilarity], result of:
            0.09227504 = score(doc=2052,freq=2.0), product of:
              0.13822912 = queryWeight, product of:
                1.4834023 = boost
                6.0419855 = idf(docFreq=286, maxDocs=44421)
                0.015422718 = queryNorm
              0.6675514 = fieldWeight in 2052, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.0419855 = idf(docFreq=286, maxDocs=44421)
                0.078125 = fieldNorm(doc=2052)
          0.06439737 = weight(abstract_txt:language in 2052) [ClassicSimilarity], result of:
            0.06439737 = score(doc=2052,freq=4.0), product of:
              0.09881189 = queryWeight, product of:
                1.5360643 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.015422718 = queryNorm
              0.6517168 = fieldWeight in 2052, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.078125 = fieldNorm(doc=2052)
          0.14541125 = weight(abstract_txt:target in 2052) [ClassicSimilarity], result of:
            0.14541125 = score(doc=2052,freq=3.0), product of:
              0.1635228 = queryWeight, product of:
                1.6134232 = boost
                6.571569 = idf(docFreq=168, maxDocs=44421)
                0.015422718 = queryNorm
              0.88924146 = fieldWeight in 2052, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.571569 = idf(docFreq=168, maxDocs=44421)
                0.078125 = fieldNorm(doc=2052)
          0.18791527 = weight(abstract_txt:spelling in 2052) [ClassicSimilarity], result of:
            0.18791527 = score(doc=2052,freq=2.0), product of:
              0.22208442 = queryWeight, product of:
                1.8802613 = boost
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.015422718 = queryNorm
              0.8461434 = fieldWeight in 2052, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.078125 = fieldNorm(doc=2052)
          0.087713964 = weight(abstract_txt:languages in 2052) [ClassicSimilarity], result of:
            0.087713964 = score(doc=2052,freq=2.0), product of:
              0.1529747 = queryWeight, product of:
                1.9112372 = boost
                5.189722 = idf(docFreq=672, maxDocs=44421)
                0.015422718 = queryNorm
              0.5733887 = fieldWeight in 2052, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.189722 = idf(docFreq=672, maxDocs=44421)
                0.078125 = fieldNorm(doc=2052)
          0.068171374 = weight(abstract_txt:words in 2052) [ClassicSimilarity], result of:
            0.068171374 = score(doc=2052,freq=1.0), product of:
              0.16292404 = queryWeight, product of:
                1.9724108 = boost
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.015422718 = queryNorm
              0.4184243 = fieldWeight in 2052, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.078125 = fieldNorm(doc=2052)
          0.16340283 = weight(abstract_txt:clir in 2052) [ClassicSimilarity], result of:
            0.16340283 = score(doc=2052,freq=1.0), product of:
              0.25491363 = queryWeight, product of:
                2.0144463 = boost
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.015422718 = queryNorm
              0.6410125 = fieldWeight in 2052, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.078125 = fieldNorm(doc=2052)
          0.17527032 = weight(abstract_txt:proper in 2052) [ClassicSimilarity], result of:
            0.17527032 = score(doc=2052,freq=2.0), product of:
              0.24268673 = queryWeight, product of:
                2.4072866 = boost
                6.5366817 = idf(docFreq=174, maxDocs=44421)
                0.015422718 = queryNorm
              0.7222081 = fieldWeight in 2052, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.5366817 = idf(docFreq=174, maxDocs=44421)
                0.078125 = fieldNorm(doc=2052)
          0.17942864 = weight(abstract_txt:technique in 2052) [ClassicSimilarity], result of:
            0.17942864 = score(doc=2052,freq=3.0), product of:
              0.23701954 = queryWeight, product of:
                2.747048 = boost
                5.5944448 = idf(docFreq=448, maxDocs=44421)
                0.015422718 = queryNorm
              0.7570205 = fieldWeight in 2052, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.5944448 = idf(docFreq=448, maxDocs=44421)
                0.078125 = fieldNorm(doc=2052)
        0.44 = coord(11/25)
    
  3. Pirkola, A.; Puolamäki, D.; Järvelin, K.: Applying query structuring in cross-language retrieval (2003) 0.40
    0.4047093 = sum of:
      0.4047093 = product of:
        1.0117732 = sum of:
          0.079956256 = weight(abstract_txt:keys in 2074) [ClassicSimilarity], result of:
            0.079956256 = score(doc=2074,freq=1.0), product of:
              0.1256353 = queryWeight, product of:
                8.146119 = idf(docFreq=34, maxDocs=44421)
                0.015422718 = queryNorm
              0.63641554 = fieldWeight in 2074, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.146119 = idf(docFreq=34, maxDocs=44421)
                0.078125 = fieldNorm(doc=2074)
          0.063587606 = weight(abstract_txt:query in 2074) [ClassicSimilarity], result of:
            0.063587606 = score(doc=2074,freq=4.0), product of:
              0.08559499 = queryWeight, product of:
                1.1673023 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.015422718 = queryNorm
              0.74288934 = fieldWeight in 2074, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.078125 = fieldNorm(doc=2074)
          0.072474614 = weight(abstract_txt:english in 2074) [ClassicSimilarity], result of:
            0.072474614 = score(doc=2074,freq=2.0), product of:
              0.11767042 = queryWeight, product of:
                1.3686513 = boost
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.015422718 = queryNorm
              0.6159119 = fieldWeight in 2074, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.078125 = fieldNorm(doc=2074)
          0.051983017 = weight(abstract_txt:cross in 2074) [ClassicSimilarity], result of:
            0.051983017 = score(doc=2074,freq=1.0), product of:
              0.11879396 = queryWeight, product of:
                1.3751699 = boost
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.015422718 = queryNorm
              0.43758973 = fieldWeight in 2074, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.078125 = fieldNorm(doc=2074)
          0.0652483 = weight(abstract_txt:matching in 2074) [ClassicSimilarity], result of:
            0.0652483 = score(doc=2074,freq=1.0), product of:
              0.13822912 = queryWeight, product of:
                1.4834023 = boost
                6.0419855 = idf(docFreq=286, maxDocs=44421)
                0.015422718 = queryNorm
              0.4720301 = fieldWeight in 2074, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0419855 = idf(docFreq=286, maxDocs=44421)
                0.078125 = fieldNorm(doc=2074)
          0.045535814 = weight(abstract_txt:language in 2074) [ClassicSimilarity], result of:
            0.045535814 = score(doc=2074,freq=2.0), product of:
              0.09881189 = queryWeight, product of:
                1.5360643 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.015422718 = queryNorm
              0.46083337 = fieldWeight in 2074, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.078125 = fieldNorm(doc=2074)
          0.13287616 = weight(abstract_txt:spelling in 2074) [ClassicSimilarity], result of:
            0.13287616 = score(doc=2074,freq=1.0), product of:
              0.22208442 = queryWeight, product of:
                1.8802613 = boost
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.015422718 = queryNorm
              0.59831375 = fieldWeight in 2074, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.078125 = fieldNorm(doc=2074)
          0.062023137 = weight(abstract_txt:languages in 2074) [ClassicSimilarity], result of:
            0.062023137 = score(doc=2074,freq=1.0), product of:
              0.1529747 = queryWeight, product of:
                1.9112372 = boost
                5.189722 = idf(docFreq=672, maxDocs=44421)
                0.015422718 = queryNorm
              0.40544704 = fieldWeight in 2074, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.189722 = idf(docFreq=672, maxDocs=44421)
                0.078125 = fieldNorm(doc=2074)
          0.123934835 = weight(abstract_txt:proper in 2074) [ClassicSimilarity], result of:
            0.123934835 = score(doc=2074,freq=1.0), product of:
              0.24268673 = queryWeight, product of:
                2.4072866 = boost
                6.5366817 = idf(docFreq=174, maxDocs=44421)
                0.015422718 = queryNorm
              0.51067823 = fieldWeight in 2074, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5366817 = idf(docFreq=174, maxDocs=44421)
                0.078125 = fieldNorm(doc=2074)
          0.31415343 = weight(abstract_txt:gram in 2074) [ClassicSimilarity], result of:
            0.31415343 = score(doc=2074,freq=2.0), product of:
              0.35809782 = queryWeight, product of:
                2.9241903 = boost
                7.9402676 = idf(docFreq=42, maxDocs=44421)
                0.015422718 = queryNorm
              0.8772839 = fieldWeight in 2074, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.9402676 = idf(docFreq=42, maxDocs=44421)
                0.078125 = fieldNorm(doc=2074)
        0.4 = coord(10/25)
    
  4. Li, Q.; Chen, Y.P.; Myaeng, S.-H.; Jin, Y.; Kang, B.-Y.: Concept unification of terms in different languages via web mining for Information Retrieval (2009) 0.36
    0.36172858 = sum of:
      0.36172858 = product of:
        0.90432143 = sum of:
          0.03299842 = weight(abstract_txt:performance in 215) [ClassicSimilarity], result of:
            0.03299842 = score(doc=215,freq=2.0), product of:
              0.08081255 = queryWeight, product of:
                1.1342233 = boost
                4.619759 = idf(docFreq=1189, maxDocs=44421)
                0.015422718 = queryNorm
              0.40833285 = fieldWeight in 215, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.619759 = idf(docFreq=1189, maxDocs=44421)
                0.0625 = fieldNorm(doc=215)
          0.07101032 = weight(abstract_txt:english in 215) [ClassicSimilarity], result of:
            0.07101032 = score(doc=215,freq=3.0), product of:
              0.11767042 = queryWeight, product of:
                1.3686513 = boost
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.015422718 = queryNorm
              0.60346794 = fieldWeight in 215, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.0625 = fieldNorm(doc=215)
          0.05881207 = weight(abstract_txt:cross in 215) [ClassicSimilarity], result of:
            0.05881207 = score(doc=215,freq=2.0), product of:
              0.11879396 = queryWeight, product of:
                1.3751699 = boost
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.015422718 = queryNorm
              0.49507627 = fieldWeight in 215, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.0625 = fieldNorm(doc=215)
          0.051517896 = weight(abstract_txt:language in 215) [ClassicSimilarity], result of:
            0.051517896 = score(doc=215,freq=4.0), product of:
              0.09881189 = queryWeight, product of:
                1.5360643 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.015422718 = queryNorm
              0.52137345 = fieldWeight in 215, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.0625 = fieldNorm(doc=215)
          0.08594178 = weight(abstract_txt:languages in 215) [ClassicSimilarity], result of:
            0.08594178 = score(doc=215,freq=3.0), product of:
              0.1529747 = queryWeight, product of:
                1.9112372 = boost
                5.189722 = idf(docFreq=672, maxDocs=44421)
                0.015422718 = queryNorm
              0.5618039 = fieldWeight in 215, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.189722 = idf(docFreq=672, maxDocs=44421)
                0.0625 = fieldNorm(doc=215)
          0.054537103 = weight(abstract_txt:words in 215) [ClassicSimilarity], result of:
            0.054537103 = score(doc=215,freq=1.0), product of:
              0.16292404 = queryWeight, product of:
                1.9724108 = boost
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.015422718 = queryNorm
              0.33473945 = fieldWeight in 215, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.0625 = fieldNorm(doc=215)
          0.18486919 = weight(abstract_txt:clir in 215) [ClassicSimilarity], result of:
            0.18486919 = score(doc=215,freq=2.0), product of:
              0.25491363 = queryWeight, product of:
                2.0144463 = boost
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.015422718 = queryNorm
              0.7252228 = fieldWeight in 215, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.0625 = fieldNorm(doc=215)
          0.09914787 = weight(abstract_txt:proper in 215) [ClassicSimilarity], result of:
            0.09914787 = score(doc=215,freq=1.0), product of:
              0.24268673 = queryWeight, product of:
                2.4072866 = boost
                6.5366817 = idf(docFreq=174, maxDocs=44421)
                0.015422718 = queryNorm
              0.4085426 = fieldWeight in 215, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5366817 = idf(docFreq=174, maxDocs=44421)
                0.0625 = fieldNorm(doc=215)
          0.082874544 = weight(abstract_txt:technique in 215) [ClassicSimilarity], result of:
            0.082874544 = score(doc=215,freq=1.0), product of:
              0.23701954 = queryWeight, product of:
                2.747048 = boost
                5.5944448 = idf(docFreq=448, maxDocs=44421)
                0.015422718 = queryNorm
              0.3496528 = fieldWeight in 215, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5944448 = idf(docFreq=448, maxDocs=44421)
                0.0625 = fieldNorm(doc=215)
          0.18261227 = weight(abstract_txt:nouns in 215) [ClassicSimilarity], result of:
            0.18261227 = score(doc=215,freq=1.0), product of:
              0.36465073 = queryWeight, product of:
                2.9508243 = boost
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.015422718 = queryNorm
              0.5007868 = fieldWeight in 215, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.0625 = fieldNorm(doc=215)
        0.4 = coord(10/25)
    
  5. Ahmed, F.; Nürnberger, A.: Evaluation of n-gram conflation approaches for Arabic text retrieval (2009) 0.26
    0.2576489 = sum of:
      0.2576489 = product of:
        0.8051528 = sum of:
          0.025435042 = weight(abstract_txt:query in 3941) [ClassicSimilarity], result of:
            0.025435042 = score(doc=3941,freq=1.0), product of:
              0.08559499 = queryWeight, product of:
                1.1673023 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.015422718 = queryNorm
              0.29715574 = fieldWeight in 3941, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.0625 = fieldNorm(doc=3941)
          0.040997833 = weight(abstract_txt:english in 3941) [ClassicSimilarity], result of:
            0.040997833 = score(doc=3941,freq=1.0), product of:
              0.11767042 = queryWeight, product of:
                1.3686513 = boost
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.015422718 = queryNorm
              0.3484124 = fieldWeight in 3941, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.0625 = fieldNorm(doc=3941)
          0.044615805 = weight(abstract_txt:language in 3941) [ClassicSimilarity], result of:
            0.044615805 = score(doc=3941,freq=3.0), product of:
              0.09881189 = queryWeight, product of:
                1.5360643 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.015422718 = queryNorm
              0.45152265 = fieldWeight in 3941, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.0625 = fieldNorm(doc=3941)
          0.09498224 = weight(abstract_txt:target in 3941) [ClassicSimilarity], result of:
            0.09498224 = score(doc=3941,freq=2.0), product of:
              0.1635228 = queryWeight, product of:
                1.6134232 = boost
                6.571569 = idf(docFreq=168, maxDocs=44421)
                0.015422718 = queryNorm
              0.5808501 = fieldWeight in 3941, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.571569 = idf(docFreq=168, maxDocs=44421)
                0.0625 = fieldNorm(doc=3941)
          0.08758611 = weight(abstract_txt:string in 3941) [ClassicSimilarity], result of:
            0.08758611 = score(doc=3941,freq=1.0), product of:
              0.19518669 = queryWeight, product of:
                1.7627238 = boost
                7.179679 = idf(docFreq=91, maxDocs=44421)
                0.015422718 = queryNorm
              0.44872993 = fieldWeight in 3941, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.179679 = idf(docFreq=91, maxDocs=44421)
                0.0625 = fieldNorm(doc=3941)
          0.20567589 = weight(abstract_txt:arabic in 3941) [ClassicSimilarity], result of:
            0.20567589 = score(doc=3941,freq=4.0), product of:
              0.21723458 = queryWeight, product of:
                1.8596176 = boost
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.015422718 = queryNorm
              0.94679165 = fieldWeight in 3941, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.0625 = fieldNorm(doc=3941)
          0.054537103 = weight(abstract_txt:words in 3941) [ClassicSimilarity], result of:
            0.054537103 = score(doc=3941,freq=1.0), product of:
              0.16292404 = queryWeight, product of:
                1.9724108 = boost
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.015422718 = queryNorm
              0.33473945 = fieldWeight in 3941, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.0625 = fieldNorm(doc=3941)
          0.25132275 = weight(abstract_txt:gram in 3941) [ClassicSimilarity], result of:
            0.25132275 = score(doc=3941,freq=2.0), product of:
              0.35809782 = queryWeight, product of:
                2.9241903 = boost
                7.9402676 = idf(docFreq=42, maxDocs=44421)
                0.015422718 = queryNorm
              0.7018271 = fieldWeight in 3941, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.9402676 = idf(docFreq=42, maxDocs=44421)
                0.0625 = fieldNorm(doc=3941)
        0.32 = coord(8/25)