Document (#30052)

Author
Li, K.W.
Yang, C.C.
Title
Conceptual analysis of parallel corpus collected from the Web
Source
Journal of the American Society for Information Science and Technology. 57(2006) no.5, S.632-644
Year
2006
Abstract
As illustrated by the World Wide Web, the volume of information in languages other than English has grown significantly in recent years. This highlights the importance of multilingual corpora. Much effort has been devoted to the compilation of multilingual corpora for the purpose of cross-lingual information retrieval and machine translation. Existing parallel corpora mostly involve European languages, such as English-French and English-Spanish. There is still a lack of parallel corpora between European languages and Asian. languages. In the authors' previous work, an alignment method to identify one-to-one Chinese and English title pairs was developed to construct an English-Chinese parallel corpus that works automatically from the World Wide Web, and a 100% precision and 87% recall were obtained. Careful analysis of these results has helped the authors to understand how the alignment method can be improved. A conceptual analysis was conducted, which includes the analysis of conceptual equivalent and conceptual information alternation in the aligned and nonaligned English-Chinese title pairs that are obtained by the alignment method. The result of the analysis not only reflects the characteristics of parallel corpora, but also gives insight into the strengths and weaknesses of the alignment method. In particular, conceptual alternation, such as omission and addition, is found to have a significant impact on the performance of the alignment method.
Footnote
Beitrag einer special topic section on multilingual information systems
Theme
Multilinguale Probleme

Similar documents (author)

  1. Yang, S.C.: ¬An interpretive and situated approach to an evaluation of Perseus digital libraries (2001) 4.48
    4.4805427 = sum of:
      4.4805427 = weight(author_txt:yang in 933) [ClassicSimilarity], result of:
        4.4805427 = fieldWeight in 933, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.168868 = idf(docFreq=92, maxDocs=44421)
          0.625 = fieldNorm(doc=933)
    
  2. Yang, K.: Information retrieval on the Web (2004) 4.48
    4.4805427 = sum of:
      4.4805427 = weight(author_txt:yang in 5278) [ClassicSimilarity], result of:
        4.4805427 = fieldWeight in 5278, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.168868 = idf(docFreq=92, maxDocs=44421)
          0.625 = fieldNorm(doc=5278)
    
  3. Yang, C.C.: Content-based image retrievaI : a comparison between query by example and image browsing map approaches (2005) 4.48
    4.4805427 = sum of:
      4.4805427 = weight(author_txt:yang in 5649) [ClassicSimilarity], result of:
        4.4805427 = fieldWeight in 5649, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.168868 = idf(docFreq=92, maxDocs=44421)
          0.625 = fieldNorm(doc=5649)
    
  4. Salton, G.; Yang, C.S.: On the specification of term values in automatic indexing (1973) 3.58
    3.584434 = sum of:
      3.584434 = weight(author_txt:yang in 5475) [ClassicSimilarity], result of:
        3.584434 = fieldWeight in 5475, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.168868 = idf(docFreq=92, maxDocs=44421)
          0.5 = fieldNorm(doc=5475)
    
  5. Yang, Y.; Chute, C.G.A.: ¬A schematic analysis of the Unified Medical Language System (1992) 3.58
    3.584434 = sum of:
      3.584434 = weight(author_txt:yang in 6444) [ClassicSimilarity], result of:
        3.584434 = fieldWeight in 6444, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.168868 = idf(docFreq=92, maxDocs=44421)
          0.5 = fieldNorm(doc=6444)
    

Similar documents (content)

  1. Yang, C.C.; Li, K.W.: Automatic construction of English/Chinese parallel corpora (2003) 1.13
    1.1269397 = sum of:
      1.1269397 = product of:
        1.7608433 = sum of:
          0.044832986 = weight(abstract_txt:asian in 2683) [ClassicSimilarity], result of:
            0.044832986 = score(doc=2683,freq=1.0), product of:
              0.1052454 = queryWeight, product of:
                1.0305771 = boost
                7.7894444 = idf(docFreq=49, maxDocs=44421)
                0.013110406 = queryNorm
              0.42598525 = fieldWeight in 2683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7894444 = idf(docFreq=49, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.0717682 = weight(abstract_txt:lingual in 2683) [ClassicSimilarity], result of:
            0.0717682 = score(doc=2683,freq=2.0), product of:
              0.1143096 = queryWeight, product of:
                1.0740396 = boost
                8.117949 = idf(docFreq=35, maxDocs=44421)
                0.013110406 = queryNorm
              0.6278405 = fieldWeight in 2683, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.117949 = idf(docFreq=35, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.016460128 = weight(abstract_txt:world in 2683) [ClassicSimilarity], result of:
            0.016460128 = score(doc=2683,freq=1.0), product of:
              0.06798871 = queryWeight, product of:
                1.1714191 = boost
                4.426988 = idf(docFreq=1442, maxDocs=44421)
                0.013110406 = queryNorm
              0.24210091 = fieldWeight in 2683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.426988 = idf(docFreq=1442, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.02192067 = weight(abstract_txt:wide in 2683) [ClassicSimilarity], result of:
            0.02192067 = score(doc=2683,freq=1.0), product of:
              0.082296975 = queryWeight, product of:
                1.288801 = boost
                4.8705935 = idf(docFreq=925, maxDocs=44421)
                0.013110406 = queryNorm
              0.26636058 = fieldWeight in 2683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8705935 = idf(docFreq=925, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.04955992 = weight(abstract_txt:european in 2683) [ClassicSimilarity], result of:
            0.04955992 = score(doc=2683,freq=2.0), product of:
              0.11251879 = queryWeight, product of:
                1.5069764 = boost
                5.6951146 = idf(docFreq=405, maxDocs=44421)
                0.013110406 = queryNorm
              0.44045907 = fieldWeight in 2683, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6951146 = idf(docFreq=405, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.06158073 = weight(abstract_txt:title in 2683) [ClassicSimilarity], result of:
            0.06158073 = score(doc=2683,freq=3.0), product of:
              0.11360675 = queryWeight, product of:
                1.5142444 = boost
                5.722582 = idf(docFreq=394, maxDocs=44421)
                0.013110406 = queryNorm
              0.5420517 = fieldWeight in 2683, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.722582 = idf(docFreq=394, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.07429295 = weight(abstract_txt:corpus in 2683) [ClassicSimilarity], result of:
            0.07429295 = score(doc=2683,freq=3.0), product of:
              0.12874764 = queryWeight, product of:
                1.6119945 = boost
                6.0919957 = idf(docFreq=272, maxDocs=44421)
                0.013110406 = queryNorm
              0.5770432 = fieldWeight in 2683, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.0919957 = idf(docFreq=272, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.06704936 = weight(abstract_txt:multilingual in 2683) [ClassicSimilarity], result of:
            0.06704936 = score(doc=2683,freq=2.0), product of:
              0.13763675 = queryWeight, product of:
                1.6667142 = boost
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.013110406 = queryNorm
              0.4871472 = fieldWeight in 2683, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.08394689 = weight(abstract_txt:pairs in 2683) [ClassicSimilarity], result of:
            0.08394689 = score(doc=2683,freq=2.0), product of:
              0.159885 = queryWeight, product of:
                1.7963783 = boost
                6.7888126 = idf(docFreq=135, maxDocs=44421)
                0.013110406 = queryNorm
              0.52504545 = fieldWeight in 2683, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.7888126 = idf(docFreq=135, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.14223318 = weight(abstract_txt:chinese in 2683) [ClassicSimilarity], result of:
            0.14223318 = score(doc=2683,freq=4.0), product of:
              0.20645513 = queryWeight, product of:
                2.5000713 = boost
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.013110406 = queryNorm
              0.6889302 = fieldWeight in 2683, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.075004175 = weight(abstract_txt:languages in 2683) [ClassicSimilarity], result of:
            0.075004175 = score(doc=2683,freq=2.0), product of:
              0.18686944 = queryWeight, product of:
                2.7464902 = boost
                5.189722 = idf(docFreq=672, maxDocs=44421)
                0.013110406 = queryNorm
              0.40137208 = fieldWeight in 2683, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.189722 = idf(docFreq=672, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.08637144 = weight(abstract_txt:method in 2683) [ClassicSimilarity], result of:
            0.08637144 = score(doc=2683,freq=4.0), product of:
              0.17553137 = queryWeight, product of:
                2.9760573 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.013110406 = queryNorm
              0.49205697 = fieldWeight in 2683, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.17692435 = weight(abstract_txt:parallel in 2683) [ClassicSimilarity], result of:
            0.17692435 = score(doc=2683,freq=2.0), product of:
              0.35670546 = queryWeight, product of:
                4.2424703 = boost
                6.4132004 = idf(docFreq=197, maxDocs=44421)
                0.013110406 = queryNorm
              0.49599564 = fieldWeight in 2683, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.4132004 = idf(docFreq=197, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.24151592 = weight(abstract_txt:english in 2683) [ClassicSimilarity], result of:
            0.24151592 = score(doc=2683,freq=6.0), product of:
              0.3234212 = queryWeight, product of:
                4.42526 = boost
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.013110406 = queryNorm
              0.7467535 = fieldWeight in 2683, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.28299916 = weight(abstract_txt:corpora in 2683) [ClassicSimilarity], result of:
            0.28299916 = score(doc=2683,freq=3.0), product of:
              0.42619762 = queryWeight, product of:
                4.6373453 = boost
                7.01012 = idf(docFreq=108, maxDocs=44421)
                0.013110406 = queryNorm
              0.6640093 = fieldWeight in 2683, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.01012 = idf(docFreq=108, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.26438332 = weight(abstract_txt:alignment in 2683) [ClassicSimilarity], result of:
            0.26438332 = score(doc=2683,freq=2.0), product of:
              0.46623766 = queryWeight, product of:
                4.850289 = boost
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.013110406 = queryNorm
              0.56705695 = fieldWeight in 2683, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
        0.64 = coord(16/25)
    
  2. Talvensaari, T.; Laurikkala, J.; Järvelin, K.; Juhola, M.: ¬A study on automatic creation of a comparable document collection in cross-language information retrieval (2006) 0.42
    0.4166548 = sum of:
      0.4166548 = product of:
        1.3020463 = sum of:
          0.09105339 = weight(abstract_txt:aligned in 601) [ClassicSimilarity], result of:
            0.09105339 = score(doc=601,freq=2.0), product of:
              0.12255493 = queryWeight, product of:
                1.1121012 = boost
                8.405631 = idf(docFreq=26, maxDocs=44421)
                0.013110406 = queryNorm
              0.74295986 = fieldWeight in 601, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.405631 = idf(docFreq=26, maxDocs=44421)
                0.0625 = fieldNorm(doc=601)
          0.11750117 = weight(abstract_txt:pairs in 601) [ClassicSimilarity], result of:
            0.11750117 = score(doc=601,freq=3.0), product of:
              0.159885 = queryWeight, product of:
                1.7963783 = boost
                6.7888126 = idf(docFreq=135, maxDocs=44421)
                0.013110406 = queryNorm
              0.7349105 = fieldWeight in 601, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.7888126 = idf(docFreq=135, maxDocs=44421)
                0.0625 = fieldNorm(doc=601)
          0.10498398 = weight(abstract_txt:languages in 601) [ClassicSimilarity], result of:
            0.10498398 = score(doc=601,freq=3.0), product of:
              0.18686944 = queryWeight, product of:
                2.7464902 = boost
                5.189722 = idf(docFreq=672, maxDocs=44421)
                0.013110406 = queryNorm
              0.5618039 = fieldWeight in 601, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.189722 = idf(docFreq=672, maxDocs=44421)
                0.0625 = fieldNorm(doc=601)
          0.09871021 = weight(abstract_txt:method in 601) [ClassicSimilarity], result of:
            0.09871021 = score(doc=601,freq=4.0), product of:
              0.17553137 = queryWeight, product of:
                2.9760573 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.013110406 = queryNorm
              0.5623508 = fieldWeight in 601, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0625 = fieldNorm(doc=601)
          0.14297648 = weight(abstract_txt:parallel in 601) [ClassicSimilarity], result of:
            0.14297648 = score(doc=601,freq=1.0), product of:
              0.35670546 = queryWeight, product of:
                4.2424703 = boost
                6.4132004 = idf(docFreq=197, maxDocs=44421)
                0.013110406 = queryNorm
              0.40082502 = fieldWeight in 601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4132004 = idf(docFreq=197, maxDocs=44421)
                0.0625 = fieldNorm(doc=601)
          0.11268396 = weight(abstract_txt:english in 601) [ClassicSimilarity], result of:
            0.11268396 = score(doc=601,freq=1.0), product of:
              0.3234212 = queryWeight, product of:
                4.42526 = boost
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.013110406 = queryNorm
              0.3484124 = fieldWeight in 601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.0625 = fieldNorm(doc=601)
          0.26407754 = weight(abstract_txt:corpora in 601) [ClassicSimilarity], result of:
            0.26407754 = score(doc=601,freq=2.0), product of:
              0.42619762 = queryWeight, product of:
                4.6373453 = boost
                7.01012 = idf(docFreq=108, maxDocs=44421)
                0.013110406 = queryNorm
              0.61961293 = fieldWeight in 601, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.01012 = idf(docFreq=108, maxDocs=44421)
                0.0625 = fieldNorm(doc=601)
          0.37005955 = weight(abstract_txt:alignment in 601) [ClassicSimilarity], result of:
            0.37005955 = score(doc=601,freq=3.0), product of:
              0.46623766 = queryWeight, product of:
                4.850289 = boost
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.013110406 = queryNorm
              0.7937144 = fieldWeight in 601, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.0625 = fieldNorm(doc=601)
        0.32 = coord(8/25)
    
  3. Yang, C.C.; Luk, J.: Automatic generation of English/Chinese thesaurus based on a parallel corpus in laws (2003) 0.37
    0.36772516 = sum of:
      0.36772516 = product of:
        0.7660941 = sum of:
          0.032023564 = weight(abstract_txt:asian in 2616) [ClassicSimilarity], result of:
            0.032023564 = score(doc=2616,freq=1.0), product of:
              0.1052454 = queryWeight, product of:
                1.0305771 = boost
                7.7894444 = idf(docFreq=49, maxDocs=44421)
                0.013110406 = queryNorm
              0.30427518 = fieldWeight in 2616, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7894444 = idf(docFreq=49, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2616)
          0.08105392 = weight(abstract_txt:lingual in 2616) [ClassicSimilarity], result of:
            0.08105392 = score(doc=2616,freq=5.0), product of:
              0.1143096 = queryWeight, product of:
                1.0740396 = boost
                8.117949 = idf(docFreq=35, maxDocs=44421)
                0.013110406 = queryNorm
              0.7090736 = fieldWeight in 2616, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                8.117949 = idf(docFreq=35, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2616)
          0.011757235 = weight(abstract_txt:world in 2616) [ClassicSimilarity], result of:
            0.011757235 = score(doc=2616,freq=1.0), product of:
              0.06798871 = queryWeight, product of:
                1.1714191 = boost
                4.426988 = idf(docFreq=1442, maxDocs=44421)
                0.013110406 = queryNorm
              0.17292923 = fieldWeight in 2616, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.426988 = idf(docFreq=1442, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2616)
          0.015657622 = weight(abstract_txt:wide in 2616) [ClassicSimilarity], result of:
            0.015657622 = score(doc=2616,freq=1.0), product of:
              0.082296975 = queryWeight, product of:
                1.288801 = boost
                4.8705935 = idf(docFreq=925, maxDocs=44421)
                0.013110406 = queryNorm
              0.19025756 = fieldWeight in 2616, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8705935 = idf(docFreq=925, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2616)
          0.025031539 = weight(abstract_txt:european in 2616) [ClassicSimilarity], result of:
            0.025031539 = score(doc=2616,freq=1.0), product of:
              0.11251879 = queryWeight, product of:
                1.5069764 = boost
                5.6951146 = idf(docFreq=405, maxDocs=44421)
                0.013110406 = queryNorm
              0.22246541 = fieldWeight in 2616, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6951146 = idf(docFreq=405, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2616)
          0.025808413 = weight(abstract_txt:obtained in 2616) [ClassicSimilarity], result of:
            0.025808413 = score(doc=2616,freq=1.0), product of:
              0.11483498 = queryWeight, product of:
                1.5224079 = boost
                5.7534328 = idf(docFreq=382, maxDocs=44421)
                0.013110406 = queryNorm
              0.22474347 = fieldWeight in 2616, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7534328 = idf(docFreq=382, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2616)
          0.043328527 = weight(abstract_txt:corpus in 2616) [ClassicSimilarity], result of:
            0.043328527 = score(doc=2616,freq=2.0), product of:
              0.12874764 = queryWeight, product of:
                1.6119945 = boost
                6.0919957 = idf(docFreq=272, maxDocs=44421)
                0.013110406 = queryNorm
              0.3365384 = fieldWeight in 2616, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.0919957 = idf(docFreq=272, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2616)
          0.0164155 = weight(abstract_txt:analysis in 2616) [ClassicSimilarity], result of:
            0.0164155 = score(doc=2616,freq=1.0), product of:
              0.11526994 = queryWeight, product of:
                2.4116926 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.013110406 = queryNorm
              0.14240919 = fieldWeight in 2616, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2616)
          0.10159512 = weight(abstract_txt:chinese in 2616) [ClassicSimilarity], result of:
            0.10159512 = score(doc=2616,freq=4.0), product of:
              0.20645513 = queryWeight, product of:
                2.5000713 = boost
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.013110406 = queryNorm
              0.492093 = fieldWeight in 2616, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2616)
          0.07576566 = weight(abstract_txt:languages in 2616) [ClassicSimilarity], result of:
            0.07576566 = score(doc=2616,freq=4.0), product of:
              0.18686944 = queryWeight, product of:
                2.7464902 = boost
                5.189722 = idf(docFreq=672, maxDocs=44421)
                0.013110406 = queryNorm
              0.40544704 = fieldWeight in 2616, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.189722 = idf(docFreq=672, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2616)
          0.12637453 = weight(abstract_txt:parallel in 2616) [ClassicSimilarity], result of:
            0.12637453 = score(doc=2616,freq=2.0), product of:
              0.35670546 = queryWeight, product of:
                4.2424703 = boost
                6.4132004 = idf(docFreq=197, maxDocs=44421)
                0.013110406 = queryNorm
              0.3542826 = fieldWeight in 2616, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.4132004 = idf(docFreq=197, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2616)
          0.21128242 = weight(abstract_txt:english in 2616) [ClassicSimilarity], result of:
            0.21128242 = score(doc=2616,freq=9.0), product of:
              0.3234212 = queryWeight, product of:
                4.42526 = boost
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.013110406 = queryNorm
              0.6532732 = fieldWeight in 2616, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2616)
        0.48 = coord(12/25)
    
  4. Xu, J.; Weischedel, R.: Empirical studies on the impact of lexical resources on CLIR performance (2005) 0.32
    0.3235891 = sum of:
      0.3235891 = product of:
        1.3482879 = sum of:
          0.07249684 = weight(abstract_txt:lingual in 2020) [ClassicSimilarity], result of:
            0.07249684 = score(doc=2020,freq=1.0), product of:
              0.1143096 = queryWeight, product of:
                1.0740396 = boost
                8.117949 = idf(docFreq=35, maxDocs=44421)
                0.013110406 = queryNorm
              0.63421476 = fieldWeight in 2020, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.117949 = idf(docFreq=35, maxDocs=44421)
                0.078125 = fieldNorm(doc=2020)
          0.086657055 = weight(abstract_txt:corpus in 2020) [ClassicSimilarity], result of:
            0.086657055 = score(doc=2020,freq=2.0), product of:
              0.12874764 = queryWeight, product of:
                1.6119945 = boost
                6.0919957 = idf(docFreq=272, maxDocs=44421)
                0.013110406 = queryNorm
              0.6730768 = fieldWeight in 2020, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.0919957 = idf(docFreq=272, maxDocs=44421)
                0.078125 = fieldNorm(doc=2020)
          0.1436772 = weight(abstract_txt:chinese in 2020) [ClassicSimilarity], result of:
            0.1436772 = score(doc=2020,freq=2.0), product of:
              0.20645513 = queryWeight, product of:
                2.5000713 = boost
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.013110406 = queryNorm
              0.69592464 = fieldWeight in 2020, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.078125 = fieldNorm(doc=2020)
          0.4377743 = weight(abstract_txt:parallel in 2020) [ClassicSimilarity], result of:
            0.4377743 = score(doc=2020,freq=6.0), product of:
              0.35670546 = queryWeight, product of:
                4.2424703 = boost
                6.4132004 = idf(docFreq=197, maxDocs=44421)
                0.013110406 = queryNorm
              1.2272711 = fieldWeight in 2020, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.4132004 = idf(docFreq=197, maxDocs=44421)
                0.078125 = fieldNorm(doc=2020)
          0.14085495 = weight(abstract_txt:english in 2020) [ClassicSimilarity], result of:
            0.14085495 = score(doc=2020,freq=1.0), product of:
              0.3234212 = queryWeight, product of:
                4.42526 = boost
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.013110406 = queryNorm
              0.4355155 = fieldWeight in 2020, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.078125 = fieldNorm(doc=2020)
          0.46682754 = weight(abstract_txt:corpora in 2020) [ClassicSimilarity], result of:
            0.46682754 = score(doc=2020,freq=4.0), product of:
              0.42619762 = queryWeight, product of:
                4.6373453 = boost
                7.01012 = idf(docFreq=108, maxDocs=44421)
                0.013110406 = queryNorm
              1.0953312 = fieldWeight in 2020, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.01012 = idf(docFreq=108, maxDocs=44421)
                0.078125 = fieldNorm(doc=2020)
        0.24 = coord(6/25)
    
  5. Li, K.W.; Yang, C.C.: Automatic crosslingual thesaurus generated from the Hong Kong SAR Police Department Web Corpus for Crime Analysis (2005) 0.27
    0.27308002 = sum of:
      0.27308002 = product of:
        0.8533751 = sum of:
          0.054345787 = weight(abstract_txt:asian in 4391) [ClassicSimilarity], result of:
            0.054345787 = score(doc=4391,freq=2.0), product of:
              0.1052454 = queryWeight, product of:
                1.0305771 = boost
                7.7894444 = idf(docFreq=49, maxDocs=44421)
                0.013110406 = queryNorm
              0.5163721 = fieldWeight in 4391, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.7894444 = idf(docFreq=49, maxDocs=44421)
                0.046875 = fieldNorm(doc=4391)
          0.05199423 = weight(abstract_txt:corpus in 4391) [ClassicSimilarity], result of:
            0.05199423 = score(doc=4391,freq=2.0), product of:
              0.12874764 = queryWeight, product of:
                1.6119945 = boost
                6.0919957 = idf(docFreq=272, maxDocs=44421)
                0.013110406 = queryNorm
              0.40384609 = fieldWeight in 4391, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.0919957 = idf(docFreq=272, maxDocs=44421)
                0.046875 = fieldNorm(doc=4391)
          0.027858023 = weight(abstract_txt:analysis in 4391) [ClassicSimilarity], result of:
            0.027858023 = score(doc=4391,freq=2.0), product of:
              0.11526994 = queryWeight, product of:
                2.4116926 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.013110406 = queryNorm
              0.24167639 = fieldWeight in 4391, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.046875 = fieldNorm(doc=4391)
          0.13630415 = weight(abstract_txt:chinese in 4391) [ClassicSimilarity], result of:
            0.13630415 = score(doc=4391,freq=5.0), product of:
              0.20645513 = queryWeight, product of:
                2.5000713 = boost
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.013110406 = queryNorm
              0.66021204 = fieldWeight in 4391, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.046875 = fieldNorm(doc=4391)
          0.12857859 = weight(abstract_txt:languages in 4391) [ClassicSimilarity], result of:
            0.12857859 = score(doc=4391,freq=8.0), product of:
              0.18686944 = queryWeight, product of:
                2.7464902 = boost
                5.189722 = idf(docFreq=672, maxDocs=44421)
                0.013110406 = queryNorm
              0.6880664 = fieldWeight in 4391, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                5.189722 = idf(docFreq=672, maxDocs=44421)
                0.046875 = fieldNorm(doc=4391)
          0.107232355 = weight(abstract_txt:parallel in 4391) [ClassicSimilarity], result of:
            0.107232355 = score(doc=4391,freq=1.0), product of:
              0.35670546 = queryWeight, product of:
                4.2424703 = boost
                6.4132004 = idf(docFreq=197, maxDocs=44421)
                0.013110406 = queryNorm
              0.30061877 = fieldWeight in 4391, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4132004 = idf(docFreq=197, maxDocs=44421)
                0.046875 = fieldNorm(doc=4391)
          0.20701365 = weight(abstract_txt:english in 4391) [ClassicSimilarity], result of:
            0.20701365 = score(doc=4391,freq=6.0), product of:
              0.3234212 = queryWeight, product of:
                4.42526 = boost
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.013110406 = queryNorm
              0.64007443 = fieldWeight in 4391, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.046875 = fieldNorm(doc=4391)
          0.14004827 = weight(abstract_txt:corpora in 4391) [ClassicSimilarity], result of:
            0.14004827 = score(doc=4391,freq=1.0), product of:
              0.42619762 = queryWeight, product of:
                4.6373453 = boost
                7.01012 = idf(docFreq=108, maxDocs=44421)
                0.013110406 = queryNorm
              0.32859936 = fieldWeight in 4391, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.01012 = idf(docFreq=108, maxDocs=44421)
                0.046875 = fieldNorm(doc=4391)
        0.32 = coord(8/25)