Document (#30062)

Author
Tsuji, K.
Kageura, K.
Title
Automatic generation of Japanese-English bilingual thesauri based on bilingual corpora
Source
Journal of the American Society for Information Science and Technology. 57(2006) no.7, S.891-906
Year
2006
Abstract
The authors propose a method for automatically generating Japanese-English bilingual thesauri based on bilingual corpora. The term bilingual thesaurus refers to a set of bilingual equivalent words and their synonyms. Most of the methods proposed so far for extracting bilingual equivalent word clusters from bilingual corpora depend heavily on word frequency and are not effective for dealing with low-frequency clusters. These low-frequency bilingual clusters are worth extracting because they contain many newly coined terms that are in demand but are not listed in existing bilingual thesauri. Assuming that single language-pair-independent methods such as frequency-based ones have reached their limitations and that a language-pair-dependent method used in combination with other methods shows promise, the authors propose the following approach: (a) Extract translation pairs based on transliteration patterns; (b) remove the pairs from among the candidate words; (c) extract translation pairs based on word frequency from the remaining candidate words; and (d) generate bilingual clusters based on the extracted pairs using a graph-theoretic method. The proposed method has been found to be significantly more effective than other methods.
Theme
Multilinguale Probleme

Similar documents (author)

  1. Kageura, K.: Terminological semantics : an examination of 'concept' and 'meaning' in the study of terms (1995) 5.94
    5.9401517 = sum of:
      5.9401517 = weight(author_txt:kageura in 4629) [ClassicSimilarity], result of:
        5.9401517 = fieldWeight in 4629, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.625 = fieldNorm(doc=4629)
    
  2. Kageura, K.: Theories of terminology : a quest for a framework for the study of term formation (1999) 5.94
    5.9401517 = sum of:
      5.9401517 = weight(author_txt:kageura in 290) [ClassicSimilarity], result of:
        5.9401517 = fieldWeight in 290, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.625 = fieldNorm(doc=290)
    
  3. Kageura, K.: ¬The dynamics of terminology : a descriptive theory of term formation and terminological growth (2002) 5.94
    5.9401517 = sum of:
      5.9401517 = weight(author_txt:kageura in 2787) [ClassicSimilarity], result of:
        5.9401517 = fieldWeight in 2787, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.625 = fieldNorm(doc=2787)
    
  4. Fukuda, M.; Kageura, K.: Research into 'see also' references in the dictionary of terminology : using semantic relations between entries (1993) 4.75
    4.7521214 = sum of:
      4.7521214 = weight(author_txt:kageura in 1118) [ClassicSimilarity], result of:
        4.7521214 = fieldWeight in 1118, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.5 = fieldNorm(doc=1118)
    
  5. Tsuji, K.; Kageura, K.: Analysis of word structure of medical synonyms (1996) 4.75
    4.7521214 = sum of:
      4.7521214 = weight(author_txt:kageura in 6406) [ClassicSimilarity], result of:
        4.7521214 = fieldWeight in 6406, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.5 = fieldNorm(doc=6406)
    

Similar documents (content)

  1. Yang, C.C.; Li, K.W.: Automatic construction of English/Chinese parallel corpora (2003) 0.39
    0.3874766 = sum of:
      0.3874766 = product of:
        0.88062865 = sum of:
          0.018544096 = weight(abstract_txt:proposed in 2683) [ClassicSimilarity], result of:
            0.018544096 = score(doc=2683,freq=2.0), product of:
              0.052033633 = queryWeight, product of:
                1.1011873 = boost
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.010254265 = queryNorm
              0.35638672 = fieldWeight in 2683, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.056865882 = weight(abstract_txt:english in 2683) [ClassicSimilarity], result of:
            0.056865882 = score(doc=2683,freq=6.0), product of:
              0.076150805 = queryWeight, product of:
                1.33216 = boost
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.010254265 = queryNorm
              0.7467535 = fieldWeight in 2683, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.054679055 = weight(abstract_txt:translation in 2683) [ClassicSimilarity], result of:
            0.054679055 = score(doc=2683,freq=3.0), product of:
              0.09346823 = queryWeight, product of:
                1.4758803 = boost
                6.176015 = idf(docFreq=250, maxDocs=44421)
                0.010254265 = queryNorm
              0.5850015 = fieldWeight in 2683, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.176015 = idf(docFreq=250, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.030882133 = weight(abstract_txt:words in 2683) [ClassicSimilarity], result of:
            0.030882133 = score(doc=2683,freq=1.0), product of:
              0.10543683 = queryWeight, product of:
                1.9198219 = boost
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.010254265 = queryNorm
              0.29289702 = fieldWeight in 2683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.032292735 = weight(abstract_txt:thesauri in 2683) [ClassicSimilarity], result of:
            0.032292735 = score(doc=2683,freq=1.0), product of:
              0.10862356 = queryWeight, product of:
                1.9486183 = boost
                5.4361663 = idf(docFreq=525, maxDocs=44421)
                0.010254265 = queryNorm
              0.29729035 = fieldWeight in 2683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4361663 = idf(docFreq=525, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.055991426 = weight(abstract_txt:word in 2683) [ClassicSimilarity], result of:
            0.055991426 = score(doc=2683,freq=3.0), product of:
              0.10869964 = queryWeight, product of:
                1.9493006 = boost
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.010254265 = queryNorm
              0.5151022 = fieldWeight in 2683, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.048807587 = weight(abstract_txt:method in 2683) [ClassicSimilarity], result of:
            0.048807587 = score(doc=2683,freq=4.0), product of:
              0.09919093 = queryWeight, product of:
                2.1501567 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.010254265 = queryNorm
              0.49205697 = fieldWeight in 2683, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.0129657425 = weight(abstract_txt:based in 2683) [ClassicSimilarity], result of:
            0.0129657425 = score(doc=2683,freq=1.0), product of:
              0.0744839 = queryWeight, product of:
                2.2819755 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.010254265 = queryNorm
              0.17407443 = fieldWeight in 2683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.119939886 = weight(abstract_txt:corpora in 2683) [ClassicSimilarity], result of:
            0.119939886 = score(doc=2683,freq=3.0), product of:
              0.18062983 = queryWeight, product of:
                2.512809 = boost
                7.01012 = idf(docFreq=108, maxDocs=44421)
                0.010254265 = queryNorm
              0.6640093 = fieldWeight in 2683, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.01012 = idf(docFreq=108, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.11859377 = weight(abstract_txt:pairs in 2683) [ClassicSimilarity], result of:
            0.11859377 = score(doc=2683,freq=2.0), product of:
              0.22587334 = queryWeight, product of:
                3.2446408 = boost
                6.7888126 = idf(docFreq=135, maxDocs=44421)
                0.010254265 = queryNorm
              0.52504545 = fieldWeight in 2683, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.7888126 = idf(docFreq=135, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
          0.3310663 = weight(abstract_txt:bilingual in 2683) [ClassicSimilarity], result of:
            0.3310663 = score(doc=2683,freq=1.0), product of:
              0.79047465 = queryWeight, product of:
                10.065711 = boost
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.010254265 = queryNorm
              0.41881964 = fieldWeight in 2683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2683)
        0.44 = coord(11/25)
    
  2. Lee, Y.-S.; Wu, Y.-C.; Yang, J.-C.: BVideoQA : Online English/Chinese bilingual video question answering (2009) 0.32
    0.32456347 = sum of:
      0.32456347 = product of:
        1.0142609 = sum of:
          0.014985892 = weight(abstract_txt:proposed in 3739) [ClassicSimilarity], result of:
            0.014985892 = score(doc=3739,freq=1.0), product of:
              0.052033633 = queryWeight, product of:
                1.1011873 = boost
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.010254265 = queryNorm
              0.28800395 = fieldWeight in 3739, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.0625 = fieldNorm(doc=3739)
          0.03752175 = weight(abstract_txt:english in 3739) [ClassicSimilarity], result of:
            0.03752175 = score(doc=3739,freq=2.0), product of:
              0.076150805 = queryWeight, product of:
                1.33216 = boost
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.010254265 = queryNorm
              0.4927295 = fieldWeight in 3739, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.0625 = fieldNorm(doc=3739)
          0.067867845 = weight(abstract_txt:japanese in 3739) [ClassicSimilarity], result of:
            0.067867845 = score(doc=3739,freq=1.0), product of:
              0.1424312 = queryWeight, product of:
                1.8218881 = boost
                7.62393 = idf(docFreq=58, maxDocs=44421)
                0.010254265 = queryNorm
              0.47649562 = fieldWeight in 3739, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.62393 = idf(docFreq=58, maxDocs=44421)
                0.0625 = fieldNorm(doc=3739)
          0.05224778 = weight(abstract_txt:word in 3739) [ClassicSimilarity], result of:
            0.05224778 = score(doc=3739,freq=2.0), product of:
              0.10869964 = queryWeight, product of:
                1.9493006 = boost
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.010254265 = queryNorm
              0.48066196 = fieldWeight in 3739, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.0625 = fieldNorm(doc=3739)
          0.021789677 = weight(abstract_txt:methods in 3739) [ClassicSimilarity], result of:
            0.021789677 = score(doc=3739,freq=1.0), product of:
              0.08414073 = queryWeight, product of:
                1.9803287 = boost
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.010254265 = queryNorm
              0.25896704 = fieldWeight in 3739, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.0625 = fieldNorm(doc=3739)
          0.048306983 = weight(abstract_txt:method in 3739) [ClassicSimilarity], result of:
            0.048306983 = score(doc=3739,freq=3.0), product of:
              0.09919093 = queryWeight, product of:
                2.1501567 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.010254265 = queryNorm
              0.4870101 = fieldWeight in 3739, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0625 = fieldNorm(doc=3739)
          0.014817991 = weight(abstract_txt:based in 3739) [ClassicSimilarity], result of:
            0.014817991 = score(doc=3739,freq=1.0), product of:
              0.0744839 = queryWeight, product of:
                2.2819755 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.010254265 = queryNorm
              0.1989422 = fieldWeight in 3739, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0625 = fieldNorm(doc=3739)
          0.756723 = weight(abstract_txt:bilingual in 3739) [ClassicSimilarity], result of:
            0.756723 = score(doc=3739,freq=4.0), product of:
              0.79047465 = queryWeight, product of:
                10.065711 = boost
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.010254265 = queryNorm
              0.95730203 = fieldWeight in 3739, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.0625 = fieldNorm(doc=3739)
        0.32 = coord(8/25)
    
  3. Dadashkarimia, J.; Shakery, A.; Failia, H.; Zamani, H.: ¬An expectation-maximization algorithm for query translation based on pseudo-relevant documents (2017) 0.27
    0.2734346 = sum of:
      0.2734346 = product of:
        0.75954056 = sum of:
          0.02932079 = weight(abstract_txt:proposed in 4296) [ClassicSimilarity], result of:
            0.02932079 = score(doc=4296,freq=5.0), product of:
              0.052033633 = queryWeight, product of:
                1.1011873 = boost
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.010254265 = queryNorm
              0.5634969 = fieldWeight in 4296, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4296)
          0.094706915 = weight(abstract_txt:translation in 4296) [ClassicSimilarity], result of:
            0.094706915 = score(doc=4296,freq=9.0), product of:
              0.09346823 = queryWeight, product of:
                1.4758803 = boost
                6.176015 = idf(docFreq=250, maxDocs=44421)
                0.010254265 = queryNorm
              1.0132525 = fieldWeight in 4296, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                6.176015 = idf(docFreq=250, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4296)
          0.03232667 = weight(abstract_txt:word in 4296) [ClassicSimilarity], result of:
            0.03232667 = score(doc=4296,freq=1.0), product of:
              0.10869964 = queryWeight, product of:
                1.9493006 = boost
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.010254265 = queryNorm
              0.29739442 = fieldWeight in 4296, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4296)
          0.019065967 = weight(abstract_txt:methods in 4296) [ClassicSimilarity], result of:
            0.019065967 = score(doc=4296,freq=1.0), product of:
              0.08414073 = queryWeight, product of:
                1.9803287 = boost
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.010254265 = queryNorm
              0.22659616 = fieldWeight in 4296, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4296)
          0.048807587 = weight(abstract_txt:method in 4296) [ClassicSimilarity], result of:
            0.048807587 = score(doc=4296,freq=4.0), product of:
              0.09919093 = queryWeight, product of:
                2.1501567 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.010254265 = queryNorm
              0.49205697 = fieldWeight in 4296, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4296)
          0.022457324 = weight(abstract_txt:based in 4296) [ClassicSimilarity], result of:
            0.022457324 = score(doc=4296,freq=3.0), product of:
              0.0744839 = queryWeight, product of:
                2.2819755 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.010254265 = queryNorm
              0.30150574 = fieldWeight in 4296, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4296)
          0.09793051 = weight(abstract_txt:corpora in 4296) [ClassicSimilarity], result of:
            0.09793051 = score(doc=4296,freq=2.0), product of:
              0.18062983 = queryWeight, product of:
                2.512809 = boost
                7.01012 = idf(docFreq=108, maxDocs=44421)
                0.010254265 = queryNorm
              0.54216135 = fieldWeight in 4296, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.01012 = idf(docFreq=108, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4296)
          0.08385846 = weight(abstract_txt:pairs in 4296) [ClassicSimilarity], result of:
            0.08385846 = score(doc=4296,freq=1.0), product of:
              0.22587334 = queryWeight, product of:
                3.2446408 = boost
                6.7888126 = idf(docFreq=135, maxDocs=44421)
                0.010254265 = queryNorm
              0.3712632 = fieldWeight in 4296, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7888126 = idf(docFreq=135, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4296)
          0.3310663 = weight(abstract_txt:bilingual in 4296) [ClassicSimilarity], result of:
            0.3310663 = score(doc=4296,freq=1.0), product of:
              0.79047465 = queryWeight, product of:
                10.065711 = boost
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.010254265 = queryNorm
              0.41881964 = fieldWeight in 4296, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4296)
        0.36 = coord(9/25)
    
  4. Alexiev, B.: Terminology structuring for learner's glossaries (2006) 0.22
    0.21520145 = sum of:
      0.21520145 = product of:
        0.67250454 = sum of:
          0.021193253 = weight(abstract_txt:proposed in 1094) [ClassicSimilarity], result of:
            0.021193253 = score(doc=1094,freq=2.0), product of:
              0.052033633 = queryWeight, product of:
                1.1011873 = boost
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.010254265 = queryNorm
              0.4072991 = fieldWeight in 1094, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.0625 = fieldNorm(doc=1094)
          0.072157644 = weight(abstract_txt:translation in 1094) [ClassicSimilarity], result of:
            0.072157644 = score(doc=1094,freq=4.0), product of:
              0.09346823 = queryWeight, product of:
                1.4758803 = boost
                6.176015 = idf(docFreq=250, maxDocs=44421)
                0.010254265 = queryNorm
              0.77200186 = fieldWeight in 1094, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.176015 = idf(docFreq=250, maxDocs=44421)
                0.0625 = fieldNorm(doc=1094)
          0.043941494 = weight(abstract_txt:extract in 1094) [ClassicSimilarity], result of:
            0.043941494 = score(doc=1094,freq=1.0), product of:
              0.10659716 = queryWeight, product of:
                1.5761298 = boost
                6.595522 = idf(docFreq=164, maxDocs=44421)
                0.010254265 = queryNorm
              0.41222012 = fieldWeight in 1094, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.595522 = idf(docFreq=164, maxDocs=44421)
                0.0625 = fieldNorm(doc=1094)
          0.054048594 = weight(abstract_txt:equivalent in 1094) [ClassicSimilarity], result of:
            0.054048594 = score(doc=1094,freq=1.0), product of:
              0.12237293 = queryWeight, product of:
                1.6887362 = boost
                7.0667386 = idf(docFreq=102, maxDocs=44421)
                0.010254265 = queryNorm
              0.44167116 = fieldWeight in 1094, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0667386 = idf(docFreq=102, maxDocs=44421)
                0.0625 = fieldNorm(doc=1094)
          0.06005664 = weight(abstract_txt:candidate in 1094) [ClassicSimilarity], result of:
            0.06005664 = score(doc=1094,freq=1.0), product of:
              0.13128138 = queryWeight, product of:
                1.7491244 = boost
                7.319441 = idf(docFreq=79, maxDocs=44421)
                0.010254265 = queryNorm
              0.45746505 = fieldWeight in 1094, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.319441 = idf(docFreq=79, maxDocs=44421)
                0.0625 = fieldNorm(doc=1094)
          0.021789677 = weight(abstract_txt:methods in 1094) [ClassicSimilarity], result of:
            0.021789677 = score(doc=1094,freq=1.0), product of:
              0.08414073 = queryWeight, product of:
                1.9803287 = boost
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.010254265 = queryNorm
              0.25896704 = fieldWeight in 1094, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.0625 = fieldNorm(doc=1094)
          0.020955803 = weight(abstract_txt:based in 1094) [ClassicSimilarity], result of:
            0.020955803 = score(doc=1094,freq=2.0), product of:
              0.0744839 = queryWeight, product of:
                2.2819755 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.010254265 = queryNorm
              0.28134674 = fieldWeight in 1094, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0625 = fieldNorm(doc=1094)
          0.3783615 = weight(abstract_txt:bilingual in 1094) [ClassicSimilarity], result of:
            0.3783615 = score(doc=1094,freq=1.0), product of:
              0.79047465 = queryWeight, product of:
                10.065711 = boost
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.010254265 = queryNorm
              0.47865102 = fieldWeight in 1094, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.0625 = fieldNorm(doc=1094)
        0.32 = coord(8/25)
    
  5. Li, Y.; Shawe-Taylor, J.: Advanced learning algorithms for cross-language patent retrieval and classification (2007) 0.21
    0.20819442 = sum of:
      0.20819442 = product of:
        0.86747676 = sum of:
          0.033164855 = weight(abstract_txt:english in 1931) [ClassicSimilarity], result of:
            0.033164855 = score(doc=1931,freq=1.0), product of:
              0.076150805 = queryWeight, product of:
                1.33216 = boost
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.010254265 = queryNorm
              0.4355155 = fieldWeight in 1931, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5745983 = idf(docFreq=457, maxDocs=44421)
                0.078125 = fieldNorm(doc=1931)
          0.08483481 = weight(abstract_txt:japanese in 1931) [ClassicSimilarity], result of:
            0.08483481 = score(doc=1931,freq=1.0), product of:
              0.1424312 = queryWeight, product of:
                1.8218881 = boost
                7.62393 = idf(docFreq=58, maxDocs=44421)
                0.010254265 = queryNorm
              0.59561956 = fieldWeight in 1931, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.62393 = idf(docFreq=58, maxDocs=44421)
                0.078125 = fieldNorm(doc=1931)
          0.027237097 = weight(abstract_txt:methods in 1931) [ClassicSimilarity], result of:
            0.027237097 = score(doc=1931,freq=1.0), product of:
              0.08414073 = queryWeight, product of:
                1.9803287 = boost
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.010254265 = queryNorm
              0.3237088 = fieldWeight in 1931, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.078125 = fieldNorm(doc=1931)
          0.03486256 = weight(abstract_txt:method in 1931) [ClassicSimilarity], result of:
            0.03486256 = score(doc=1931,freq=1.0), product of:
              0.09919093 = queryWeight, product of:
                2.1501567 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.010254265 = queryNorm
              0.35146925 = fieldWeight in 1931, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.078125 = fieldNorm(doc=1931)
          0.018522488 = weight(abstract_txt:based in 1931) [ClassicSimilarity], result of:
            0.018522488 = score(doc=1931,freq=1.0), product of:
              0.0744839 = queryWeight, product of:
                2.2819755 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.010254265 = queryNorm
              0.24867775 = fieldWeight in 1931, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.078125 = fieldNorm(doc=1931)
          0.66885495 = weight(abstract_txt:bilingual in 1931) [ClassicSimilarity], result of:
            0.66885495 = score(doc=1931,freq=2.0), product of:
              0.79047465 = queryWeight, product of:
                10.065711 = boost
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.010254265 = queryNorm
              0.8461434 = fieldWeight in 1931, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.078125 = fieldNorm(doc=1931)
        0.24 = coord(6/25)