Document (#16479)

Author
Lonsdale, D.
Mitamura, T.
Nyberg, E.
Title
Acquisition of large lexicons for practical knowledge-based MT
Source
Machine translation. 9(1994/95) nos.3/4, S.251-283
Year
1994/95
Abstract
Although knowledge based MT systems have the potential to achieve high translation accuracy, each successful application system requires a large amount of hand coded lexical knowledge. Systems like KBMT-89 and its descendants have demonstarted how knowledge based translation can produce good results in technical domains with tractable domain semantics. Nevertheless, the magnitude of the development task for large scale applications with 10s of 1000s of of domain concepts precludes a purely hand crafted approach. The current challenge for the next generation of knowledge based MT systems is to utilize online textual resources and corpus analysis software in order to automate the most laborious aspects of the knowledge acquisition process. This partial automation can in turn maximize the productivity of human knowledge engineers and help to make large scale applications of knowledge based MT an viable approach. Discusses the corpus based knowledge acquisition methodology used in KANT, a knowledge based translation system for multilingual document production. This methodology can be generalized beyond the KANT interlinhua approach for use with any system that requires similar kinds of knowledge
Theme
Computerlinguistik
Multilinguale Probleme
Object
KBMT-89

Similar documents (content)

  1. Knight, K.: Automatic knowledge acquisition for machine translation (1997) 0.25
    0.2519919 = sum of:
      0.2519919 = product of:
        1.2599595 = sum of:
          0.24134076 = weight(abstract_txt:automate in 4248) [ClassicSimilarity], result of:
            0.24134076 = score(doc=4248,freq=1.0), product of:
              0.16113766 = queryWeight, product of:
                1.0500548 = boost
                7.9878955 = idf(docFreq=40, maxDocs=44421)
                0.01921112 = queryNorm
              1.4977304 = fieldWeight in 4248, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9878955 = idf(docFreq=40, maxDocs=44421)
                0.1875 = fieldNorm(doc=4248)
          0.33464155 = weight(abstract_txt:translation in 4248) [ClassicSimilarity], result of:
            0.33464155 = score(doc=4248,freq=1.0), product of:
              0.28898162 = queryWeight, product of:
                2.435618 = boost
                6.176015 = idf(docFreq=250, maxDocs=44421)
                0.01921112 = queryNorm
              1.1580029 = fieldWeight in 4248, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.176015 = idf(docFreq=250, maxDocs=44421)
                0.1875 = fieldNorm(doc=4248)
          0.34475833 = weight(abstract_txt:acquisition in 4248) [ClassicSimilarity], result of:
            0.34475833 = score(doc=4248,freq=1.0), product of:
              0.29477695 = queryWeight, product of:
                2.459919 = boost
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.01921112 = queryNorm
              1.1695566 = fieldWeight in 4248, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.1875 = fieldNorm(doc=4248)
          0.106898665 = weight(abstract_txt:based in 4248) [ClassicSimilarity], result of:
            0.106898665 = score(doc=4248,freq=1.0), product of:
              0.17911176 = queryWeight, product of:
                2.9290347 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.01921112 = queryNorm
              0.5968266 = fieldWeight in 4248, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.1875 = fieldNorm(doc=4248)
          0.23232007 = weight(abstract_txt:knowledge in 4248) [ClassicSimilarity], result of:
            0.23232007 = score(doc=4248,freq=1.0), product of:
              0.34938046 = queryWeight, product of:
                5.1281304 = boost
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.01921112 = queryNorm
              0.66494864 = fieldWeight in 4248, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.1875 = fieldNorm(doc=4248)
        0.2 = coord(5/25)
    
  2. Dorr, B.J.: Large-scale dictionary construction for foreign language tutoring and interlingual machine translation (1997) 0.23
    0.23009254 = sum of:
      0.23009254 = product of:
        0.7190392 = sum of:
          0.13496219 = weight(abstract_txt:lexicons in 4244) [ClassicSimilarity], result of:
            0.13496219 = score(doc=4244,freq=1.0), product of:
              0.19606142 = queryWeight, product of:
                1.1582692 = boost
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.01921112 = queryNorm
              0.6883669 = fieldWeight in 4244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.078125 = fieldNorm(doc=4244)
          0.023493927 = weight(abstract_txt:systems in 4244) [ClassicSimilarity], result of:
            0.023493927 = score(doc=4244,freq=1.0), product of:
              0.08815797 = queryWeight, product of:
                1.3452557 = boost
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.01921112 = queryNorm
              0.26649806 = fieldWeight in 4244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.078125 = fieldNorm(doc=4244)
          0.09128421 = weight(abstract_txt:scale in 4244) [ClassicSimilarity], result of:
            0.09128421 = score(doc=4244,freq=2.0), product of:
              0.15107103 = queryWeight, product of:
                1.437868 = boost
                5.4690194 = idf(docFreq=508, maxDocs=44421)
                0.01921112 = queryNorm
              0.604247 = fieldWeight in 4244, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4690194 = idf(docFreq=508, maxDocs=44421)
                0.078125 = fieldNorm(doc=4244)
          0.04383003 = weight(abstract_txt:approach in 4244) [ClassicSimilarity], result of:
            0.04383003 = score(doc=4244,freq=2.0), product of:
              0.106038205 = queryWeight, product of:
                1.4753846 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.01921112 = queryNorm
              0.41334188 = fieldWeight in 4244, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.078125 = fieldNorm(doc=4244)
          0.0978444 = weight(abstract_txt:large in 4244) [ClassicSimilarity], result of:
            0.0978444 = score(doc=4244,freq=2.0), product of:
              0.19935083 = queryWeight, product of:
                2.3358903 = boost
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.01921112 = queryNorm
              0.4908151 = fieldWeight in 4244, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.078125 = fieldNorm(doc=4244)
          0.13943397 = weight(abstract_txt:translation in 4244) [ClassicSimilarity], result of:
            0.13943397 = score(doc=4244,freq=1.0), product of:
              0.28898162 = queryWeight, product of:
                2.435618 = boost
                6.176015 = idf(docFreq=250, maxDocs=44421)
                0.01921112 = queryNorm
              0.48250115 = fieldWeight in 4244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.176015 = idf(docFreq=250, maxDocs=44421)
                0.078125 = fieldNorm(doc=4244)
          0.14364931 = weight(abstract_txt:acquisition in 4244) [ClassicSimilarity], result of:
            0.14364931 = score(doc=4244,freq=1.0), product of:
              0.29477695 = queryWeight, product of:
                2.459919 = boost
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.01921112 = queryNorm
              0.4873153 = fieldWeight in 4244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.078125 = fieldNorm(doc=4244)
          0.04454111 = weight(abstract_txt:based in 4244) [ClassicSimilarity], result of:
            0.04454111 = score(doc=4244,freq=1.0), product of:
              0.17911176 = queryWeight, product of:
                2.9290347 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.01921112 = queryNorm
              0.24867775 = fieldWeight in 4244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.078125 = fieldNorm(doc=4244)
        0.32 = coord(8/25)
    
  3. Li, L.X.; Xu, L.D.: Knowledge-based problem solving (2002) 0.22
    0.21657714 = sum of:
      0.21657714 = product of:
        0.60160315 = sum of:
          0.06079693 = weight(abstract_txt:coded in 5259) [ClassicSimilarity], result of:
            0.06079693 = score(doc=5259,freq=1.0), product of:
              0.14614137 = queryWeight, product of:
                7.607123 = idf(docFreq=59, maxDocs=44421)
                0.01921112 = queryNorm
              0.41601452 = fieldWeight in 5259, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.607123 = idf(docFreq=59, maxDocs=44421)
                0.0546875 = fieldNorm(doc=5259)
          0.050591867 = weight(abstract_txt:domain in 5259) [ClassicSimilarity], result of:
            0.050591867 = score(doc=5259,freq=3.0), product of:
              0.11294719 = queryWeight, product of:
                1.243272 = boost
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.01921112 = queryNorm
              0.44792497 = fieldWeight in 5259, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.0546875 = fieldNorm(doc=5259)
          0.029348904 = weight(abstract_txt:applications in 5259) [ClassicSimilarity], result of:
            0.029348904 = score(doc=5259,freq=1.0), product of:
              0.11330698 = queryWeight, product of:
                1.2452506 = boost
                4.7363873 = idf(docFreq=1058, maxDocs=44421)
                0.01921112 = queryNorm
              0.25902116 = fieldWeight in 5259, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7363873 = idf(docFreq=1058, maxDocs=44421)
                0.0546875 = fieldNorm(doc=5259)
          0.015896581 = weight(abstract_txt:system in 5259) [ClassicSimilarity], result of:
            0.015896581 = score(doc=5259,freq=1.0), product of:
              0.08618432 = queryWeight, product of:
                1.330112 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.01921112 = queryNorm
              0.18444863 = fieldWeight in 5259, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.0546875 = fieldNorm(doc=5259)
          0.032891497 = weight(abstract_txt:systems in 5259) [ClassicSimilarity], result of:
            0.032891497 = score(doc=5259,freq=4.0), product of:
              0.08815797 = queryWeight, product of:
                1.3452557 = boost
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.01921112 = queryNorm
              0.37309727 = fieldWeight in 5259, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.0546875 = fieldNorm(doc=5259)
          0.02169476 = weight(abstract_txt:approach in 5259) [ClassicSimilarity], result of:
            0.02169476 = score(doc=5259,freq=1.0), product of:
              0.106038205 = queryWeight, product of:
                1.4753846 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.01921112 = queryNorm
              0.20459381 = fieldWeight in 5259, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.0546875 = fieldNorm(doc=5259)
          0.052534036 = weight(abstract_txt:requires in 5259) [ClassicSimilarity], result of:
            0.052534036 = score(doc=5259,freq=1.0), product of:
              0.1670408 = queryWeight, product of:
                1.5119579 = boost
                5.750825 = idf(docFreq=383, maxDocs=44421)
                0.01921112 = queryNorm
              0.31449825 = fieldWeight in 5259, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.750825 = idf(docFreq=383, maxDocs=44421)
                0.0546875 = fieldNorm(doc=5259)
          0.09353633 = weight(abstract_txt:based in 5259) [ClassicSimilarity], result of:
            0.09353633 = score(doc=5259,freq=9.0), product of:
              0.17911176 = queryWeight, product of:
                2.9290347 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.01921112 = queryNorm
              0.5222233 = fieldWeight in 5259, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0546875 = fieldNorm(doc=5259)
          0.24431221 = weight(abstract_txt:knowledge in 5259) [ClassicSimilarity], result of:
            0.24431221 = score(doc=5259,freq=13.0), product of:
              0.34938046 = queryWeight, product of:
                5.1281304 = boost
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.01921112 = queryNorm
              0.6992727 = fieldWeight in 5259, product of:
                3.6055512 = tf(freq=13.0), with freq of:
                  13.0 = termFreq=13.0
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.0546875 = fieldNorm(doc=5259)
        0.36 = coord(9/25)
    
  4. Xu, Y.; Li, G.; Mou, L.; Lu, Y.: Learning non-taxonomic relations on demand for ontology extension (2014) 0.20
    0.19693846 = sum of:
      0.19693846 = product of:
        0.6154327 = sum of:
          0.10796975 = weight(abstract_txt:crafted in 3961) [ClassicSimilarity], result of:
            0.10796975 = score(doc=3961,freq=1.0), product of:
              0.19606142 = queryWeight, product of:
                1.1582692 = boost
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.01921112 = queryNorm
              0.5506935 = fieldWeight in 3961, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.0625 = fieldNorm(doc=3961)
          0.1310512 = weight(abstract_txt:laborious in 3961) [ClassicSimilarity], result of:
            0.1310512 = score(doc=3961,freq=1.0), product of:
              0.22309239 = queryWeight, product of:
                1.2355372 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.01921112 = queryNorm
              0.5874302 = fieldWeight in 3961, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.0625 = fieldNorm(doc=3961)
          0.033381976 = weight(abstract_txt:domain in 3961) [ClassicSimilarity], result of:
            0.033381976 = score(doc=3961,freq=1.0), product of:
              0.11294719 = queryWeight, product of:
                1.243272 = boost
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.01921112 = queryNorm
              0.29555383 = fieldWeight in 3961, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.0625 = fieldNorm(doc=3961)
          0.035064027 = weight(abstract_txt:approach in 3961) [ClassicSimilarity], result of:
            0.035064027 = score(doc=3961,freq=2.0), product of:
              0.106038205 = queryWeight, product of:
                1.4753846 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.01921112 = queryNorm
              0.33067352 = fieldWeight in 3961, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.0625 = fieldNorm(doc=3961)
          0.059957486 = weight(abstract_txt:hand in 3961) [ClassicSimilarity], result of:
            0.059957486 = score(doc=3961,freq=1.0), product of:
              0.16688976 = queryWeight, product of:
                1.5112742 = boost
                5.7482243 = idf(docFreq=384, maxDocs=44421)
                0.01921112 = queryNorm
              0.35926402 = fieldWeight in 3961, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7482243 = idf(docFreq=384, maxDocs=44421)
                0.0625 = fieldNorm(doc=3961)
          0.071370885 = weight(abstract_txt:corpus in 3961) [ClassicSimilarity], result of:
            0.071370885 = score(doc=3961,freq=1.0), product of:
              0.18744828 = queryWeight, product of:
                1.6016556 = boost
                6.0919957 = idf(docFreq=272, maxDocs=44421)
                0.01921112 = queryNorm
              0.38074973 = fieldWeight in 3961, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0919957 = idf(docFreq=272, maxDocs=44421)
                0.0625 = fieldNorm(doc=3961)
          0.11491945 = weight(abstract_txt:acquisition in 3961) [ClassicSimilarity], result of:
            0.11491945 = score(doc=3961,freq=1.0), product of:
              0.29477695 = queryWeight, product of:
                2.459919 = boost
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.01921112 = queryNorm
              0.38985223 = fieldWeight in 3961, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.0625 = fieldNorm(doc=3961)
          0.061717972 = weight(abstract_txt:based in 3961) [ClassicSimilarity], result of:
            0.061717972 = score(doc=3961,freq=3.0), product of:
              0.17911176 = queryWeight, product of:
                2.9290347 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.01921112 = queryNorm
              0.344578 = fieldWeight in 3961, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0625 = fieldNorm(doc=3961)
        0.32 = coord(8/25)
    
  5. Basili, R.; Pazienza, M.T.; Velardi, P.: ¬An empirical symbolic approach to natural language processing (1996) 0.19
    0.1894775 = sum of:
      0.1894775 = product of:
        0.67670536 = sum of:
          0.05869781 = weight(abstract_txt:applications in 6821) [ClassicSimilarity], result of:
            0.05869781 = score(doc=6821,freq=1.0), product of:
              0.11330698 = queryWeight, product of:
                1.2452506 = boost
                4.7363873 = idf(docFreq=1058, maxDocs=44421)
                0.01921112 = queryNorm
              0.5180423 = fieldWeight in 6821, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7363873 = idf(docFreq=1058, maxDocs=44421)
                0.109375 = fieldNorm(doc=6821)
          0.031793162 = weight(abstract_txt:system in 6821) [ClassicSimilarity], result of:
            0.031793162 = score(doc=6821,freq=1.0), product of:
              0.08618432 = queryWeight, product of:
                1.330112 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.01921112 = queryNorm
              0.36889726 = fieldWeight in 6821, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.109375 = fieldNorm(doc=6821)
          0.09036676 = weight(abstract_txt:scale in 6821) [ClassicSimilarity], result of:
            0.09036676 = score(doc=6821,freq=1.0), product of:
              0.15107103 = queryWeight, product of:
                1.437868 = boost
                5.4690194 = idf(docFreq=508, maxDocs=44421)
                0.01921112 = queryNorm
              0.598174 = fieldWeight in 6821, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4690194 = idf(docFreq=508, maxDocs=44421)
                0.109375 = fieldNorm(doc=6821)
          0.09686101 = weight(abstract_txt:large in 6821) [ClassicSimilarity], result of:
            0.09686101 = score(doc=6821,freq=1.0), product of:
              0.19935083 = queryWeight, product of:
                2.3358903 = boost
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.01921112 = queryNorm
              0.48588216 = fieldWeight in 6821, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.109375 = fieldNorm(doc=6821)
          0.20110904 = weight(abstract_txt:acquisition in 6821) [ClassicSimilarity], result of:
            0.20110904 = score(doc=6821,freq=1.0), product of:
              0.29477695 = queryWeight, product of:
                2.459919 = boost
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.01921112 = queryNorm
              0.6822414 = fieldWeight in 6821, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.109375 = fieldNorm(doc=6821)
          0.062357556 = weight(abstract_txt:based in 6821) [ClassicSimilarity], result of:
            0.062357556 = score(doc=6821,freq=1.0), product of:
              0.17911176 = queryWeight, product of:
                2.9290347 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.01921112 = queryNorm
              0.34814885 = fieldWeight in 6821, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.109375 = fieldNorm(doc=6821)
          0.13552004 = weight(abstract_txt:knowledge in 6821) [ClassicSimilarity], result of:
            0.13552004 = score(doc=6821,freq=1.0), product of:
              0.34938046 = queryWeight, product of:
                5.1281304 = boost
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.01921112 = queryNorm
              0.3878867 = fieldWeight in 6821, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.109375 = fieldNorm(doc=6821)
        0.28 = coord(7/25)