Document (#15822)

Author
Basili, R.
Pazienza, M.T.
Velardi, P.
Title
¬An empirical symbolic approach to natural language processing
Source
Artificial intelligence. 85(1996) nos.1/2, S.59-99
Year
1996
Abstract
Describes and evaluates the results of a large scale lexical learning system, ARISTO-LEX, that uses a combination of probabilisitc and knowledge based methods for the acquisition of selectional restrictions of words in sublanguages. Presents experimental data obtained from different corpora in different doamins and languages, and shows that the acquired lexical data not only have practical applications in natural language processing, but they are useful for a comparative analysis of sublanguages
Theme
Computerlinguistik

Similar documents (content)

  1. Stede, M.: Lexicalization in natural language generation (2002) 0.18
    0.1845881 = sum of:
      0.1845881 = product of:
        0.57683784 = sum of:
          0.027201338 = weight(abstract_txt:languages in 5245) [ClassicSimilarity], result of:
            0.027201338 = score(doc=5245,freq=1.0), product of:
              0.09584249 = queryWeight, product of:
                1.0819322 = boost
                5.189722 = idf(docFreq=672, maxDocs=44421)
                0.01706923 = queryNorm
              0.28381294 = fieldWeight in 5245, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.189722 = idf(docFreq=672, maxDocs=44421)
                0.0546875 = fieldNorm(doc=5245)
          0.051784422 = weight(abstract_txt:words in 5245) [ClassicSimilarity], result of:
            0.051784422 = score(doc=5245,freq=3.0), product of:
              0.10207599 = queryWeight, product of:
                1.1165619 = boost
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.01706923 = queryNorm
              0.5073125 = fieldWeight in 5245, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.0546875 = fieldNorm(doc=5245)
          0.031833638 = weight(abstract_txt:scale in 5245) [ClassicSimilarity], result of:
            0.031833638 = score(doc=5245,freq=1.0), product of:
              0.10643605 = queryWeight, product of:
                1.1401589 = boost
                5.4690194 = idf(docFreq=508, maxDocs=44421)
                0.01706923 = queryNorm
              0.299087 = fieldWeight in 5245, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4690194 = idf(docFreq=508, maxDocs=44421)
                0.0546875 = fieldNorm(doc=5245)
          0.019078141 = weight(abstract_txt:different in 5245) [ClassicSimilarity], result of:
            0.019078141 = score(doc=5245,freq=1.0), product of:
              0.0953232 = queryWeight, product of:
                1.5259324 = boost
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.01706923 = queryNorm
              0.20014164 = fieldWeight in 5245, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.0546875 = fieldNorm(doc=5245)
          0.06315236 = weight(abstract_txt:language in 5245) [ClassicSimilarity], result of:
            0.06315236 = score(doc=5245,freq=5.0), product of:
              0.12381625 = queryWeight, product of:
                1.739101 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.01706923 = queryNorm
              0.51004905 = fieldWeight in 5245, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.0546875 = fieldNorm(doc=5245)
          0.046493925 = weight(abstract_txt:processing in 5245) [ClassicSimilarity], result of:
            0.046493925 = score(doc=5245,freq=1.0), product of:
              0.17262568 = queryWeight, product of:
                2.0534716 = boost
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.01706923 = queryNorm
              0.26933378 = fieldWeight in 5245, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.0546875 = fieldNorm(doc=5245)
          0.07171161 = weight(abstract_txt:natural in 5245) [ClassicSimilarity], result of:
            0.07171161 = score(doc=5245,freq=2.0), product of:
              0.18290444 = queryWeight, product of:
                2.1137233 = boost
                5.0694656 = idf(docFreq=758, maxDocs=44421)
                0.01706923 = queryNorm
              0.39207146 = fieldWeight in 5245, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.0694656 = idf(docFreq=758, maxDocs=44421)
                0.0546875 = fieldNorm(doc=5245)
          0.2655824 = weight(abstract_txt:lexical in 5245) [ClassicSimilarity], result of:
            0.2655824 = score(doc=5245,freq=6.0), product of:
              0.3035687 = queryWeight, product of:
                2.7231057 = boost
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.01706923 = queryNorm
              0.87486756 = fieldWeight in 5245, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.0546875 = fieldNorm(doc=5245)
        0.32 = coord(8/25)
    
  2. Markó, K.G.: Foundation, implementation and evaluation of the MorphoSaurus system (2008) 0.16
    0.15742931 = sum of:
      0.15742931 = product of:
        0.4373036 = sum of:
          0.043445744 = weight(abstract_txt:languages in 415) [ClassicSimilarity], result of:
            0.043445744 = score(doc=415,freq=5.0), product of:
              0.09584249 = queryWeight, product of:
                1.0819322 = boost
                5.189722 = idf(docFreq=672, maxDocs=44421)
                0.01706923 = queryNorm
              0.45330358 = fieldWeight in 415, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.189722 = idf(docFreq=672, maxDocs=44421)
                0.0390625 = fieldNorm(doc=415)
          0.022738313 = weight(abstract_txt:scale in 415) [ClassicSimilarity], result of:
            0.022738313 = score(doc=415,freq=1.0), product of:
              0.10643605 = queryWeight, product of:
                1.1401589 = boost
                5.4690194 = idf(docFreq=508, maxDocs=44421)
                0.01706923 = queryNorm
              0.21363357 = fieldWeight in 415, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4690194 = idf(docFreq=508, maxDocs=44421)
                0.0390625 = fieldNorm(doc=415)
          0.033735715 = weight(abstract_txt:acquisition in 415) [ClassicSimilarity], result of:
            0.033735715 = score(doc=415,freq=1.0), product of:
              0.13845539 = queryWeight, product of:
                1.3003969 = boost
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.01706923 = queryNorm
              0.24365765 = fieldWeight in 415, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.0390625 = fieldNorm(doc=415)
          0.049884915 = weight(abstract_txt:acquired in 415) [ClassicSimilarity], result of:
            0.049884915 = score(doc=415,freq=1.0), product of:
              0.17970607 = queryWeight, product of:
                1.4815024 = boost
                7.1063476 = idf(docFreq=98, maxDocs=44421)
                0.01706923 = queryNorm
              0.2775917 = fieldWeight in 415, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1063476 = idf(docFreq=98, maxDocs=44421)
                0.0390625 = fieldNorm(doc=415)
          0.019271834 = weight(abstract_txt:different in 415) [ClassicSimilarity], result of:
            0.019271834 = score(doc=415,freq=2.0), product of:
              0.0953232 = queryWeight, product of:
                1.5259324 = boost
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.01706923 = queryNorm
              0.20217359 = fieldWeight in 415, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.0390625 = fieldNorm(doc=415)
          0.040346567 = weight(abstract_txt:language in 415) [ClassicSimilarity], result of:
            0.040346567 = score(doc=415,freq=4.0), product of:
              0.12381625 = queryWeight, product of:
                1.739101 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.01706923 = queryNorm
              0.3258584 = fieldWeight in 415, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.0390625 = fieldNorm(doc=415)
          0.057521313 = weight(abstract_txt:processing in 415) [ClassicSimilarity], result of:
            0.057521313 = score(doc=415,freq=3.0), product of:
              0.17262568 = queryWeight, product of:
                2.0534716 = boost
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.01706923 = queryNorm
              0.33321413 = fieldWeight in 415, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.0390625 = fieldNorm(doc=415)
          0.036219835 = weight(abstract_txt:natural in 415) [ClassicSimilarity], result of:
            0.036219835 = score(doc=415,freq=1.0), product of:
              0.18290444 = queryWeight, product of:
                2.1137233 = boost
                5.0694656 = idf(docFreq=758, maxDocs=44421)
                0.01706923 = queryNorm
              0.198026 = fieldWeight in 415, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0694656 = idf(docFreq=758, maxDocs=44421)
                0.0390625 = fieldNorm(doc=415)
          0.13413936 = weight(abstract_txt:lexical in 415) [ClassicSimilarity], result of:
            0.13413936 = score(doc=415,freq=3.0), product of:
              0.3035687 = queryWeight, product of:
                2.7231057 = boost
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.01706923 = queryNorm
              0.4418748 = fieldWeight in 415, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.0390625 = fieldNorm(doc=415)
        0.36 = coord(9/25)
    
  3. Conlon, S.P.N.; Evens, M.; Ahlswede, T.: Developing a large lexical database for information retrieval, parsing, and text generation systems (1993) 0.15
    0.14525656 = sum of:
      0.14525656 = product of:
        0.6052357 = sum of:
          0.036972098 = weight(abstract_txt:shows in 5812) [ClassicSimilarity], result of:
            0.036972098 = score(doc=5812,freq=1.0), product of:
              0.092714146 = queryWeight, product of:
                1.0641283 = boost
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.01706923 = queryNorm
              0.39877516 = fieldWeight in 5812, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.078125 = fieldNorm(doc=5812)
          0.042711075 = weight(abstract_txt:words in 5812) [ClassicSimilarity], result of:
            0.042711075 = score(doc=5812,freq=1.0), product of:
              0.10207599 = queryWeight, product of:
                1.1165619 = boost
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.01706923 = queryNorm
              0.4184243 = fieldWeight in 5812, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.078125 = fieldNorm(doc=5812)
          0.040346567 = weight(abstract_txt:language in 5812) [ClassicSimilarity], result of:
            0.040346567 = score(doc=5812,freq=1.0), product of:
              0.12381625 = queryWeight, product of:
                1.739101 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.01706923 = queryNorm
              0.3258584 = fieldWeight in 5812, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.078125 = fieldNorm(doc=5812)
          0.0664199 = weight(abstract_txt:processing in 5812) [ClassicSimilarity], result of:
            0.0664199 = score(doc=5812,freq=1.0), product of:
              0.17262568 = queryWeight, product of:
                2.0534716 = boost
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.01706923 = queryNorm
              0.38476256 = fieldWeight in 5812, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.078125 = fieldNorm(doc=5812)
          0.07243967 = weight(abstract_txt:natural in 5812) [ClassicSimilarity], result of:
            0.07243967 = score(doc=5812,freq=1.0), product of:
              0.18290444 = queryWeight, product of:
                2.1137233 = boost
                5.0694656 = idf(docFreq=758, maxDocs=44421)
                0.01706923 = queryNorm
              0.396052 = fieldWeight in 5812, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0694656 = idf(docFreq=758, maxDocs=44421)
                0.078125 = fieldNorm(doc=5812)
          0.34634635 = weight(abstract_txt:lexical in 5812) [ClassicSimilarity], result of:
            0.34634635 = score(doc=5812,freq=5.0), product of:
              0.3035687 = queryWeight, product of:
                2.7231057 = boost
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.01706923 = queryNorm
              1.1409159 = fieldWeight in 5812, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.078125 = fieldNorm(doc=5812)
        0.24 = coord(6/25)
    
  4. Sánchez-de-Madariaga, R.; Fernández-del-Castillo, J.R.: ¬The bootstrapping of the Yarowsky algorithm in real corpora (2009) 0.14
    0.14166856 = sum of:
      0.14166856 = product of:
        0.59028566 = sum of:
          0.044366516 = weight(abstract_txt:shows in 3451) [ClassicSimilarity], result of:
            0.044366516 = score(doc=3451,freq=1.0), product of:
              0.092714146 = queryWeight, product of:
                1.0641283 = boost
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.01706923 = queryNorm
              0.47853017 = fieldWeight in 3451, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.104322 = idf(docFreq=732, maxDocs=44421)
                0.09375 = fieldNorm(doc=3451)
          0.08096571 = weight(abstract_txt:acquisition in 3451) [ClassicSimilarity], result of:
            0.08096571 = score(doc=3451,freq=1.0), product of:
              0.13845539 = queryWeight, product of:
                1.3003969 = boost
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.01706923 = queryNorm
              0.5847783 = fieldWeight in 3451, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.09375 = fieldNorm(doc=3451)
          0.22985156 = weight(abstract_txt:corpora in 3451) [ClassicSimilarity], result of:
            0.22985156 = score(doc=3451,freq=4.0), product of:
              0.17487219 = queryWeight, product of:
                1.4614413 = boost
                7.01012 = idf(docFreq=108, maxDocs=44421)
                0.01706923 = queryNorm
              1.3143975 = fieldWeight in 3451, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.01012 = idf(docFreq=108, maxDocs=44421)
                0.09375 = fieldNorm(doc=3451)
          0.068470396 = weight(abstract_txt:language in 3451) [ClassicSimilarity], result of:
            0.068470396 = score(doc=3451,freq=2.0), product of:
              0.12381625 = queryWeight, product of:
                1.739101 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.01706923 = queryNorm
              0.5530001 = fieldWeight in 3451, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.09375 = fieldNorm(doc=3451)
          0.079703875 = weight(abstract_txt:processing in 3451) [ClassicSimilarity], result of:
            0.079703875 = score(doc=3451,freq=1.0), product of:
              0.17262568 = queryWeight, product of:
                2.0534716 = boost
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.01706923 = queryNorm
              0.46171504 = fieldWeight in 3451, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.09375 = fieldNorm(doc=3451)
          0.0869276 = weight(abstract_txt:natural in 3451) [ClassicSimilarity], result of:
            0.0869276 = score(doc=3451,freq=1.0), product of:
              0.18290444 = queryWeight, product of:
                2.1137233 = boost
                5.0694656 = idf(docFreq=758, maxDocs=44421)
                0.01706923 = queryNorm
              0.4752624 = fieldWeight in 3451, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0694656 = idf(docFreq=758, maxDocs=44421)
                0.09375 = fieldNorm(doc=3451)
        0.24 = coord(6/25)
    
  5. Talvensaari, T.; Laurikkala, J.; Järvelin, K.; Juhola, M.: ¬A study on automatic creation of a comparable document collection in cross-language information retrieval (2006) 0.14
    0.13929199 = sum of:
      0.13929199 = product of:
        0.43528748 = sum of:
          0.024546077 = weight(abstract_txt:practical in 601) [ClassicSimilarity], result of:
            0.024546077 = score(doc=601,freq=1.0), product of:
              0.08187626 = queryWeight, product of:
                4.7967167 = idf(docFreq=996, maxDocs=44421)
                0.01706923 = queryNorm
              0.2997948 = fieldWeight in 601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7967167 = idf(docFreq=996, maxDocs=44421)
                0.0625 = fieldNorm(doc=601)
          0.053844683 = weight(abstract_txt:languages in 601) [ClassicSimilarity], result of:
            0.053844683 = score(doc=601,freq=3.0), product of:
              0.09584249 = queryWeight, product of:
                1.0819322 = boost
                5.189722 = idf(docFreq=672, maxDocs=44421)
                0.01706923 = queryNorm
              0.5618039 = fieldWeight in 601, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.189722 = idf(docFreq=672, maxDocs=44421)
                0.0625 = fieldNorm(doc=601)
          0.034168858 = weight(abstract_txt:words in 601) [ClassicSimilarity], result of:
            0.034168858 = score(doc=601,freq=1.0), product of:
              0.10207599 = queryWeight, product of:
                1.1165619 = boost
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.01706923 = queryNorm
              0.33473945 = fieldWeight in 601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.0625 = fieldNorm(doc=601)
          0.0363813 = weight(abstract_txt:scale in 601) [ClassicSimilarity], result of:
            0.0363813 = score(doc=601,freq=1.0), product of:
              0.10643605 = queryWeight, product of:
                1.1401589 = boost
                5.4690194 = idf(docFreq=508, maxDocs=44421)
                0.01706923 = queryNorm
              0.3418137 = fieldWeight in 601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4690194 = idf(docFreq=508, maxDocs=44421)
                0.0625 = fieldNorm(doc=601)
          0.10835307 = weight(abstract_txt:corpora in 601) [ClassicSimilarity], result of:
            0.10835307 = score(doc=601,freq=2.0), product of:
              0.17487219 = queryWeight, product of:
                1.4614413 = boost
                7.01012 = idf(docFreq=108, maxDocs=44421)
                0.01706923 = queryNorm
              0.61961293 = fieldWeight in 601, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.01012 = idf(docFreq=108, maxDocs=44421)
                0.0625 = fieldNorm(doc=601)
          0.02180359 = weight(abstract_txt:different in 601) [ClassicSimilarity], result of:
            0.02180359 = score(doc=601,freq=1.0), product of:
              0.0953232 = queryWeight, product of:
                1.5259324 = boost
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.01706923 = queryNorm
              0.2287333 = fieldWeight in 601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.0625 = fieldNorm(doc=601)
          0.032277253 = weight(abstract_txt:language in 601) [ClassicSimilarity], result of:
            0.032277253 = score(doc=601,freq=1.0), product of:
              0.12381625 = queryWeight, product of:
                1.739101 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.01706923 = queryNorm
              0.26068673 = fieldWeight in 601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.0625 = fieldNorm(doc=601)
          0.12391263 = weight(abstract_txt:lexical in 601) [ClassicSimilarity], result of:
            0.12391263 = score(doc=601,freq=1.0), product of:
              0.3035687 = queryWeight, product of:
                2.7231057 = boost
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.01706923 = queryNorm
              0.40818647 = fieldWeight in 601, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.0625 = fieldNorm(doc=601)
        0.32 = coord(8/25)