Document (#16485)

Haas, S.W.
Natural language processing : toward large-scale, robust systems
Annual review of information science and technology. 31(1996), S.83-119
State of the art review of natural language processing updating an earlier review published in ARIST 22(1987). Discusses important developments that have allowed for significant advances in the field of natural language processing: materials and resources; knowledge based systems and statistical approaches; and a strong emphasis on evaluation. Reviews some natural language processing applications and common problems still awaiting solution. Considers closely related applications such as language generation and th egeneration phase of machine translation which face the same problems as natural language processing. Covers natural language methodologies for information retrieval only briefly

Similar documents (author)

  1. Haas, S.W.: ¬A feasibility study of the case hierarchy model for the construction and porting of natural language interfaces (1990) 5.50
    5.504072 = sum of:
      5.504072 = weight(author_txt:haas in 8071) [ClassicSimilarity], result of:
        5.504072 = fieldWeight in 8071, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.806516 = idf(docFreq=17, maxDocs=44218)
          0.625 = fieldNorm(doc=8071)
  2. Haas, S.W.: Disciplinary variation in automatic sublanguage term identification (1997) 5.50
    5.504072 = sum of:
      5.504072 = weight(author_txt:haas in 6500) [ClassicSimilarity], result of:
        5.504072 = fieldWeight in 6500, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.806516 = idf(docFreq=17, maxDocs=44218)
          0.625 = fieldNorm(doc=6500)
  3. Haas, S.W.: ¬A text filter for the automatic identification of empirical articles (1996) 5.50
    5.504072 = sum of:
      5.504072 = weight(author_txt:haas in 6798) [ClassicSimilarity], result of:
        5.504072 = fieldWeight in 6798, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.806516 = idf(docFreq=17, maxDocs=44218)
          0.625 = fieldNorm(doc=6798)
  4. Haas, S.: Metadata mania : an overview (1998) 5.50
    5.504072 = sum of:
      5.504072 = weight(author_txt:haas in 2222) [ClassicSimilarity], result of:
        5.504072 = fieldWeight in 2222, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.806516 = idf(docFreq=17, maxDocs=44218)
          0.625 = fieldNorm(doc=2222)
  5. Haas, S.W.: Improving the search environment : informed decision making in the search for statistical information (2003) 5.50
    5.504072 = sum of:
      5.504072 = weight(author_txt:haas in 1687) [ClassicSimilarity], result of:
        5.504072 = fieldWeight in 1687, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.806516 = idf(docFreq=17, maxDocs=44218)
          0.625 = fieldNorm(doc=1687)

Similar documents (content)

  1. Chowdhury, G.G.: Natural language processing (2002) 0.35
    0.34524542 = sum of:
      0.34524542 = product of:
        1.2330194 = sum of:
          0.05940157 = weight(abstract_txt:translation in 4284) [ClassicSimilarity], result of:
            0.05940157 = score(doc=4284,freq=1.0), product of:
              0.1231235 = queryWeight, product of:
                1.0591737 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.018823778 = queryNorm
              0.4824552 = fieldWeight in 4284, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.078125 = fieldNorm(doc=4284)
          0.028334964 = weight(abstract_txt:systems in 4284) [ClassicSimilarity], result of:
            0.028334964 = score(doc=4284,freq=2.0), product of:
              0.07516646 = queryWeight, product of:
                1.1703715 = boost
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.018823778 = queryNorm
              0.37696287 = fieldWeight in 4284, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.078125 = fieldNorm(doc=4284)
          0.19208167 = weight(abstract_txt:arist in 4284) [ClassicSimilarity], result of:
            0.19208167 = score(doc=4284,freq=1.0), product of:
              0.26923588 = queryWeight, product of:
                1.5662576 = boost
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.018823778 = queryNorm
              0.71343267 = fieldWeight in 4284, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.131938 = idf(docFreq=12, maxDocs=44218)
                0.078125 = fieldNorm(doc=4284)
          0.05379972 = weight(abstract_txt:applications in 4284) [ClassicSimilarity], result of:
            0.05379972 = score(doc=4284,freq=1.0), product of:
              0.14521305 = queryWeight, product of:
                1.6267265 = boost
                4.7422485 = idf(docFreq=1047, maxDocs=44218)
                0.018823778 = queryNorm
              0.37048817 = fieldWeight in 4284, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7422485 = idf(docFreq=1047, maxDocs=44218)
                0.078125 = fieldNorm(doc=4284)
          0.21394944 = weight(abstract_txt:processing in 4284) [ClassicSimilarity], result of:
            0.21394944 = score(doc=4284,freq=2.0), product of:
              0.39264172 = queryWeight, product of:
                4.2294116 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.018823778 = queryNorm
              0.5448974 = fieldWeight in 4284, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.078125 = fieldNorm(doc=4284)
          0.28877488 = weight(abstract_txt:language in 4284) [ClassicSimilarity], result of:
            0.28877488 = score(doc=4284,freq=5.0), product of:
              0.39526764 = queryWeight, product of:
                5.0210133 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.018823778 = queryNorm
              0.7305806 = fieldWeight in 4284, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.078125 = fieldNorm(doc=4284)
          0.39667717 = weight(abstract_txt:natural in 4284) [ClassicSimilarity], result of:
            0.39667717 = score(doc=4284,freq=4.0), product of:
              0.4998015 = queryWeight, product of:
                5.227224 = boost
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.018823778 = queryNorm
              0.79366946 = fieldWeight in 4284, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.078125 = fieldNorm(doc=4284)
        0.28 = coord(7/25)
  2. Jurafsky, D.; Martin, J.H.: Speech and language processing : ani ntroduction to natural language processing, computational linguistics and speech recognition (2009) 0.27
    0.2741378 = sum of:
      0.2741378 = product of:
        1.1422409 = sum of:
          0.052066367 = weight(abstract_txt:emphasis in 1081) [ClassicSimilarity], result of:
            0.052066367 = score(doc=1081,freq=1.0), product of:
              0.11276661 = queryWeight, product of:
                1.0136476 = boost
                5.90999 = idf(docFreq=325, maxDocs=44218)
                0.018823778 = queryNorm
              0.46171796 = fieldWeight in 1081, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.90999 = idf(docFreq=325, maxDocs=44218)
                0.078125 = fieldNorm(doc=1081)
          0.020035842 = weight(abstract_txt:systems in 1081) [ClassicSimilarity], result of:
            0.020035842 = score(doc=1081,freq=1.0), product of:
              0.07516646 = queryWeight, product of:
                1.1703715 = boost
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.018823778 = queryNorm
              0.26655298 = fieldWeight in 1081, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.078125 = fieldNorm(doc=1081)
          0.05379972 = weight(abstract_txt:applications in 1081) [ClassicSimilarity], result of:
            0.05379972 = score(doc=1081,freq=1.0), product of:
              0.14521305 = queryWeight, product of:
                1.6267265 = boost
                4.7422485 = idf(docFreq=1047, maxDocs=44218)
                0.018823778 = queryNorm
              0.37048817 = fieldWeight in 1081, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7422485 = idf(docFreq=1047, maxDocs=44218)
                0.078125 = fieldNorm(doc=1081)
          0.37057135 = weight(abstract_txt:processing in 1081) [ClassicSimilarity], result of:
            0.37057135 = score(doc=1081,freq=6.0), product of:
              0.39264172 = queryWeight, product of:
                4.2294116 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.018823778 = queryNorm
              0.94379 = fieldWeight in 1081, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.078125 = fieldNorm(doc=1081)
          0.36527452 = weight(abstract_txt:language in 1081) [ClassicSimilarity], result of:
            0.36527452 = score(doc=1081,freq=8.0), product of:
              0.39526764 = queryWeight, product of:
                5.0210133 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.018823778 = queryNorm
              0.9241195 = fieldWeight in 1081, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.078125 = fieldNorm(doc=1081)
          0.2804931 = weight(abstract_txt:natural in 1081) [ClassicSimilarity], result of:
            0.2804931 = score(doc=1081,freq=2.0), product of:
              0.4998015 = queryWeight, product of:
                5.227224 = boost
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.018823778 = queryNorm
              0.561209 = fieldWeight in 1081, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.078125 = fieldNorm(doc=1081)
        0.24 = coord(6/25)
  3. McCray, A.T.: Natural language research program (1992) 0.27
    0.27377716 = sum of:
      0.27377716 = product of:
        1.3688858 = sum of:
          0.09998305 = weight(abstract_txt:briefly in 7273) [ClassicSimilarity], result of:
            0.09998305 = score(doc=7273,freq=1.0), product of:
              0.10975052 = queryWeight, product of:
                5.830419 = idf(docFreq=352, maxDocs=44218)
                0.018823778 = queryNorm
              0.911003 = fieldWeight in 7273, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.830419 = idf(docFreq=352, maxDocs=44218)
                0.15625 = fieldNorm(doc=7273)
          0.040071685 = weight(abstract_txt:systems in 7273) [ClassicSimilarity], result of:
            0.040071685 = score(doc=7273,freq=1.0), product of:
              0.07516646 = queryWeight, product of:
                1.1703715 = boost
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.018823778 = queryNorm
              0.53310597 = fieldWeight in 7273, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4118783 = idf(docFreq=3963, maxDocs=44218)
                0.15625 = fieldNorm(doc=7273)
          0.30257022 = weight(abstract_txt:processing in 7273) [ClassicSimilarity], result of:
            0.30257022 = score(doc=7273,freq=1.0), product of:
              0.39264172 = queryWeight, product of:
                4.2294116 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.018823778 = queryNorm
              0.7706013 = fieldWeight in 7273, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.15625 = fieldNorm(doc=7273)
          0.36527452 = weight(abstract_txt:language in 7273) [ClassicSimilarity], result of:
            0.36527452 = score(doc=7273,freq=2.0), product of:
              0.39526764 = queryWeight, product of:
                5.0210133 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.018823778 = queryNorm
              0.9241195 = fieldWeight in 7273, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.15625 = fieldNorm(doc=7273)
          0.5609862 = weight(abstract_txt:natural in 7273) [ClassicSimilarity], result of:
            0.5609862 = score(doc=7273,freq=2.0), product of:
              0.4998015 = queryWeight, product of:
                5.227224 = boost
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.018823778 = queryNorm
              1.122418 = fieldWeight in 7273, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.15625 = fieldNorm(doc=7273)
        0.2 = coord(5/25)
  4. Chowdhury, G.G.: Natural language processing and information retrieval : pt.1: basic issues; pt.2: major applications (1991) 0.27
    0.26633048 = sum of:
      0.26633048 = product of:
        1.6645656 = sum of:
          0.1022336 = weight(abstract_txt:covers in 3313) [ClassicSimilarity], result of:
            0.1022336 = score(doc=3313,freq=1.0), product of:
              0.11139134 = queryWeight, product of:
                1.0074475 = boost
                5.8738413 = idf(docFreq=337, maxDocs=44218)
                0.018823778 = queryNorm
              0.9177877 = fieldWeight in 3313, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8738413 = idf(docFreq=337, maxDocs=44218)
                0.15625 = fieldNorm(doc=3313)
          0.42789888 = weight(abstract_txt:processing in 3313) [ClassicSimilarity], result of:
            0.42789888 = score(doc=3313,freq=2.0), product of:
              0.39264172 = queryWeight, product of:
                4.2294116 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.018823778 = queryNorm
              1.0897948 = fieldWeight in 3313, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.15625 = fieldNorm(doc=3313)
          0.44736812 = weight(abstract_txt:language in 3313) [ClassicSimilarity], result of:
            0.44736812 = score(doc=3313,freq=3.0), product of:
              0.39526764 = queryWeight, product of:
                5.0210133 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.018823778 = queryNorm
              1.1318107 = fieldWeight in 3313, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.15625 = fieldNorm(doc=3313)
          0.687065 = weight(abstract_txt:natural in 3313) [ClassicSimilarity], result of:
            0.687065 = score(doc=3313,freq=3.0), product of:
              0.4998015 = queryWeight, product of:
                5.227224 = boost
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.018823778 = queryNorm
              1.3746758 = fieldWeight in 3313, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.15625 = fieldNorm(doc=3313)
        0.16 = coord(4/25)
  5. Handbook of terminology management : Vol.2: Application-oriented terminology management (2001) 0.25
    0.2549909 = sum of:
      0.2549909 = product of:
        0.91068184 = sum of:
          0.061340164 = weight(abstract_txt:covers in 1750) [ClassicSimilarity], result of:
            0.061340164 = score(doc=1750,freq=1.0), product of:
              0.11139134 = queryWeight, product of:
                1.0074475 = boost
                5.8738413 = idf(docFreq=337, maxDocs=44218)
                0.018823778 = queryNorm
              0.55067265 = fieldWeight in 1750, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8738413 = idf(docFreq=337, maxDocs=44218)
                0.09375 = fieldNorm(doc=1750)
          0.07128189 = weight(abstract_txt:translation in 1750) [ClassicSimilarity], result of:
            0.07128189 = score(doc=1750,freq=1.0), product of:
              0.1231235 = queryWeight, product of:
                1.0591737 = boost
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.018823778 = queryNorm
              0.57894623 = fieldWeight in 1750, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1754265 = idf(docFreq=249, maxDocs=44218)
                0.09375 = fieldNorm(doc=1750)
          0.048045497 = weight(abstract_txt:problems in 1750) [ClassicSimilarity], result of:
            0.048045497 = score(doc=1750,freq=1.0), product of:
              0.119252264 = queryWeight, product of:
                1.4741614 = boost
                4.297489 = idf(docFreq=1634, maxDocs=44218)
                0.018823778 = queryNorm
              0.4028896 = fieldWeight in 1750, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.297489 = idf(docFreq=1634, maxDocs=44218)
                0.09375 = fieldNorm(doc=1750)
          0.09130114 = weight(abstract_txt:applications in 1750) [ClassicSimilarity], result of:
            0.09130114 = score(doc=1750,freq=2.0), product of:
              0.14521305 = queryWeight, product of:
                1.6267265 = boost
                4.7422485 = idf(docFreq=1047, maxDocs=44218)
                0.018823778 = queryNorm
              0.62873924 = fieldWeight in 1750, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7422485 = idf(docFreq=1047, maxDocs=44218)
                0.09375 = fieldNorm(doc=1750)
          0.18154211 = weight(abstract_txt:processing in 1750) [ClassicSimilarity], result of:
            0.18154211 = score(doc=1750,freq=1.0), product of:
              0.39264172 = queryWeight, product of:
                4.2294116 = boost
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.018823778 = queryNorm
              0.46236074 = fieldWeight in 1750, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.931848 = idf(docFreq=866, maxDocs=44218)
                0.09375 = fieldNorm(doc=1750)
          0.21916473 = weight(abstract_txt:language in 1750) [ClassicSimilarity], result of:
            0.21916473 = score(doc=1750,freq=2.0), product of:
              0.39526764 = queryWeight, product of:
                5.0210133 = boost
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.018823778 = queryNorm
              0.55447173 = fieldWeight in 1750, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.09375 = fieldNorm(doc=1750)
          0.2380063 = weight(abstract_txt:natural in 1750) [ClassicSimilarity], result of:
            0.2380063 = score(doc=1750,freq=1.0), product of:
              0.4998015 = queryWeight, product of:
                5.227224 = boost
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.018823778 = queryNorm
              0.47620165 = fieldWeight in 1750, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.09375 = fieldNorm(doc=1750)
        0.28 = coord(7/25)