Document (#13137)

Author
Losee, R.M.
Title
Learning syntactic rules and tags with genetic algorithms for information retrieval and filtering : an empirical basis for grammatical rules
Source
Information processing and management. 32(1996) no.2, S.185-197
Year
1996
Abstract
The grammars of natural languages may be learned by using genetic algorithms that reproduce and mutate grammatical rules and parts of speech tags, improving the quality of later generations of grammatical components. Syntactic rules are randomly generated and then evolve; those rules resulting in improved parsing and occasionally improved filtering performance are allowed to further propagate. The LUST system learns the characteristics of the language or subkanguage used in document abstracts by learning from the document rankings obtained from the parsed abstracts. Unlike the application of traditional linguistic rules to retrieval and filtering applications, LUST develops grammatical structures and tags without the prior imposition of some common grammatical assumptions (e.g. part of speech assumptions), producing grammars that are empirically based and are optimized for this particular application
Theme
Computerlinguistik

Similar documents (author)

  1. Losee, R.M.: ¬A Gray code based ordering for documents on shelves : classification for browsing and retrieval (1992) 5.19
    5.187669 = sum of:
      5.187669 = weight(author_txt:losee in 2334) [ClassicSimilarity], result of:
        5.187669 = fieldWeight in 2334, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.30027 = idf(docFreq=29, maxDocs=44421)
          0.625 = fieldNorm(doc=2334)
    
  2. Losee, R.M.: ¬The relative shelf location of circulated books : a study of classification, users, and browsing (1993) 5.19
    5.187669 = sum of:
      5.187669 = weight(author_txt:losee in 4484) [ClassicSimilarity], result of:
        5.187669 = fieldWeight in 4484, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.30027 = idf(docFreq=29, maxDocs=44421)
          0.625 = fieldNorm(doc=4484)
    
  3. Losee, R.M.: Seven fundamental questions for the science of library classification (1993) 5.19
    5.187669 = sum of:
      5.187669 = weight(author_txt:losee in 4507) [ClassicSimilarity], result of:
        5.187669 = fieldWeight in 4507, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.30027 = idf(docFreq=29, maxDocs=44421)
          0.625 = fieldNorm(doc=4507)
    
  4. Losee, R.M.: Term dependence : truncating the Bahadur Lazarsfeld expansion (1994) 5.19
    5.187669 = sum of:
      5.187669 = weight(author_txt:losee in 7389) [ClassicSimilarity], result of:
        5.187669 = fieldWeight in 7389, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.30027 = idf(docFreq=29, maxDocs=44421)
          0.625 = fieldNorm(doc=7389)
    
  5. Losee, R.M.: Upper bounds for retrieval performance and their user measuring performance and generating optimal queries : can it get any better than this? (1994) 5.19
    5.187669 = sum of:
      5.187669 = weight(author_txt:losee in 7417) [ClassicSimilarity], result of:
        5.187669 = fieldWeight in 7417, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.30027 = idf(docFreq=29, maxDocs=44421)
          0.625 = fieldNorm(doc=7417)
    

Similar documents (content)

  1. Fang, L.; Tuan, L.A.; Hui, S.C.; Wu, L.: Syntactic based approach for grammar question retrieval (2018) 0.13
    0.12777916 = sum of:
      0.12777916 = product of:
        0.79861975 = sum of:
          0.023484211 = weight(abstract_txt:learning in 86) [ClassicSimilarity], result of:
            0.023484211 = score(doc=86,freq=1.0), product of:
              0.07923701 = queryWeight, product of:
                1.2237244 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.013654524 = queryNorm
              0.29637933 = fieldWeight in 86, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.0625 = fieldNorm(doc=86)
          0.13718073 = weight(abstract_txt:syntactic in 86) [ClassicSimilarity], result of:
            0.13718073 = score(doc=86,freq=5.0), product of:
              0.15029672 = queryWeight, product of:
                1.6853664 = boost
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.013654524 = queryNorm
              0.9127327 = fieldWeight in 86, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.0625 = fieldNorm(doc=86)
          0.07150567 = weight(abstract_txt:speech in 86) [ClassicSimilarity], result of:
            0.07150567 = score(doc=86,freq=1.0), product of:
              0.1664579 = queryWeight, product of:
                1.7736658 = boost
                6.8731537 = idf(docFreq=124, maxDocs=44421)
                0.013654524 = queryNorm
              0.4295721 = fieldWeight in 86, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8731537 = idf(docFreq=124, maxDocs=44421)
                0.0625 = fieldNorm(doc=86)
          0.5664491 = weight(abstract_txt:grammatical in 86) [ClassicSimilarity], result of:
            0.5664491 = score(doc=86,freq=4.0), product of:
              0.56555915 = queryWeight, product of:
                5.1692624 = boost
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.013654524 = queryNorm
              1.0015736 = fieldWeight in 86, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.0625 = fieldNorm(doc=86)
        0.16 = coord(4/25)
    
  2. Marcu, D.: Automatic abstracting and summarization (2009) 0.09
    0.093146905 = sum of:
      0.093146905 = product of:
        0.58216816 = sum of:
          0.026157437 = weight(abstract_txt:document in 735) [ClassicSimilarity], result of:
            0.026157437 = score(doc=735,freq=1.0), product of:
              0.06497506 = queryWeight, product of:
                1.1081357 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.013654524 = queryNorm
              0.40257657 = fieldWeight in 735, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.09375 = fieldNorm(doc=735)
          0.0611787 = weight(abstract_txt:algorithms in 735) [ClassicSimilarity], result of:
            0.0611787 = score(doc=735,freq=1.0), product of:
              0.11448539 = queryWeight, product of:
                1.4709388 = boost
                5.7000527 = idf(docFreq=403, maxDocs=44421)
                0.013654524 = queryNorm
              0.53437996 = fieldWeight in 735, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7000527 = idf(docFreq=403, maxDocs=44421)
                0.09375 = fieldNorm(doc=735)
          0.069995224 = weight(abstract_txt:abstracts in 735) [ClassicSimilarity], result of:
            0.069995224 = score(doc=735,freq=1.0), product of:
              0.1252359 = queryWeight, product of:
                1.5384521 = boost
                5.9616747 = idf(docFreq=310, maxDocs=44421)
                0.013654524 = queryNorm
              0.55890703 = fieldWeight in 735, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9616747 = idf(docFreq=310, maxDocs=44421)
                0.09375 = fieldNorm(doc=735)
          0.4248368 = weight(abstract_txt:grammatical in 735) [ClassicSimilarity], result of:
            0.4248368 = score(doc=735,freq=1.0), product of:
              0.56555915 = queryWeight, product of:
                5.1692624 = boost
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.013654524 = queryNorm
              0.7511802 = fieldWeight in 735, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.09375 = fieldNorm(doc=735)
        0.16 = coord(4/25)
    
  3. Losee, R.M.: Text windows and phrases differing by discipline, location in document, and syntactic structure (1996) 0.09
    0.09184373 = sum of:
      0.09184373 = product of:
        0.76536447 = sum of:
          0.026157437 = weight(abstract_txt:document in 31) [ClassicSimilarity], result of:
            0.026157437 = score(doc=31,freq=1.0), product of:
              0.06497506 = queryWeight, product of:
                1.1081357 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.013654524 = queryNorm
              0.40257657 = fieldWeight in 31, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.09375 = fieldNorm(doc=31)
          0.13839705 = weight(abstract_txt:filtering in 31) [ClassicSimilarity], result of:
            0.13839705 = score(doc=31,freq=1.0), product of:
              0.22583863 = queryWeight, product of:
                2.530255 = boost
                6.5366817 = idf(docFreq=174, maxDocs=44421)
                0.013654524 = queryNorm
              0.6128139 = fieldWeight in 31, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5366817 = idf(docFreq=174, maxDocs=44421)
                0.09375 = fieldNorm(doc=31)
          0.60081 = weight(abstract_txt:grammatical in 31) [ClassicSimilarity], result of:
            0.60081 = score(doc=31,freq=2.0), product of:
              0.56555915 = queryWeight, product of:
                5.1692624 = boost
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.013654524 = queryNorm
              1.0623292 = fieldWeight in 31, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.09375 = fieldNorm(doc=31)
        0.12 = coord(3/25)
    
  4. Svenonius, E.: Facets as semantic categories (1979) 0.08
    0.078569405 = sum of:
      0.078569405 = product of:
        0.65474504 = sum of:
          0.13282466 = weight(abstract_txt:syntactic in 1426) [ClassicSimilarity], result of:
            0.13282466 = score(doc=1426,freq=3.0), product of:
              0.15029672 = queryWeight, product of:
                1.6853664 = boost
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.013654524 = queryNorm
              0.8837496 = fieldWeight in 1426, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.078125 = fieldNorm(doc=1426)
          0.16788971 = weight(abstract_txt:rules in 1426) [ClassicSimilarity], result of:
            0.16788971 = score(doc=1426,freq=2.0), product of:
              0.29007962 = queryWeight, product of:
                4.055448 = boost
                5.238438 = idf(docFreq=640, maxDocs=44421)
                0.013654524 = queryNorm
              0.5787711 = fieldWeight in 1426, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.238438 = idf(docFreq=640, maxDocs=44421)
                0.078125 = fieldNorm(doc=1426)
          0.35403067 = weight(abstract_txt:grammatical in 1426) [ClassicSimilarity], result of:
            0.35403067 = score(doc=1426,freq=1.0), product of:
              0.56555915 = queryWeight, product of:
                5.1692624 = boost
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.013654524 = queryNorm
              0.6259835 = fieldWeight in 1426, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.078125 = fieldNorm(doc=1426)
        0.12 = coord(3/25)
    
  5. Brill, E.: ¬An overview of empirical natural language processing (1997) 0.07
    0.06643191 = sum of:
      0.06643191 = product of:
        0.41519946 = sum of:
          0.10252154 = weight(abstract_txt:parsing in 4249) [ClassicSimilarity], result of:
            0.10252154 = score(doc=4249,freq=1.0), product of:
              0.105825625 = queryWeight, product of:
                7.750224 = idf(docFreq=51, maxDocs=44421)
                0.013654524 = queryNorm
              0.968778 = fieldWeight in 4249, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.750224 = idf(docFreq=51, maxDocs=44421)
                0.125 = fieldNorm(doc=4249)
          0.046968423 = weight(abstract_txt:learning in 4249) [ClassicSimilarity], result of:
            0.046968423 = score(doc=4249,freq=1.0), product of:
              0.07923701 = queryWeight, product of:
                1.2237244 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.013654524 = queryNorm
              0.59275866 = fieldWeight in 4249, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.125 = fieldNorm(doc=4249)
          0.12269817 = weight(abstract_txt:syntactic in 4249) [ClassicSimilarity], result of:
            0.12269817 = score(doc=4249,freq=1.0), product of:
              0.15029672 = queryWeight, product of:
                1.6853664 = boost
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.013654524 = queryNorm
              0.81637293 = fieldWeight in 4249, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.125 = fieldNorm(doc=4249)
          0.14301135 = weight(abstract_txt:speech in 4249) [ClassicSimilarity], result of:
            0.14301135 = score(doc=4249,freq=1.0), product of:
              0.1664579 = queryWeight, product of:
                1.7736658 = boost
                6.8731537 = idf(docFreq=124, maxDocs=44421)
                0.013654524 = queryNorm
              0.8591442 = fieldWeight in 4249, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8731537 = idf(docFreq=124, maxDocs=44421)
                0.125 = fieldNorm(doc=4249)
        0.16 = coord(4/25)