Document (#20874)

Author
Chandrasekar, R.
Srinivas, B.
Title
Automatic induction of rules for text simplification
Source
Knowledge-based systems. 10(1997) no.3, S.183-190
Year
1997
Abstract
Explores methods to automatically transform sentences in order to make them simpler. These methods involve the use of a rule-based system, driven by the syntax of the text in the domain of interest. Hand-crafting rules for every domain is time-consuming and impractical. Describes an algorithm and an implementation by which generalized rules for simplification are automatically induced from annotated training materials using a novel partial parsing technique, which combines constituent structure and dependency information. The algorithm employs example-based generalisations on linguistically motivated structures
Footnote
Contribution to an issue devoted to papers from the International Conference on Knowledge Based Computer systems, 16-18 Dec 1996, Mumbai, India
Theme
Computerlinguistik

Similar documents (content)

  1. Jiang, X.; Tan, A.-H.: CRCTOL: a semantic-based domain ontology learning system (2009) 0.17
    0.1652666 = sum of:
      0.1652666 = product of:
        0.51645815 = sum of:
          0.05840409 = weight(abstract_txt:generalized in 307) [ClassicSimilarity], result of:
            0.05840409 = score(doc=307,freq=1.0), product of:
              0.15112495 = queryWeight, product of:
                1.0363815 = boost
                7.0667386 = idf(docFreq=102, maxDocs=44421)
                0.02063467 = queryNorm
              0.38646227 = fieldWeight in 307, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0667386 = idf(docFreq=102, maxDocs=44421)
                0.0546875 = fieldNorm(doc=307)
          0.06124932 = weight(abstract_txt:employs in 307) [ClassicSimilarity], result of:
            0.06124932 = score(doc=307,freq=1.0), product of:
              0.1559941 = queryWeight, product of:
                1.0529449 = boost
                7.179679 = idf(docFreq=91, maxDocs=44421)
                0.02063467 = queryNorm
              0.39263868 = fieldWeight in 307, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.179679 = idf(docFreq=91, maxDocs=44421)
                0.0546875 = fieldNorm(doc=307)
          0.077042244 = weight(abstract_txt:parsing in 307) [ClassicSimilarity], result of:
            0.077042244 = score(doc=307,freq=1.0), product of:
              0.18177184 = queryWeight, product of:
                1.1366189 = boost
                7.750224 = idf(docFreq=51, maxDocs=44421)
                0.02063467 = queryNorm
              0.42384037 = fieldWeight in 307, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.750224 = idf(docFreq=51, maxDocs=44421)
                0.0546875 = fieldNorm(doc=307)
          0.037827305 = weight(abstract_txt:text in 307) [ClassicSimilarity], result of:
            0.037827305 = score(doc=307,freq=3.0), product of:
              0.098828115 = queryWeight, product of:
                1.1852413 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.02063467 = queryNorm
              0.38275853 = fieldWeight in 307, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0546875 = fieldNorm(doc=307)
          0.033298492 = weight(abstract_txt:methods in 307) [ClassicSimilarity], result of:
            0.033298492 = score(doc=307,freq=2.0), product of:
              0.10390993 = queryWeight, product of:
                1.2153324 = boost
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.02063467 = queryNorm
              0.32045534 = fieldWeight in 307, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.0546875 = fieldNorm(doc=307)
          0.07000288 = weight(abstract_txt:domain in 307) [ClassicSimilarity], result of:
            0.07000288 = score(doc=307,freq=4.0), product of:
              0.13534471 = queryWeight, product of:
                1.3870343 = boost
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.02063467 = queryNorm
              0.5172192 = fieldWeight in 307, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.0546875 = fieldNorm(doc=307)
          0.055715483 = weight(abstract_txt:automatically in 307) [ClassicSimilarity], result of:
            0.055715483 = score(doc=307,freq=1.0), product of:
              0.18451624 = queryWeight, product of:
                1.6195108 = boost
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.02063467 = queryNorm
              0.30195436 = fieldWeight in 307, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.0546875 = fieldNorm(doc=307)
          0.12291835 = weight(abstract_txt:algorithm in 307) [ClassicSimilarity], result of:
            0.12291835 = score(doc=307,freq=4.0), product of:
              0.19698894 = queryWeight, product of:
                1.6733526 = boost
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.02063467 = queryNorm
              0.62398607 = fieldWeight in 307, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.0546875 = fieldNorm(doc=307)
        0.32 = coord(8/25)
    
  2. Finegan-Dollak, C.; Radev, D.R.: Sentence simplification, compression, and disaggregation for summarization of sophisticated documents (2016) 0.16
    0.16063781 = sum of:
      0.16063781 = product of:
        0.80318904 = sum of:
          0.18356968 = weight(abstract_txt:sentences in 4122) [ClassicSimilarity], result of:
            0.18356968 = score(doc=4122,freq=8.0), product of:
              0.1483258 = queryWeight, product of:
                1.0267386 = boost
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.02063467 = queryNorm
              1.2376113 = fieldWeight in 4122, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.0625 = fieldNorm(doc=4122)
          0.026909247 = weight(abstract_txt:methods in 4122) [ClassicSimilarity], result of:
            0.026909247 = score(doc=4122,freq=1.0), product of:
              0.10390993 = queryWeight, product of:
                1.2153324 = boost
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.02063467 = queryNorm
              0.25896704 = fieldWeight in 4122, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.0625 = fieldNorm(doc=4122)
          0.09933303 = weight(abstract_txt:algorithm in 4122) [ClassicSimilarity], result of:
            0.09933303 = score(doc=4122,freq=2.0), product of:
              0.19698894 = queryWeight, product of:
                1.6733526 = boost
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.02063467 = queryNorm
              0.5042569 = fieldWeight in 4122, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.0625 = fieldNorm(doc=4122)
          0.08156519 = weight(abstract_txt:rules in 4122) [ClassicSimilarity], result of:
            0.08156519 = score(doc=4122,freq=1.0), product of:
              0.24912827 = queryWeight, product of:
                2.3047493 = boost
                5.238438 = idf(docFreq=640, maxDocs=44421)
                0.02063467 = queryNorm
              0.32740238 = fieldWeight in 4122, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.238438 = idf(docFreq=640, maxDocs=44421)
                0.0625 = fieldNorm(doc=4122)
          0.41181192 = weight(abstract_txt:simplification in 4122) [ClassicSimilarity], result of:
            0.41181192 = score(doc=4122,freq=3.0), product of:
              0.4441008 = queryWeight, product of:
                2.5125072 = boost
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.02063467 = queryNorm
              0.9272938 = fieldWeight in 4122, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.0625 = fieldNorm(doc=4122)
        0.2 = coord(5/25)
    
  3. Kauchak, D.; Leroy, G.; Hogue, A.: Measuring text difficulty using parse-tree frequency (2017) 0.11
    0.11065296 = sum of:
      0.11065296 = product of:
        0.5532648 = sum of:
          0.12980337 = weight(abstract_txt:sentences in 4786) [ClassicSimilarity], result of:
            0.12980337 = score(doc=4786,freq=4.0), product of:
              0.1483258 = queryWeight, product of:
                1.0267386 = boost
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.02063467 = queryNorm
              0.8751234 = fieldWeight in 4786, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.0625 = fieldNorm(doc=4786)
          0.07269391 = weight(abstract_txt:motivated in 4786) [ClassicSimilarity], result of:
            0.07269391 = score(doc=4786,freq=1.0), product of:
              0.15997227 = queryWeight, product of:
                1.0662864 = boost
                7.270651 = idf(docFreq=83, maxDocs=44421)
                0.02063467 = queryNorm
              0.45441568 = fieldWeight in 4786, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.270651 = idf(docFreq=83, maxDocs=44421)
                0.0625 = fieldNorm(doc=4786)
          0.08804829 = weight(abstract_txt:parsing in 4786) [ClassicSimilarity], result of:
            0.08804829 = score(doc=4786,freq=1.0), product of:
              0.18177184 = queryWeight, product of:
                1.1366189 = boost
                7.750224 = idf(docFreq=51, maxDocs=44421)
                0.02063467 = queryNorm
              0.484389 = fieldWeight in 4786, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.750224 = idf(docFreq=51, maxDocs=44421)
                0.0625 = fieldNorm(doc=4786)
          0.024959547 = weight(abstract_txt:text in 4786) [ClassicSimilarity], result of:
            0.024959547 = score(doc=4786,freq=1.0), product of:
              0.098828115 = queryWeight, product of:
                1.1852413 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.02063467 = queryNorm
              0.25255513 = fieldWeight in 4786, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=4786)
          0.23775972 = weight(abstract_txt:simplification in 4786) [ClassicSimilarity], result of:
            0.23775972 = score(doc=4786,freq=1.0), product of:
              0.4441008 = queryWeight, product of:
                2.5125072 = boost
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.02063467 = queryNorm
              0.53537333 = fieldWeight in 4786, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.0625 = fieldNorm(doc=4786)
        0.2 = coord(5/25)
    
  4. Cui, H.; Heidorn, P.B.: ¬The reusability of induced knowledge for the automatic semantic markup of taxonomic descriptions (2007) 0.09
    0.08805957 = sum of:
      0.08805957 = product of:
        0.44029784 = sum of:
          0.14774829 = weight(abstract_txt:induced in 1084) [ClassicSimilarity], result of:
            0.14774829 = score(doc=1084,freq=2.0), product of:
              0.20372814 = queryWeight, product of:
                1.2033087 = boost
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.02063467 = queryNorm
              0.7252228 = fieldWeight in 1084, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.0625 = fieldNorm(doc=1084)
          0.026909247 = weight(abstract_txt:methods in 1084) [ClassicSimilarity], result of:
            0.026909247 = score(doc=1084,freq=1.0), product of:
              0.10390993 = queryWeight, product of:
                1.2153324 = boost
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.02063467 = queryNorm
              0.25896704 = fieldWeight in 1084, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.0625 = fieldNorm(doc=1084)
          0.04000165 = weight(abstract_txt:domain in 1084) [ClassicSimilarity], result of:
            0.04000165 = score(doc=1084,freq=1.0), product of:
              0.13534471 = queryWeight, product of:
                1.3870343 = boost
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.02063467 = queryNorm
              0.29555383 = fieldWeight in 1084, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.0625 = fieldNorm(doc=1084)
          0.11028805 = weight(abstract_txt:automatically in 1084) [ClassicSimilarity], result of:
            0.11028805 = score(doc=1084,freq=3.0), product of:
              0.18451624 = queryWeight, product of:
                1.6195108 = boost
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.02063467 = queryNorm
              0.5977146 = fieldWeight in 1084, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.0625 = fieldNorm(doc=1084)
          0.1153506 = weight(abstract_txt:rules in 1084) [ClassicSimilarity], result of:
            0.1153506 = score(doc=1084,freq=2.0), product of:
              0.24912827 = queryWeight, product of:
                2.3047493 = boost
                5.238438 = idf(docFreq=640, maxDocs=44421)
                0.02063467 = queryNorm
              0.4630169 = fieldWeight in 1084, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.238438 = idf(docFreq=640, maxDocs=44421)
                0.0625 = fieldNorm(doc=1084)
        0.2 = coord(5/25)
    
  5. Ku, C.-H.; Leroy, G.: ¬A crime reports analysis system to identify related crimes (2011) 0.08
    0.07736213 = sum of:
      0.07736213 = product of:
        0.38681063 = sum of:
          0.07233951 = weight(abstract_txt:consuming in 629) [ClassicSimilarity], result of:
            0.07233951 = score(doc=629,freq=1.0), product of:
              0.15945192 = queryWeight, product of:
                1.0645509 = boost
                7.2588162 = idf(docFreq=84, maxDocs=44421)
                0.02063467 = queryNorm
              0.45367602 = fieldWeight in 629, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2588162 = idf(docFreq=84, maxDocs=44421)
                0.0625 = fieldNorm(doc=629)
          0.024959547 = weight(abstract_txt:text in 629) [ClassicSimilarity], result of:
            0.024959547 = score(doc=629,freq=1.0), product of:
              0.098828115 = queryWeight, product of:
                1.1852413 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.02063467 = queryNorm
              0.25255513 = fieldWeight in 629, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=629)
          0.04000165 = weight(abstract_txt:domain in 629) [ClassicSimilarity], result of:
            0.04000165 = score(doc=629,freq=1.0), product of:
              0.13534471 = queryWeight, product of:
                1.3870343 = boost
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.02063467 = queryNorm
              0.29555383 = fieldWeight in 629, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.0625 = fieldNorm(doc=629)
          0.06367484 = weight(abstract_txt:automatically in 629) [ClassicSimilarity], result of:
            0.06367484 = score(doc=629,freq=1.0), product of:
              0.18451624 = queryWeight, product of:
                1.6195108 = boost
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.02063467 = queryNorm
              0.3450907 = fieldWeight in 629, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.0625 = fieldNorm(doc=629)
          0.18583508 = weight(abstract_txt:algorithm in 629) [ClassicSimilarity], result of:
            0.18583508 = score(doc=629,freq=7.0), product of:
              0.19698894 = queryWeight, product of:
                1.6733526 = boost
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.02063467 = queryNorm
              0.94337827 = fieldWeight in 629, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.0625 = fieldNorm(doc=629)
        0.2 = coord(5/25)