Document (#32951)

Author
Dorr, B.J.
Gaasterland, T.
Title
Exploiting aspectual features and connecting words for summarization-inspired temporal-relation extraction
Source
Information processing and management. 43(2007) no.6, S.1681-1704
Year
2007
Abstract
This paper presents a model that incorporates contemporary theories of tense and aspect and develops a new framework for extracting temporal relations between two sentence-internal events, given their tense, aspect, and a temporal connecting word relating the two events. A linguistic constraint on event combination has been implemented to detect incorrect parser analyses and potentially apply syntactic reanalysis or semantic reinterpretation - in preparation for subsequent processing for multi-document summarization. An important contribution of this work is the extension of two different existing theoretical frameworks - Hornstein's 1990 theory of tense analysis and Allen's 1984 theory on event ordering - and the combination of both into a unified system for representing and constraining combinations of different event types (points, closed intervals, and open-ended intervals). We show that our theoretical results have been verified in a large-scale corpus analysis. The framework is designed to inform a temporally motivated sentence-ordering module in an implemented multi-document summarization system.
Theme
Automatisches Abstracting

Similar documents (author)

  1. Dorr, B.J.: Large-scale dictionary construction for foreign language tutoring and interlingual machine translation (1997) 5.81
    5.814733 = sum of:
      5.814733 = weight(author_txt:dorr in 4244) [ClassicSimilarity], result of:
        5.814733 = fieldWeight in 4244, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.303573 = idf(docFreq=10, maxDocs=44421)
          0.625 = fieldNorm(doc=4244)
    
  2. Dorr, B.J.; Olsen, M.B.: Multilingual generation : the role of telicity in lexical choice and syntactic realization (1996) 4.65
    4.6517863 = sum of:
      4.6517863 = weight(author_txt:dorr in 536) [ClassicSimilarity], result of:
        4.6517863 = fieldWeight in 536, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.303573 = idf(docFreq=10, maxDocs=44421)
          0.5 = fieldNorm(doc=536)
    
  3. Oard, D.W.; Dorr, B.J.: Evaluating cross-laguage text filtering effectiveness (1998) 4.65
    4.6517863 = sum of:
      4.6517863 = weight(author_txt:dorr in 214) [ClassicSimilarity], result of:
        4.6517863 = fieldWeight in 214, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.303573 = idf(docFreq=10, maxDocs=44421)
          0.5 = fieldNorm(doc=214)
    
  4. Schacter, J.; Chung, G.K.W.K.; Dorr, A.: Children's Internet searching on complex problems : performance and process analyses (1998) 3.49
    3.4888396 = sum of:
      3.4888396 = weight(author_txt:dorr in 4552) [ClassicSimilarity], result of:
        3.4888396 = fieldWeight in 4552, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.303573 = idf(docFreq=10, maxDocs=44421)
          0.375 = fieldNorm(doc=4552)
    
  5. Zajic, D.M.; Dorr, B.J.; Lin, J.: Single-document and multi-document summarization techniques for email threads using sentence compression (2008) 3.49
    3.4888396 = sum of:
      3.4888396 = weight(author_txt:dorr in 3105) [ClassicSimilarity], result of:
        3.4888396 = fieldWeight in 3105, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.303573 = idf(docFreq=10, maxDocs=44421)
          0.375 = fieldNorm(doc=3105)
    

Similar documents (content)

  1. Ou, S.; Khoo, C.S.G.; Goh, D.H.: Multi-document summarization of news articles using an event-based framework (2006) 0.35
    0.3485676 = sum of:
      0.3485676 = product of:
        1.0892738 = sum of:
          0.044419583 = weight(abstract_txt:document in 782) [ClassicSimilarity], result of:
            0.044419583 = score(doc=782,freq=4.0), product of:
              0.08275367 = queryWeight, product of:
                1.1289814 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.017069599 = queryNorm
              0.53676873 = fieldWeight in 782, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.0625 = fieldNorm(doc=782)
          0.07445938 = weight(abstract_txt:framework in 782) [ClassicSimilarity], result of:
            0.07445938 = score(doc=782,freq=8.0), product of:
              0.09268452 = queryWeight, product of:
                1.1948042 = boost
                4.5445113 = idf(docFreq=1282, maxDocs=44421)
                0.017069599 = queryNorm
              0.8033637 = fieldWeight in 782, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.5445113 = idf(docFreq=1282, maxDocs=44421)
                0.0625 = fieldNorm(doc=782)
          0.07606439 = weight(abstract_txt:implemented in 782) [ClassicSimilarity], result of:
            0.07606439 = score(doc=782,freq=2.0), product of:
              0.14923427 = queryWeight, product of:
                1.5160984 = boost
                5.7665734 = idf(docFreq=377, maxDocs=44421)
                0.017069599 = queryNorm
              0.5096979 = fieldWeight in 782, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7665734 = idf(docFreq=377, maxDocs=44421)
                0.0625 = fieldNorm(doc=782)
          0.10178994 = weight(abstract_txt:multi in 782) [ClassicSimilarity], result of:
            0.10178994 = score(doc=782,freq=3.0), product of:
              0.1583144 = queryWeight, product of:
                1.5615408 = boost
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.017069599 = queryNorm
              0.64296067 = fieldWeight in 782, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.0625 = fieldNorm(doc=782)
          0.06920801 = weight(abstract_txt:events in 782) [ClassicSimilarity], result of:
            0.06920801 = score(doc=782,freq=1.0), product of:
              0.17654762 = queryWeight, product of:
                1.6490129 = boost
                6.272122 = idf(docFreq=227, maxDocs=44421)
                0.017069599 = queryNorm
              0.39200762 = fieldWeight in 782, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.272122 = idf(docFreq=227, maxDocs=44421)
                0.0625 = fieldNorm(doc=782)
          0.12746549 = weight(abstract_txt:sentence in 782) [ClassicSimilarity], result of:
            0.12746549 = score(doc=782,freq=2.0), product of:
              0.2105439 = queryWeight, product of:
                1.8007958 = boost
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.017069599 = queryNorm
              0.60541046 = fieldWeight in 782, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.0625 = fieldNorm(doc=782)
          0.38049227 = weight(abstract_txt:event in 782) [ClassicSimilarity], result of:
            0.38049227 = score(doc=782,freq=7.0), product of:
              0.32909346 = queryWeight, product of:
                2.757391 = boost
                6.9919376 = idf(docFreq=110, maxDocs=44421)
                0.017069599 = queryNorm
              1.156183 = fieldWeight in 782, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.9919376 = idf(docFreq=110, maxDocs=44421)
                0.0625 = fieldNorm(doc=782)
          0.21537481 = weight(abstract_txt:summarization in 782) [ClassicSimilarity], result of:
            0.21537481 = score(doc=782,freq=2.0), product of:
              0.341907 = queryWeight, product of:
                2.8105593 = boost
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.017069599 = queryNorm
              0.6299222 = fieldWeight in 782, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.0625 = fieldNorm(doc=782)
        0.32 = coord(8/25)
    
  2. Zajic, D.; Dorr, B.J.; Lin, J.; Schwartz, R.: Multi-candidate reduction : sentence compression as a tool for document summarization tasks (2007) 0.24
    0.24244723 = sum of:
      0.24244723 = product of:
        1.0101968 = sum of:
          0.06662938 = weight(abstract_txt:document in 1944) [ClassicSimilarity], result of:
            0.06662938 = score(doc=1944,freq=4.0), product of:
              0.08275367 = queryWeight, product of:
                1.1289814 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.017069599 = queryNorm
              0.80515313 = fieldWeight in 1944, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.09375 = fieldNorm(doc=1944)
          0.05584453 = weight(abstract_txt:framework in 1944) [ClassicSimilarity], result of:
            0.05584453 = score(doc=1944,freq=2.0), product of:
              0.09268452 = queryWeight, product of:
                1.1948042 = boost
                4.5445113 = idf(docFreq=1282, maxDocs=44421)
                0.017069599 = queryNorm
              0.60252273 = fieldWeight in 1944, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5445113 = idf(docFreq=1282, maxDocs=44421)
                0.09375 = fieldNorm(doc=1944)
          0.08157965 = weight(abstract_txt:combination in 1944) [ClassicSimilarity], result of:
            0.08157965 = score(doc=1944,freq=1.0), product of:
              0.1503435 = queryWeight, product of:
                1.5217224 = boost
                5.787965 = idf(docFreq=369, maxDocs=44421)
                0.017069599 = queryNorm
              0.54262173 = fieldWeight in 1944, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.787965 = idf(docFreq=369, maxDocs=44421)
                0.09375 = fieldNorm(doc=1944)
          0.17630534 = weight(abstract_txt:multi in 1944) [ClassicSimilarity], result of:
            0.17630534 = score(doc=1944,freq=4.0), product of:
              0.1583144 = queryWeight, product of:
                1.5615408 = boost
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.017069599 = queryNorm
              1.1136405 = fieldWeight in 1944, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.09375 = fieldNorm(doc=1944)
          0.23416904 = weight(abstract_txt:sentence in 1944) [ClassicSimilarity], result of:
            0.23416904 = score(doc=1944,freq=3.0), product of:
              0.2105439 = queryWeight, product of:
                1.8007958 = boost
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.017069599 = queryNorm
              1.11221 = fieldWeight in 1944, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.09375 = fieldNorm(doc=1944)
          0.39566883 = weight(abstract_txt:summarization in 1944) [ClassicSimilarity], result of:
            0.39566883 = score(doc=1944,freq=3.0), product of:
              0.341907 = queryWeight, product of:
                2.8105593 = boost
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.017069599 = queryNorm
              1.1572411 = fieldWeight in 1944, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.09375 = fieldNorm(doc=1944)
        0.24 = coord(6/25)
    
  3. Zajic, D.M.; Dorr, B.J.; Lin, J.: Single-document and multi-document summarization techniques for email threads using sentence compression (2008) 0.21
    0.21020511 = sum of:
      0.21020511 = product of:
        0.8758546 = sum of:
          0.039261736 = weight(abstract_txt:document in 3105) [ClassicSimilarity], result of:
            0.039261736 = score(doc=3105,freq=2.0), product of:
              0.08275367 = queryWeight, product of:
                1.1289814 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.017069599 = queryNorm
              0.47444102 = fieldWeight in 3105, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.078125 = fieldNorm(doc=3105)
          0.032906707 = weight(abstract_txt:framework in 3105) [ClassicSimilarity], result of:
            0.032906707 = score(doc=3105,freq=1.0), product of:
              0.09268452 = queryWeight, product of:
                1.1948042 = boost
                4.5445113 = idf(docFreq=1282, maxDocs=44421)
                0.017069599 = queryNorm
              0.35503995 = fieldWeight in 3105, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5445113 = idf(docFreq=1282, maxDocs=44421)
                0.078125 = fieldNorm(doc=3105)
          0.06723206 = weight(abstract_txt:implemented in 3105) [ClassicSimilarity], result of:
            0.06723206 = score(doc=3105,freq=1.0), product of:
              0.14923427 = queryWeight, product of:
                1.5160984 = boost
                5.7665734 = idf(docFreq=377, maxDocs=44421)
                0.017069599 = queryNorm
              0.45051354 = fieldWeight in 3105, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7665734 = idf(docFreq=377, maxDocs=44421)
                0.078125 = fieldNorm(doc=3105)
          0.073460564 = weight(abstract_txt:multi in 3105) [ClassicSimilarity], result of:
            0.073460564 = score(doc=3105,freq=1.0), product of:
              0.1583144 = queryWeight, product of:
                1.5615408 = boost
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.017069599 = queryNorm
              0.4640169 = fieldWeight in 3105, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.078125 = fieldNorm(doc=3105)
          0.15933186 = weight(abstract_txt:sentence in 3105) [ClassicSimilarity], result of:
            0.15933186 = score(doc=3105,freq=2.0), product of:
              0.2105439 = queryWeight, product of:
                1.8007958 = boost
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.017069599 = queryNorm
              0.7567631 = fieldWeight in 3105, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.078125 = fieldNorm(doc=3105)
          0.5036617 = weight(abstract_txt:summarization in 3105) [ClassicSimilarity], result of:
            0.5036617 = score(doc=3105,freq=7.0), product of:
              0.341907 = queryWeight, product of:
                2.8105593 = boost
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.017069599 = queryNorm
              1.4730957 = fieldWeight in 3105, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.078125 = fieldNorm(doc=3105)
        0.24 = coord(6/25)
    
  4. Kar, M.; Nunes, S.; Ribeiro, C.: Summarization of changes in dynamic text collections using Latent Dirichlet Allocation model (2015) 0.14
    0.13614334 = sum of:
      0.13614334 = product of:
        0.6807167 = sum of:
          0.037246954 = weight(abstract_txt:document in 3676) [ClassicSimilarity], result of:
            0.037246954 = score(doc=3676,freq=5.0), product of:
              0.08275367 = queryWeight, product of:
                1.1289814 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.017069599 = queryNorm
              0.45009425 = fieldWeight in 3676, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.046875 = fieldNorm(doc=3676)
          0.13429132 = weight(abstract_txt:intervals in 3676) [ClassicSimilarity], result of:
            0.13429132 = score(doc=3676,freq=1.0), product of:
              0.33272243 = queryWeight, product of:
                2.2637796 = boost
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.017069599 = queryNorm
              0.4036137 = fieldWeight in 3676, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.046875 = fieldNorm(doc=3676)
          0.14591588 = weight(abstract_txt:temporal in 3676) [ClassicSimilarity], result of:
            0.14591588 = score(doc=3676,freq=2.0), product of:
              0.319501 = queryWeight, product of:
                2.7169077 = boost
                6.889283 = idf(docFreq=122, maxDocs=44421)
                0.017069599 = queryNorm
              0.45669928 = fieldWeight in 3676, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.889283 = idf(docFreq=122, maxDocs=44421)
                0.046875 = fieldNorm(doc=3676)
          0.10785942 = weight(abstract_txt:event in 3676) [ClassicSimilarity], result of:
            0.10785942 = score(doc=3676,freq=1.0), product of:
              0.32909346 = queryWeight, product of:
                2.757391 = boost
                6.9919376 = idf(docFreq=110, maxDocs=44421)
                0.017069599 = queryNorm
              0.32774708 = fieldWeight in 3676, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9919376 = idf(docFreq=110, maxDocs=44421)
                0.046875 = fieldNorm(doc=3676)
          0.2554031 = weight(abstract_txt:summarization in 3676) [ClassicSimilarity], result of:
            0.2554031 = score(doc=3676,freq=5.0), product of:
              0.341907 = queryWeight, product of:
                2.8105593 = boost
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.017069599 = queryNorm
              0.74699587 = fieldWeight in 3676, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.046875 = fieldNorm(doc=3676)
        0.2 = coord(5/25)
    
  5. Xiong, S.; Ji, D.: Query-focused multi-document summarization using hypergraph-based ranking (2016) 0.12
    0.11640598 = sum of:
      0.11640598 = product of:
        0.5820299 = sum of:
          0.039261736 = weight(abstract_txt:document in 3972) [ClassicSimilarity], result of:
            0.039261736 = score(doc=3972,freq=2.0), product of:
              0.08275367 = queryWeight, product of:
                1.1289814 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.017069599 = queryNorm
              0.47444102 = fieldWeight in 3972, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.078125 = fieldNorm(doc=3972)
          0.05699609 = weight(abstract_txt:framework in 3972) [ClassicSimilarity], result of:
            0.05699609 = score(doc=3972,freq=3.0), product of:
              0.09268452 = queryWeight, product of:
                1.1948042 = boost
                4.5445113 = idf(docFreq=1282, maxDocs=44421)
                0.017069599 = queryNorm
              0.6149472 = fieldWeight in 3972, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.5445113 = idf(docFreq=1282, maxDocs=44421)
                0.078125 = fieldNorm(doc=3972)
          0.10388892 = weight(abstract_txt:multi in 3972) [ClassicSimilarity], result of:
            0.10388892 = score(doc=3972,freq=2.0), product of:
              0.1583144 = queryWeight, product of:
                1.5615408 = boost
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.017069599 = queryNorm
              0.656219 = fieldWeight in 3972, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.078125 = fieldNorm(doc=3972)
          0.112664625 = weight(abstract_txt:sentence in 3972) [ClassicSimilarity], result of:
            0.112664625 = score(doc=3972,freq=1.0), product of:
              0.2105439 = queryWeight, product of:
                1.8007958 = boost
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.017069599 = queryNorm
              0.53511226 = fieldWeight in 3972, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.078125 = fieldNorm(doc=3972)
          0.2692185 = weight(abstract_txt:summarization in 3972) [ClassicSimilarity], result of:
            0.2692185 = score(doc=3972,freq=2.0), product of:
              0.341907 = queryWeight, product of:
                2.8105593 = boost
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.017069599 = queryNorm
              0.78740275 = fieldWeight in 3972, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.078125 = fieldNorm(doc=3972)
        0.2 = coord(5/25)