Document (#29541)

Author
Moens, M.F.
Dumortier, J.
Title
Use of a text grammar for generating highlight abstracts of magazine articles
Source
Journal of documentation. 56(2000) no.5, S.520-539
Year
2000
Abstract
Browsing a database of article abstracts is one way to select and buy relevant magazine articles online. Our research contributes to the design and development of text grammars for abstracting texts in unlimited subject domains. We developed a system that parses texts based on the text grammar of a specific text type and that extracts sentences and statements which are relevant for inclusion in the abstracts. The system employs knowledge of the discourse patterns that are typical of news stories. The results are encouraging and demonstrate the importance of discourse structures in text summarisation.
Content
Vgl. auch: http://www.emeraldinsight.com/10.1108/EUM0000000007126
Theme
Automatisches Abstracting
Computerlinguistik

Similar documents (author)

  1. Moens, M.F.: Automatic indexing and abstracting of document texts (2000) 5.87
    5.871439 = sum of:
      5.871439 = weight(author_txt:moens in 6892) [ClassicSimilarity], result of:
        5.871439 = fieldWeight in 6892, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.394302 = idf(docFreq=9, maxDocs=44218)
          0.625 = fieldNorm(doc=6892)
    
  2. Moens, M.-F.: Summarizing court decisions (2007) 4.70
    4.697151 = sum of:
      4.697151 = weight(author_txt:moens in 954) [ClassicSimilarity], result of:
        4.697151 = fieldWeight in 954, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.394302 = idf(docFreq=9, maxDocs=44218)
          0.5 = fieldNorm(doc=954)
    
  3. Moens, M.-F.; Dumortier, J.: Text categorization : the assignment of subject descriptors to magazine articles (2000) 4.11
    4.1100073 = sum of:
      4.1100073 = weight(author_txt:moens in 3329) [ClassicSimilarity], result of:
        4.1100073 = fieldWeight in 3329, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.394302 = idf(docFreq=9, maxDocs=44218)
          0.4375 = fieldNorm(doc=3329)
    
  4. Moens, M.-F.; Uyttendaele, C.: Automatic text structuring and categorization as a first step in summarizing legal cases (1997) 4.11
    4.1100073 = sum of:
      4.1100073 = weight(author_txt:moens in 2256) [ClassicSimilarity], result of:
        4.1100073 = fieldWeight in 2256, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.394302 = idf(docFreq=9, maxDocs=44218)
          0.4375 = fieldNorm(doc=2256)
    
  5. Moens, M.-F.; Uyttendaele, C.; Dumotier, J.: Abstracting of legal cases : the potential of clustering based on the selection of representative objects (1999) 3.52
    3.5228634 = sum of:
      3.5228634 = weight(author_txt:moens in 2944) [ClassicSimilarity], result of:
        3.5228634 = fieldWeight in 2944, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.394302 = idf(docFreq=9, maxDocs=44218)
          0.375 = fieldNorm(doc=2944)
    

Similar documents (content)

  1. Moens, M.-F.; Angheluta, R.; Dumortier, J.: Generic technologies for single-and multi-document summarization (2005) 0.22
    0.22482479 = sum of:
      0.22482479 = product of:
        0.7025775 = sum of:
          0.025917562 = weight(abstract_txt:system in 1026) [ClassicSimilarity], result of:
            0.025917562 = score(doc=1026,freq=2.0), product of:
              0.06956036 = queryWeight, product of:
                1.0378904 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.019873895 = queryNorm
              0.372591 = fieldWeight in 1026, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.078125 = fieldNorm(doc=1026)
          0.14172715 = weight(abstract_txt:sentences in 1026) [ClassicSimilarity], result of:
            0.14172715 = score(doc=1026,freq=3.0), product of:
              0.1497019 = queryWeight, product of:
                1.0766369 = boost
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.019873895 = queryNorm
              0.9467291 = fieldWeight in 1026, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.078125 = fieldNorm(doc=1026)
          0.013485279 = weight(abstract_txt:that in 1026) [ClassicSimilarity], result of:
            0.013485279 = score(doc=1026,freq=2.0), product of:
              0.05151133 = queryWeight, product of:
                1.0938748 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.019873895 = queryNorm
              0.26179248 = fieldWeight in 1026, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.078125 = fieldNorm(doc=1026)
          0.04759944 = weight(abstract_txt:relevant in 1026) [ClassicSimilarity], result of:
            0.04759944 = score(doc=1026,freq=1.0), product of:
              0.13143477 = queryWeight, product of:
                1.4266773 = boost
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.019873895 = queryNorm
              0.36215258 = fieldWeight in 1026, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.078125 = fieldNorm(doc=1026)
          0.052259896 = weight(abstract_txt:articles in 1026) [ClassicSimilarity], result of:
            0.052259896 = score(doc=1026,freq=1.0), product of:
              0.13987972 = queryWeight, product of:
                1.4717972 = boost
                4.7821565 = idf(docFreq=1006, maxDocs=44218)
                0.019873895 = queryNorm
              0.37360597 = fieldWeight in 1026, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7821565 = idf(docFreq=1006, maxDocs=44218)
                0.078125 = fieldNorm(doc=1026)
          0.122162595 = weight(abstract_txt:texts in 1026) [ClassicSimilarity], result of:
            0.122162595 = score(doc=1026,freq=2.0), product of:
              0.19555002 = queryWeight, product of:
                1.7402016 = boost
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.019873895 = queryNorm
              0.62471277 = fieldWeight in 1026, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.078125 = fieldNorm(doc=1026)
          0.22042546 = weight(abstract_txt:magazine in 1026) [ClassicSimilarity], result of:
            0.22042546 = score(doc=1026,freq=1.0), product of:
              0.3651603 = queryWeight, product of:
                2.378003 = boost
                7.7265954 = idf(docFreq=52, maxDocs=44218)
                0.019873895 = queryNorm
              0.60364026 = fieldWeight in 1026, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7265954 = idf(docFreq=52, maxDocs=44218)
                0.078125 = fieldNorm(doc=1026)
          0.079000115 = weight(abstract_txt:text in 1026) [ClassicSimilarity], result of:
            0.079000115 = score(doc=1026,freq=1.0), product of:
              0.25005805 = queryWeight, product of:
                3.1114373 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.019873895 = queryNorm
              0.3159271 = fieldWeight in 1026, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=1026)
        0.32 = coord(8/25)
    
  2. Moens, M.-F.; Uyttendaele, C.: Automatic text structuring and categorization as a first step in summarizing legal cases (1997) 0.22
    0.21535353 = sum of:
      0.21535353 = product of:
        0.8973064 = sum of:
          0.018326482 = weight(abstract_txt:system in 2256) [ClassicSimilarity], result of:
            0.018326482 = score(doc=2256,freq=1.0), product of:
              0.06956036 = queryWeight, product of:
                1.0378904 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.019873895 = queryNorm
              0.2634616 = fieldWeight in 2256, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.078125 = fieldNorm(doc=2256)
          0.13049884 = weight(abstract_txt:abstracting in 2256) [ClassicSimilarity], result of:
            0.13049884 = score(doc=2256,freq=3.0), product of:
              0.14168692 = queryWeight, product of:
                1.0474191 = boost
                6.806538 = idf(docFreq=132, maxDocs=44218)
                0.019873895 = queryNorm
              0.92103666 = fieldWeight in 2256, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.806538 = idf(docFreq=132, maxDocs=44218)
                0.078125 = fieldNorm(doc=2256)
          0.14563656 = weight(abstract_txt:extracts in 2256) [ClassicSimilarity], result of:
            0.14563656 = score(doc=2256,freq=2.0), product of:
              0.17450291 = queryWeight, product of:
                1.1624036 = boost
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.019873895 = queryNorm
              0.8345796 = fieldWeight in 2256, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.078125 = fieldNorm(doc=2256)
          0.082444645 = weight(abstract_txt:relevant in 2256) [ClassicSimilarity], result of:
            0.082444645 = score(doc=2256,freq=3.0), product of:
              0.13143477 = queryWeight, product of:
                1.4266773 = boost
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.019873895 = queryNorm
              0.62726665 = fieldWeight in 2256, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.078125 = fieldNorm(doc=2256)
          0.2969538 = weight(abstract_txt:grammar in 2256) [ClassicSimilarity], result of:
            0.2969538 = score(doc=2256,freq=2.0), product of:
              0.35352895 = queryWeight, product of:
                2.3398235 = boost
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.019873895 = queryNorm
              0.83997023 = fieldWeight in 2256, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.602543 = idf(docFreq=59, maxDocs=44218)
                0.078125 = fieldNorm(doc=2256)
          0.22344606 = weight(abstract_txt:text in 2256) [ClassicSimilarity], result of:
            0.22344606 = score(doc=2256,freq=8.0), product of:
              0.25005805 = queryWeight, product of:
                3.1114373 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.019873895 = queryNorm
              0.89357674 = fieldWeight in 2256, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=2256)
        0.24 = coord(6/25)
    
  3. Atanassova, I.; Bertin, M.; Larivière, V.: On the composition of scientific abstracts (2016) 0.21
    0.20760228 = sum of:
      0.20760228 = product of:
        0.7414367 = sum of:
          0.046132654 = weight(abstract_txt:contributes in 3028) [ClassicSimilarity], result of:
            0.046132654 = score(doc=3028,freq=1.0), product of:
              0.12959035 = queryWeight, product of:
                1.0017098 = boost
                6.5095015 = idf(docFreq=178, maxDocs=44218)
                0.019873895 = queryNorm
              0.35598835 = fieldWeight in 3028, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5095015 = idf(docFreq=178, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3028)
          0.052740484 = weight(abstract_txt:abstracting in 3028) [ClassicSimilarity], result of:
            0.052740484 = score(doc=3028,freq=1.0), product of:
              0.14168692 = queryWeight, product of:
                1.0474191 = boost
                6.806538 = idf(docFreq=132, maxDocs=44218)
                0.019873895 = queryNorm
              0.37223256 = fieldWeight in 3028, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.806538 = idf(docFreq=132, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3028)
          0.12807827 = weight(abstract_txt:sentences in 3028) [ClassicSimilarity], result of:
            0.12807827 = score(doc=3028,freq=5.0), product of:
              0.1497019 = queryWeight, product of:
                1.0766369 = boost
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.019873895 = queryNorm
              0.8555554 = fieldWeight in 3028, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3028)
          0.017660052 = weight(abstract_txt:that in 3028) [ClassicSimilarity], result of:
            0.017660052 = score(doc=3028,freq=7.0), product of:
              0.05151133 = queryWeight, product of:
                1.0938748 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.019873895 = queryNorm
              0.3428382 = fieldWeight in 3028, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3028)
          0.08179968 = weight(abstract_txt:articles in 3028) [ClassicSimilarity], result of:
            0.08179968 = score(doc=3028,freq=5.0), product of:
              0.13987972 = queryWeight, product of:
                1.4717972 = boost
                4.7821565 = idf(docFreq=1006, maxDocs=44218)
                0.019873895 = queryNorm
              0.5847858 = fieldWeight in 3028, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.7821565 = idf(docFreq=1006, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3028)
          0.319243 = weight(abstract_txt:abstracts in 3028) [ClassicSimilarity], result of:
            0.319243 = score(doc=3028,freq=9.0), product of:
              0.32629284 = queryWeight, product of:
                2.7530875 = boost
                5.963546 = idf(docFreq=308, maxDocs=44218)
                0.019873895 = queryNorm
              0.97839415 = fieldWeight in 3028, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                5.963546 = idf(docFreq=308, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3028)
          0.09578255 = weight(abstract_txt:text in 3028) [ClassicSimilarity], result of:
            0.09578255 = score(doc=3028,freq=3.0), product of:
              0.25005805 = queryWeight, product of:
                3.1114373 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.019873895 = queryNorm
              0.38304123 = fieldWeight in 3028, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0546875 = fieldNorm(doc=3028)
        0.28 = coord(7/25)
    
  4. Pack, T.: Shortcuts to finding short stories : searching fiction online (1992) 0.18
    0.1834392 = sum of:
      0.1834392 = product of:
        1.146495 = sum of:
          0.026970558 = weight(abstract_txt:that in 4685) [ClassicSimilarity], result of:
            0.026970558 = score(doc=4685,freq=2.0), product of:
              0.05151133 = queryWeight, product of:
                1.0938748 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.019873895 = queryNorm
              0.52358496 = fieldWeight in 4685, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.15625 = fieldNorm(doc=4685)
          0.1920261 = weight(abstract_txt:stories in 4685) [ClassicSimilarity], result of:
            0.1920261 = score(doc=4685,freq=1.0), product of:
              0.16654025 = queryWeight, product of:
                1.1355734 = boost
                7.3793993 = idf(docFreq=74, maxDocs=44218)
                0.019873895 = queryNorm
              1.1530311 = fieldWeight in 4685, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3793993 = idf(docFreq=74, maxDocs=44218)
                0.15625 = fieldNorm(doc=4685)
          0.6234573 = weight(abstract_txt:magazine in 4685) [ClassicSimilarity], result of:
            0.6234573 = score(doc=4685,freq=2.0), product of:
              0.3651603 = queryWeight, product of:
                2.378003 = boost
                7.7265954 = idf(docFreq=52, maxDocs=44218)
                0.019873895 = queryNorm
              1.7073524 = fieldWeight in 4685, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.7265954 = idf(docFreq=52, maxDocs=44218)
                0.15625 = fieldNorm(doc=4685)
          0.304041 = weight(abstract_txt:abstracts in 4685) [ClassicSimilarity], result of:
            0.304041 = score(doc=4685,freq=1.0), product of:
              0.32629284 = queryWeight, product of:
                2.7530875 = boost
                5.963546 = idf(docFreq=308, maxDocs=44218)
                0.019873895 = queryNorm
              0.93180406 = fieldWeight in 4685, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.963546 = idf(docFreq=308, maxDocs=44218)
                0.15625 = fieldNorm(doc=4685)
        0.16 = coord(4/25)
    
  5. Moens, M.-F.; Uyttendaele, C.; Dumotier, J.: Abstracting of legal cases : the potential of clustering based on the selection of representative objects (1999) 0.18
    0.17722823 = sum of:
      0.17722823 = product of:
        0.63295794 = sum of:
          0.07534355 = weight(abstract_txt:abstracting in 2944) [ClassicSimilarity], result of:
            0.07534355 = score(doc=2944,freq=1.0), product of:
              0.14168692 = queryWeight, product of:
                1.0474191 = boost
                6.806538 = idf(docFreq=132, maxDocs=44218)
                0.019873895 = queryNorm
              0.5317608 = fieldWeight in 2944, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.806538 = idf(docFreq=132, maxDocs=44218)
                0.078125 = fieldNorm(doc=2944)
          0.0818262 = weight(abstract_txt:sentences in 2944) [ClassicSimilarity], result of:
            0.0818262 = score(doc=2944,freq=1.0), product of:
              0.1497019 = queryWeight, product of:
                1.0766369 = boost
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.019873895 = queryNorm
              0.5465943 = fieldWeight in 2944, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.078125 = fieldNorm(doc=2944)
          0.009535532 = weight(abstract_txt:that in 2944) [ClassicSimilarity], result of:
            0.009535532 = score(doc=2944,freq=1.0), product of:
              0.05151133 = queryWeight, product of:
                1.0938748 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.019873895 = queryNorm
              0.18511525 = fieldWeight in 2944, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.078125 = fieldNorm(doc=2944)
          0.102980606 = weight(abstract_txt:extracts in 2944) [ClassicSimilarity], result of:
            0.102980606 = score(doc=2944,freq=1.0), product of:
              0.17450291 = queryWeight, product of:
                1.1624036 = boost
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.019873895 = queryNorm
              0.5901369 = fieldWeight in 2944, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5537524 = idf(docFreq=62, maxDocs=44218)
                0.078125 = fieldNorm(doc=2944)
          0.04759944 = weight(abstract_txt:relevant in 2944) [ClassicSimilarity], result of:
            0.04759944 = score(doc=2944,freq=1.0), product of:
              0.13143477 = queryWeight, product of:
                1.4266773 = boost
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.019873895 = queryNorm
              0.36215258 = fieldWeight in 2944, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.635553 = idf(docFreq=1165, maxDocs=44218)
                0.078125 = fieldNorm(doc=2944)
          0.122162595 = weight(abstract_txt:texts in 2944) [ClassicSimilarity], result of:
            0.122162595 = score(doc=2944,freq=2.0), product of:
              0.19555002 = queryWeight, product of:
                1.7402016 = boost
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.019873895 = queryNorm
              0.62471277 = fieldWeight in 2944, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.078125 = fieldNorm(doc=2944)
          0.19350997 = weight(abstract_txt:text in 2944) [ClassicSimilarity], result of:
            0.19350997 = score(doc=2944,freq=6.0), product of:
              0.25005805 = queryWeight, product of:
                3.1114373 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.019873895 = queryNorm
              0.77386016 = fieldWeight in 2944, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=2944)
        0.28 = coord(7/25)