Document (#29541)

Author
Moens, M.F.
Dumortier, J.
Title
Use of a text grammar for generating highlight abstracts of magazine articles
Source
Journal of documentation. 56(2000) no.5, S.520-539
Year
2000
Abstract
Browsing a database of article abstracts is one way to select and buy relevant magazine articles online. Our research contributes to the design and development of text grammars for abstracting texts in unlimited subject domains. We developed a system that parses texts based on the text grammar of a specific text type and that extracts sentences and statements which are relevant for inclusion in the abstracts. The system employs knowledge of the discourse patterns that are typical of news stories. The results are encouraging and demonstrate the importance of discourse structures in text summarisation.
Content
Vgl. auch: http://www.emeraldinsight.com/10.1108/EUM0000000007126
Theme
Automatisches Abstracting
Computerlinguistik

Similar documents (author)

  1. Moens, M.F.: Automatic indexing and abstracting of document texts (2000) 5.87
    5.874302 = sum of:
      5.874302 = weight(author_txt:moens in 892) [ClassicSimilarity], result of:
        5.874302 = fieldWeight in 892, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.398883 = idf(docFreq=9, maxDocs=44421)
          0.625 = fieldNorm(doc=892)
    
  2. Moens, M.-F.: Summarizing court decisions (2007) 4.70
    4.6994414 = sum of:
      4.6994414 = weight(author_txt:moens in 1954) [ClassicSimilarity], result of:
        4.6994414 = fieldWeight in 1954, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.398883 = idf(docFreq=9, maxDocs=44421)
          0.5 = fieldNorm(doc=1954)
    
  3. Moens, M.-F.; Dumortier, J.: Text categorization : the assignment of subject descriptors to magazine articles (2000) 4.11
    4.1120114 = sum of:
      4.1120114 = weight(author_txt:moens in 3397) [ClassicSimilarity], result of:
        4.1120114 = fieldWeight in 3397, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.398883 = idf(docFreq=9, maxDocs=44421)
          0.4375 = fieldNorm(doc=3397)
    
  4. Moens, M.-F.; Uyttendaele, C.: Automatic text structuring and categorization as a first step in summarizing legal cases (1997) 4.11
    4.1120114 = sum of:
      4.1120114 = weight(author_txt:moens in 3256) [ClassicSimilarity], result of:
        4.1120114 = fieldWeight in 3256, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.398883 = idf(docFreq=9, maxDocs=44421)
          0.4375 = fieldNorm(doc=3256)
    
  5. Moens, M.-F.; Uyttendaele, C.; Dumotier, J.: Abstracting of legal cases : the potential of clustering based on the selection of representative objects (1999) 3.52
    3.524581 = sum of:
      3.524581 = weight(author_txt:moens in 3944) [ClassicSimilarity], result of:
        3.524581 = fieldWeight in 3944, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.398883 = idf(docFreq=9, maxDocs=44421)
          0.375 = fieldNorm(doc=3944)
    

Similar documents (content)

  1. Moens, M.-F.; Angheluta, R.; Dumortier, J.: Generic technologies for single-and multi-document summarization (2005) 0.22
    0.22476655 = sum of:
      0.22476655 = product of:
        0.7023955 = sum of:
          0.025954576 = weight(abstract_txt:system in 2026) [ClassicSimilarity], result of:
            0.025954576 = score(doc=2026,freq=2.0), product of:
              0.06965007 = queryWeight, product of:
                1.0425311 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.019808207 = queryNorm
              0.37264252 = fieldWeight in 2026, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.078125 = fieldNorm(doc=2026)
          0.14214948 = weight(abstract_txt:sentences in 2026) [ClassicSimilarity], result of:
            0.14214948 = score(doc=2026,freq=3.0), product of:
              0.15004978 = queryWeight, product of:
                1.0820091 = boost
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.019808207 = queryNorm
              0.94734883 = fieldWeight in 2026, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.078125 = fieldNorm(doc=2026)
          0.013421494 = weight(abstract_txt:that in 2026) [ClassicSimilarity], result of:
            0.013421494 = score(doc=2026,freq=2.0), product of:
              0.051366094 = queryWeight, product of:
                1.0965089 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.019808207 = queryNorm
              0.2612909 = fieldWeight in 2026, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.078125 = fieldNorm(doc=2026)
          0.047473367 = weight(abstract_txt:relevant in 2026) [ClassicSimilarity], result of:
            0.047473367 = score(doc=2026,freq=1.0), product of:
              0.13124686 = queryWeight, product of:
                1.431109 = boost
                4.6298943 = idf(docFreq=1177, maxDocs=44421)
                0.019808207 = queryNorm
              0.3617105 = fieldWeight in 2026, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6298943 = idf(docFreq=1177, maxDocs=44421)
                0.078125 = fieldNorm(doc=2026)
          0.052235838 = weight(abstract_txt:articles in 2026) [ClassicSimilarity], result of:
            0.052235838 = score(doc=2026,freq=1.0), product of:
              0.13988397 = queryWeight, product of:
                1.477448 = boost
                4.7798095 = idf(docFreq=1013, maxDocs=44421)
                0.019808207 = queryNorm
              0.37342262 = fieldWeight in 2026, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7798095 = idf(docFreq=1013, maxDocs=44421)
                0.078125 = fieldNorm(doc=2026)
          0.12121424 = weight(abstract_txt:texts in 2026) [ClassicSimilarity], result of:
            0.12121424 = score(doc=2026,freq=2.0), product of:
              0.19460233 = queryWeight, product of:
                1.7426182 = boost
                5.6376824 = idf(docFreq=429, maxDocs=44421)
                0.019808207 = queryNorm
              0.62288177 = fieldWeight in 2026, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6376824 = idf(docFreq=429, maxDocs=44421)
                0.078125 = fieldNorm(doc=2026)
          0.22104134 = weight(abstract_txt:magazine in 2026) [ClassicSimilarity], result of:
            0.22104134 = score(doc=2026,freq=1.0), product of:
              0.3659636 = queryWeight, product of:
                2.389721 = boost
                7.731176 = idf(docFreq=52, maxDocs=44421)
                0.019808207 = queryNorm
              0.6039981 = fieldWeight in 2026, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.731176 = idf(docFreq=52, maxDocs=44421)
                0.078125 = fieldNorm(doc=2026)
          0.07890516 = weight(abstract_txt:text in 2026) [ClassicSimilarity], result of:
            0.07890516 = score(doc=2026,freq=1.0), product of:
              0.24994196 = queryWeight, product of:
                3.1226106 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.019808207 = queryNorm
              0.3156939 = fieldWeight in 2026, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.078125 = fieldNorm(doc=2026)
        0.32 = coord(8/25)
    
  2. Moens, M.-F.; Uyttendaele, C.: Automatic text structuring and categorization as a first step in summarizing legal cases (1997) 0.22
    0.21563827 = sum of:
      0.21563827 = product of:
        0.8984928 = sum of:
          0.01835266 = weight(abstract_txt:system in 3256) [ClassicSimilarity], result of:
            0.01835266 = score(doc=3256,freq=1.0), product of:
              0.06965007 = queryWeight, product of:
                1.0425311 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.019808207 = queryNorm
              0.26349807 = fieldWeight in 3256, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.078125 = fieldNorm(doc=3256)
          0.13089491 = weight(abstract_txt:abstracting in 3256) [ClassicSimilarity], result of:
            0.13089491 = score(doc=3256,freq=3.0), product of:
              0.14202137 = queryWeight, product of:
                1.0526648 = boost
                6.8111186 = idf(docFreq=132, maxDocs=44421)
                0.019808207 = queryNorm
              0.9216565 = fieldWeight in 3256, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.8111186 = idf(docFreq=132, maxDocs=44421)
                0.078125 = fieldNorm(doc=3256)
          0.14604943 = weight(abstract_txt:extracts in 3256) [ClassicSimilarity], result of:
            0.14604943 = score(doc=3256,freq=2.0), product of:
              0.17489156 = queryWeight, product of:
                1.1681474 = boost
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.019808207 = queryNorm
              0.83508563 = fieldWeight in 3256, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.078125 = fieldNorm(doc=3256)
          0.08222628 = weight(abstract_txt:relevant in 3256) [ClassicSimilarity], result of:
            0.08222628 = score(doc=3256,freq=3.0), product of:
              0.13124686 = queryWeight, product of:
                1.431109 = boost
                4.6298943 = idf(docFreq=1177, maxDocs=44421)
                0.019808207 = queryNorm
              0.6265009 = fieldWeight in 3256, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6298943 = idf(docFreq=1177, maxDocs=44421)
                0.078125 = fieldNorm(doc=3256)
          0.29779205 = weight(abstract_txt:grammar in 3256) [ClassicSimilarity], result of:
            0.29779205 = score(doc=3256,freq=2.0), product of:
              0.35431346 = queryWeight, product of:
                2.3513758 = boost
                7.607123 = idf(docFreq=59, maxDocs=44421)
                0.019808207 = queryNorm
              0.8404762 = fieldWeight in 3256, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.607123 = idf(docFreq=59, maxDocs=44421)
                0.078125 = fieldNorm(doc=3256)
          0.22317748 = weight(abstract_txt:text in 3256) [ClassicSimilarity], result of:
            0.22317748 = score(doc=3256,freq=8.0), product of:
              0.24994196 = queryWeight, product of:
                3.1226106 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.019808207 = queryNorm
              0.8929172 = fieldWeight in 3256, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.078125 = fieldNorm(doc=3256)
        0.24 = coord(6/25)
    
  3. Atanassova, I.; Bertin, M.; Larivière, V.: On the composition of scientific abstracts (2016) 0.21
    0.20747536 = sum of:
      0.20747536 = product of:
        0.7409834 = sum of:
          0.04535132 = weight(abstract_txt:contributes in 4028) [ClassicSimilarity], result of:
            0.04535132 = score(doc=4028,freq=1.0), product of:
              0.1281662 = queryWeight, product of:
                6.470359 = idf(docFreq=186, maxDocs=44421)
                0.019808207 = queryNorm
              0.35384774 = fieldWeight in 4028, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.470359 = idf(docFreq=186, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4028)
          0.052900553 = weight(abstract_txt:abstracting in 4028) [ClassicSimilarity], result of:
            0.052900553 = score(doc=4028,freq=1.0), product of:
              0.14202137 = queryWeight, product of:
                1.0526648 = boost
                6.8111186 = idf(docFreq=132, maxDocs=44421)
                0.019808207 = queryNorm
              0.37248304 = fieldWeight in 4028, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8111186 = idf(docFreq=132, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4028)
          0.12845993 = weight(abstract_txt:sentences in 4028) [ClassicSimilarity], result of:
            0.12845993 = score(doc=4028,freq=5.0), product of:
              0.15004978 = queryWeight, product of:
                1.0820091 = boost
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.019808207 = queryNorm
              0.85611546 = fieldWeight in 4028, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4028)
          0.017576518 = weight(abstract_txt:that in 4028) [ClassicSimilarity], result of:
            0.017576518 = score(doc=4028,freq=7.0), product of:
              0.051366094 = queryWeight, product of:
                1.0965089 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.019808207 = queryNorm
              0.34218132 = fieldWeight in 4028, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4028)
          0.081762016 = weight(abstract_txt:articles in 4028) [ClassicSimilarity], result of:
            0.081762016 = score(doc=4028,freq=5.0), product of:
              0.13988397 = queryWeight, product of:
                1.477448 = boost
                4.7798095 = idf(docFreq=1013, maxDocs=44421)
                0.019808207 = queryNorm
              0.5844988 = fieldWeight in 4028, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.7798095 = idf(docFreq=1013, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4028)
          0.31926566 = weight(abstract_txt:abstracts in 4028) [ClassicSimilarity], result of:
            0.31926566 = score(doc=4028,freq=9.0), product of:
              0.32641837 = queryWeight, product of:
                2.7641473 = boost
                5.9616747 = idf(docFreq=310, maxDocs=44421)
                0.019808207 = queryNorm
              0.9780873 = fieldWeight in 4028, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                5.9616747 = idf(docFreq=310, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4028)
          0.095667414 = weight(abstract_txt:text in 4028) [ClassicSimilarity], result of:
            0.095667414 = score(doc=4028,freq=3.0), product of:
              0.24994196 = queryWeight, product of:
                3.1226106 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.019808207 = queryNorm
              0.38275853 = fieldWeight in 4028, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4028)
        0.28 = coord(7/25)
    
  4. Pack, T.: Shortcuts to finding short stories : searching fiction online (1992) 0.18
    0.18362385 = sum of:
      0.18362385 = product of:
        1.147649 = sum of:
          0.026842987 = weight(abstract_txt:that in 4684) [ClassicSimilarity], result of:
            0.026842987 = score(doc=4684,freq=2.0), product of:
              0.051366094 = queryWeight, product of:
                1.0965089 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.019808207 = queryNorm
              0.5225818 = fieldWeight in 4684, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.15625 = fieldNorm(doc=4684)
          0.19154425 = weight(abstract_txt:stories in 4684) [ClassicSimilarity], result of:
            0.19154425 = score(doc=4684,freq=1.0), product of:
              0.16631764 = queryWeight, product of:
                1.1391538 = boost
                7.370734 = idf(docFreq=75, maxDocs=44421)
                0.019808207 = queryNorm
              1.1516773 = fieldWeight in 4684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.370734 = idf(docFreq=75, maxDocs=44421)
                0.15625 = fieldNorm(doc=4684)
          0.62519926 = weight(abstract_txt:magazine in 4684) [ClassicSimilarity], result of:
            0.62519926 = score(doc=4684,freq=2.0), product of:
              0.3659636 = queryWeight, product of:
                2.389721 = boost
                7.731176 = idf(docFreq=52, maxDocs=44421)
                0.019808207 = queryNorm
              1.7083646 = fieldWeight in 4684, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.731176 = idf(docFreq=52, maxDocs=44421)
                0.15625 = fieldNorm(doc=4684)
          0.30406252 = weight(abstract_txt:abstracts in 4684) [ClassicSimilarity], result of:
            0.30406252 = score(doc=4684,freq=1.0), product of:
              0.32641837 = queryWeight, product of:
                2.7641473 = boost
                5.9616747 = idf(docFreq=310, maxDocs=44421)
                0.019808207 = queryNorm
              0.93151164 = fieldWeight in 4684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9616747 = idf(docFreq=310, maxDocs=44421)
                0.15625 = fieldNorm(doc=4684)
        0.16 = coord(4/25)
    
  5. Moens, M.-F.; Uyttendaele, C.; Dumotier, J.: Abstracting of legal cases : the potential of clustering based on the selection of representative objects (1999) 0.18
    0.17706366 = sum of:
      0.17706366 = product of:
        0.63237023 = sum of:
          0.075572215 = weight(abstract_txt:abstracting in 3944) [ClassicSimilarity], result of:
            0.075572215 = score(doc=3944,freq=1.0), product of:
              0.14202137 = queryWeight, product of:
                1.0526648 = boost
                6.8111186 = idf(docFreq=132, maxDocs=44421)
                0.019808207 = queryNorm
              0.5321186 = fieldWeight in 3944, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8111186 = idf(docFreq=132, maxDocs=44421)
                0.078125 = fieldNorm(doc=3944)
          0.082070045 = weight(abstract_txt:sentences in 3944) [ClassicSimilarity], result of:
            0.082070045 = score(doc=3944,freq=1.0), product of:
              0.15004978 = queryWeight, product of:
                1.0820091 = boost
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.019808207 = queryNorm
              0.5469521 = fieldWeight in 3944, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.078125 = fieldNorm(doc=3944)
          0.0094904285 = weight(abstract_txt:that in 3944) [ClassicSimilarity], result of:
            0.0094904285 = score(doc=3944,freq=1.0), product of:
              0.051366094 = queryWeight, product of:
                1.0965089 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.019808207 = queryNorm
              0.18476056 = fieldWeight in 3944, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.078125 = fieldNorm(doc=3944)
          0.10327255 = weight(abstract_txt:extracts in 3944) [ClassicSimilarity], result of:
            0.10327255 = score(doc=3944,freq=1.0), product of:
              0.17489156 = queryWeight, product of:
                1.1681474 = boost
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.019808207 = queryNorm
              0.59049475 = fieldWeight in 3944, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.078125 = fieldNorm(doc=3944)
          0.047473367 = weight(abstract_txt:relevant in 3944) [ClassicSimilarity], result of:
            0.047473367 = score(doc=3944,freq=1.0), product of:
              0.13124686 = queryWeight, product of:
                1.431109 = boost
                4.6298943 = idf(docFreq=1177, maxDocs=44421)
                0.019808207 = queryNorm
              0.3617105 = fieldWeight in 3944, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6298943 = idf(docFreq=1177, maxDocs=44421)
                0.078125 = fieldNorm(doc=3944)
          0.12121424 = weight(abstract_txt:texts in 3944) [ClassicSimilarity], result of:
            0.12121424 = score(doc=3944,freq=2.0), product of:
              0.19460233 = queryWeight, product of:
                1.7426182 = boost
                5.6376824 = idf(docFreq=429, maxDocs=44421)
                0.019808207 = queryNorm
              0.62288177 = fieldWeight in 3944, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6376824 = idf(docFreq=429, maxDocs=44421)
                0.078125 = fieldNorm(doc=3944)
          0.19327739 = weight(abstract_txt:text in 3944) [ClassicSimilarity], result of:
            0.19327739 = score(doc=3944,freq=6.0), product of:
              0.24994196 = queryWeight, product of:
                3.1226106 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.019808207 = queryNorm
              0.7732891 = fieldWeight in 3944, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.078125 = fieldNorm(doc=3944)
        0.28 = coord(7/25)