Document (#38519)

Author
Moreno, J.M.T.
Title
Automatic text summarization
Imprint
Hoboken : Wiley
Year
2014
Pages
320 S
Isbn
978-1-84821-668-6
Abstract
This new textbook examines the motivations and the different algorithms for automatic document summarization (ADS). We performed a recent state of the art. The book shows the main problems of ADS, difficulties and the solutions provided by the community. It presents recent advances in ADS, as well as current applications and trends. The approaches are statistical, linguistic and symbolic. Several exemples are included in order to clarify the theoretical concepts. The books currently available in the area of Automatic Document Summarization are not recent. Powerful algorithms have been developed in recent years that include several applications of ADS. The development of recent technology has impacted on the development of algorithms and their applications. The massive use of social networks and the new forms of the technology requires the adaptation of the classical methods of text summarizers. This is a new textbook on Automatic Text Summarization, based on teaching materials used in two or one-semester courses. It presents a extensive state-of-art and describes the new systems on the subject. Previous automatic summarization books have been either collections of specialized papers, or else authored books with only a chapter or two devoted to the field as a whole. In other hand, the classic books on the subject are not recent.
Content
Automatic Text Summarization Some Important Concepts 23 Single document Summarization 53 Guided Multi-Document Summarization 109 Emerging systems 151 Source and DomainSpecific Summarization 179 Text Abstracting 219 Evaluating Document Summaries 243 Conclusion 275 Information Retrieval NLP and Automatic Text Summarization 281 Automatic Text Summarization Resources 305
Theme
Automatisches Indexieren
DDC
025.4
LCC
P98.5 .A87

Similar documents (author)

  1. Moreno, R.B. -> Bailón-Moreno, R.: 5.49
    5.486953 = sum of:
      5.486953 = weight(author_txt:moreno in 7608) [ClassicSimilarity], result of:
        5.486953 = fieldWeight in 7608, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.868255 = idf(docFreq=16, maxDocs=44421)
          0.4375 = fieldNorm(doc=7608)
    
  2. Moreno, R.R. -> Bailón-Moreno, R.: 5.49
    5.486953 = sum of:
      5.486953 = weight(author_txt:moreno in 1055) [ClassicSimilarity], result of:
        5.486953 = fieldWeight in 1055, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.868255 = idf(docFreq=16, maxDocs=44421)
          0.4375 = fieldNorm(doc=1055)
    
  3. Schneider, J. Moreno => Moreno Schneider, J.: 4.70
    4.703102 = sum of:
      4.703102 = weight(author_txt:moreno in 1196) [ClassicSimilarity], result of:
        4.703102 = fieldWeight in 1196, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.868255 = idf(docFreq=16, maxDocs=44421)
          0.375 = fieldNorm(doc=1196)
    
  4. Fernandez, F.S.; Moreno, A.G.: History of information science in Spain : a selected bibliography (1997) 4.43
    4.4341273 = sum of:
      4.4341273 = weight(author_txt:moreno in 1052) [ClassicSimilarity], result of:
        4.4341273 = fieldWeight in 1052, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.868255 = idf(docFreq=16, maxDocs=44421)
          0.5 = fieldNorm(doc=1052)
    
  5. Moreno, N.; Vallecillo, A.: Towards interoperable Web engineering methods (2008) 4.43
    4.4341273 = sum of:
      4.4341273 = weight(author_txt:moreno in 2860) [ClassicSimilarity], result of:
        4.4341273 = fieldWeight in 2860, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.868255 = idf(docFreq=16, maxDocs=44421)
          0.5 = fieldNorm(doc=2860)
    

Similar documents (content)

  1. Sankarasubramaniam, Y.; Ramanathan, K.; Ghosh, S.: Text summarization using Wikipedia (2014) 0.27
    0.2732173 = sum of:
      0.2732173 = product of:
        0.9757761 = sum of:
          0.019105518 = weight(abstract_txt:been in 3693) [ClassicSimilarity], result of:
            0.019105518 = score(doc=3693,freq=2.0), product of:
              0.059802942 = queryWeight, product of:
                1.0186839 = boost
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.016242087 = queryNorm
              0.31947455 = fieldWeight in 3693, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.0625 = fieldNorm(doc=3693)
          0.02265435 = weight(abstract_txt:document in 3693) [ClassicSimilarity], result of:
            0.02265435 = score(doc=3693,freq=1.0), product of:
              0.0844101 = queryWeight, product of:
                1.2102509 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.016242087 = queryNorm
              0.26838437 = fieldWeight in 3693, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.0625 = fieldNorm(doc=3693)
          0.038072873 = weight(abstract_txt:several in 3693) [ClassicSimilarity], result of:
            0.038072873 = score(doc=3693,freq=2.0), product of:
              0.09470228 = queryWeight, product of:
                1.2819126 = boost
                4.548416 = idf(docFreq=1277, maxDocs=44421)
                0.016242087 = queryNorm
              0.40202698 = fieldWeight in 3693, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.548416 = idf(docFreq=1277, maxDocs=44421)
                0.0625 = fieldNorm(doc=3693)
          0.049045645 = weight(abstract_txt:text in 3693) [ClassicSimilarity], result of:
            0.049045645 = score(doc=3693,freq=3.0), product of:
              0.11212014 = queryWeight, product of:
                1.7083058 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.016242087 = queryNorm
              0.4374383 = fieldWeight in 3693, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=3693)
          0.112399116 = weight(abstract_txt:algorithms in 3693) [ClassicSimilarity], result of:
            0.112399116 = score(doc=3693,freq=2.0), product of:
              0.22309458 = queryWeight, product of:
                2.4097297 = boost
                5.7000527 = idf(docFreq=403, maxDocs=44421)
                0.016242087 = queryNorm
              0.5038182 = fieldWeight in 3693, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7000527 = idf(docFreq=403, maxDocs=44421)
                0.0625 = fieldNorm(doc=3693)
          0.10032013 = weight(abstract_txt:automatic in 3693) [ClassicSimilarity], result of:
            0.10032013 = score(doc=3693,freq=1.0), product of:
              0.30893376 = queryWeight, product of:
                3.660841 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.016242087 = queryNorm
              0.32473022 = fieldWeight in 3693, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.0625 = fieldNorm(doc=3693)
          0.63417846 = weight(abstract_txt:summarization in 3693) [ClassicSimilarity], result of:
            0.63417846 = score(doc=3693,freq=6.0), product of:
              0.5812512 = queryWeight, product of:
                5.021461 = boost
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.016242087 = queryNorm
              1.0910574 = fieldWeight in 3693, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.0625 = fieldNorm(doc=3693)
        0.28 = coord(7/25)
    
  2. Smeaton, A.F.: Progress in the application of natural language processing to information retrieval tasks (1992) 0.26
    0.2586346 = sum of:
      0.2586346 = product of:
        1.6164663 = sum of:
          0.1201368 = weight(abstract_txt:text in 7079) [ClassicSimilarity], result of:
            0.1201368 = score(doc=7079,freq=2.0), product of:
              0.11212014 = queryWeight, product of:
                1.7083058 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.016242087 = queryNorm
              1.0715007 = fieldWeight in 7079, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.1875 = fieldNorm(doc=7079)
          0.42562228 = weight(abstract_txt:automatic in 7079) [ClassicSimilarity], result of:
            0.42562228 = score(doc=7079,freq=2.0), product of:
              0.30893376 = queryWeight, product of:
                3.660841 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.016242087 = queryNorm
              1.3777137 = fieldWeight in 7079, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.1875 = fieldNorm(doc=7079)
          0.29400048 = weight(abstract_txt:recent in 7079) [ClassicSimilarity], result of:
            0.29400048 = score(doc=7079,freq=1.0), product of:
              0.32321012 = queryWeight, product of:
                4.1018643 = boost
                4.8513412 = idf(docFreq=943, maxDocs=44421)
                0.016242087 = queryNorm
              0.9096265 = fieldWeight in 7079, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8513412 = idf(docFreq=943, maxDocs=44421)
                0.1875 = fieldNorm(doc=7079)
          0.77670676 = weight(abstract_txt:summarization in 7079) [ClassicSimilarity], result of:
            0.77670676 = score(doc=7079,freq=1.0), product of:
              0.5812512 = queryWeight, product of:
                5.021461 = boost
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.016242087 = queryNorm
              1.3362669 = fieldWeight in 7079, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.1875 = fieldNorm(doc=7079)
        0.16 = coord(4/25)
    
  3. Shen, D.; Yang, Q.; Chen, Z.: Noise reduction through summarization for Web-page classification (2007) 0.23
    0.23190346 = sum of:
      0.23190346 = product of:
        1.1595173 = sum of:
          0.03365198 = weight(abstract_txt:several in 1953) [ClassicSimilarity], result of:
            0.03365198 = score(doc=1953,freq=1.0), product of:
              0.09470228 = queryWeight, product of:
                1.2819126 = boost
                4.548416 = idf(docFreq=1277, maxDocs=44421)
                0.016242087 = queryNorm
              0.355345 = fieldWeight in 1953, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.548416 = idf(docFreq=1277, maxDocs=44421)
                0.078125 = fieldNorm(doc=1953)
          0.04020273 = weight(abstract_txt:state in 1953) [ClassicSimilarity], result of:
            0.04020273 = score(doc=1953,freq=1.0), product of:
              0.10662451 = queryWeight, product of:
                1.3602123 = boost
                4.8262353 = idf(docFreq=967, maxDocs=44421)
                0.016242087 = queryNorm
              0.37704962 = fieldWeight in 1953, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8262353 = idf(docFreq=967, maxDocs=44421)
                0.078125 = fieldNorm(doc=1953)
          0.07079129 = weight(abstract_txt:text in 1953) [ClassicSimilarity], result of:
            0.07079129 = score(doc=1953,freq=4.0), product of:
              0.11212014 = queryWeight, product of:
                1.7083058 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.016242087 = queryNorm
              0.6313878 = fieldWeight in 1953, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.078125 = fieldNorm(doc=1953)
          0.22214827 = weight(abstract_txt:algorithms in 1953) [ClassicSimilarity], result of:
            0.22214827 = score(doc=1953,freq=5.0), product of:
              0.22309458 = queryWeight, product of:
                2.4097297 = boost
                5.7000527 = idf(docFreq=403, maxDocs=44421)
                0.016242087 = queryNorm
              0.99575824 = fieldWeight in 1953, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.7000527 = idf(docFreq=403, maxDocs=44421)
                0.078125 = fieldNorm(doc=1953)
          0.79272306 = weight(abstract_txt:summarization in 1953) [ClassicSimilarity], result of:
            0.79272306 = score(doc=1953,freq=6.0), product of:
              0.5812512 = queryWeight, product of:
                5.021461 = boost
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.016242087 = queryNorm
              1.3638217 = fieldWeight in 1953, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.078125 = fieldNorm(doc=1953)
        0.2 = coord(5/25)
    
  4. Oh, H.; Nam, S.; Zhu, Y.: Structured abstract summarization of scientific articles : summarization using full-text section information (2023) 0.20
    0.2001298 = sum of:
      0.2001298 = product of:
        0.8338742 = sum of:
          0.013509642 = weight(abstract_txt:been in 1890) [ClassicSimilarity], result of:
            0.013509642 = score(doc=1890,freq=1.0), product of:
              0.059802942 = queryWeight, product of:
                1.0186839 = boost
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.016242087 = queryNorm
              0.22590263 = fieldWeight in 1890, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.0625 = fieldNorm(doc=1890)
          0.032162186 = weight(abstract_txt:state in 1890) [ClassicSimilarity], result of:
            0.032162186 = score(doc=1890,freq=1.0), product of:
              0.10662451 = queryWeight, product of:
                1.3602123 = boost
                4.8262353 = idf(docFreq=967, maxDocs=44421)
                0.016242087 = queryNorm
              0.3016397 = fieldWeight in 1890, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8262353 = idf(docFreq=967, maxDocs=44421)
                0.0625 = fieldNorm(doc=1890)
          0.049045645 = weight(abstract_txt:text in 1890) [ClassicSimilarity], result of:
            0.049045645 = score(doc=1890,freq=3.0), product of:
              0.11212014 = queryWeight, product of:
                1.7083058 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.016242087 = queryNorm
              0.4374383 = fieldWeight in 1890, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=1890)
          0.07947818 = weight(abstract_txt:algorithms in 1890) [ClassicSimilarity], result of:
            0.07947818 = score(doc=1890,freq=1.0), product of:
              0.22309458 = queryWeight, product of:
                2.4097297 = boost
                5.7000527 = idf(docFreq=403, maxDocs=44421)
                0.016242087 = queryNorm
              0.3562533 = fieldWeight in 1890, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7000527 = idf(docFreq=403, maxDocs=44421)
                0.0625 = fieldNorm(doc=1890)
          0.14187409 = weight(abstract_txt:automatic in 1890) [ClassicSimilarity], result of:
            0.14187409 = score(doc=1890,freq=2.0), product of:
              0.30893376 = queryWeight, product of:
                3.660841 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.016242087 = queryNorm
              0.45923787 = fieldWeight in 1890, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.0625 = fieldNorm(doc=1890)
          0.5178045 = weight(abstract_txt:summarization in 1890) [ClassicSimilarity], result of:
            0.5178045 = score(doc=1890,freq=4.0), product of:
              0.5812512 = queryWeight, product of:
                5.021461 = boost
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.016242087 = queryNorm
              0.8908446 = fieldWeight in 1890, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.0625 = fieldNorm(doc=1890)
        0.24 = coord(6/25)
    
  5. Yang, C.C.; Wang, F.L.: Hierarchical summarization of large documents (2008) 0.19
    0.18684497 = sum of:
      0.18684497 = product of:
        0.93422484 = sum of:
          0.019105518 = weight(abstract_txt:been in 2719) [ClassicSimilarity], result of:
            0.019105518 = score(doc=2719,freq=2.0), product of:
              0.059802942 = queryWeight, product of:
                1.0186839 = boost
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.016242087 = queryNorm
              0.31947455 = fieldWeight in 2719, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.0625 = fieldNorm(doc=2719)
          0.059937783 = weight(abstract_txt:document in 2719) [ClassicSimilarity], result of:
            0.059937783 = score(doc=2719,freq=7.0), product of:
              0.0844101 = queryWeight, product of:
                1.2102509 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.016242087 = queryNorm
              0.7100783 = fieldWeight in 2719, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.0625 = fieldNorm(doc=2719)
          0.028316516 = weight(abstract_txt:text in 2719) [ClassicSimilarity], result of:
            0.028316516 = score(doc=2719,freq=1.0), product of:
              0.11212014 = queryWeight, product of:
                1.7083058 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.016242087 = queryNorm
              0.25255513 = fieldWeight in 2719, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=2719)
          0.14187409 = weight(abstract_txt:automatic in 2719) [ClassicSimilarity], result of:
            0.14187409 = score(doc=2719,freq=2.0), product of:
              0.30893376 = queryWeight, product of:
                3.660841 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.016242087 = queryNorm
              0.45923787 = fieldWeight in 2719, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.0625 = fieldNorm(doc=2719)
          0.68499094 = weight(abstract_txt:summarization in 2719) [ClassicSimilarity], result of:
            0.68499094 = score(doc=2719,freq=7.0), product of:
              0.5812512 = queryWeight, product of:
                5.021461 = boost
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.016242087 = queryNorm
              1.1784766 = fieldWeight in 2719, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.0625 = fieldNorm(doc=2719)
        0.2 = coord(5/25)