Document (#25650)

Author
Haag, M.
Title
Automatic text summarization : Evaluation des Copernic Summarizer und mögliche Einsatzfelder in der Fachinformation der DaimlerCrysler AG
Imprint
Aachen : Shaker Verlag
Year
2002
Pages
211 S
Isbn
3-8265-9952-7
Series
Wirtschaftsinformatik
Abstract
An evaluation of the Copernic Summarizer, a software for automatically summarizing text in various data formats, is being presented. It shall be assessed if and how the Copernic Summarizer can reasonably be used in the DaimlerChrysler Information Division in order to enhance the quality of its information services. First, an introduction into Automatic Text Summarization is given and the Copernic Summarizer is being presented. Various methods for evaluating Automatic Text Summarization systems and software ergonomics are presented. Two evaluation forms are developed with which the employees of the Information Division shall evaluate the quality and relevance of the extracted keywords and summaries as well as the software's usability. The quality and relevance assessment is done by comparing the original text to the summaries. Finally, a recommendation is given concerning the use of the Copernic Summarizer.
Footnote
Diplomarbeit an der HBI Stuttgart. - Vgl. auch: nfd 53(2002) H.4, S.243-244
Theme
Automatisches Abstracting
Object
Copernic Summarizer

Similar documents (content)

  1. Aker, A.; Gaizauskas, R.: Generating descriptive multi-document summaries of geo-located entities using entity type models (2015) 0.31
    0.3075587 = sum of:
      0.3075587 = product of:
        1.2814945 = sum of:
          0.033941817 = weight(abstract_txt:quality in 2726) [ClassicSimilarity], result of:
            0.033941817 = score(doc=2726,freq=1.0), product of:
              0.11692909 = queryWeight, product of:
                2.2545757 = boost
                4.6444306 = idf(docFreq=1160, maxDocs=44421)
                0.01116671 = queryNorm
              0.2902769 = fieldWeight in 2726, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6444306 = idf(docFreq=1160, maxDocs=44421)
                0.0625 = fieldNorm(doc=2726)
          0.13638295 = weight(abstract_txt:summaries in 2726) [ClassicSimilarity], result of:
            0.13638295 = score(doc=2726,freq=3.0), product of:
              0.17900634 = queryWeight, product of:
                2.2776768 = boost
                7.0380287 = idf(docFreq=105, maxDocs=44421)
                0.01116671 = queryNorm
              0.7618889 = fieldWeight in 2726, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.0380287 = idf(docFreq=105, maxDocs=44421)
                0.0625 = fieldNorm(doc=2726)
          0.047518823 = weight(abstract_txt:automatic in 2726) [ClassicSimilarity], result of:
            0.047518823 = score(doc=2726,freq=1.0), product of:
              0.14633323 = queryWeight, product of:
                2.522174 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.01116671 = queryNorm
              0.32473022 = fieldWeight in 2726, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.0625 = fieldNorm(doc=2726)
          0.052690204 = weight(abstract_txt:text in 2726) [ClassicSimilarity], result of:
            0.052690204 = score(doc=2726,freq=2.0), product of:
              0.14752264 = queryWeight, product of:
                3.2693186 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.01116671 = queryNorm
              0.3571669 = fieldWeight in 2726, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=2726)
          0.2452694 = weight(abstract_txt:summarization in 2726) [ClassicSimilarity], result of:
            0.2452694 = score(doc=2726,freq=4.0), product of:
              0.27532232 = queryWeight, product of:
                3.4595869 = boost
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.01116671 = queryNorm
              0.8908446 = fieldWeight in 2726, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.0625 = fieldNorm(doc=2726)
          0.7656913 = weight(abstract_txt:summarizer in 2726) [ClassicSimilarity], result of:
            0.7656913 = score(doc=2726,freq=3.0), product of:
              0.7674395 = queryWeight, product of:
                7.4567566 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.01116671 = queryNorm
              0.997722 = fieldWeight in 2726, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.0625 = fieldNorm(doc=2726)
        0.24 = coord(6/25)
    
  2. Ou, S.; Khoo, C.S.G.; Goh, D.H.: Multi-document summarization of news articles using an event-based framework (2006) 0.22
    0.21542628 = sum of:
      0.21542628 = product of:
        0.5984063 = sum of:
          0.008305231 = weight(abstract_txt:information in 782) [ClassicSimilarity], result of:
            0.008305231 = score(doc=782,freq=3.0), product of:
              0.031717084 = queryWeight, product of:
                1.1742219 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.01116671 = queryNorm
              0.26185355 = fieldWeight in 782, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0625 = fieldNorm(doc=782)
          0.053791117 = weight(abstract_txt:summarizing in 782) [ClassicSimilarity], result of:
            0.053791117 = score(doc=782,freq=1.0), product of:
              0.11020445 = queryWeight, product of:
                1.2636956 = boost
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.01116671 = queryNorm
              0.48810294 = fieldWeight in 782, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.0625 = fieldNorm(doc=782)
          0.020699136 = weight(abstract_txt:being in 782) [ClassicSimilarity], result of:
            0.020699136 = score(doc=782,freq=1.0), product of:
              0.07345763 = queryWeight, product of:
                1.4590708 = boost
                4.5085335 = idf(docFreq=1329, maxDocs=44421)
                0.01116671 = queryNorm
              0.28178334 = fieldWeight in 782, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5085335 = idf(docFreq=1329, maxDocs=44421)
                0.0625 = fieldNorm(doc=782)
          0.023412347 = weight(abstract_txt:given in 782) [ClassicSimilarity], result of:
            0.023412347 = score(doc=782,freq=1.0), product of:
              0.07974413 = queryWeight, product of:
                1.5202229 = boost
                4.6974936 = idf(docFreq=1100, maxDocs=44421)
                0.01116671 = queryNorm
              0.29359335 = fieldWeight in 782, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6974936 = idf(docFreq=1100, maxDocs=44421)
                0.0625 = fieldNorm(doc=782)
          0.043070465 = weight(abstract_txt:evaluation in 782) [ClassicSimilarity], result of:
            0.043070465 = score(doc=782,freq=2.0), product of:
              0.10877829 = queryWeight, product of:
                2.1745763 = boost
                4.479632 = idf(docFreq=1368, maxDocs=44421)
                0.01116671 = queryNorm
              0.39594725 = fieldWeight in 782, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.479632 = idf(docFreq=1368, maxDocs=44421)
                0.0625 = fieldNorm(doc=782)
          0.19287463 = weight(abstract_txt:summaries in 782) [ClassicSimilarity], result of:
            0.19287463 = score(doc=782,freq=6.0), product of:
              0.17900634 = queryWeight, product of:
                2.2776768 = boost
                7.0380287 = idf(docFreq=105, maxDocs=44421)
                0.01116671 = queryNorm
              1.0774738 = fieldWeight in 782, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.0380287 = idf(docFreq=105, maxDocs=44421)
                0.0625 = fieldNorm(doc=782)
          0.03530292 = weight(abstract_txt:presented in 782) [ClassicSimilarity], result of:
            0.03530292 = score(doc=782,freq=1.0), product of:
              0.120034546 = queryWeight, product of:
                2.2843184 = boost
                4.7057014 = idf(docFreq=1091, maxDocs=44421)
                0.01116671 = queryNorm
              0.29410633 = fieldWeight in 782, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7057014 = idf(docFreq=1091, maxDocs=44421)
                0.0625 = fieldNorm(doc=782)
          0.047518823 = weight(abstract_txt:automatic in 782) [ClassicSimilarity], result of:
            0.047518823 = score(doc=782,freq=1.0), product of:
              0.14633323 = queryWeight, product of:
                2.522174 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.01116671 = queryNorm
              0.32473022 = fieldWeight in 782, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.0625 = fieldNorm(doc=782)
          0.17343165 = weight(abstract_txt:summarization in 782) [ClassicSimilarity], result of:
            0.17343165 = score(doc=782,freq=2.0), product of:
              0.27532232 = queryWeight, product of:
                3.4595869 = boost
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.01116671 = queryNorm
              0.6299222 = fieldWeight in 782, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.0625 = fieldNorm(doc=782)
        0.36 = coord(9/25)
    
  3. Sankarasubramaniam, Y.; Ramanathan, K.; Ghosh, S.: Text summarization using Wikipedia (2014) 0.18
    0.17769144 = sum of:
      0.17769144 = product of:
        0.8884572 = sum of:
          0.033941817 = weight(abstract_txt:quality in 3693) [ClassicSimilarity], result of:
            0.033941817 = score(doc=3693,freq=1.0), product of:
              0.11692909 = queryWeight, product of:
                2.2545757 = boost
                4.6444306 = idf(docFreq=1160, maxDocs=44421)
                0.01116671 = queryNorm
              0.2902769 = fieldWeight in 3693, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6444306 = idf(docFreq=1160, maxDocs=44421)
                0.0625 = fieldNorm(doc=3693)
          0.047518823 = weight(abstract_txt:automatic in 3693) [ClassicSimilarity], result of:
            0.047518823 = score(doc=3693,freq=1.0), product of:
              0.14633323 = queryWeight, product of:
                2.522174 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.01116671 = queryNorm
              0.32473022 = fieldWeight in 3693, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.0625 = fieldNorm(doc=3693)
          0.06453206 = weight(abstract_txt:text in 3693) [ClassicSimilarity], result of:
            0.06453206 = score(doc=3693,freq=3.0), product of:
              0.14752264 = queryWeight, product of:
                3.2693186 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.01116671 = queryNorm
              0.4374383 = fieldWeight in 3693, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=3693)
          0.30039245 = weight(abstract_txt:summarization in 3693) [ClassicSimilarity], result of:
            0.30039245 = score(doc=3693,freq=6.0), product of:
              0.27532232 = queryWeight, product of:
                3.4595869 = boost
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.01116671 = queryNorm
              1.0910574 = fieldWeight in 3693, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.0625 = fieldNorm(doc=3693)
          0.44207206 = weight(abstract_txt:summarizer in 3693) [ClassicSimilarity], result of:
            0.44207206 = score(doc=3693,freq=1.0), product of:
              0.7674395 = queryWeight, product of:
                7.4567566 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.01116671 = queryNorm
              0.5760351 = fieldWeight in 3693, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.0625 = fieldNorm(doc=3693)
        0.2 = coord(5/25)
    
  4. Maybury, M.T.: Generating summaries from event data (1995) 0.17
    0.16766842 = sum of:
      0.16766842 = product of:
        0.5988158 = sum of:
          0.01340251 = weight(abstract_txt:information in 2417) [ClassicSimilarity], result of:
            0.01340251 = score(doc=2417,freq=5.0), product of:
              0.031717084 = queryWeight, product of:
                1.1742219 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.01116671 = queryNorm
              0.4225644 = fieldWeight in 2417, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.078125 = fieldNorm(doc=2417)
          0.02587392 = weight(abstract_txt:being in 2417) [ClassicSimilarity], result of:
            0.02587392 = score(doc=2417,freq=1.0), product of:
              0.07345763 = queryWeight, product of:
                1.4590708 = boost
                4.5085335 = idf(docFreq=1329, maxDocs=44421)
                0.01116671 = queryNorm
              0.35222918 = fieldWeight in 2417, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5085335 = idf(docFreq=1329, maxDocs=44421)
                0.078125 = fieldNorm(doc=2417)
          0.029265434 = weight(abstract_txt:given in 2417) [ClassicSimilarity], result of:
            0.029265434 = score(doc=2417,freq=1.0), product of:
              0.07974413 = queryWeight, product of:
                1.5202229 = boost
                4.6974936 = idf(docFreq=1100, maxDocs=44421)
                0.01116671 = queryNorm
              0.3669917 = fieldWeight in 2417, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6974936 = idf(docFreq=1100, maxDocs=44421)
                0.078125 = fieldNorm(doc=2417)
          0.09842592 = weight(abstract_txt:summaries in 2417) [ClassicSimilarity], result of:
            0.09842592 = score(doc=2417,freq=1.0), product of:
              0.17900634 = queryWeight, product of:
                2.2776768 = boost
                7.0380287 = idf(docFreq=105, maxDocs=44421)
                0.01116671 = queryNorm
              0.549846 = fieldWeight in 2417, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0380287 = idf(docFreq=105, maxDocs=44421)
                0.078125 = fieldNorm(doc=2417)
          0.059398524 = weight(abstract_txt:automatic in 2417) [ClassicSimilarity], result of:
            0.059398524 = score(doc=2417,freq=1.0), product of:
              0.14633323 = queryWeight, product of:
                2.522174 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.01116671 = queryNorm
              0.40591276 = fieldWeight in 2417, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.078125 = fieldNorm(doc=2417)
          0.06586275 = weight(abstract_txt:text in 2417) [ClassicSimilarity], result of:
            0.06586275 = score(doc=2417,freq=2.0), product of:
              0.14752264 = queryWeight, product of:
                3.2693186 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.01116671 = queryNorm
              0.4464586 = fieldWeight in 2417, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.078125 = fieldNorm(doc=2417)
          0.30658674 = weight(abstract_txt:summarization in 2417) [ClassicSimilarity], result of:
            0.30658674 = score(doc=2417,freq=4.0), product of:
              0.27532232 = queryWeight, product of:
                3.4595869 = boost
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.01116671 = queryNorm
              1.1135557 = fieldWeight in 2417, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.078125 = fieldNorm(doc=2417)
        0.28 = coord(7/25)
    
  5. Finegan-Dollak, C.; Radev, D.R.: Sentence simplification, compression, and disaggregation for summarization of sophisticated documents (2016) 0.17
    0.16598555 = sum of:
      0.16598555 = product of:
        0.82992774 = sum of:
          0.0047950274 = weight(abstract_txt:information in 4122) [ClassicSimilarity], result of:
            0.0047950274 = score(doc=4122,freq=1.0), product of:
              0.031717084 = queryWeight, product of:
                1.1742219 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.01116671 = queryNorm
              0.15118122 = fieldWeight in 4122, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0625 = fieldNorm(doc=4122)
          0.033110056 = weight(abstract_txt:given in 4122) [ClassicSimilarity], result of:
            0.033110056 = score(doc=4122,freq=2.0), product of:
              0.07974413 = queryWeight, product of:
                1.5202229 = boost
                4.6974936 = idf(docFreq=1100, maxDocs=44421)
                0.01116671 = queryNorm
              0.4152037 = fieldWeight in 4122, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6974936 = idf(docFreq=1100, maxDocs=44421)
                0.0625 = fieldNorm(doc=4122)
          0.030455418 = weight(abstract_txt:evaluation in 4122) [ClassicSimilarity], result of:
            0.030455418 = score(doc=4122,freq=1.0), product of:
              0.10877829 = queryWeight, product of:
                2.1745763 = boost
                4.479632 = idf(docFreq=1368, maxDocs=44421)
                0.01116671 = queryNorm
              0.279977 = fieldWeight in 4122, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.479632 = idf(docFreq=1368, maxDocs=44421)
                0.0625 = fieldNorm(doc=4122)
          0.13638295 = weight(abstract_txt:summaries in 4122) [ClassicSimilarity], result of:
            0.13638295 = score(doc=4122,freq=3.0), product of:
              0.17900634 = queryWeight, product of:
                2.2776768 = boost
                7.0380287 = idf(docFreq=105, maxDocs=44421)
                0.01116671 = queryNorm
              0.7618889 = fieldWeight in 4122, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.0380287 = idf(docFreq=105, maxDocs=44421)
                0.0625 = fieldNorm(doc=4122)
          0.6251843 = weight(abstract_txt:summarizer in 4122) [ClassicSimilarity], result of:
            0.6251843 = score(doc=4122,freq=2.0), product of:
              0.7674395 = queryWeight, product of:
                7.4567566 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.01116671 = queryNorm
              0.8146366 = fieldWeight in 4122, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.0625 = fieldNorm(doc=4122)
        0.2 = coord(5/25)