Document (#25650)

Author
Haag, M.
Title
Automatic text summarization : Evaluation des Copernic Summarizer und mögliche Einsatzfelder in der Fachinformation der DaimlerCrysler AG
Imprint
Aachen : Shaker Verlag
Year
2002
Pages
211 S
Isbn
3-8265-9952-7
Series
Wirtschaftsinformatik
Abstract
An evaluation of the Copernic Summarizer, a software for automatically summarizing text in various data formats, is being presented. It shall be assessed if and how the Copernic Summarizer can reasonably be used in the DaimlerChrysler Information Division in order to enhance the quality of its information services. First, an introduction into Automatic Text Summarization is given and the Copernic Summarizer is being presented. Various methods for evaluating Automatic Text Summarization systems and software ergonomics are presented. Two evaluation forms are developed with which the employees of the Information Division shall evaluate the quality and relevance of the extracted keywords and summaries as well as the software's usability. The quality and relevance assessment is done by comparing the original text to the summaries. Finally, a recommendation is given concerning the use of the Copernic Summarizer.
Footnote
Diplomarbeit an der HBI Stuttgart. - Vgl. auch: nfd 53(2002) H.4, S.243-244
Theme
Automatisches Abstracting
Object
Copernic Summarizer

Similar documents (content)

  1. Aker, A.; Gaizauskas, R.: Generating descriptive multi-document summaries of geo-located entities using entity type models (2015) 0.31
    0.30759305 = sum of:
      0.30759305 = product of:
        1.2816377 = sum of:
          0.034144238 = weight(abstract_txt:quality in 1726) [ClassicSimilarity], result of:
            0.034144238 = score(doc=1726,freq=1.0), product of:
              0.117413476 = queryWeight, product of:
                2.258874 = boost
                4.6528544 = idf(docFreq=1145, maxDocs=44218)
                0.011171371 = queryNorm
              0.2908034 = fieldWeight in 1726, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6528544 = idf(docFreq=1145, maxDocs=44218)
                0.0625 = fieldNorm(doc=1726)
          0.13618623 = weight(abstract_txt:summaries in 1726) [ClassicSimilarity], result of:
            0.13618623 = score(doc=1726,freq=3.0), product of:
              0.17886455 = queryWeight, product of:
                2.276405 = boost
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.011171371 = queryNorm
              0.7613931 = fieldWeight in 1726, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.0625 = fieldNorm(doc=1726)
          0.0475407 = weight(abstract_txt:automatic in 1726) [ClassicSimilarity], result of:
            0.0475407 = score(doc=1726,freq=1.0), product of:
              0.14640301 = queryWeight, product of:
                2.5223656 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.011171371 = queryNorm
              0.32472485 = fieldWeight in 1726, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.0625 = fieldNorm(doc=1726)
          0.05283397 = weight(abstract_txt:text in 1726) [ClassicSimilarity], result of:
            0.05283397 = score(doc=1726,freq=2.0), product of:
              0.14781599 = queryWeight, product of:
                3.2720363 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.011171371 = queryNorm
              0.3574307 = fieldWeight in 1726, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=1726)
          0.24599229 = weight(abstract_txt:summarization in 1726) [ClassicSimilarity], result of:
            0.24599229 = score(doc=1726,freq=4.0), product of:
              0.27590993 = queryWeight, product of:
                3.4627147 = boost
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.011171371 = queryNorm
              0.89156735 = fieldWeight in 1726, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.0625 = fieldNorm(doc=1726)
          0.76494026 = weight(abstract_txt:summarizer in 1726) [ClassicSimilarity], result of:
            0.76494026 = score(doc=1726,freq=3.0), product of:
              0.76706797 = queryWeight, product of:
                7.4537416 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.011171371 = queryNorm
              0.9972262 = fieldWeight in 1726, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.0625 = fieldNorm(doc=1726)
        0.24 = coord(6/25)
    
  2. Ou, S.; Khoo, C.S.G.; Goh, D.H.: Multi-document summarization of news articles using an event-based framework (2006) 0.22
    0.21563423 = sum of:
      0.21563423 = product of:
        0.59898394 = sum of:
          0.008330578 = weight(abstract_txt:information in 657) [ClassicSimilarity], result of:
            0.008330578 = score(doc=657,freq=3.0), product of:
              0.031786986 = queryWeight, product of:
                1.1753243 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.011171371 = queryNorm
              0.26207513 = fieldWeight in 657, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0625 = fieldNorm(doc=657)
          0.05372392 = weight(abstract_txt:summarizing in 657) [ClassicSimilarity], result of:
            0.05372392 = score(doc=657,freq=1.0), product of:
              0.11013137 = queryWeight, product of:
                1.2630714 = boost
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.011171371 = queryNorm
              0.4878167 = fieldWeight in 657, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.0625 = fieldNorm(doc=657)
          0.020750605 = weight(abstract_txt:being in 657) [ClassicSimilarity], result of:
            0.020750605 = score(doc=657,freq=1.0), product of:
              0.07359185 = queryWeight, product of:
                1.4601661 = boost
                4.5115004 = idf(docFreq=1319, maxDocs=44218)
                0.011171371 = queryNorm
              0.28196877 = fieldWeight in 657, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5115004 = idf(docFreq=1319, maxDocs=44218)
                0.0625 = fieldNorm(doc=657)
          0.023478592 = weight(abstract_txt:given in 657) [ClassicSimilarity], result of:
            0.023478592 = score(doc=657,freq=1.0), product of:
              0.079908065 = queryWeight, product of:
                1.5215377 = boost
                4.701121 = idf(docFreq=1091, maxDocs=44218)
                0.011171371 = queryNorm
              0.29382005 = fieldWeight in 657, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.701121 = idf(docFreq=1091, maxDocs=44218)
                0.0625 = fieldNorm(doc=657)
          0.04327846 = weight(abstract_txt:evaluation in 657) [ClassicSimilarity], result of:
            0.04327846 = score(doc=657,freq=2.0), product of:
              0.10914676 = queryWeight, product of:
                2.1779027 = boost
                4.4860687 = idf(docFreq=1353, maxDocs=44218)
                0.011171371 = queryNorm
              0.3965162 = fieldWeight in 657, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4860687 = idf(docFreq=1353, maxDocs=44218)
                0.0625 = fieldNorm(doc=657)
          0.19259642 = weight(abstract_txt:summaries in 657) [ClassicSimilarity], result of:
            0.19259642 = score(doc=657,freq=6.0), product of:
              0.17886455 = queryWeight, product of:
                2.276405 = boost
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.011171371 = queryNorm
              1.0767725 = fieldWeight in 657, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.0625 = fieldNorm(doc=657)
          0.03534186 = weight(abstract_txt:presented in 657) [ClassicSimilarity], result of:
            0.03534186 = score(doc=657,freq=1.0), product of:
              0.12014321 = queryWeight, product of:
                2.2849813 = boost
                4.7066307 = idf(docFreq=1085, maxDocs=44218)
                0.011171371 = queryNorm
              0.29416442 = fieldWeight in 657, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7066307 = idf(docFreq=1085, maxDocs=44218)
                0.0625 = fieldNorm(doc=657)
          0.0475407 = weight(abstract_txt:automatic in 657) [ClassicSimilarity], result of:
            0.0475407 = score(doc=657,freq=1.0), product of:
              0.14640301 = queryWeight, product of:
                2.5223656 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.011171371 = queryNorm
              0.32472485 = fieldWeight in 657, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.0625 = fieldNorm(doc=657)
          0.17394282 = weight(abstract_txt:summarization in 657) [ClassicSimilarity], result of:
            0.17394282 = score(doc=657,freq=2.0), product of:
              0.27590993 = queryWeight, product of:
                3.4627147 = boost
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.011171371 = queryNorm
              0.6304333 = fieldWeight in 657, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.0625 = fieldNorm(doc=657)
        0.36 = coord(9/25)
    
  3. Sankarasubramaniam, Y.; Ramanathan, K.; Ghosh, S.: Text summarization using Wikipedia (2014) 0.18
    0.17786185 = sum of:
      0.17786185 = product of:
        0.8893093 = sum of:
          0.034144238 = weight(abstract_txt:quality in 2693) [ClassicSimilarity], result of:
            0.034144238 = score(doc=2693,freq=1.0), product of:
              0.117413476 = queryWeight, product of:
                2.258874 = boost
                4.6528544 = idf(docFreq=1145, maxDocs=44218)
                0.011171371 = queryNorm
              0.2908034 = fieldWeight in 2693, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6528544 = idf(docFreq=1145, maxDocs=44218)
                0.0625 = fieldNorm(doc=2693)
          0.0475407 = weight(abstract_txt:automatic in 2693) [ClassicSimilarity], result of:
            0.0475407 = score(doc=2693,freq=1.0), product of:
              0.14640301 = queryWeight, product of:
                2.5223656 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.011171371 = queryNorm
              0.32472485 = fieldWeight in 2693, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.0625 = fieldNorm(doc=2693)
          0.064708136 = weight(abstract_txt:text in 2693) [ClassicSimilarity], result of:
            0.064708136 = score(doc=2693,freq=3.0), product of:
              0.14781599 = queryWeight, product of:
                3.2720363 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.011171371 = queryNorm
              0.4377614 = fieldWeight in 2693, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=2693)
          0.3012778 = weight(abstract_txt:summarization in 2693) [ClassicSimilarity], result of:
            0.3012778 = score(doc=2693,freq=6.0), product of:
              0.27590993 = queryWeight, product of:
                3.4627147 = boost
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.011171371 = queryNorm
              1.0919425 = fieldWeight in 2693, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.0625 = fieldNorm(doc=2693)
          0.44163847 = weight(abstract_txt:summarizer in 2693) [ClassicSimilarity], result of:
            0.44163847 = score(doc=2693,freq=1.0), product of:
              0.76706797 = queryWeight, product of:
                7.4537416 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.011171371 = queryNorm
              0.5757488 = fieldWeight in 2693, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.0625 = fieldNorm(doc=2693)
        0.2 = coord(5/25)
    
  4. Maybury, M.T.: Generating summaries from event data (1995) 0.17
    0.16799231 = sum of:
      0.16799231 = product of:
        0.59997255 = sum of:
          0.013443413 = weight(abstract_txt:information in 2349) [ClassicSimilarity], result of:
            0.013443413 = score(doc=2349,freq=5.0), product of:
              0.031786986 = queryWeight, product of:
                1.1753243 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.011171371 = queryNorm
              0.42292193 = fieldWeight in 2349, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.078125 = fieldNorm(doc=2349)
          0.025938256 = weight(abstract_txt:being in 2349) [ClassicSimilarity], result of:
            0.025938256 = score(doc=2349,freq=1.0), product of:
              0.07359185 = queryWeight, product of:
                1.4601661 = boost
                4.5115004 = idf(docFreq=1319, maxDocs=44218)
                0.011171371 = queryNorm
              0.35246098 = fieldWeight in 2349, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5115004 = idf(docFreq=1319, maxDocs=44218)
                0.078125 = fieldNorm(doc=2349)
          0.02934824 = weight(abstract_txt:given in 2349) [ClassicSimilarity], result of:
            0.02934824 = score(doc=2349,freq=1.0), product of:
              0.079908065 = queryWeight, product of:
                1.5215377 = boost
                4.701121 = idf(docFreq=1091, maxDocs=44218)
                0.011171371 = queryNorm
              0.36727506 = fieldWeight in 2349, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.701121 = idf(docFreq=1091, maxDocs=44218)
                0.078125 = fieldNorm(doc=2349)
          0.09828395 = weight(abstract_txt:summaries in 2349) [ClassicSimilarity], result of:
            0.09828395 = score(doc=2349,freq=1.0), product of:
              0.17886455 = queryWeight, product of:
                2.276405 = boost
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.011171371 = queryNorm
              0.5494881 = fieldWeight in 2349, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.078125 = fieldNorm(doc=2349)
          0.059425876 = weight(abstract_txt:automatic in 2349) [ClassicSimilarity], result of:
            0.059425876 = score(doc=2349,freq=1.0), product of:
              0.14640301 = queryWeight, product of:
                2.5223656 = boost
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.011171371 = queryNorm
              0.40590608 = fieldWeight in 2349, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1955976 = idf(docFreq=665, maxDocs=44218)
                0.078125 = fieldNorm(doc=2349)
          0.06604246 = weight(abstract_txt:text in 2349) [ClassicSimilarity], result of:
            0.06604246 = score(doc=2349,freq=2.0), product of:
              0.14781599 = queryWeight, product of:
                3.2720363 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.011171371 = queryNorm
              0.44678837 = fieldWeight in 2349, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=2349)
          0.30749035 = weight(abstract_txt:summarization in 2349) [ClassicSimilarity], result of:
            0.30749035 = score(doc=2349,freq=4.0), product of:
              0.27590993 = queryWeight, product of:
                3.4627147 = boost
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.011171371 = queryNorm
              1.1144592 = fieldWeight in 2349, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.078125 = fieldNorm(doc=2349)
        0.28 = coord(7/25)
    
  5. Finegan-Dollak, C.; Radev, D.R.: Sentence simplification, compression, and disaggregation for summarization of sophisticated documents (2016) 0.17
    0.16587465 = sum of:
      0.16587465 = product of:
        0.82937324 = sum of:
          0.004809662 = weight(abstract_txt:information in 3122) [ClassicSimilarity], result of:
            0.004809662 = score(doc=3122,freq=1.0), product of:
              0.031786986 = queryWeight, product of:
                1.1753243 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.011171371 = queryNorm
              0.15130915 = fieldWeight in 3122, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0625 = fieldNorm(doc=3122)
          0.033203743 = weight(abstract_txt:given in 3122) [ClassicSimilarity], result of:
            0.033203743 = score(doc=3122,freq=2.0), product of:
              0.079908065 = queryWeight, product of:
                1.5215377 = boost
                4.701121 = idf(docFreq=1091, maxDocs=44218)
                0.011171371 = queryNorm
              0.4155243 = fieldWeight in 3122, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.701121 = idf(docFreq=1091, maxDocs=44218)
                0.0625 = fieldNorm(doc=3122)
          0.03060249 = weight(abstract_txt:evaluation in 3122) [ClassicSimilarity], result of:
            0.03060249 = score(doc=3122,freq=1.0), product of:
              0.10914676 = queryWeight, product of:
                2.1779027 = boost
                4.4860687 = idf(docFreq=1353, maxDocs=44218)
                0.011171371 = queryNorm
              0.2803793 = fieldWeight in 3122, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4860687 = idf(docFreq=1353, maxDocs=44218)
                0.0625 = fieldNorm(doc=3122)
          0.13618623 = weight(abstract_txt:summaries in 3122) [ClassicSimilarity], result of:
            0.13618623 = score(doc=3122,freq=3.0), product of:
              0.17886455 = queryWeight, product of:
                2.276405 = boost
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.011171371 = queryNorm
              0.7613931 = fieldWeight in 3122, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.033448 = idf(docFreq=105, maxDocs=44218)
                0.0625 = fieldNorm(doc=3122)
          0.6245711 = weight(abstract_txt:summarizer in 3122) [ClassicSimilarity], result of:
            0.6245711 = score(doc=3122,freq=2.0), product of:
              0.76706797 = queryWeight, product of:
                7.4537416 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.011171371 = queryNorm
              0.81423175 = fieldWeight in 3122, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.0625 = fieldNorm(doc=3122)
        0.2 = coord(5/25)