Document (#37979)

Editor
Poibeau, T. u.a.
Title
Multi-source, multilingual information extraction and summarization
Imprint
Berlin : Springer
Year
2013
Pages
XX, 323 S
Isbn
978-3-642-28568-4
Series
Theory and applications of natural language processing
Abstract
Information extraction (IE) and text summarization (TS) are powerful technologies for finding relevant pieces of information in text and presenting them to the user in condensed form. The ongoing information explosion makes IE and TS critical for successful functioning within the information society. These technologies face particular challenges due to the inherent multi-source nature of the information explosion. The technologies must now handle not isolated texts or individual narratives, but rather large-scale repositories and streams---in general, in multiple languages---containing a multiplicity of perspectives, opinions, or commentaries on particular topics, entities or events. There is thus a need to adapt existing techniques and develop new ones to deal with these challenges. This volume contains a selection of papers that present a variety of methodologies for content identification and extraction, as well as for content fusion and regeneration. The chapters cover various aspects of the challenges, depending on the nature of the information sought---names vs. events,--- and the nature of the sources---news streams vs. image captions vs. scientific research papers, etc. This volume aims to offer a broad and representative sample of studies from this very active research field.
Footnote
Rez. in: JASIST 64(2013) no.7, S.1519-1521 (José L. Vicedo, David Tomás)
Theme
Computerlinguistik
RSWK
Natürlichsprachiges System / Information Extraction / Automatische Inhaltsanalyse / Zusammenfassung / Aufsatzsammlung
BK
54.75 (Sprachverarbeitung) <Informatik>
DDC
006.312 / DDC22ger
005.74 / DDC22ger
RVK
ST 530
ST 306
AN 95300

Similar documents (content)

  1. Moens, M.-F.; Angheluta, R.; Dumortier, J.: Generic technologies for single-and multi-document summarization (2005) 0.18
    0.17735684 = sum of:
      0.17735684 = product of:
        0.73898685 = sum of:
          0.026523061 = weight(abstract_txt:text in 2026) [ClassicSimilarity], result of:
            0.026523061 = score(doc=2026,freq=1.0), product of:
              0.084015116 = queryWeight, product of:
                1.0427781 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.019938355 = queryNorm
              0.3156939 = fieldWeight in 2026, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.078125 = fieldNorm(doc=2026)
          0.041506566 = weight(abstract_txt:content in 2026) [ClassicSimilarity], result of:
            0.041506566 = score(doc=2026,freq=2.0), product of:
              0.089882724 = queryWeight, product of:
                1.0785774 = boost
                4.1796083 = idf(docFreq=1847, maxDocs=44421)
                0.019938355 = queryNorm
              0.46178582 = fieldWeight in 2026, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1796083 = idf(docFreq=1847, maxDocs=44421)
                0.078125 = fieldNorm(doc=2026)
          0.08422217 = weight(abstract_txt:multi in 2026) [ClassicSimilarity], result of:
            0.08422217 = score(doc=2026,freq=1.0), product of:
              0.18150668 = queryWeight, product of:
                1.5327083 = boost
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.019938355 = queryNorm
              0.4640169 = fieldWeight in 2026, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.078125 = fieldNorm(doc=2026)
          0.29100528 = weight(abstract_txt:summarization in 2026) [ClassicSimilarity], result of:
            0.29100528 = score(doc=2026,freq=4.0), product of:
              0.2613298 = queryWeight, product of:
                1.8391098 = boost
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.019938355 = queryNorm
              1.1135557 = fieldWeight in 2026, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.078125 = fieldNorm(doc=2026)
          0.10486112 = weight(abstract_txt:technologies in 2026) [ClassicSimilarity], result of:
            0.10486112 = score(doc=2026,freq=2.0), product of:
              0.19085567 = queryWeight, product of:
                1.9249141 = boost
                4.972839 = idf(docFreq=835, maxDocs=44421)
                0.019938355 = queryNorm
              0.54942626 = fieldWeight in 2026, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.972839 = idf(docFreq=835, maxDocs=44421)
                0.078125 = fieldNorm(doc=2026)
          0.19086865 = weight(abstract_txt:extraction in 2026) [ClassicSimilarity], result of:
            0.19086865 = score(doc=2026,freq=1.0), product of:
              0.39455548 = queryWeight, product of:
                3.1958194 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.019938355 = queryNorm
              0.48375618 = fieldWeight in 2026, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.078125 = fieldNorm(doc=2026)
        0.24 = coord(6/25)
    
  2. Yang, C.C.; Wang, F.L.: Hierarchical summarization of large documents (2008) 0.18
    0.17639883 = sum of:
      0.17639883 = product of:
        0.7349951 = sum of:
          0.021218449 = weight(abstract_txt:text in 2719) [ClassicSimilarity], result of:
            0.021218449 = score(doc=2719,freq=1.0), product of:
              0.084015116 = queryWeight, product of:
                1.0427781 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.019938355 = queryNorm
              0.25255513 = fieldWeight in 2719, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=2719)
          0.0981388 = weight(abstract_txt:condensed in 2719) [ClassicSimilarity], result of:
            0.0981388 = score(doc=2719,freq=1.0), product of:
              0.18511097 = queryWeight, product of:
                1.0944963 = boost
                8.482592 = idf(docFreq=24, maxDocs=44421)
                0.019938355 = queryNorm
              0.530162 = fieldWeight in 2719, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.482592 = idf(docFreq=24, maxDocs=44421)
                0.0625 = fieldNorm(doc=2719)
          0.06019101 = weight(abstract_txt:source in 2719) [ClassicSimilarity], result of:
            0.06019101 = score(doc=2719,freq=2.0), product of:
              0.13362655 = queryWeight, product of:
                1.3151025 = boost
                5.0961695 = idf(docFreq=738, maxDocs=44421)
                0.019938355 = queryNorm
              0.450442 = fieldWeight in 2719, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.0961695 = idf(docFreq=738, maxDocs=44421)
                0.0625 = fieldNorm(doc=2719)
          0.30797106 = weight(abstract_txt:summarization in 2719) [ClassicSimilarity], result of:
            0.30797106 = score(doc=2719,freq=7.0), product of:
              0.2613298 = queryWeight, product of:
                1.8391098 = boost
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.019938355 = queryNorm
              1.1784766 = fieldWeight in 2719, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.0625 = fieldNorm(doc=2719)
          0.031532556 = weight(abstract_txt:information in 2719) [ClassicSimilarity], result of:
            0.031532556 = score(doc=2719,freq=3.0), product of:
              0.12042058 = queryWeight, product of:
                2.4968565 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.019938355 = queryNorm
              0.26185355 = fieldWeight in 2719, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0625 = fieldNorm(doc=2719)
          0.21594322 = weight(abstract_txt:extraction in 2719) [ClassicSimilarity], result of:
            0.21594322 = score(doc=2719,freq=2.0), product of:
              0.39455548 = queryWeight, product of:
                3.1958194 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.019938355 = queryNorm
              0.5473076 = fieldWeight in 2719, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.0625 = fieldNorm(doc=2719)
        0.24 = coord(6/25)
    
  3. Huo, W.: Automatic multi-word term extraction and its application to Web-page summarization (2012) 0.17
    0.16763864 = sum of:
      0.16763864 = product of:
        0.8381932 = sum of:
          0.026523061 = weight(abstract_txt:text in 1563) [ClassicSimilarity], result of:
            0.026523061 = score(doc=1563,freq=1.0), product of:
              0.084015116 = queryWeight, product of:
                1.0427781 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.019938355 = queryNorm
              0.3156939 = fieldWeight in 1563, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.078125 = fieldNorm(doc=1563)
          0.20630133 = weight(abstract_txt:multi in 1563) [ClassicSimilarity], result of:
            0.20630133 = score(doc=1563,freq=6.0), product of:
              0.18150668 = queryWeight, product of:
                1.5327083 = boost
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.019938355 = queryNorm
              1.1366047 = fieldWeight in 1563, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.078125 = fieldNorm(doc=1563)
          0.25201797 = weight(abstract_txt:summarization in 1563) [ClassicSimilarity], result of:
            0.25201797 = score(doc=1563,freq=3.0), product of:
              0.2613298 = queryWeight, product of:
                1.8391098 = boost
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.019938355 = queryNorm
              0.9643675 = fieldWeight in 1563, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.078125 = fieldNorm(doc=1563)
          0.022756664 = weight(abstract_txt:information in 1563) [ClassicSimilarity], result of:
            0.022756664 = score(doc=1563,freq=1.0), product of:
              0.12042058 = queryWeight, product of:
                2.4968565 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.019938355 = queryNorm
              0.18897653 = fieldWeight in 1563, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.078125 = fieldNorm(doc=1563)
          0.33059418 = weight(abstract_txt:extraction in 1563) [ClassicSimilarity], result of:
            0.33059418 = score(doc=1563,freq=3.0), product of:
              0.39455548 = queryWeight, product of:
                3.1958194 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.019938355 = queryNorm
              0.83789027 = fieldWeight in 1563, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.078125 = fieldNorm(doc=1563)
        0.2 = coord(5/25)
    
  4. Multilingual information management : current levels and future abilities. A report Commissioned by the US National Science Foundation and also delivered to the European Commission's Language Engineering Office and the US Defense Advanced Research Projects Agency, April 1999 (1999) 0.16
    0.16284025 = sum of:
      0.16284025 = product of:
        0.5815723 = sum of:
          0.026256492 = weight(abstract_txt:text in 68) [ClassicSimilarity], result of:
            0.026256492 = score(doc=68,freq=2.0), product of:
              0.084015116 = queryWeight, product of:
                1.0427781 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.019938355 = queryNorm
              0.31252104 = fieldWeight in 68, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0546875 = fieldNorm(doc=68)
          0.10211395 = weight(abstract_txt:multi in 68) [ClassicSimilarity], result of:
            0.10211395 = score(doc=68,freq=3.0), product of:
              0.18150668 = queryWeight, product of:
                1.5327083 = boost
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.019938355 = queryNorm
              0.5625906 = fieldWeight in 68, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.9394164 = idf(docFreq=317, maxDocs=44421)
                0.0546875 = fieldNorm(doc=68)
          0.14404026 = weight(abstract_txt:summarization in 68) [ClassicSimilarity], result of:
            0.14404026 = score(doc=68,freq=2.0), product of:
              0.2613298 = queryWeight, product of:
                1.8391098 = boost
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.019938355 = queryNorm
              0.5511819 = fieldWeight in 68, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.0546875 = fieldNorm(doc=68)
          0.07340278 = weight(abstract_txt:technologies in 68) [ClassicSimilarity], result of:
            0.07340278 = score(doc=68,freq=2.0), product of:
              0.19085567 = queryWeight, product of:
                1.9249141 = boost
                4.972839 = idf(docFreq=835, maxDocs=44421)
                0.019938355 = queryNorm
              0.38459837 = fieldWeight in 68, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.972839 = idf(docFreq=835, maxDocs=44421)
                0.0546875 = fieldNorm(doc=68)
          0.057094865 = weight(abstract_txt:challenges in 68) [ClassicSimilarity], result of:
            0.057094865 = score(doc=68,freq=1.0), product of:
              0.20337836 = queryWeight, product of:
                1.987061 = boost
                5.1333895 = idf(docFreq=711, maxDocs=44421)
                0.019938355 = queryNorm
              0.28073224 = fieldWeight in 68, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1333895 = idf(docFreq=711, maxDocs=44421)
                0.0546875 = fieldNorm(doc=68)
          0.045055892 = weight(abstract_txt:information in 68) [ClassicSimilarity], result of:
            0.045055892 = score(doc=68,freq=8.0), product of:
              0.12042058 = queryWeight, product of:
                2.4968565 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.019938355 = queryNorm
              0.37415442 = fieldWeight in 68, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0546875 = fieldNorm(doc=68)
          0.13360806 = weight(abstract_txt:extraction in 68) [ClassicSimilarity], result of:
            0.13360806 = score(doc=68,freq=1.0), product of:
              0.39455548 = queryWeight, product of:
                3.1958194 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.019938355 = queryNorm
              0.33862934 = fieldWeight in 68, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.0546875 = fieldNorm(doc=68)
        0.28 = coord(7/25)
    
  5. Ercan, G.; Cicekli, I.: Using lexical chains for keyword extraction (2007) 0.16
    0.1617742 = sum of:
      0.1617742 = product of:
        0.80887103 = sum of:
          0.055127148 = weight(abstract_txt:text in 1951) [ClassicSimilarity], result of:
            0.055127148 = score(doc=1951,freq=3.0), product of:
              0.084015116 = queryWeight, product of:
                1.0427781 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.019938355 = queryNorm
              0.6561575 = fieldWeight in 1951, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.09375 = fieldNorm(doc=1951)
          0.03521949 = weight(abstract_txt:content in 1951) [ClassicSimilarity], result of:
            0.03521949 = score(doc=1951,freq=1.0), product of:
              0.089882724 = queryWeight, product of:
                1.0785774 = boost
                4.1796083 = idf(docFreq=1847, maxDocs=44421)
                0.019938355 = queryNorm
              0.39183828 = fieldWeight in 1951, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1796083 = idf(docFreq=1847, maxDocs=44421)
                0.09375 = fieldNorm(doc=1951)
          0.1472082 = weight(abstract_txt:condensed in 1951) [ClassicSimilarity], result of:
            0.1472082 = score(doc=1951,freq=1.0), product of:
              0.18511097 = queryWeight, product of:
                1.0944963 = boost
                8.482592 = idf(docFreq=24, maxDocs=44421)
                0.019938355 = queryNorm
              0.79524297 = fieldWeight in 1951, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.482592 = idf(docFreq=24, maxDocs=44421)
                0.09375 = fieldNorm(doc=1951)
          0.17460318 = weight(abstract_txt:summarization in 1951) [ClassicSimilarity], result of:
            0.17460318 = score(doc=1951,freq=1.0), product of:
              0.2613298 = queryWeight, product of:
                1.8391098 = boost
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.019938355 = queryNorm
              0.66813344 = fieldWeight in 1951, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.09375 = fieldNorm(doc=1951)
          0.39671305 = weight(abstract_txt:extraction in 1951) [ClassicSimilarity], result of:
            0.39671305 = score(doc=1951,freq=3.0), product of:
              0.39455548 = queryWeight, product of:
                3.1958194 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.019938355 = queryNorm
              1.0054684 = fieldWeight in 1951, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.09375 = fieldNorm(doc=1951)
        0.2 = coord(5/25)