Document (#34072)

Author
Otterbacher, J.
Radev, D.
Kareem, O.
Title
Hierarchical summarization for delivering information to mobile devices
Source
Information processing and management. 44(2008) no.2, S.931-947
Year
2008
Abstract
Access to information via handheld devices supports decision making away from one's computer. However, limitations include small screens and constrained wireless bandwidth. We present a summarization method that transforms online content for delivery to small devices. Unlike previous algorithms, ours assumes nothing about document formatting, and induces a hierarchical structure based on the relative importance of sentences within the document. As compared to delivering full documents, the method reduces the bytes transferred by half. An experiment also demonstrates that when given hierarchical summaries, users are no less accurate in answering questions about the documents.

Similar documents (author)

  1. Otterbacher, J.; Radev, D.: Exploring fact-focused relevance and novelty detection (2008) 4.57
    4.5682592 = sum of:
      4.5682592 = weight(author_txt:radev in 3210) [ClassicSimilarity], result of:
        4.5682592 = fieldWeight in 3210, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.1365185 = idf(docFreq=12, maxDocs=44421)
          0.5 = fieldNorm(doc=3210)
    
  2. Finegan-Dollak, C.; Radev, D.R.: Sentence simplification, compression, and disaggregation for summarization of sophisticated documents (2016) 4.00
    3.9972267 = sum of:
      3.9972267 = weight(author_txt:radev in 4122) [ClassicSimilarity], result of:
        3.9972267 = fieldWeight in 4122, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.1365185 = idf(docFreq=12, maxDocs=44421)
          0.4375 = fieldNorm(doc=4122)
    
  3. Radev, D.R.; Libner, K.; Fan, W.: Getting answers to natural language questions on the Web (2002) 3.43
    3.4261944 = sum of:
      3.4261944 = weight(author_txt:radev in 204) [ClassicSimilarity], result of:
        3.4261944 = fieldWeight in 204, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.1365185 = idf(docFreq=12, maxDocs=44421)
          0.375 = fieldNorm(doc=204)
    
  4. Otterbacher, J.; Erkan, G.; Radev, D.R.: Biased LexRank : passage retrieval using random walks with question-based priors (2009) 3.43
    3.4261944 = sum of:
      3.4261944 = weight(author_txt:radev in 3450) [ClassicSimilarity], result of:
        3.4261944 = fieldWeight in 3450, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.1365185 = idf(docFreq=12, maxDocs=44421)
          0.375 = fieldNorm(doc=3450)
    
  5. Lam, W.; Chan, K.; Radev, D.; Saggion, H.; Teufel, S.: Context-based generic cross-lingual retrieval of documents and automated summaries (2005) 2.86
    2.8551621 = sum of:
      2.8551621 = weight(author_txt:radev in 2965) [ClassicSimilarity], result of:
        2.8551621 = fieldWeight in 2965, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.1365185 = idf(docFreq=12, maxDocs=44421)
          0.3125 = fieldNorm(doc=2965)
    

Similar documents (content)

  1. Yang, C.C.; Wang, F.L.: Hierarchical summarization of large documents (2008) 0.13
    0.13446155 = sum of:
      0.13446155 = product of:
        0.6723077 = sum of:
          0.058446098 = weight(abstract_txt:summaries in 2719) [ClassicSimilarity], result of:
            0.058446098 = score(doc=2719,freq=1.0), product of:
              0.13286924 = queryWeight, product of:
                7.0380287 = idf(docFreq=105, maxDocs=44421)
                0.018878758 = queryNorm
              0.4398768 = fieldWeight in 2719, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0380287 = idf(docFreq=105, maxDocs=44421)
                0.0625 = fieldNorm(doc=2719)
          0.033242155 = weight(abstract_txt:documents in 2719) [ClassicSimilarity], result of:
            0.033242155 = score(doc=2719,freq=2.0), product of:
              0.09121093 = queryWeight, product of:
                1.1717263 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.018878758 = queryNorm
              0.3644536 = fieldWeight in 2719, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0625 = fieldNorm(doc=2719)
          0.07024461 = weight(abstract_txt:document in 2719) [ClassicSimilarity], result of:
            0.07024461 = score(doc=2719,freq=7.0), product of:
              0.098925166 = queryWeight, product of:
                1.2202706 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.018878758 = queryNorm
              0.7100783 = fieldWeight in 2719, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.0625 = fieldNorm(doc=2719)
          0.3211125 = weight(abstract_txt:summarization in 2719) [ClassicSimilarity], result of:
            0.3211125 = score(doc=2719,freq=7.0), product of:
              0.27248102 = queryWeight, product of:
                2.025214 = boost
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.018878758 = queryNorm
              1.1784766 = fieldWeight in 2719, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.0625 = fieldNorm(doc=2719)
          0.18926235 = weight(abstract_txt:hierarchical in 2719) [ClassicSimilarity], result of:
            0.18926235 = score(doc=2719,freq=4.0), product of:
              0.26423115 = queryWeight, product of:
                2.4425328 = boost
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.018878758 = queryNorm
              0.7162757 = fieldWeight in 2719, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.0625 = fieldNorm(doc=2719)
        0.2 = coord(5/25)
    
  2. Ou, S.; Khoo, C.S.G.; Goh, D.H.: Multi-document summarization of news articles using an event-based framework (2006) 0.13
    0.13056694 = sum of:
      0.13056694 = product of:
        0.54402894 = sum of:
          0.14316313 = weight(abstract_txt:summaries in 782) [ClassicSimilarity], result of:
            0.14316313 = score(doc=782,freq=6.0), product of:
              0.13286924 = queryWeight, product of:
                7.0380287 = idf(docFreq=105, maxDocs=44421)
                0.018878758 = queryNorm
              1.0774738 = fieldWeight in 782, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.0380287 = idf(docFreq=105, maxDocs=44421)
                0.0625 = fieldNorm(doc=782)
          0.053099938 = weight(abstract_txt:document in 782) [ClassicSimilarity], result of:
            0.053099938 = score(doc=782,freq=4.0), product of:
              0.098925166 = queryWeight, product of:
                1.2202706 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.018878758 = queryNorm
              0.53676873 = fieldWeight in 782, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.0625 = fieldNorm(doc=782)
          0.030529825 = weight(abstract_txt:method in 782) [ClassicSimilarity], result of:
            0.030529825 = score(doc=782,freq=1.0), product of:
              0.108579285 = queryWeight, product of:
                1.278428 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.018878758 = queryNorm
              0.2811754 = fieldWeight in 782, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0625 = fieldNorm(doc=782)
          0.050962985 = weight(abstract_txt:small in 782) [ClassicSimilarity], result of:
            0.050962985 = score(doc=782,freq=1.0), product of:
              0.15279202 = queryWeight, product of:
                1.5165373 = boost
                5.3367167 = idf(docFreq=580, maxDocs=44421)
                0.018878758 = queryNorm
              0.3335448 = fieldWeight in 782, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3367167 = idf(docFreq=580, maxDocs=44421)
                0.0625 = fieldNorm(doc=782)
          0.17164186 = weight(abstract_txt:summarization in 782) [ClassicSimilarity], result of:
            0.17164186 = score(doc=782,freq=2.0), product of:
              0.27248102 = queryWeight, product of:
                2.025214 = boost
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.018878758 = queryNorm
              0.6299222 = fieldWeight in 782, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.0625 = fieldNorm(doc=782)
          0.09463117 = weight(abstract_txt:hierarchical in 782) [ClassicSimilarity], result of:
            0.09463117 = score(doc=782,freq=1.0), product of:
              0.26423115 = queryWeight, product of:
                2.4425328 = boost
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.018878758 = queryNorm
              0.35813785 = fieldWeight in 782, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.0625 = fieldNorm(doc=782)
        0.24 = coord(6/25)
    
  3. Pons-Porrata, A.; Berlanga-Llavori, R.; Ruiz-Shulcloper, J.: Topic discovery based on text mining techniques (2007) 0.10
    0.10317507 = sum of:
      0.10317507 = product of:
        0.51587534 = sum of:
          0.103319086 = weight(abstract_txt:summaries in 1916) [ClassicSimilarity], result of:
            0.103319086 = score(doc=1916,freq=2.0), product of:
              0.13286924 = queryWeight, product of:
                7.0380287 = idf(docFreq=105, maxDocs=44421)
                0.018878758 = queryNorm
              0.7775997 = fieldWeight in 1916, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.0380287 = idf(docFreq=105, maxDocs=44421)
                0.078125 = fieldNorm(doc=1916)
          0.04155269 = weight(abstract_txt:documents in 1916) [ClassicSimilarity], result of:
            0.04155269 = score(doc=1916,freq=2.0), product of:
              0.09121093 = queryWeight, product of:
                1.1717263 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.018878758 = queryNorm
              0.455567 = fieldWeight in 1916, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.078125 = fieldNorm(doc=1916)
          0.03816228 = weight(abstract_txt:method in 1916) [ClassicSimilarity], result of:
            0.03816228 = score(doc=1916,freq=1.0), product of:
              0.108579285 = queryWeight, product of:
                1.278428 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.018878758 = queryNorm
              0.35146925 = fieldWeight in 1916, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.078125 = fieldNorm(doc=1916)
          0.21455231 = weight(abstract_txt:summarization in 1916) [ClassicSimilarity], result of:
            0.21455231 = score(doc=1916,freq=2.0), product of:
              0.27248102 = queryWeight, product of:
                2.025214 = boost
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.018878758 = queryNorm
              0.78740275 = fieldWeight in 1916, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.078125 = fieldNorm(doc=1916)
          0.118288964 = weight(abstract_txt:hierarchical in 1916) [ClassicSimilarity], result of:
            0.118288964 = score(doc=1916,freq=1.0), product of:
              0.26423115 = queryWeight, product of:
                2.4425328 = boost
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.018878758 = queryNorm
              0.4476723 = fieldWeight in 1916, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.078125 = fieldNorm(doc=1916)
        0.2 = coord(5/25)
    
  4. Xiong, S.; Ji, D.: Query-focused multi-document summarization using hypergraph-based ranking (2016) 0.10
    0.09644305 = sum of:
      0.09644305 = product of:
        0.48221526 = sum of:
          0.07305762 = weight(abstract_txt:summaries in 3972) [ClassicSimilarity], result of:
            0.07305762 = score(doc=3972,freq=1.0), product of:
              0.13286924 = queryWeight, product of:
                7.0380287 = idf(docFreq=105, maxDocs=44421)
                0.018878758 = queryNorm
              0.549846 = fieldWeight in 3972, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0380287 = idf(docFreq=105, maxDocs=44421)
                0.078125 = fieldNorm(doc=3972)
          0.02938219 = weight(abstract_txt:documents in 3972) [ClassicSimilarity], result of:
            0.02938219 = score(doc=3972,freq=1.0), product of:
              0.09121093 = queryWeight, product of:
                1.1717263 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.018878758 = queryNorm
              0.32213452 = fieldWeight in 3972, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.078125 = fieldNorm(doc=3972)
          0.046934158 = weight(abstract_txt:document in 3972) [ClassicSimilarity], result of:
            0.046934158 = score(doc=3972,freq=2.0), product of:
              0.098925166 = queryWeight, product of:
                1.2202706 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.018878758 = queryNorm
              0.47444102 = fieldWeight in 3972, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.078125 = fieldNorm(doc=3972)
          0.21455231 = weight(abstract_txt:summarization in 3972) [ClassicSimilarity], result of:
            0.21455231 = score(doc=3972,freq=2.0), product of:
              0.27248102 = queryWeight, product of:
                2.025214 = boost
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.018878758 = queryNorm
              0.78740275 = fieldWeight in 3972, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.078125 = fieldNorm(doc=3972)
          0.118288964 = weight(abstract_txt:hierarchical in 3972) [ClassicSimilarity], result of:
            0.118288964 = score(doc=3972,freq=1.0), product of:
              0.26423115 = queryWeight, product of:
                2.4425328 = boost
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.018878758 = queryNorm
              0.4476723 = fieldWeight in 3972, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.078125 = fieldNorm(doc=3972)
        0.2 = coord(5/25)
    
  5. Chang, Y.-W.: Influence of human behavior and the principle of least effort on library and information science research (2016) 0.10
    0.09644305 = sum of:
      0.09644305 = product of:
        0.48221526 = sum of:
          0.07305762 = weight(abstract_txt:summaries in 3973) [ClassicSimilarity], result of:
            0.07305762 = score(doc=3973,freq=1.0), product of:
              0.13286924 = queryWeight, product of:
                7.0380287 = idf(docFreq=105, maxDocs=44421)
                0.018878758 = queryNorm
              0.549846 = fieldWeight in 3973, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0380287 = idf(docFreq=105, maxDocs=44421)
                0.078125 = fieldNorm(doc=3973)
          0.02938219 = weight(abstract_txt:documents in 3973) [ClassicSimilarity], result of:
            0.02938219 = score(doc=3973,freq=1.0), product of:
              0.09121093 = queryWeight, product of:
                1.1717263 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.018878758 = queryNorm
              0.32213452 = fieldWeight in 3973, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.078125 = fieldNorm(doc=3973)
          0.046934158 = weight(abstract_txt:document in 3973) [ClassicSimilarity], result of:
            0.046934158 = score(doc=3973,freq=2.0), product of:
              0.098925166 = queryWeight, product of:
                1.2202706 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.018878758 = queryNorm
              0.47444102 = fieldWeight in 3973, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.078125 = fieldNorm(doc=3973)
          0.21455231 = weight(abstract_txt:summarization in 3973) [ClassicSimilarity], result of:
            0.21455231 = score(doc=3973,freq=2.0), product of:
              0.27248102 = queryWeight, product of:
                2.025214 = boost
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.018878758 = queryNorm
              0.78740275 = fieldWeight in 3973, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.078125 = fieldNorm(doc=3973)
          0.118288964 = weight(abstract_txt:hierarchical in 3973) [ClassicSimilarity], result of:
            0.118288964 = score(doc=3973,freq=1.0), product of:
              0.26423115 = queryWeight, product of:
                2.4425328 = boost
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.018878758 = queryNorm
              0.4476723 = fieldWeight in 3973, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.078125 = fieldNorm(doc=3973)
        0.2 = coord(5/25)