Document (#35982)

Author
Wang, W.
Hwang, D.
Title
Abstraction Assistant : an automatic text abstraction system
Source
Journal of the American Society for Information Science and Technology. 61(2010) no.9, S.1790-1799
Year
2010
Abstract
In the interest of standardization and quality assurance, it is desirable for authors and staff of access services to follow the American National Standards Institute (ANSI) guidelines in preparing abstracts. Using the statistical approach an extraction system (the Abstraction Assistant) was developed to generate informative abstracts to meet the ANSI guidelines for structural content elements. The system performance is evaluated by comparing the system-generated abstracts with the author's original abstracts and the manually enhanced system abstracts on three criteria: balance (satisfaction of the ANSI standards), fluency (text coherence), and understandability (clarity). The results suggest that it is possible to use the system output directly without manual modification, but there are issues that need to be addressed in further studies to make the system a better tool.
Theme
Automatisches Abstracting

Similar documents (author)

  1. Wang, H.; Wang, C.: Ontologies for universal information systems (1995) 4.62
    4.6221313 = sum of:
      4.6221313 = weight(author_txt:wang in 3262) [ClassicSimilarity], result of:
        4.6221313 = score(doc=3262,freq=2.0), product of:
          0.99999994 = queryWeight, product of:
            6.5366817 = idf(docFreq=174, maxDocs=44421)
            0.15298282 = queryNorm
          4.622132 = fieldWeight in 3262, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            6.5366817 = idf(docFreq=174, maxDocs=44421)
            0.5 = fieldNorm(doc=3262)
    
  2. Wang, F.; Wang, X.: Tracing theory diffusion : a text mining and citation-based analysis of TAM (2020) 4.62
    4.6221313 = sum of:
      4.6221313 = weight(author_txt:wang in 980) [ClassicSimilarity], result of:
        4.6221313 = score(doc=980,freq=2.0), product of:
          0.99999994 = queryWeight, product of:
            6.5366817 = idf(docFreq=174, maxDocs=44421)
            0.15298282 = queryNorm
          4.622132 = fieldWeight in 980, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            6.5366817 = idf(docFreq=174, maxDocs=44421)
            0.5 = fieldNorm(doc=980)
    
  3. Wang, C.: ¬The online catalogue, subject access and user reactions : a review (1985) 4.09
    4.0854254 = sum of:
      4.0854254 = weight(author_txt:wang in 985) [ClassicSimilarity], result of:
        4.0854254 = score(doc=985,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            6.5366817 = idf(docFreq=174, maxDocs=44421)
            0.15298282 = queryNorm
          4.085426 = fieldWeight in 985, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            6.5366817 = idf(docFreq=174, maxDocs=44421)
            0.625 = fieldNorm(doc=985)
    
  4. Wang, C.: Bibliometrics : a textbook (1990) 4.09
    4.0854254 = sum of:
      4.0854254 = weight(author_txt:wang in 5108) [ClassicSimilarity], result of:
        4.0854254 = score(doc=5108,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            6.5366817 = idf(docFreq=174, maxDocs=44421)
            0.15298282 = queryNorm
          4.085426 = fieldWeight in 5108, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            6.5366817 = idf(docFreq=174, maxDocs=44421)
            0.625 = fieldNorm(doc=5108)
    
  5. Wang, P.: Users' information needs at different stages of a research project : a cognitive view (1997) 4.09
    4.0854254 = sum of:
      4.0854254 = weight(author_txt:wang in 1320) [ClassicSimilarity], result of:
        4.0854254 = score(doc=1320,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            6.5366817 = idf(docFreq=174, maxDocs=44421)
            0.15298282 = queryNorm
          4.085426 = fieldWeight in 1320, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            6.5366817 = idf(docFreq=174, maxDocs=44421)
            0.625 = fieldNorm(doc=1320)
    

Similar documents (content)

  1. Tenopir, C.; Jascó, P.: Quality of abstracts (1993) 0.19
    0.1910272 = sum of:
      0.1910272 = product of:
        1.19392 = sum of:
          0.10725404 = weight(abstract_txt:informative in 5025) [ClassicSimilarity], result of:
            0.10725404 = score(doc=5025,freq=2.0), product of:
              0.11431659 = queryWeight, product of:
                1.1494739 = boost
                7.0764947 = idf(docFreq=101, maxDocs=44421)
                0.0140537415 = queryNorm
              0.9382194 = fieldWeight in 5025, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.0764947 = idf(docFreq=101, maxDocs=44421)
                0.09375 = fieldNorm(doc=5025)
          0.04738832 = weight(abstract_txt:standards in 5025) [ClassicSimilarity], result of:
            0.04738832 = score(doc=5025,freq=1.0), product of:
              0.10526912 = queryWeight, product of:
                1.5599475 = boost
                4.8017445 = idf(docFreq=991, maxDocs=44421)
                0.0140537415 = queryNorm
              0.45016354 = fieldWeight in 5025, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8017445 = idf(docFreq=991, maxDocs=44421)
                0.09375 = fieldNorm(doc=5025)
          0.5322823 = weight(abstract_txt:ansi in 5025) [ClassicSimilarity], result of:
            0.5322823 = score(doc=5025,freq=2.0), product of:
              0.47969872 = queryWeight, product of:
                4.078396 = boost
                8.369263 = idf(docFreq=27, maxDocs=44421)
                0.0140537415 = queryNorm
              1.109618 = fieldWeight in 5025, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.369263 = idf(docFreq=27, maxDocs=44421)
                0.09375 = fieldNorm(doc=5025)
          0.5069954 = weight(abstract_txt:abstracts in 5025) [ClassicSimilarity], result of:
            0.5069954 = score(doc=5025,freq=5.0), product of:
              0.4056761 = queryWeight, product of:
                4.8419375 = boost
                5.9616747 = idf(docFreq=310, maxDocs=44421)
                0.0140537415 = queryNorm
              1.2497541 = fieldWeight in 5025, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.9616747 = idf(docFreq=310, maxDocs=44421)
                0.09375 = fieldNorm(doc=5025)
        0.16 = coord(4/25)
    
  2. Tibbo, H.R.: Abstracting across the disciplines : a content analysis of abstracts for the natural sciences, the social sciences, and the humanities with implications for abstracting standards and online information retrieval (1992) 0.12
    0.12465783 = sum of:
      0.12465783 = product of:
        1.0388153 = sum of:
          0.10943863 = weight(abstract_txt:standards in 2535) [ClassicSimilarity], result of:
            0.10943863 = score(doc=2535,freq=3.0), product of:
              0.10526912 = queryWeight, product of:
                1.5599475 = boost
                4.8017445 = idf(docFreq=991, maxDocs=44421)
                0.0140537415 = queryNorm
              1.0396081 = fieldWeight in 2535, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.8017445 = idf(docFreq=991, maxDocs=44421)
                0.125 = fieldNorm(doc=2535)
          0.5018406 = weight(abstract_txt:ansi in 2535) [ClassicSimilarity], result of:
            0.5018406 = score(doc=2535,freq=1.0), product of:
              0.47969872 = queryWeight, product of:
                4.078396 = boost
                8.369263 = idf(docFreq=27, maxDocs=44421)
                0.0140537415 = queryNorm
              1.0461578 = fieldWeight in 2535, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.369263 = idf(docFreq=27, maxDocs=44421)
                0.125 = fieldNorm(doc=2535)
          0.42753598 = weight(abstract_txt:abstracts in 2535) [ClassicSimilarity], result of:
            0.42753598 = score(doc=2535,freq=2.0), product of:
              0.4056761 = queryWeight, product of:
                4.8419375 = boost
                5.9616747 = idf(docFreq=310, maxDocs=44421)
                0.0140537415 = queryNorm
              1.0538851 = fieldWeight in 2535, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.9616747 = idf(docFreq=310, maxDocs=44421)
                0.125 = fieldNorm(doc=2535)
        0.12 = coord(3/25)
    
  3. Goh, A.; Hui, S.C.: TES: a text extraction system (1996) 0.11
    0.10705643 = sum of:
      0.10705643 = product of:
        0.53528214 = sum of:
          0.083833076 = weight(abstract_txt:extraction in 6667) [ClassicSimilarity], result of:
            0.083833076 = score(doc=6667,freq=2.0), product of:
              0.08752777 = queryWeight, product of:
                1.0058134 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.0140537415 = queryNorm
              0.95778835 = fieldWeight in 6667, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.109375 = fieldNorm(doc=6667)
          0.073268846 = weight(abstract_txt:manually in 6667) [ClassicSimilarity], result of:
            0.073268846 = score(doc=6667,freq=1.0), product of:
              0.100807264 = queryWeight, product of:
                1.0794199 = boost
                6.6452217 = idf(docFreq=156, maxDocs=44421)
                0.0140537415 = queryNorm
              0.7268211 = fieldWeight in 6667, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6452217 = idf(docFreq=156, maxDocs=44421)
                0.109375 = fieldNorm(doc=6667)
          0.046597704 = weight(abstract_txt:text in 6667) [ClassicSimilarity], result of:
            0.046597704 = score(doc=6667,freq=2.0), product of:
              0.07455131 = queryWeight, product of:
                1.3127654 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0140537415 = queryNorm
              0.6250421 = fieldWeight in 6667, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.109375 = fieldNorm(doc=6667)
          0.06705808 = weight(abstract_txt:system in 6667) [ClassicSimilarity], result of:
            0.06705808 = score(doc=6667,freq=1.0), product of:
              0.18177982 = queryWeight, product of:
                3.8350084 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.0140537415 = queryNorm
              0.36889726 = fieldWeight in 6667, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.109375 = fieldNorm(doc=6667)
          0.26452443 = weight(abstract_txt:abstracts in 6667) [ClassicSimilarity], result of:
            0.26452443 = score(doc=6667,freq=1.0), product of:
              0.4056761 = queryWeight, product of:
                4.8419375 = boost
                5.9616747 = idf(docFreq=310, maxDocs=44421)
                0.0140537415 = queryNorm
              0.6520582 = fieldWeight in 6667, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9616747 = idf(docFreq=310, maxDocs=44421)
                0.109375 = fieldNorm(doc=6667)
        0.2 = coord(5/25)
    
  4. Goh, A.; Hui, S.C.; Chan, S.K.: ¬A text extraction system for news reports (1996) 0.10
    0.10205741 = sum of:
      0.10205741 = product of:
        0.51028705 = sum of:
          0.07574385 = weight(abstract_txt:extraction in 6669) [ClassicSimilarity], result of:
            0.07574385 = score(doc=6669,freq=5.0), product of:
              0.08752777 = queryWeight, product of:
                1.0058134 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.0140537415 = queryNorm
              0.8653694 = fieldWeight in 6669, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.0625 = fieldNorm(doc=6669)
          0.05921017 = weight(abstract_txt:manually in 6669) [ClassicSimilarity], result of:
            0.05921017 = score(doc=6669,freq=2.0), product of:
              0.100807264 = queryWeight, product of:
                1.0794199 = boost
                6.6452217 = idf(docFreq=156, maxDocs=44421)
                0.0140537415 = queryNorm
              0.58736014 = fieldWeight in 6669, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.6452217 = idf(docFreq=156, maxDocs=44421)
                0.0625 = fieldNorm(doc=6669)
          0.018828316 = weight(abstract_txt:text in 6669) [ClassicSimilarity], result of:
            0.018828316 = score(doc=6669,freq=1.0), product of:
              0.07455131 = queryWeight, product of:
                1.3127654 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0140537415 = queryNorm
              0.25255513 = fieldWeight in 6669, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=6669)
          0.05419111 = weight(abstract_txt:system in 6669) [ClassicSimilarity], result of:
            0.05419111 = score(doc=6669,freq=2.0), product of:
              0.18177982 = queryWeight, product of:
                3.8350084 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.0140537415 = queryNorm
              0.298114 = fieldWeight in 6669, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.0625 = fieldNorm(doc=6669)
          0.30231363 = weight(abstract_txt:abstracts in 6669) [ClassicSimilarity], result of:
            0.30231363 = score(doc=6669,freq=4.0), product of:
              0.4056761 = queryWeight, product of:
                4.8419375 = boost
                5.9616747 = idf(docFreq=310, maxDocs=44421)
                0.0140537415 = queryNorm
              0.74520934 = fieldWeight in 6669, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.9616747 = idf(docFreq=310, maxDocs=44421)
                0.0625 = fieldNorm(doc=6669)
        0.2 = coord(5/25)
    
  5. Fidel, R.: Writing abstracts for free-text searching (1986) 0.10
    0.09955835 = sum of:
      0.09955835 = product of:
        0.6222397 = sum of:
          0.07584007 = weight(abstract_txt:informative in 683) [ClassicSimilarity], result of:
            0.07584007 = score(doc=683,freq=1.0), product of:
              0.11431659 = queryWeight, product of:
                1.1494739 = boost
                7.0764947 = idf(docFreq=101, maxDocs=44421)
                0.0140537415 = queryNorm
              0.6634214 = fieldWeight in 683, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0764947 = idf(docFreq=101, maxDocs=44421)
                0.09375 = fieldNorm(doc=683)
          0.039940886 = weight(abstract_txt:text in 683) [ClassicSimilarity], result of:
            0.039940886 = score(doc=683,freq=2.0), product of:
              0.07455131 = queryWeight, product of:
                1.3127654 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0140537415 = queryNorm
              0.5357503 = fieldWeight in 683, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.09375 = fieldNorm(doc=683)
          0.11374187 = weight(abstract_txt:guidelines in 683) [ClassicSimilarity], result of:
            0.11374187 = score(doc=683,freq=2.0), product of:
              0.14978111 = queryWeight, product of:
                1.8607498 = boost
                5.727658 = idf(docFreq=392, maxDocs=44421)
                0.0140537415 = queryNorm
              0.75938725 = fieldWeight in 683, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.727658 = idf(docFreq=392, maxDocs=44421)
                0.09375 = fieldNorm(doc=683)
          0.39271688 = weight(abstract_txt:abstracts in 683) [ClassicSimilarity], result of:
            0.39271688 = score(doc=683,freq=3.0), product of:
              0.4056761 = queryWeight, product of:
                4.8419375 = boost
                5.9616747 = idf(docFreq=310, maxDocs=44421)
                0.0140537415 = queryNorm
              0.96805525 = fieldWeight in 683, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.9616747 = idf(docFreq=310, maxDocs=44421)
                0.09375 = fieldNorm(doc=683)
        0.16 = coord(4/25)