Document (#6683)

Author
Rau, L.F.
Jacobs, P.S.
Zernik, U.
Title
Information extraction and text summarization using linguistic knowledge acquisition
Source
Information processing and management. 25(1989) no.4, S.419-428
Year
1989
Abstract
Storing and accessing texts in a conceptual format has a number of advantages over traditional document retrieval methods. A conceptual format facilitates natural language access to text information. It can support imprecise and inexact queries, conceptual information summarisation, and, ultimately, document translation. Describes 2 methods which have been implemented in a prototype intelligent information retrieval system calles SCISOR (System for Conceptual Information Summarisation, Organization and Retrieval). Describes the text processing, language acquisition, and summarisation components of SCISOR
Theme
Computerlinguistik
Object
SCISOR

Similar documents (author)

  1. Jacobs, M.: Criteria for evaluating alternative MEDLINE search engines (1998) 5.41
    5.4105906 = sum of:
      5.4105906 = weight(author_txt:jacobs in 4264) [ClassicSimilarity], result of:
        5.4105906 = fieldWeight in 4264, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.656945 = idf(docFreq=20, maxDocs=44421)
          0.625 = fieldNorm(doc=4264)
    
  2. Jacobs, E.H.: Buying into classes : the practice of book selection in eighteenth-Century Britain (1999) 5.41
    5.4105906 = sum of:
      5.4105906 = weight(author_txt:jacobs in 154) [ClassicSimilarity], result of:
        5.4105906 = fieldWeight in 154, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.656945 = idf(docFreq=20, maxDocs=44421)
          0.625 = fieldNorm(doc=154)
    
  3. Jacobs, C.: If a picture is worth a thousand words, then ... (1999) 5.41
    5.4105906 = sum of:
      5.4105906 = weight(author_txt:jacobs in 321) [ClassicSimilarity], result of:
        5.4105906 = fieldWeight in 321, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.656945 = idf(docFreq=20, maxDocs=44421)
          0.625 = fieldNorm(doc=321)
    
  4. Jacobs, N.: Information technology and interests in scholarly communication : a discourse analysis (2001) 5.41
    5.4105906 = sum of:
      5.4105906 = weight(author_txt:jacobs in 848) [ClassicSimilarity], result of:
        5.4105906 = fieldWeight in 848, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.656945 = idf(docFreq=20, maxDocs=44421)
          0.625 = fieldNorm(doc=848)
    
  5. Jacobs, I.: From chaos, order: W3C standard helps organize knowledge : SKOS Connects Diverse Knowledge Organization Systems to Linked Data (2009) 5.41
    5.4105906 = sum of:
      5.4105906 = weight(author_txt:jacobs in 49) [ClassicSimilarity], result of:
        5.4105906 = fieldWeight in 49, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.656945 = idf(docFreq=20, maxDocs=44421)
          0.625 = fieldNorm(doc=49)
    

Similar documents (content)

  1. Salton, G.: Automatic text structuring and summarization (1997) 0.27
    0.2742651 = sum of:
      0.2742651 = product of:
        1.3713255 = sum of:
          0.062524855 = weight(abstract_txt:extraction in 1145) [ClassicSimilarity], result of:
            0.062524855 = score(doc=1145,freq=1.0), product of:
              0.107707255 = queryWeight, product of:
                1.0737883 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.01619906 = queryNorm
              0.5805074 = fieldWeight in 1145, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.09375 = fieldNorm(doc=1145)
          0.03746853 = weight(abstract_txt:methods in 1145) [ClassicSimilarity], result of:
            0.03746853 = score(doc=1145,freq=1.0), product of:
              0.09645637 = queryWeight, product of:
                1.4370657 = boost
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.01619906 = queryNorm
              0.38845056 = fieldWeight in 1145, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.09375 = fieldNorm(doc=1145)
          0.08341321 = weight(abstract_txt:document in 1145) [ClassicSimilarity], result of:
            0.08341321 = score(doc=1145,freq=4.0), product of:
              0.10359919 = queryWeight, product of:
                1.4893246 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.01619906 = queryNorm
              0.80515313 = fieldWeight in 1145, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.09375 = fieldNorm(doc=1145)
          0.11656766 = weight(abstract_txt:text in 1145) [ClassicSimilarity], result of:
            0.11656766 = score(doc=1145,freq=5.0), product of:
              0.1376086 = queryWeight, product of:
                2.102227 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.01619906 = queryNorm
              0.8470957 = fieldWeight in 1145, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.09375 = fieldNorm(doc=1145)
          1.0713513 = weight(abstract_txt:summarisation in 1145) [ClassicSimilarity], result of:
            1.0713513 = score(doc=1145,freq=3.0), product of:
              0.71586496 = queryWeight, product of:
                4.794821 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.01619906 = queryNorm
              1.496583 = fieldWeight in 1145, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.09375 = fieldNorm(doc=1145)
        0.2 = coord(5/25)
    
  2. Szlávik, Z.; Tombros, A.; Lalmas, M.: Summarisation of the logical structure of XML documents (2012) 0.22
    0.22155304 = sum of:
      0.22155304 = product of:
        1.1077652 = sum of:
          0.03532567 = weight(abstract_txt:methods in 3731) [ClassicSimilarity], result of:
            0.03532567 = score(doc=3731,freq=2.0), product of:
              0.09645637 = queryWeight, product of:
                1.4370657 = boost
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.01619906 = queryNorm
              0.3662347 = fieldWeight in 3731, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.0625 = fieldNorm(doc=3731)
          0.027804403 = weight(abstract_txt:document in 3731) [ClassicSimilarity], result of:
            0.027804403 = score(doc=3731,freq=1.0), product of:
              0.10359919 = queryWeight, product of:
                1.4893246 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.01619906 = queryNorm
              0.26838437 = fieldWeight in 3731, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.0625 = fieldNorm(doc=3731)
          0.022130948 = weight(abstract_txt:retrieval in 3731) [ClassicSimilarity], result of:
            0.022130948 = score(doc=3731,freq=1.0), product of:
              0.10185392 = queryWeight, product of:
                1.8086132 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.01619906 = queryNorm
              0.21728125 = fieldWeight in 3731, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=3731)
          0.012424402 = weight(abstract_txt:information in 3731) [ClassicSimilarity], result of:
            0.012424402 = score(doc=3731,freq=1.0), product of:
              0.08218218 = queryWeight, product of:
                2.0973456 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.01619906 = queryNorm
              0.15118122 = fieldWeight in 3731, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0625 = fieldNorm(doc=3731)
          1.0100797 = weight(abstract_txt:summarisation in 3731) [ClassicSimilarity], result of:
            1.0100797 = score(doc=3731,freq=6.0), product of:
              0.71586496 = queryWeight, product of:
                4.794821 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.01619906 = queryNorm
              1.410992 = fieldWeight in 3731, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.0625 = fieldNorm(doc=3731)
        0.2 = coord(5/25)
    
  3. Huo, W.: Automatic multi-word term extraction and its application to Web-page summarization (2012) 0.20
    0.19809696 = sum of:
      0.19809696 = product of:
        0.4952424 = sum of:
          0.051699582 = weight(abstract_txt:translation in 1563) [ClassicSimilarity], result of:
            0.051699582 = score(doc=1563,freq=1.0), product of:
              0.10714914 = queryWeight, product of:
                1.0710026 = boost
                6.176015 = idf(docFreq=250, maxDocs=44421)
                0.01619906 = queryNorm
              0.48250115 = fieldWeight in 1563, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.176015 = idf(docFreq=250, maxDocs=44421)
                0.078125 = fieldNorm(doc=1563)
          0.09024686 = weight(abstract_txt:extraction in 1563) [ClassicSimilarity], result of:
            0.09024686 = score(doc=1563,freq=3.0), product of:
              0.107707255 = queryWeight, product of:
                1.0737883 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.01619906 = queryNorm
              0.83789027 = fieldWeight in 1563, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.078125 = fieldNorm(doc=1563)
          0.016840467 = weight(abstract_txt:system in 1563) [ClassicSimilarity], result of:
            0.016840467 = score(doc=1563,freq=1.0), product of:
              0.06391116 = queryWeight, product of:
                1.1697675 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.01619906 = queryNorm
              0.26349807 = fieldWeight in 1563, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.078125 = fieldNorm(doc=1563)
          0.13759364 = weight(abstract_txt:summarization in 1563) [ClassicSimilarity], result of:
            0.13759364 = score(doc=1563,freq=3.0), product of:
              0.1426776 = queryWeight, product of:
                1.2358737 = boost
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.01619906 = queryNorm
              0.9643675 = fieldWeight in 1563, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.078125 = fieldNorm(doc=1563)
          0.031223776 = weight(abstract_txt:methods in 1563) [ClassicSimilarity], result of:
            0.031223776 = score(doc=1563,freq=1.0), product of:
              0.09645637 = queryWeight, product of:
                1.4370657 = boost
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.01619906 = queryNorm
              0.3237088 = fieldWeight in 1563, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.078125 = fieldNorm(doc=1563)
          0.031849947 = weight(abstract_txt:language in 1563) [ClassicSimilarity], result of:
            0.031849947 = score(doc=1563,freq=1.0), product of:
              0.09774167 = queryWeight, product of:
                1.4466087 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.01619906 = queryNorm
              0.3258584 = fieldWeight in 1563, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.078125 = fieldNorm(doc=1563)
          0.049151707 = weight(abstract_txt:document in 1563) [ClassicSimilarity], result of:
            0.049151707 = score(doc=1563,freq=2.0), product of:
              0.10359919 = queryWeight, product of:
                1.4893246 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.01619906 = queryNorm
              0.47444102 = fieldWeight in 1563, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.078125 = fieldNorm(doc=1563)
          0.027663684 = weight(abstract_txt:retrieval in 1563) [ClassicSimilarity], result of:
            0.027663684 = score(doc=1563,freq=1.0), product of:
              0.10185392 = queryWeight, product of:
                1.8086132 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.01619906 = queryNorm
              0.27160156 = fieldWeight in 1563, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.078125 = fieldNorm(doc=1563)
          0.015530502 = weight(abstract_txt:information in 1563) [ClassicSimilarity], result of:
            0.015530502 = score(doc=1563,freq=1.0), product of:
              0.08218218 = queryWeight, product of:
                2.0973456 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.01619906 = queryNorm
              0.18897653 = fieldWeight in 1563, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.078125 = fieldNorm(doc=1563)
          0.043442197 = weight(abstract_txt:text in 1563) [ClassicSimilarity], result of:
            0.043442197 = score(doc=1563,freq=1.0), product of:
              0.1376086 = queryWeight, product of:
                2.102227 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.01619906 = queryNorm
              0.3156939 = fieldWeight in 1563, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.078125 = fieldNorm(doc=1563)
        0.4 = coord(10/25)
    
  4. Sweeney, S.; Crestani, F.; Losada, D.E.: 'Show me more' : incremental length summarisation using novelty detection (2008) 0.18
    0.17857337 = sum of:
      0.17857337 = product of:
        0.89286685 = sum of:
          0.046092793 = weight(abstract_txt:accessing in 3054) [ClassicSimilarity], result of:
            0.046092793 = score(doc=3054,freq=1.0), product of:
              0.11517529 = queryWeight, product of:
                1.1103908 = boost
                6.40315 = idf(docFreq=199, maxDocs=44421)
                0.01619906 = queryNorm
              0.40019688 = fieldWeight in 3054, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.40315 = idf(docFreq=199, maxDocs=44421)
                0.0625 = fieldNorm(doc=3054)
          0.055608805 = weight(abstract_txt:document in 3054) [ClassicSimilarity], result of:
            0.055608805 = score(doc=3054,freq=4.0), product of:
              0.10359919 = queryWeight, product of:
                1.4893246 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.01619906 = queryNorm
              0.53676873 = fieldWeight in 3054, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.0625 = fieldNorm(doc=3054)
          0.027781809 = weight(abstract_txt:information in 3054) [ClassicSimilarity], result of:
            0.027781809 = score(doc=3054,freq=5.0), product of:
              0.08218218 = queryWeight, product of:
                2.0973456 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.01619906 = queryNorm
              0.3380515 = fieldWeight in 3054, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0625 = fieldNorm(doc=3054)
          0.049149238 = weight(abstract_txt:text in 3054) [ClassicSimilarity], result of:
            0.049149238 = score(doc=3054,freq=2.0), product of:
              0.1376086 = queryWeight, product of:
                2.102227 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.01619906 = queryNorm
              0.3571669 = fieldWeight in 3054, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=3054)
          0.71423423 = weight(abstract_txt:summarisation in 3054) [ClassicSimilarity], result of:
            0.71423423 = score(doc=3054,freq=3.0), product of:
              0.71586496 = queryWeight, product of:
                4.794821 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.01619906 = queryNorm
              0.997722 = fieldWeight in 3054, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.0625 = fieldNorm(doc=3054)
        0.2 = coord(5/25)
    
  5. Lihui, C.; Lian, C.W.: Using Web structure and summarisation techniques for Web content mining (2005) 0.18
    0.17799626 = sum of:
      0.17799626 = product of:
        0.7416511 = sum of:
          0.036496945 = weight(abstract_txt:prototype in 2046) [ClassicSimilarity], result of:
            0.036496945 = score(doc=2046,freq=1.0), product of:
              0.098576866 = queryWeight, product of:
                1.0272678 = boost
                5.9238153 = idf(docFreq=322, maxDocs=44421)
                0.01619906 = queryNorm
              0.37023845 = fieldWeight in 2046, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9238153 = idf(docFreq=322, maxDocs=44421)
                0.0625 = fieldNorm(doc=2046)
          0.042961333 = weight(abstract_txt:intelligent in 2046) [ClassicSimilarity], result of:
            0.042961333 = score(doc=2046,freq=1.0), product of:
              0.10989784 = queryWeight, product of:
                1.0846528 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.01619906 = queryNorm
              0.39092064 = fieldWeight in 2046, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.0625 = fieldNorm(doc=2046)
          0.039321363 = weight(abstract_txt:document in 2046) [ClassicSimilarity], result of:
            0.039321363 = score(doc=2046,freq=2.0), product of:
              0.10359919 = queryWeight, product of:
                1.4893246 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.01619906 = queryNorm
              0.3795528 = fieldWeight in 2046, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.0625 = fieldNorm(doc=2046)
          0.022130948 = weight(abstract_txt:retrieval in 2046) [ClassicSimilarity], result of:
            0.022130948 = score(doc=2046,freq=1.0), product of:
              0.10185392 = queryWeight, product of:
                1.8086132 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.01619906 = queryNorm
              0.21728125 = fieldWeight in 2046, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=2046)
          0.017570758 = weight(abstract_txt:information in 2046) [ClassicSimilarity], result of:
            0.017570758 = score(doc=2046,freq=2.0), product of:
              0.08218218 = queryWeight, product of:
                2.0973456 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.01619906 = queryNorm
              0.21380253 = fieldWeight in 2046, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0625 = fieldNorm(doc=2046)
          0.58316976 = weight(abstract_txt:summarisation in 2046) [ClassicSimilarity], result of:
            0.58316976 = score(doc=2046,freq=2.0), product of:
              0.71586496 = queryWeight, product of:
                4.794821 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.01619906 = queryNorm
              0.8146366 = fieldWeight in 2046, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.0625 = fieldNorm(doc=2046)
        0.24 = coord(6/25)