Document (#789)

Author
Craven, T.C.
Title
Customized extracts based on Boolean queries and sentence dependency structures
Source
International classification. 16(1989), S.11-14
Year
1989
Abstract
A method is described for using Boolean queries in automatically deriving customized extracts from a text in which semantic dependencies between sentences have been coded. Each sentence in the structured text is treated as defining a separate extract. This extract consists of the sentence and all other sentences on which the sentence is directly or indirectly dependent for its meaning. Extracts from a text that satisfy a given Boolean query are merged to eliminate duplicate sentences. A prototype implementation of the method has been developed within an experimental text structure management system (TEXNET)
Object
TEXNET

Similar documents (author)

  1. Craven, T.C.: ¬An online index entry format based on multiple search terms (1987) 5.21
    5.2088575 = sum of:
      5.2088575 = weight(author_txt:craven in 437) [ClassicSimilarity], result of:
        5.2088575 = fieldWeight in 437, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.334172 = idf(docFreq=28, maxDocs=44421)
          0.625 = fieldNorm(doc=437)
    
  2. Craven, T.C.: Adapting of string indexing systems for retrieval using proximity operators (1988) 5.21
    5.2088575 = sum of:
      5.2088575 = weight(author_txt:craven in 704) [ClassicSimilarity], result of:
        5.2088575 = fieldWeight in 704, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.334172 = idf(docFreq=28, maxDocs=44421)
          0.625 = fieldNorm(doc=704)
    
  3. Craven, T.C.: Research in document classification and indexing (Canada) 1971-1980 (1981) 5.21
    5.2088575 = sum of:
      5.2088575 = weight(author_txt:craven in 1210) [ClassicSimilarity], result of:
        5.2088575 = fieldWeight in 1210, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.334172 = idf(docFreq=28, maxDocs=44421)
          0.625 = fieldNorm(doc=1210)
    
  4. Craven, T.C.: NEPHIS: a nested phrase indexing system (1977) 5.21
    5.2088575 = sum of:
      5.2088575 = weight(author_txt:craven in 1332) [ClassicSimilarity], result of:
        5.2088575 = fieldWeight in 1332, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.334172 = idf(docFreq=28, maxDocs=44421)
          0.625 = fieldNorm(doc=1332)
    
  5. Craven, T.C.: Changing technologies: impact on information: the case of string indexing (1985) 5.21
    5.2088575 = sum of:
      5.2088575 = weight(author_txt:craven in 1347) [ClassicSimilarity], result of:
        5.2088575 = fieldWeight in 1347, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.334172 = idf(docFreq=28, maxDocs=44421)
          0.625 = fieldNorm(doc=1347)
    

Similar documents (content)

  1. Craven, T.C.: Condensed representation of sentences in graphic displays of text structures (1990) 0.19
    0.18842117 = sum of:
      0.18842117 = product of:
        0.9421059 = sum of:
          0.024221104 = weight(abstract_txt:been in 3869) [ClassicSimilarity], result of:
            0.024221104 = score(doc=3869,freq=1.0), product of:
              0.061268125 = queryWeight, product of:
                1.1666762 = boost
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.014529242 = queryNorm
              0.3953296 = fieldWeight in 3869, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.109375 = fieldNorm(doc=3869)
          0.22847554 = weight(abstract_txt:texnet in 3869) [ClassicSimilarity], result of:
            0.22847554 = score(doc=3869,freq=1.0), product of:
              0.21709764 = queryWeight, product of:
                1.5529076 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.014529242 = queryNorm
              1.0524092 = fieldWeight in 3869, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.109375 = fieldNorm(doc=3869)
          0.095729016 = weight(abstract_txt:text in 3869) [ClassicSimilarity], result of:
            0.095729016 = score(doc=3869,freq=2.0), product of:
              0.15315612 = queryWeight, product of:
                2.6086464 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.014529242 = queryNorm
              0.6250421 = fieldWeight in 3869, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.109375 = fieldNorm(doc=3869)
          0.2640214 = weight(abstract_txt:sentences in 3869) [ClassicSimilarity], result of:
            0.2640214 = score(doc=3869,freq=1.0), product of:
              0.34479567 = queryWeight, product of:
                3.3896868 = boost
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.014529242 = queryNorm
              0.76573294 = fieldWeight in 3869, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.109375 = fieldNorm(doc=3869)
          0.32965887 = weight(abstract_txt:sentence in 3869) [ClassicSimilarity], result of:
            0.32965887 = score(doc=3869,freq=1.0), product of:
              0.44003966 = queryWeight, product of:
                4.4217477 = boost
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.014529242 = queryNorm
              0.7491572 = fieldWeight in 3869, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.109375 = fieldNorm(doc=3869)
        0.2 = coord(5/25)
    
  2. Craven, T.C.: ¬A computer-aided abstracting tool kit (1993) 0.18
    0.17513676 = sum of:
      0.17513676 = product of:
        1.0946047 = sum of:
          0.22847554 = weight(abstract_txt:texnet in 6505) [ClassicSimilarity], result of:
            0.22847554 = score(doc=6505,freq=1.0), product of:
              0.21709764 = queryWeight, product of:
                1.5529076 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.014529242 = queryNorm
              1.0524092 = fieldWeight in 6505, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.109375 = fieldNorm(doc=6505)
          0.06769063 = weight(abstract_txt:text in 6505) [ClassicSimilarity], result of:
            0.06769063 = score(doc=6505,freq=1.0), product of:
              0.15315612 = queryWeight, product of:
                2.6086464 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.014529242 = queryNorm
              0.44197148 = fieldWeight in 6505, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.109375 = fieldNorm(doc=6505)
          0.33223042 = weight(abstract_txt:extracts in 6505) [ClassicSimilarity], result of:
            0.33223042 = score(doc=6505,freq=1.0), product of:
              0.401879 = queryWeight, product of:
                3.6595387 = boost
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.014529242 = queryNorm
              0.82669264 = fieldWeight in 6505, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.109375 = fieldNorm(doc=6505)
          0.46620807 = weight(abstract_txt:sentence in 6505) [ClassicSimilarity], result of:
            0.46620807 = score(doc=6505,freq=2.0), product of:
              0.44003966 = queryWeight, product of:
                4.4217477 = boost
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.014529242 = queryNorm
              1.0594683 = fieldWeight in 6505, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.109375 = fieldNorm(doc=6505)
        0.16 = coord(4/25)
    
  3. Agarwal, B.; Ramampiaro, H.; Langseth, H.; Ruocco, M.: ¬A deep network model for paraphrase detection in short text messages (2018) 0.15
    0.15153238 = sum of:
      0.15153238 = product of:
        0.7576619 = sum of:
          0.07922356 = weight(abstract_txt:dependency in 43) [ClassicSimilarity], result of:
            0.07922356 = score(doc=43,freq=1.0), product of:
              0.155605 = queryWeight, product of:
                1.3147095 = boost
                8.146119 = idf(docFreq=34, maxDocs=44421)
                0.014529242 = queryNorm
              0.50913244 = fieldWeight in 43, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.146119 = idf(docFreq=34, maxDocs=44421)
                0.0625 = fieldNorm(doc=43)
          0.08409682 = weight(abstract_txt:extract in 43) [ClassicSimilarity], result of:
            0.08409682 = score(doc=43,freq=1.0), product of:
              0.20400949 = queryWeight, product of:
                2.1289146 = boost
                6.595522 = idf(docFreq=164, maxDocs=44421)
                0.014529242 = queryNorm
              0.41222012 = fieldWeight in 43, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.595522 = idf(docFreq=164, maxDocs=44421)
                0.0625 = fieldNorm(doc=43)
          0.054702293 = weight(abstract_txt:text in 43) [ClassicSimilarity], result of:
            0.054702293 = score(doc=43,freq=2.0), product of:
              0.15315612 = queryWeight, product of:
                2.6086464 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.014529242 = queryNorm
              0.3571669 = fieldWeight in 43, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=43)
          0.21336152 = weight(abstract_txt:sentences in 43) [ClassicSimilarity], result of:
            0.21336152 = score(doc=43,freq=2.0), product of:
              0.34479567 = queryWeight, product of:
                3.3896868 = boost
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.014529242 = queryNorm
              0.61880565 = fieldWeight in 43, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.0625 = fieldNorm(doc=43)
          0.32627767 = weight(abstract_txt:sentence in 43) [ClassicSimilarity], result of:
            0.32627767 = score(doc=43,freq=3.0), product of:
              0.44003966 = queryWeight, product of:
                4.4217477 = boost
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.014529242 = queryNorm
              0.7414733 = fieldWeight in 43, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.0625 = fieldNorm(doc=43)
        0.2 = coord(5/25)
    
  4. Ko, Y.; Park, J.; Seo, J.: Improving text categorization using the importance of sentences (2004) 0.14
    0.13968945 = sum of:
      0.13968945 = product of:
        0.6984472 = sum of:
          0.037743222 = weight(abstract_txt:method in 3557) [ClassicSimilarity], result of:
            0.037743222 = score(doc=3557,freq=2.0), product of:
              0.09491758 = queryWeight, product of:
                1.4521329 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.014529242 = queryNorm
              0.39764208 = fieldWeight in 3557, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0625 = fieldNorm(doc=3557)
          0.08409682 = weight(abstract_txt:extract in 3557) [ClassicSimilarity], result of:
            0.08409682 = score(doc=3557,freq=1.0), product of:
              0.20400949 = queryWeight, product of:
                2.1289146 = boost
                6.595522 = idf(docFreq=164, maxDocs=44421)
                0.014529242 = queryNorm
              0.41222012 = fieldWeight in 3557, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.595522 = idf(docFreq=164, maxDocs=44421)
                0.0625 = fieldNorm(doc=3557)
          0.08649193 = weight(abstract_txt:text in 3557) [ClassicSimilarity], result of:
            0.08649193 = score(doc=3557,freq=5.0), product of:
              0.15315612 = queryWeight, product of:
                2.6086464 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.014529242 = queryNorm
              0.56473047 = fieldWeight in 3557, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=3557)
          0.30173877 = weight(abstract_txt:sentences in 3557) [ClassicSimilarity], result of:
            0.30173877 = score(doc=3557,freq=4.0), product of:
              0.34479567 = queryWeight, product of:
                3.3896868 = boost
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.014529242 = queryNorm
              0.8751234 = fieldWeight in 3557, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.0625 = fieldNorm(doc=3557)
          0.1883765 = weight(abstract_txt:sentence in 3557) [ClassicSimilarity], result of:
            0.1883765 = score(doc=3557,freq=1.0), product of:
              0.44003966 = queryWeight, product of:
                4.4217477 = boost
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.014529242 = queryNorm
              0.42808983 = fieldWeight in 3557, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.0625 = fieldNorm(doc=3557)
        0.2 = coord(5/25)
    
  5. Ou, S.; Khoo, S.G.; Goh, D.H.: Automatic multidocument summarization of research abstracts : design and user evaluation (2007) 0.13
    0.12711595 = sum of:
      0.12711595 = product of:
        0.7944747 = sum of:
          0.04622582 = weight(abstract_txt:method in 1522) [ClassicSimilarity], result of:
            0.04622582 = score(doc=1522,freq=3.0), product of:
              0.09491758 = queryWeight, product of:
                1.4521329 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.014529242 = queryNorm
              0.4870101 = fieldWeight in 1522, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0625 = fieldNorm(doc=1522)
          0.21336152 = weight(abstract_txt:sentences in 1522) [ClassicSimilarity], result of:
            0.21336152 = score(doc=1522,freq=2.0), product of:
              0.34479567 = queryWeight, product of:
                3.3896868 = boost
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.014529242 = queryNorm
              0.61880565 = fieldWeight in 1522, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.0625 = fieldNorm(doc=1522)
          0.2684827 = weight(abstract_txt:extracts in 1522) [ClassicSimilarity], result of:
            0.2684827 = score(doc=1522,freq=2.0), product of:
              0.401879 = queryWeight, product of:
                3.6595387 = boost
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.014529242 = queryNorm
              0.6680685 = fieldWeight in 1522, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.0625 = fieldNorm(doc=1522)
          0.26640463 = weight(abstract_txt:sentence in 1522) [ClassicSimilarity], result of:
            0.26640463 = score(doc=1522,freq=2.0), product of:
              0.44003966 = queryWeight, product of:
                4.4217477 = boost
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.014529242 = queryNorm
              0.60541046 = fieldWeight in 1522, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.0625 = fieldNorm(doc=1522)
        0.16 = coord(4/25)