Document (#16154)

Author
Ahonen, H.
Title
Automatic generation of SGML content models
Source
Electronic publishing. 8(1995) nos.2/3, S.195-206
Year
1996
Abstract
Examines the problem of the automatic generation of a document type definition (DTD) for a set of SGML documents. Describe various situations where documents have been tagged and not DTD is available, and discusses the requirements of various applications with respect to the generation process. Presents an automatic DTD generation tool that can be adjusted for the several tasks necessary in the application. Describes some experimental studies to illustrate how this method can be used to satisfy the needs of varying applications
Content
Paper presented at EP'96: the Electronic Publishing, Document Manipulation and Typography Conference, held in Palo Alto, CA, 24-26 Sep 96
Theme
Elektronisches Publizieren
Object
SGML
DTD

Similar documents (content)

  1. O'Connor, M.A.: Markup, SGML, and hypertext for full-text databases : pt.2 (1992) 0.15
    0.14594704 = sum of:
      0.14594704 = product of:
        0.912169 = sum of:
          0.06922673 = weight(abstract_txt:application in 5917) [ClassicSimilarity], result of:
            0.06922673 = score(doc=5917,freq=2.0), product of:
              0.085589714 = queryWeight, product of:
                4.5753803 = idf(docFreq=1243, maxDocs=44421)
                0.018706579 = queryNorm
              0.8088206 = fieldWeight in 5917, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5753803 = idf(docFreq=1243, maxDocs=44421)
                0.125 = fieldNorm(doc=5917)
          0.22955887 = weight(abstract_txt:tagged in 5917) [ClassicSimilarity], result of:
            0.22955887 = score(doc=5917,freq=1.0), product of:
              0.23979774 = queryWeight, product of:
                1.6738316 = boost
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.018706579 = queryNorm
              0.95730203 = fieldWeight in 5917, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.125 = fieldNorm(doc=5917)
          0.0716554 = weight(abstract_txt:documents in 5917) [ClassicSimilarity], result of:
            0.0716554 = score(doc=5917,freq=1.0), product of:
              0.1390246 = queryWeight, product of:
                1.8023953 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.018706579 = queryNorm
              0.51541525 = fieldWeight in 5917, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.125 = fieldNorm(doc=5917)
          0.541728 = weight(abstract_txt:sgml in 5917) [ClassicSimilarity], result of:
            0.541728 = score(doc=5917,freq=3.0), product of:
              0.37131244 = queryWeight, product of:
                2.9456012 = boost
                6.738623 = idf(docFreq=142, maxDocs=44421)
                0.018706579 = queryNorm
              1.4589547 = fieldWeight in 5917, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.738623 = idf(docFreq=142, maxDocs=44421)
                0.125 = fieldNorm(doc=5917)
        0.16 = coord(4/25)
    
  2. Aker, A.; Gaizauskas, R.: Generating descriptive multi-document summaries of geo-located entities using entity type models (2015) 0.13
    0.12563376 = sum of:
      0.12563376 = product of:
        0.52347404 = sum of:
          0.034613363 = weight(abstract_txt:application in 2726) [ClassicSimilarity], result of:
            0.034613363 = score(doc=2726,freq=2.0), product of:
              0.085589714 = queryWeight, product of:
                4.5753803 = idf(docFreq=1243, maxDocs=44421)
                0.018706579 = queryNorm
              0.4044103 = fieldWeight in 2726, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5753803 = idf(docFreq=1243, maxDocs=44421)
                0.0625 = fieldNorm(doc=2726)
          0.07574878 = weight(abstract_txt:models in 2726) [ClassicSimilarity], result of:
            0.07574878 = score(doc=2726,freq=9.0), product of:
              0.08738535 = queryWeight, product of:
                1.0104353 = boost
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.018706579 = queryNorm
              0.86683613 = fieldWeight in 2726, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.0625 = fieldNorm(doc=2726)
          0.089920454 = weight(abstract_txt:type in 2726) [ClassicSimilarity], result of:
            0.089920454 = score(doc=2726,freq=8.0), product of:
              0.10189308 = queryWeight, product of:
                1.0910925 = boost
                4.992163 = idf(docFreq=819, maxDocs=44421)
                0.018706579 = queryNorm
              0.8824981 = fieldWeight in 2726, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.992163 = idf(docFreq=819, maxDocs=44421)
                0.0625 = fieldNorm(doc=2726)
          0.03496958 = weight(abstract_txt:tasks in 2726) [ClassicSimilarity], result of:
            0.03496958 = score(doc=2726,freq=1.0), product of:
              0.108574875 = queryWeight, product of:
                1.1262995 = boost
                5.1532483 = idf(docFreq=697, maxDocs=44421)
                0.018706579 = queryNorm
              0.32207802 = fieldWeight in 2726, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1532483 = idf(docFreq=697, maxDocs=44421)
                0.0625 = fieldNorm(doc=2726)
          0.10752179 = weight(abstract_txt:automatic in 2726) [ClassicSimilarity], result of:
            0.10752179 = score(doc=2726,freq=1.0), product of:
              0.33111113 = queryWeight, product of:
                3.4067223 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.018706579 = queryNorm
              0.32473022 = fieldWeight in 2726, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.0625 = fieldNorm(doc=2726)
          0.18070008 = weight(abstract_txt:generation in 2726) [ClassicSimilarity], result of:
            0.18070008 = score(doc=2726,freq=1.0), product of:
              0.51514316 = queryWeight, product of:
                4.9066286 = boost
                5.612423 = idf(docFreq=440, maxDocs=44421)
                0.018706579 = queryNorm
              0.35077643 = fieldWeight in 2726, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.612423 = idf(docFreq=440, maxDocs=44421)
                0.0625 = fieldNorm(doc=2726)
        0.24 = coord(6/25)
    
  3. ISO 25964 Thesauri and interoperability with other vocabularies (2008) 0.13
    0.12527183 = sum of:
      0.12527183 = product of:
        0.34797728 = sum of:
          0.013107208 = weight(abstract_txt:where in 2169) [ClassicSimilarity], result of:
            0.013107208 = score(doc=2169,freq=1.0), product of:
              0.089597486 = queryWeight, product of:
                1.0231448 = boost
                4.681277 = idf(docFreq=1118, maxDocs=44421)
                0.018706579 = queryNorm
              0.1462899 = fieldWeight in 2169, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.681277 = idf(docFreq=1118, maxDocs=44421)
                0.03125 = fieldNorm(doc=2169)
          0.022480113 = weight(abstract_txt:type in 2169) [ClassicSimilarity], result of:
            0.022480113 = score(doc=2169,freq=2.0), product of:
              0.10189308 = queryWeight, product of:
                1.0910925 = boost
                4.992163 = idf(docFreq=819, maxDocs=44421)
                0.018706579 = queryNorm
              0.22062452 = fieldWeight in 2169, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.992163 = idf(docFreq=819, maxDocs=44421)
                0.03125 = fieldNorm(doc=2169)
          0.020024326 = weight(abstract_txt:necessary in 2169) [ClassicSimilarity], result of:
            0.020024326 = score(doc=2169,freq=1.0), product of:
              0.11884867 = queryWeight, product of:
                1.1783828 = boost
                5.391549 = idf(docFreq=549, maxDocs=44421)
                0.018706579 = queryNorm
              0.16848591 = fieldWeight in 2169, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.391549 = idf(docFreq=549, maxDocs=44421)
                0.03125 = fieldNorm(doc=2169)
          0.021362908 = weight(abstract_txt:definition in 2169) [ClassicSimilarity], result of:
            0.021362908 = score(doc=2169,freq=1.0), product of:
              0.12408786 = queryWeight, product of:
                1.2040759 = boost
                5.509105 = idf(docFreq=488, maxDocs=44421)
                0.018706579 = queryNorm
              0.17215954 = fieldWeight in 2169, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.509105 = idf(docFreq=488, maxDocs=44421)
                0.03125 = fieldNorm(doc=2169)
          0.033156157 = weight(abstract_txt:situations in 2169) [ClassicSimilarity], result of:
            0.033156157 = score(doc=2169,freq=1.0), product of:
              0.16634068 = queryWeight, product of:
                1.3940824 = boost
                6.3784575 = idf(docFreq=204, maxDocs=44421)
                0.018706579 = queryNorm
              0.1993268 = fieldWeight in 2169, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3784575 = idf(docFreq=204, maxDocs=44421)
                0.03125 = fieldNorm(doc=2169)
          0.01791385 = weight(abstract_txt:documents in 2169) [ClassicSimilarity], result of:
            0.01791385 = score(doc=2169,freq=1.0), product of:
              0.1390246 = queryWeight, product of:
                1.8023953 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.018706579 = queryNorm
              0.12885381 = fieldWeight in 2169, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.03125 = fieldNorm(doc=2169)
          0.038397577 = weight(abstract_txt:applications in 2169) [ClassicSimilarity], result of:
            0.038397577 = score(doc=2169,freq=2.0), product of:
              0.18343896 = queryWeight, product of:
                2.0703797 = boost
                4.7363873 = idf(docFreq=1058, maxDocs=44421)
                0.018706579 = queryNorm
              0.20932072 = fieldWeight in 2169, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7363873 = idf(docFreq=1058, maxDocs=44421)
                0.03125 = fieldNorm(doc=2169)
          0.053760894 = weight(abstract_txt:automatic in 2169) [ClassicSimilarity], result of:
            0.053760894 = score(doc=2169,freq=1.0), product of:
              0.33111113 = queryWeight, product of:
                3.4067223 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.018706579 = queryNorm
              0.16236511 = fieldWeight in 2169, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.03125 = fieldNorm(doc=2169)
          0.12777424 = weight(abstract_txt:generation in 2169) [ClassicSimilarity], result of:
            0.12777424 = score(doc=2169,freq=2.0), product of:
              0.51514316 = queryWeight, product of:
                4.9066286 = boost
                5.612423 = idf(docFreq=440, maxDocs=44421)
                0.018706579 = queryNorm
              0.24803638 = fieldWeight in 2169, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.612423 = idf(docFreq=440, maxDocs=44421)
                0.03125 = fieldNorm(doc=2169)
        0.36 = coord(9/25)
    
  4. Zhou, D.; Lawless, S.; Wu, X.; Zhao, W.; Liu, J.: ¬A study of user profile representation for personalized cross-language information retrieval (2016) 0.12
    0.11612197 = sum of:
      0.11612197 = product of:
        0.48384154 = sum of:
          0.04373358 = weight(abstract_txt:models in 4167) [ClassicSimilarity], result of:
            0.04373358 = score(doc=4167,freq=3.0), product of:
              0.08738535 = queryWeight, product of:
                1.0104353 = boost
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.018706579 = queryNorm
              0.5004681 = fieldWeight in 4167, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.0625 = fieldNorm(doc=4167)
          0.040170662 = weight(abstract_txt:experimental in 4167) [ClassicSimilarity], result of:
            0.040170662 = score(doc=4167,freq=1.0), product of:
              0.11908993 = queryWeight, product of:
                1.1795782 = boost
                5.397019 = idf(docFreq=546, maxDocs=44421)
                0.018706579 = queryNorm
              0.33731368 = fieldWeight in 4167, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.397019 = idf(docFreq=546, maxDocs=44421)
                0.0625 = fieldNorm(doc=4167)
          0.05066802 = weight(abstract_txt:documents in 4167) [ClassicSimilarity], result of:
            0.05066802 = score(doc=4167,freq=2.0), product of:
              0.1390246 = queryWeight, product of:
                1.8023953 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.018706579 = queryNorm
              0.3644536 = fieldWeight in 4167, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0625 = fieldNorm(doc=4167)
          0.06104745 = weight(abstract_txt:various in 4167) [ClassicSimilarity], result of:
            0.06104745 = score(doc=4167,freq=2.0), product of:
              0.15741546 = queryWeight, product of:
                1.9179087 = boost
                4.387581 = idf(docFreq=1500, maxDocs=44421)
                0.018706579 = queryNorm
              0.387811 = fieldWeight in 4167, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.387581 = idf(docFreq=1500, maxDocs=44421)
                0.0625 = fieldNorm(doc=4167)
          0.10752179 = weight(abstract_txt:automatic in 4167) [ClassicSimilarity], result of:
            0.10752179 = score(doc=4167,freq=1.0), product of:
              0.33111113 = queryWeight, product of:
                3.4067223 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.018706579 = queryNorm
              0.32473022 = fieldWeight in 4167, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.0625 = fieldNorm(doc=4167)
          0.18070008 = weight(abstract_txt:generation in 4167) [ClassicSimilarity], result of:
            0.18070008 = score(doc=4167,freq=1.0), product of:
              0.51514316 = queryWeight, product of:
                4.9066286 = boost
                5.612423 = idf(docFreq=440, maxDocs=44421)
                0.018706579 = queryNorm
              0.35077643 = fieldWeight in 4167, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.612423 = idf(docFreq=440, maxDocs=44421)
                0.0625 = fieldNorm(doc=4167)
        0.24 = coord(6/25)
    
  5. O'Kane, K.C.: World Wide Web-based information storage and retrieval (1996) 0.12
    0.115671515 = sum of:
      0.115671515 = product of:
        0.5783576 = sum of:
          0.050060812 = weight(abstract_txt:necessary in 4805) [ClassicSimilarity], result of:
            0.050060812 = score(doc=4805,freq=1.0), product of:
              0.11884867 = queryWeight, product of:
                1.1783828 = boost
                5.391549 = idf(docFreq=549, maxDocs=44421)
                0.018706579 = queryNorm
              0.42121476 = fieldWeight in 4805, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.391549 = idf(docFreq=549, maxDocs=44421)
                0.078125 = fieldNorm(doc=4805)
          0.100141466 = weight(abstract_txt:documents in 4805) [ClassicSimilarity], result of:
            0.100141466 = score(doc=4805,freq=5.0), product of:
              0.1390246 = queryWeight, product of:
                1.8023953 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.018706579 = queryNorm
              0.72031474 = fieldWeight in 4805, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.078125 = fieldNorm(doc=4805)
          0.06787796 = weight(abstract_txt:applications in 4805) [ClassicSimilarity], result of:
            0.06787796 = score(doc=4805,freq=1.0), product of:
              0.18343896 = queryWeight, product of:
                2.0703797 = boost
                4.7363873 = idf(docFreq=1058, maxDocs=44421)
                0.018706579 = queryNorm
              0.37003025 = fieldWeight in 4805, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7363873 = idf(docFreq=1058, maxDocs=44421)
                0.078125 = fieldNorm(doc=4805)
          0.13440223 = weight(abstract_txt:automatic in 4805) [ClassicSimilarity], result of:
            0.13440223 = score(doc=4805,freq=1.0), product of:
              0.33111113 = queryWeight, product of:
                3.4067223 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.018706579 = queryNorm
              0.40591276 = fieldWeight in 4805, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.078125 = fieldNorm(doc=4805)
          0.2258751 = weight(abstract_txt:generation in 4805) [ClassicSimilarity], result of:
            0.2258751 = score(doc=4805,freq=1.0), product of:
              0.51514316 = queryWeight, product of:
                4.9066286 = boost
                5.612423 = idf(docFreq=440, maxDocs=44421)
                0.018706579 = queryNorm
              0.43847054 = fieldWeight in 4805, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.612423 = idf(docFreq=440, maxDocs=44421)
                0.078125 = fieldNorm(doc=4805)
        0.2 = coord(5/25)