Document (#25462)

Author
Riggs, K.R.
Title
XML and free text
Source
Journal of the American Society for Information Science and technology. 53(2002) no.6, S.526-528
Year
2002
Abstract
We show several problems with marking free text, text that is either natural language or semigrammatical but unstructured. These problems prevent well-formed XML from marking text for readily available meaning. A solution is proposed to mark meaning in free text that is consistent with the intended simplicity of XML versus SGML.
Object
XML

Similar documents (author)

  1. Riggs, F.W.: Information and social science : the need for onomantics (1989) 6.09
    6.094361 = sum of:
      6.094361 = weight(author_txt:riggs in 2842) [ClassicSimilarity], result of:
        6.094361 = fieldWeight in 2842, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.625 = fieldNorm(doc=2842)
    
  2. Riggs, F.W.: Onomantics and terminology : pt.1: their contributions to knowledge organization (1996) 6.09
    6.094361 = sum of:
      6.094361 = weight(author_txt:riggs in 3750) [ClassicSimilarity], result of:
        6.094361 = fieldWeight in 3750, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.625 = fieldNorm(doc=3750)
    
  3. Riggs, F.W.: Onomantics and terminology : pt.2: core concepts (1996) 6.09
    6.094361 = sum of:
      6.094361 = weight(author_txt:riggs in 5387) [ClassicSimilarity], result of:
        6.094361 = fieldWeight in 5387, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.625 = fieldNorm(doc=5387)
    
  4. Riggs, F.W.: Onomantics and terminology : pt.3: formats, borrowed terms and omissions (1996) 6.09
    6.094361 = sum of:
      6.094361 = weight(author_txt:riggs in 6040) [ClassicSimilarity], result of:
        6.094361 = fieldWeight in 6040, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.625 = fieldNorm(doc=6040)
    
  5. Riggs, F.W.: Onomantics and terminology : pt.4: neologisms, neoterisms, meta-terms, phrases, and pleonisms (1997) 6.09
    6.094361 = sum of:
      6.094361 = weight(author_txt:riggs in 7466) [ClassicSimilarity], result of:
        6.094361 = fieldWeight in 7466, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.7509775 = idf(docFreq=6, maxDocs=44218)
          0.625 = fieldNorm(doc=7466)
    

Similar documents (content)

  1. Rishel, T.; Perkins, L.A.; Yenduri, S.; Zand, F.: Determining the context of text using augmented latent semantic indexing (2007) 0.14
    0.14428996 = sum of:
      0.14428996 = product of:
        0.4509061 = sum of:
          0.03736895 = weight(abstract_txt:language in 1316) [ClassicSimilarity], result of:
            0.03736895 = score(doc=1316,freq=3.0), product of:
              0.06603392 = queryWeight, product of:
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.015789704 = queryNorm
              0.56590533 = fieldWeight in 1316, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.078125 = fieldNorm(doc=1316)
          0.03567742 = weight(abstract_txt:show in 1316) [ClassicSimilarity], result of:
            0.03567742 = score(doc=1316,freq=2.0), product of:
              0.07329133 = queryWeight, product of:
                1.05352 = boost
                4.4059124 = idf(docFreq=1466, maxDocs=44218)
                0.015789704 = queryNorm
              0.4867891 = fieldWeight in 1316, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4059124 = idf(docFreq=1466, maxDocs=44218)
                0.078125 = fieldNorm(doc=1316)
          0.027743569 = weight(abstract_txt:several in 1316) [ClassicSimilarity], result of:
            0.027743569 = score(doc=1316,freq=1.0), product of:
              0.078086354 = queryWeight, product of:
                1.0874368 = boost
                4.5477557 = idf(docFreq=1272, maxDocs=44218)
                0.015789704 = queryNorm
              0.35529342 = fieldWeight in 1316, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5477557 = idf(docFreq=1272, maxDocs=44218)
                0.078125 = fieldNorm(doc=1316)
          0.011098707 = weight(abstract_txt:that in 1316) [ClassicSimilarity], result of:
            0.011098707 = score(doc=1316,freq=2.0), product of:
              0.042395055 = queryWeight, product of:
                1.1331543 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.015789704 = queryNorm
              0.26179248 = fieldWeight in 1316, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.078125 = fieldNorm(doc=1316)
          0.013031578 = weight(abstract_txt:with in 1316) [ClassicSimilarity], result of:
            0.013031578 = score(doc=1316,freq=2.0), product of:
              0.04718438 = queryWeight, product of:
                1.1954477 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.015789704 = queryNorm
              0.27618414 = fieldWeight in 1316, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.078125 = fieldNorm(doc=1316)
          0.06695614 = weight(abstract_txt:natural in 1316) [ClassicSimilarity], result of:
            0.06695614 = score(doc=1316,freq=3.0), product of:
              0.097413726 = queryWeight, product of:
                1.2145811 = boost
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.015789704 = queryNorm
              0.6873379 = fieldWeight in 1316, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.078125 = fieldNorm(doc=1316)
          0.10316308 = weight(abstract_txt:meaning in 1316) [ClassicSimilarity], result of:
            0.10316308 = score(doc=1316,freq=1.0), product of:
              0.2361347 = queryWeight, product of:
                2.6743076 = boost
                5.592094 = idf(docFreq=447, maxDocs=44218)
                0.015789704 = queryNorm
              0.43688235 = fieldWeight in 1316, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.592094 = idf(docFreq=447, maxDocs=44218)
                0.078125 = fieldNorm(doc=1316)
          0.15586667 = weight(abstract_txt:free in 1316) [ClassicSimilarity], result of:
            0.15586667 = score(doc=1316,freq=1.0), product of:
              0.35591218 = queryWeight, product of:
                4.0211334 = boost
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.015789704 = queryNorm
              0.43793574 = fieldWeight in 1316, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.078125 = fieldNorm(doc=1316)
        0.32 = coord(8/25)
    
  2. Ashford, J.H.: Free text retrieval in the Welsh language : problems, and proposed working practice (1995) 0.14
    0.13506332 = sum of:
      0.13506332 = product of:
        0.6753166 = sum of:
          0.034519956 = weight(abstract_txt:language in 6509) [ClassicSimilarity], result of:
            0.034519956 = score(doc=6509,freq=1.0), product of:
              0.06603392 = queryWeight, product of:
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.015789704 = queryNorm
              0.5227609 = fieldWeight in 6509, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.125 = fieldNorm(doc=6509)
          0.046216775 = weight(abstract_txt:proposed in 6509) [ClassicSimilarity], result of:
            0.046216775 = score(doc=6509,freq=1.0), product of:
              0.08021459 = queryWeight, product of:
                1.1021562 = boost
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.015789704 = queryNorm
              0.5761642 = fieldWeight in 6509, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6093135 = idf(docFreq=1196, maxDocs=44218)
                0.125 = fieldNorm(doc=6509)
          0.0749144 = weight(abstract_txt:problems in 6509) [ClassicSimilarity], result of:
            0.0749144 = score(doc=6509,freq=1.0), product of:
              0.13945706 = queryWeight, product of:
                2.0551887 = boost
                4.297489 = idf(docFreq=1634, maxDocs=44218)
                0.015789704 = queryNorm
              0.53718615 = fieldWeight in 6509, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.297489 = idf(docFreq=1634, maxDocs=44218)
                0.125 = fieldNorm(doc=6509)
          0.24938667 = weight(abstract_txt:free in 6509) [ClassicSimilarity], result of:
            0.24938667 = score(doc=6509,freq=1.0), product of:
              0.35591218 = queryWeight, product of:
                4.0211334 = boost
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.015789704 = queryNorm
              0.7006972 = fieldWeight in 6509, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.125 = fieldNorm(doc=6509)
          0.2702788 = weight(abstract_txt:text in 6509) [ClassicSimilarity], result of:
            0.2702788 = score(doc=6509,freq=3.0), product of:
              0.30870563 = queryWeight, product of:
                4.834747 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.015789704 = queryNorm
              0.8755228 = fieldWeight in 6509, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.125 = fieldNorm(doc=6509)
        0.2 = coord(5/25)
    
  3. Gagan, D.: Scanning: a survival guide : 6: text scanning - editing and performance (1993) 0.13
    0.1331576 = sum of:
      0.1331576 = product of:
        0.832235 = sum of:
          0.044389706 = weight(abstract_txt:several in 6302) [ClassicSimilarity], result of:
            0.044389706 = score(doc=6302,freq=1.0), product of:
              0.078086354 = queryWeight, product of:
                1.0874368 = boost
                4.5477557 = idf(docFreq=1272, maxDocs=44218)
                0.015789704 = queryNorm
              0.56846946 = fieldWeight in 6302, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5477557 = idf(docFreq=1272, maxDocs=44218)
                0.125 = fieldNorm(doc=6302)
          0.014743547 = weight(abstract_txt:with in 6302) [ClassicSimilarity], result of:
            0.014743547 = score(doc=6302,freq=1.0), product of:
              0.04718438 = queryWeight, product of:
                1.1954477 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.015789704 = queryNorm
              0.31246668 = fieldWeight in 6302, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.125 = fieldNorm(doc=6302)
          0.55242 = weight(abstract_txt:marking in 6302) [ClassicSimilarity], result of:
            0.55242 = score(doc=6302,freq=1.0), product of:
              0.52833563 = queryWeight, product of:
                4.000243 = boost
                8.364683 = idf(docFreq=27, maxDocs=44218)
                0.015789704 = queryNorm
              1.0455854 = fieldWeight in 6302, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.364683 = idf(docFreq=27, maxDocs=44218)
                0.125 = fieldNorm(doc=6302)
          0.22068174 = weight(abstract_txt:text in 6302) [ClassicSimilarity], result of:
            0.22068174 = score(doc=6302,freq=2.0), product of:
              0.30870563 = queryWeight, product of:
                4.834747 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.015789704 = queryNorm
              0.7148614 = fieldWeight in 6302, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.125 = fieldNorm(doc=6302)
        0.16 = coord(4/25)
    
  4. Ruge, G.; Schwarz, C.: Natural language access to free-text data bases (1989) 0.13
    0.13267975 = sum of:
      0.13267975 = product of:
        0.4738562 = sum of:
          0.030511616 = weight(abstract_txt:language in 3567) [ClassicSimilarity], result of:
            0.030511616 = score(doc=3567,freq=2.0), product of:
              0.06603392 = queryWeight, product of:
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.015789704 = queryNorm
              0.46205974 = fieldWeight in 3567, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1820874 = idf(docFreq=1834, maxDocs=44218)
                0.078125 = fieldNorm(doc=3567)
          0.007847971 = weight(abstract_txt:that in 3567) [ClassicSimilarity], result of:
            0.007847971 = score(doc=3567,freq=1.0), product of:
              0.042395055 = queryWeight, product of:
                1.1331543 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.015789704 = queryNorm
              0.18511525 = fieldWeight in 3567, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.078125 = fieldNorm(doc=3567)
          0.009214717 = weight(abstract_txt:with in 3567) [ClassicSimilarity], result of:
            0.009214717 = score(doc=3567,freq=1.0), product of:
              0.04718438 = queryWeight, product of:
                1.1954477 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.015789704 = queryNorm
              0.19529167 = fieldWeight in 3567, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.078125 = fieldNorm(doc=3567)
          0.054669462 = weight(abstract_txt:natural in 3567) [ClassicSimilarity], result of:
            0.054669462 = score(doc=3567,freq=2.0), product of:
              0.097413726 = queryWeight, product of:
                1.2145811 = boost
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.015789704 = queryNorm
              0.561209 = fieldWeight in 3567, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.0794845 = idf(docFreq=747, maxDocs=44218)
                0.078125 = fieldNorm(doc=3567)
          0.0468215 = weight(abstract_txt:problems in 3567) [ClassicSimilarity], result of:
            0.0468215 = score(doc=3567,freq=1.0), product of:
              0.13945706 = queryWeight, product of:
                2.0551887 = boost
                4.297489 = idf(docFreq=1634, maxDocs=44218)
                0.015789704 = queryNorm
              0.33574134 = fieldWeight in 3567, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.297489 = idf(docFreq=1634, maxDocs=44218)
                0.078125 = fieldNorm(doc=3567)
          0.15586667 = weight(abstract_txt:free in 3567) [ClassicSimilarity], result of:
            0.15586667 = score(doc=3567,freq=1.0), product of:
              0.35591218 = queryWeight, product of:
                4.0211334 = boost
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.015789704 = queryNorm
              0.43793574 = fieldWeight in 3567, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.078125 = fieldNorm(doc=3567)
          0.16892426 = weight(abstract_txt:text in 3567) [ClassicSimilarity], result of:
            0.16892426 = score(doc=3567,freq=3.0), product of:
              0.30870563 = queryWeight, product of:
                4.834747 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.015789704 = queryNorm
              0.54720175 = fieldWeight in 3567, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.078125 = fieldNorm(doc=3567)
        0.28 = coord(7/25)
    
  5. Karamuftuoglu, M.: Collaborative information retrieval : toward a social informatics view of IR interaction (1998) 0.13
    0.12867635 = sum of:
      0.12867635 = product of:
        0.45955843 = sum of:
          0.027743569 = weight(abstract_txt:several in 2151) [ClassicSimilarity], result of:
            0.027743569 = score(doc=2151,freq=1.0), product of:
              0.078086354 = queryWeight, product of:
                1.0874368 = boost
                4.5477557 = idf(docFreq=1272, maxDocs=44218)
                0.015789704 = queryNorm
              0.35529342 = fieldWeight in 2151, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5477557 = idf(docFreq=1272, maxDocs=44218)
                0.078125 = fieldNorm(doc=2151)
          0.011098707 = weight(abstract_txt:that in 2151) [ClassicSimilarity], result of:
            0.011098707 = score(doc=2151,freq=2.0), product of:
              0.042395055 = queryWeight, product of:
                1.1331543 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.015789704 = queryNorm
              0.26179248 = fieldWeight in 2151, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.078125 = fieldNorm(doc=2151)
          0.009214717 = weight(abstract_txt:with in 2151) [ClassicSimilarity], result of:
            0.009214717 = score(doc=2151,freq=1.0), product of:
              0.04718438 = queryWeight, product of:
                1.1954477 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.015789704 = queryNorm
              0.19529167 = fieldWeight in 2151, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.078125 = fieldNorm(doc=2151)
          0.10565019 = weight(abstract_txt:readily in 2151) [ClassicSimilarity], result of:
            0.10565019 = score(doc=2151,freq=1.0), product of:
              0.19042054 = queryWeight, product of:
                1.6981394 = boost
                7.1017675 = idf(docFreq=98, maxDocs=44218)
                0.015789704 = queryNorm
              0.5548256 = fieldWeight in 2151, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1017675 = idf(docFreq=98, maxDocs=44218)
                0.078125 = fieldNorm(doc=2151)
          0.0468215 = weight(abstract_txt:problems in 2151) [ClassicSimilarity], result of:
            0.0468215 = score(doc=2151,freq=1.0), product of:
              0.13945706 = queryWeight, product of:
                2.0551887 = boost
                4.297489 = idf(docFreq=1634, maxDocs=44218)
                0.015789704 = queryNorm
              0.33574134 = fieldWeight in 2151, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.297489 = idf(docFreq=1634, maxDocs=44218)
                0.078125 = fieldNorm(doc=2151)
          0.10316308 = weight(abstract_txt:meaning in 2151) [ClassicSimilarity], result of:
            0.10316308 = score(doc=2151,freq=1.0), product of:
              0.2361347 = queryWeight, product of:
                2.6743076 = boost
                5.592094 = idf(docFreq=447, maxDocs=44218)
                0.015789704 = queryNorm
              0.43688235 = fieldWeight in 2151, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.592094 = idf(docFreq=447, maxDocs=44218)
                0.078125 = fieldNorm(doc=2151)
          0.15586667 = weight(abstract_txt:free in 2151) [ClassicSimilarity], result of:
            0.15586667 = score(doc=2151,freq=1.0), product of:
              0.35591218 = queryWeight, product of:
                4.0211334 = boost
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.015789704 = queryNorm
              0.43793574 = fieldWeight in 2151, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6055775 = idf(docFreq=441, maxDocs=44218)
                0.078125 = fieldNorm(doc=2151)
        0.28 = coord(7/25)