Document (#25462)

Author
Riggs, K.R.
Title
XML and free text
Source
Journal of the American Society for Information Science and technology. 53(2002) no.6, S.526-528
Year
2002
Abstract
We show several problems with marking free text, text that is either natural language or semigrammatical but unstructured. These problems prevent well-formed XML from marking text for readily available meaning. A solution is proposed to mark meaning in free text that is consistent with the intended simplicity of XML versus SGML.
Object
XML

Similar documents (author)

  1. Riggs, F.W.: Information and social science : the need for onomantics (1989) 6.10
    6.0972233 = sum of:
      6.0972233 = weight(author_txt:riggs in 2910) [ClassicSimilarity], result of:
        6.0972233 = fieldWeight in 2910, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.625 = fieldNorm(doc=2910)
    
  2. Riggs, F.W.: Onomantics and terminology : pt.1: their contributions to knowledge organization (1996) 6.10
    6.0972233 = sum of:
      6.0972233 = weight(author_txt:riggs in 3818) [ClassicSimilarity], result of:
        6.0972233 = fieldWeight in 3818, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.625 = fieldNorm(doc=3818)
    
  3. Riggs, F.W.: Onomantics and terminology : pt.2: core concepts (1996) 6.10
    6.0972233 = sum of:
      6.0972233 = weight(author_txt:riggs in 5455) [ClassicSimilarity], result of:
        6.0972233 = fieldWeight in 5455, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.625 = fieldNorm(doc=5455)
    
  4. Riggs, F.W.: Onomantics and terminology : pt.3: formats, borrowed terms and omissions (1996) 6.10
    6.0972233 = sum of:
      6.0972233 = weight(author_txt:riggs in 6108) [ClassicSimilarity], result of:
        6.0972233 = fieldWeight in 6108, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.625 = fieldNorm(doc=6108)
    
  5. Riggs, F.W.: Onomantics and terminology : pt.4: neologisms, neoterisms, meta-terms, phrases, and pleonisms (1997) 6.10
    6.0972233 = sum of:
      6.0972233 = weight(author_txt:riggs in 535) [ClassicSimilarity], result of:
        6.0972233 = fieldWeight in 535, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.625 = fieldNorm(doc=535)
    

Similar documents (content)

  1. Rishel, T.; Perkins, L.A.; Yenduri, S.; Zand, F.: Determining the context of text using augmented latent semantic indexing (2007) 0.14
    0.14436224 = sum of:
      0.14436224 = product of:
        0.451132 = sum of:
          0.037191153 = weight(abstract_txt:language in 2316) [ClassicSimilarity], result of:
            0.037191153 = score(doc=2316,freq=3.0), product of:
              0.06589464 = queryWeight, product of:
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.01579833 = queryNorm
              0.5644033 = fieldWeight in 2316, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.078125 = fieldNorm(doc=2316)
          0.0356722 = weight(abstract_txt:show in 2316) [ClassicSimilarity], result of:
            0.0356722 = score(doc=2316,freq=2.0), product of:
              0.073362485 = queryWeight, product of:
                1.0551445 = boost
                4.400995 = idf(docFreq=1480, maxDocs=44421)
                0.01579833 = queryNorm
              0.4862458 = fieldWeight in 2316, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.400995 = idf(docFreq=1480, maxDocs=44421)
                0.078125 = fieldNorm(doc=2316)
          0.027844723 = weight(abstract_txt:several in 2316) [ClassicSimilarity], result of:
            0.027844723 = score(doc=2316,freq=1.0), product of:
              0.078359686 = queryWeight, product of:
                1.090489 = boost
                4.548416 = idf(docFreq=1277, maxDocs=44421)
                0.01579833 = queryNorm
              0.355345 = fieldWeight in 2316, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.548416 = idf(docFreq=1277, maxDocs=44421)
                0.078125 = fieldNorm(doc=2316)
          0.011070445 = weight(abstract_txt:that in 2316) [ClassicSimilarity], result of:
            0.011070445 = score(doc=2316,freq=2.0), product of:
              0.042368274 = queryWeight, product of:
                1.1339929 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.01579833 = queryNorm
              0.2612909 = fieldWeight in 2316, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.078125 = fieldNorm(doc=2316)
          0.013017092 = weight(abstract_txt:with in 2316) [ClassicSimilarity], result of:
            0.013017092 = score(doc=2316,freq=2.0), product of:
              0.04719979 = queryWeight, product of:
                1.1969059 = boost
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.01579833 = queryNorm
              0.2757871 = fieldWeight in 2316, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.078125 = fieldNorm(doc=2316)
          0.066774316 = weight(abstract_txt:natural in 2316) [ClassicSimilarity], result of:
            0.066774316 = score(doc=2316,freq=3.0), product of:
              0.09734119 = queryWeight, product of:
                1.2154113 = boost
                5.0694656 = idf(docFreq=758, maxDocs=44421)
                0.01579833 = queryNorm
              0.68598217 = fieldWeight in 2316, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.0694656 = idf(docFreq=758, maxDocs=44421)
                0.078125 = fieldNorm(doc=2316)
          0.103377916 = weight(abstract_txt:meaning in 2316) [ClassicSimilarity], result of:
            0.103377916 = score(doc=2316,freq=1.0), product of:
              0.23671508 = queryWeight, product of:
                2.6804204 = boost
                5.59 = idf(docFreq=450, maxDocs=44421)
                0.01579833 = queryNorm
              0.43671876 = fieldWeight in 2316, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.59 = idf(docFreq=450, maxDocs=44421)
                0.078125 = fieldNorm(doc=2316)
          0.15618415 = weight(abstract_txt:free in 2316) [ClassicSimilarity], result of:
            0.15618415 = score(doc=2316,freq=1.0), product of:
              0.35677615 = queryWeight, product of:
                4.030264 = boost
                5.6033936 = idf(docFreq=444, maxDocs=44421)
                0.01579833 = queryNorm
              0.43776512 = fieldWeight in 2316, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6033936 = idf(docFreq=444, maxDocs=44421)
                0.078125 = fieldNorm(doc=2316)
        0.32 = coord(8/25)
    
  2. Ashford, J.H.: Free text retrieval in the Welsh language : problems, and proposed working practice (1995) 0.14
    0.13529101 = sum of:
      0.13529101 = product of:
        0.676455 = sum of:
          0.034355715 = weight(abstract_txt:language in 6508) [ClassicSimilarity], result of:
            0.034355715 = score(doc=6508,freq=1.0), product of:
              0.06589464 = queryWeight, product of:
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.01579833 = queryNorm
              0.52137345 = fieldWeight in 6508, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.125 = fieldNorm(doc=6508)
          0.046327367 = weight(abstract_txt:proposed in 6508) [ClassicSimilarity], result of:
            0.046327367 = score(doc=6508,freq=1.0), product of:
              0.080428354 = queryWeight, product of:
                1.1047895 = boost
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.01579833 = queryNorm
              0.5760079 = fieldWeight in 6508, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.125 = fieldNorm(doc=6508)
          0.0753311 = weight(abstract_txt:problems in 6508) [ClassicSimilarity], result of:
            0.0753311 = score(doc=6508,freq=1.0), product of:
              0.14012328 = queryWeight, product of:
                2.062268 = boost
                4.300847 = idf(docFreq=1636, maxDocs=44421)
                0.01579833 = queryNorm
              0.5376059 = fieldWeight in 6508, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.300847 = idf(docFreq=1636, maxDocs=44421)
                0.125 = fieldNorm(doc=6508)
          0.24989465 = weight(abstract_txt:free in 6508) [ClassicSimilarity], result of:
            0.24989465 = score(doc=6508,freq=1.0), product of:
              0.35677615 = queryWeight, product of:
                4.030264 = boost
                5.6033936 = idf(docFreq=444, maxDocs=44421)
                0.01579833 = queryNorm
              0.7004242 = fieldWeight in 6508, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6033936 = idf(docFreq=444, maxDocs=44421)
                0.125 = fieldNorm(doc=6508)
          0.2705462 = weight(abstract_txt:text in 6508) [ClassicSimilarity], result of:
            0.2705462 = score(doc=6508,freq=3.0), product of:
              0.30923927 = queryWeight, product of:
                4.844035 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.01579833 = queryNorm
              0.8748766 = fieldWeight in 6508, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.125 = fieldNorm(doc=6508)
        0.2 = coord(5/25)
    
  3. Ruge, G.; Schwarz, C.: Natural language access to free-text data bases (1989) 0.13
    0.13279767 = sum of:
      0.13279767 = product of:
        0.47427738 = sum of:
          0.030366449 = weight(abstract_txt:language in 3635) [ClassicSimilarity], result of:
            0.030366449 = score(doc=3635,freq=2.0), product of:
              0.06589464 = queryWeight, product of:
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.01579833 = queryNorm
              0.46083337 = fieldWeight in 3635, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.078125 = fieldNorm(doc=3635)
          0.007827986 = weight(abstract_txt:that in 3635) [ClassicSimilarity], result of:
            0.007827986 = score(doc=3635,freq=1.0), product of:
              0.042368274 = queryWeight, product of:
                1.1339929 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.01579833 = queryNorm
              0.18476056 = fieldWeight in 3635, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.078125 = fieldNorm(doc=3635)
          0.009204474 = weight(abstract_txt:with in 3635) [ClassicSimilarity], result of:
            0.009204474 = score(doc=3635,freq=1.0), product of:
              0.04719979 = queryWeight, product of:
                1.1969059 = boost
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.01579833 = queryNorm
              0.19501092 = fieldWeight in 3635, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.078125 = fieldNorm(doc=3635)
          0.054521006 = weight(abstract_txt:natural in 3635) [ClassicSimilarity], result of:
            0.054521006 = score(doc=3635,freq=2.0), product of:
              0.09734119 = queryWeight, product of:
                1.2154113 = boost
                5.0694656 = idf(docFreq=758, maxDocs=44421)
                0.01579833 = queryNorm
              0.5601021 = fieldWeight in 3635, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.0694656 = idf(docFreq=758, maxDocs=44421)
                0.078125 = fieldNorm(doc=3635)
          0.047081936 = weight(abstract_txt:problems in 3635) [ClassicSimilarity], result of:
            0.047081936 = score(doc=3635,freq=1.0), product of:
              0.14012328 = queryWeight, product of:
                2.062268 = boost
                4.300847 = idf(docFreq=1636, maxDocs=44421)
                0.01579833 = queryNorm
              0.33600366 = fieldWeight in 3635, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.300847 = idf(docFreq=1636, maxDocs=44421)
                0.078125 = fieldNorm(doc=3635)
          0.15618415 = weight(abstract_txt:free in 3635) [ClassicSimilarity], result of:
            0.15618415 = score(doc=3635,freq=1.0), product of:
              0.35677615 = queryWeight, product of:
                4.030264 = boost
                5.6033936 = idf(docFreq=444, maxDocs=44421)
                0.01579833 = queryNorm
              0.43776512 = fieldWeight in 3635, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6033936 = idf(docFreq=444, maxDocs=44421)
                0.078125 = fieldNorm(doc=3635)
          0.16909137 = weight(abstract_txt:text in 3635) [ClassicSimilarity], result of:
            0.16909137 = score(doc=3635,freq=3.0), product of:
              0.30923927 = queryWeight, product of:
                4.844035 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.01579833 = queryNorm
              0.5467979 = fieldWeight in 3635, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.078125 = fieldNorm(doc=3635)
        0.28 = coord(7/25)
    
  4. Gagan, D.: Scanning: a survival guide : 6: text scanning - editing and performance (1993) 0.13
    0.13253266 = sum of:
      0.13253266 = product of:
        0.82832915 = sum of:
          0.04455156 = weight(abstract_txt:several in 6301) [ClassicSimilarity], result of:
            0.04455156 = score(doc=6301,freq=1.0), product of:
              0.078359686 = queryWeight, product of:
                1.090489 = boost
                4.548416 = idf(docFreq=1277, maxDocs=44421)
                0.01579833 = queryNorm
              0.568552 = fieldWeight in 6301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.548416 = idf(docFreq=1277, maxDocs=44421)
                0.125 = fieldNorm(doc=6301)
          0.0147271585 = weight(abstract_txt:with in 6301) [ClassicSimilarity], result of:
            0.0147271585 = score(doc=6301,freq=1.0), product of:
              0.04719979 = queryWeight, product of:
                1.1969059 = boost
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.01579833 = queryNorm
              0.31201747 = fieldWeight in 6301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.125 = fieldNorm(doc=6301)
          0.54815036 = weight(abstract_txt:marking in 6301) [ClassicSimilarity], result of:
            0.54815036 = score(doc=6301,freq=1.0), product of:
              0.5261714 = queryWeight, product of:
                3.9962585 = boost
                8.334172 = idf(docFreq=28, maxDocs=44421)
                0.01579833 = queryNorm
              1.0417715 = fieldWeight in 6301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.334172 = idf(docFreq=28, maxDocs=44421)
                0.125 = fieldNorm(doc=6301)
          0.22090006 = weight(abstract_txt:text in 6301) [ClassicSimilarity], result of:
            0.22090006 = score(doc=6301,freq=2.0), product of:
              0.30923927 = queryWeight, product of:
                4.844035 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.01579833 = queryNorm
              0.7143338 = fieldWeight in 6301, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.125 = fieldNorm(doc=6301)
        0.16 = coord(4/25)
    
  5. Karamuftuoglu, M.: Collaborative information retrieval : toward a social informatics view of IR interaction (1998) 0.13
    0.12906826 = sum of:
      0.12906826 = product of:
        0.46095806 = sum of:
          0.027844723 = weight(abstract_txt:several in 3151) [ClassicSimilarity], result of:
            0.027844723 = score(doc=3151,freq=1.0), product of:
              0.078359686 = queryWeight, product of:
                1.090489 = boost
                4.548416 = idf(docFreq=1277, maxDocs=44421)
                0.01579833 = queryNorm
              0.355345 = fieldWeight in 3151, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.548416 = idf(docFreq=1277, maxDocs=44421)
                0.078125 = fieldNorm(doc=3151)
          0.011070445 = weight(abstract_txt:that in 3151) [ClassicSimilarity], result of:
            0.011070445 = score(doc=3151,freq=2.0), product of:
              0.042368274 = queryWeight, product of:
                1.1339929 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.01579833 = queryNorm
              0.2612909 = fieldWeight in 3151, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.078125 = fieldNorm(doc=3151)
          0.009204474 = weight(abstract_txt:with in 3151) [ClassicSimilarity], result of:
            0.009204474 = score(doc=3151,freq=1.0), product of:
              0.04719979 = queryWeight, product of:
                1.1969059 = boost
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.01579833 = queryNorm
              0.19501092 = fieldWeight in 3151, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.078125 = fieldNorm(doc=3151)
          0.106194414 = weight(abstract_txt:readily in 3151) [ClassicSimilarity], result of:
            0.106194414 = score(doc=3151,freq=1.0), product of:
              0.19127807 = queryWeight, product of:
                1.7037566 = boost
                7.1063476 = idf(docFreq=98, maxDocs=44421)
                0.01579833 = queryNorm
              0.5551834 = fieldWeight in 3151, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1063476 = idf(docFreq=98, maxDocs=44421)
                0.078125 = fieldNorm(doc=3151)
          0.047081936 = weight(abstract_txt:problems in 3151) [ClassicSimilarity], result of:
            0.047081936 = score(doc=3151,freq=1.0), product of:
              0.14012328 = queryWeight, product of:
                2.062268 = boost
                4.300847 = idf(docFreq=1636, maxDocs=44421)
                0.01579833 = queryNorm
              0.33600366 = fieldWeight in 3151, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.300847 = idf(docFreq=1636, maxDocs=44421)
                0.078125 = fieldNorm(doc=3151)
          0.103377916 = weight(abstract_txt:meaning in 3151) [ClassicSimilarity], result of:
            0.103377916 = score(doc=3151,freq=1.0), product of:
              0.23671508 = queryWeight, product of:
                2.6804204 = boost
                5.59 = idf(docFreq=450, maxDocs=44421)
                0.01579833 = queryNorm
              0.43671876 = fieldWeight in 3151, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.59 = idf(docFreq=450, maxDocs=44421)
                0.078125 = fieldNorm(doc=3151)
          0.15618415 = weight(abstract_txt:free in 3151) [ClassicSimilarity], result of:
            0.15618415 = score(doc=3151,freq=1.0), product of:
              0.35677615 = queryWeight, product of:
                4.030264 = boost
                5.6033936 = idf(docFreq=444, maxDocs=44421)
                0.01579833 = queryNorm
              0.43776512 = fieldWeight in 3151, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6033936 = idf(docFreq=444, maxDocs=44421)
                0.078125 = fieldNorm(doc=3151)
        0.28 = coord(7/25)