Document (#21242)

Author
Yongcheng, W.
Xiaoming, G.
Lixia, W.
Title
Automatic indexing on subject of Chinese text
Source
Journal of the China Society for Scientific and Technical Information. 17(1998) no.3, S.219-225.
Year
1998
Abstract
Outlines the underlying ideas, the basic algorithm and structure of CSAIS 2.1, an automatic indexing system for the subjects of Chinese documents, developed by the authors in 1993
Footnote
[In Chinesisch]
Theme
Automatisches Indexieren

Similar documents (content)

  1. Li, Z.: Research on dynamic morphological indexing (1998) 0.39
    0.3891018 = sum of:
      0.3891018 = product of:
        1.3229461 = sum of:
          0.06259591 = weight(abstract_txt:documents in 4242) [ClassicSimilarity], result of:
            0.06259591 = score(doc=4242,freq=1.0), product of:
              0.12144754 = queryWeight, product of:
                1.222531 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.024092486 = queryNorm
              0.51541525 = fieldWeight in 4242, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.125 = fieldNorm(doc=4242)
          0.16579628 = weight(abstract_txt:algorithm in 4242) [ClassicSimilarity], result of:
            0.16579628 = score(doc=4242,freq=1.0), product of:
              0.23249196 = queryWeight, product of:
                1.69149 = boost
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.024092486 = queryNorm
              0.71312696 = fieldWeight in 4242, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.125 = fieldNorm(doc=4242)
          0.29405233 = weight(abstract_txt:indexing in 4242) [ClassicSimilarity], result of:
            0.29405233 = score(doc=4242,freq=4.0), product of:
              0.27037373 = queryWeight, product of:
                2.5796616 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.024092486 = queryNorm
              1.0875773 = fieldWeight in 4242, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.125 = fieldNorm(doc=4242)
          0.3542234 = weight(abstract_txt:automatic in 4242) [ClassicSimilarity], result of:
            0.3542234 = score(doc=4242,freq=2.0), product of:
              0.3856644 = queryWeight, product of:
                3.0809546 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.024092486 = queryNorm
              0.91847575 = fieldWeight in 4242, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.125 = fieldNorm(doc=4242)
          0.4462782 = weight(abstract_txt:chinese in 4242) [ClassicSimilarity], result of:
            0.4462782 = score(doc=4242,freq=1.0), product of:
              0.5668113 = queryWeight, product of:
                3.7350788 = boost
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.024092486 = queryNorm
              0.7873488 = fieldWeight in 4242, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.125 = fieldNorm(doc=4242)
        0.29411766 = coord(5/17)
    
  2. Wan, T.-L.; Evens, M.; Wan, Y.-W.; Pao, Y.-Y.: Experiments with automatic indexing and a relational thesaurus in a Chinese information retrieval system (1997) 0.37
    0.37254754 = sum of:
      0.37254754 = product of:
        1.2666615 = sum of:
          0.044502895 = weight(abstract_txt:system in 956) [ClassicSimilarity], result of:
            0.044502895 = score(doc=956,freq=3.0), product of:
              0.081258535 = queryWeight, product of:
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.024092486 = queryNorm
              0.5476704 = fieldWeight in 956, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.09375 = fieldNorm(doc=956)
          0.04694694 = weight(abstract_txt:documents in 956) [ClassicSimilarity], result of:
            0.04694694 = score(doc=956,freq=1.0), product of:
              0.12144754 = queryWeight, product of:
                1.222531 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.024092486 = queryNorm
              0.38656145 = fieldWeight in 956, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.09375 = fieldNorm(doc=956)
          0.27010432 = weight(abstract_txt:indexing in 956) [ClassicSimilarity], result of:
            0.27010432 = score(doc=956,freq=6.0), product of:
              0.27037373 = queryWeight, product of:
                2.5796616 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.024092486 = queryNorm
              0.9990036 = fieldWeight in 956, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.09375 = fieldNorm(doc=956)
          0.32537496 = weight(abstract_txt:automatic in 956) [ClassicSimilarity], result of:
            0.32537496 = score(doc=956,freq=3.0), product of:
              0.3856644 = queryWeight, product of:
                3.0809546 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.024092486 = queryNorm
              0.8436738 = fieldWeight in 956, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.09375 = fieldNorm(doc=956)
          0.5797324 = weight(abstract_txt:chinese in 956) [ClassicSimilarity], result of:
            0.5797324 = score(doc=956,freq=3.0), product of:
              0.5668113 = queryWeight, product of:
                3.7350788 = boost
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.024092486 = queryNorm
              1.0227962 = fieldWeight in 956, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.09375 = fieldNorm(doc=956)
        0.29411766 = coord(5/17)
    
  3. Yang, C.C.; Li, K.W.: ¬A heuristic method based on a statistical approach for chinese text segmentation (2005) 0.37
    0.3678976 = sum of:
      0.3678976 = product of:
        1.0423765 = sum of:
          0.07793845 = weight(abstract_txt:text in 5580) [ClassicSimilarity], result of:
            0.07793845 = score(doc=5580,freq=7.0), product of:
              0.11663975 = queryWeight, product of:
                1.1980882 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.024092486 = queryNorm
              0.66819805 = fieldWeight in 5580, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=5580)
          0.0329817 = weight(abstract_txt:developed in 5580) [ClassicSimilarity], result of:
            0.0329817 = score(doc=5580,freq=1.0), product of:
              0.12576509 = queryWeight, product of:
                1.2440721 = boost
                4.1959753 = idf(docFreq=1817, maxDocs=44421)
                0.024092486 = queryNorm
              0.26224846 = fieldWeight in 5580, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1959753 = idf(docFreq=1817, maxDocs=44421)
                0.0625 = fieldNorm(doc=5580)
          0.063289024 = weight(abstract_txt:authors in 5580) [ClassicSimilarity], result of:
            0.063289024 = score(doc=5580,freq=2.0), product of:
              0.15414177 = queryWeight, product of:
                1.3772908 = boost
                4.6452923 = idf(docFreq=1159, maxDocs=44421)
                0.024092486 = queryNorm
              0.4105897 = fieldWeight in 5580, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6452923 = idf(docFreq=1159, maxDocs=44421)
                0.0625 = fieldNorm(doc=5580)
          0.07351308 = weight(abstract_txt:indexing in 5580) [ClassicSimilarity], result of:
            0.07351308 = score(doc=5580,freq=1.0), product of:
              0.27037373 = queryWeight, product of:
                2.5796616 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.024092486 = queryNorm
              0.27189434 = fieldWeight in 5580, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.0625 = fieldNorm(doc=5580)
          0.12523688 = weight(abstract_txt:automatic in 5580) [ClassicSimilarity], result of:
            0.12523688 = score(doc=5580,freq=1.0), product of:
              0.3856644 = queryWeight, product of:
                3.0809546 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.024092486 = queryNorm
              0.32473022 = fieldWeight in 5580, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.0625 = fieldNorm(doc=5580)
          0.6694173 = weight(abstract_txt:chinese in 5580) [ClassicSimilarity], result of:
            0.6694173 = score(doc=5580,freq=9.0), product of:
              0.5668113 = queryWeight, product of:
                3.7350788 = boost
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.024092486 = queryNorm
              1.1810232 = fieldWeight in 5580, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.0625 = fieldNorm(doc=5580)
        0.3529412 = coord(6/17)
    
  4. Wang, F.L.; Yang, C.C.: Mining Web data for Chinese segmentation (2007) 0.29
    0.29318658 = sum of:
      0.29318658 = product of:
        0.9968343 = sum of:
          0.029457968 = weight(abstract_txt:text in 1604) [ClassicSimilarity], result of:
            0.029457968 = score(doc=1604,freq=1.0), product of:
              0.11663975 = queryWeight, product of:
                1.1980882 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.024092486 = queryNorm
              0.25255513 = fieldWeight in 1604, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=1604)
          0.06998436 = weight(abstract_txt:documents in 1604) [ClassicSimilarity], result of:
            0.06998436 = score(doc=1604,freq=5.0), product of:
              0.12144754 = queryWeight, product of:
                1.222531 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.024092486 = queryNorm
              0.5762518 = fieldWeight in 1604, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0625 = fieldNorm(doc=1604)
          0.20305815 = weight(abstract_txt:algorithm in 1604) [ClassicSimilarity], result of:
            0.20305815 = score(doc=1604,freq=6.0), product of:
              0.23249196 = queryWeight, product of:
                1.69149 = boost
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.024092486 = queryNorm
              0.8733986 = fieldWeight in 1604, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.0625 = fieldNorm(doc=1604)
          0.103963204 = weight(abstract_txt:indexing in 1604) [ClassicSimilarity], result of:
            0.103963204 = score(doc=1604,freq=2.0), product of:
              0.27037373 = queryWeight, product of:
                2.5796616 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.024092486 = queryNorm
              0.38451666 = fieldWeight in 1604, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.0625 = fieldNorm(doc=1604)
          0.5903706 = weight(abstract_txt:chinese in 1604) [ClassicSimilarity], result of:
            0.5903706 = score(doc=1604,freq=7.0), product of:
              0.5668113 = queryWeight, product of:
                3.7350788 = boost
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.024092486 = queryNorm
              1.0415646 = fieldWeight in 1604, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.0625 = fieldNorm(doc=1604)
        0.29411766 = coord(5/17)
    
  5. Shen, Z.: CJK: the unique need of Chinese, Japanese, and Korean language cataloging (1993) 0.29
    0.28684393 = sum of:
      0.28684393 = product of:
        0.9752693 = sum of:
          0.048448615 = weight(abstract_txt:system in 4726) [ClassicSimilarity], result of:
            0.048448615 = score(doc=4726,freq=2.0), product of:
              0.081258535 = queryWeight, product of:
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.024092486 = queryNorm
              0.596228 = fieldWeight in 4726, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.125 = fieldNorm(doc=4726)
          0.0659634 = weight(abstract_txt:developed in 4726) [ClassicSimilarity], result of:
            0.0659634 = score(doc=4726,freq=1.0), product of:
              0.12576509 = queryWeight, product of:
                1.2440721 = boost
                4.1959753 = idf(docFreq=1817, maxDocs=44421)
                0.024092486 = queryNorm
              0.5244969 = fieldWeight in 4726, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1959753 = idf(docFreq=1817, maxDocs=44421)
                0.125 = fieldNorm(doc=4726)
          0.19286351 = weight(abstract_txt:outlines in 4726) [ClassicSimilarity], result of:
            0.19286351 = score(doc=4726,freq=2.0), product of:
              0.2041024 = queryWeight, product of:
                1.5848551 = boost
                5.34536 = idf(docFreq=575, maxDocs=44421)
                0.024092486 = queryNorm
              0.944935 = fieldWeight in 4726, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.34536 = idf(docFreq=575, maxDocs=44421)
                0.125 = fieldNorm(doc=4726)
          0.22171558 = weight(abstract_txt:1993 in 4726) [ClassicSimilarity], result of:
            0.22171558 = score(doc=4726,freq=1.0), product of:
              0.28219905 = queryWeight, product of:
                1.8635595 = boost
                6.285367 = idf(docFreq=224, maxDocs=44421)
                0.024092486 = queryNorm
              0.7856709 = fieldWeight in 4726, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.285367 = idf(docFreq=224, maxDocs=44421)
                0.125 = fieldNorm(doc=4726)
          0.4462782 = weight(abstract_txt:chinese in 4726) [ClassicSimilarity], result of:
            0.4462782 = score(doc=4726,freq=1.0), product of:
              0.5668113 = queryWeight, product of:
                3.7350788 = boost
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.024092486 = queryNorm
              0.7873488 = fieldWeight in 4726, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2987905 = idf(docFreq=221, maxDocs=44421)
                0.125 = fieldNorm(doc=4726)
        0.29411766 = coord(5/17)