Document (#21568)

Author
Jianchao, X.
Ming, H.
Milin, S.
Title
On indexing descriptors for document archive
Source
Journal of the China Society for Scientific and Technical Information. 17(1998) no.4, S.263-265
Year
1998
Abstract
Describes a method of indexing the descriptors of the full text of document archives. Explains how the method organizes the thesaurus of descriptors, and mixes both keyword and index terms from the thesaurus. Presents a procedure for weighting descriptors and discusses the technical issues involved
Footnote
[In Chinesisch]

Similar documents (content)

  1. Ferber, R.: Automated indexing with thesaurus descriptors : a co-occurence based approach to multilingual retrieval (1997) 0.34
    0.33988667 = sum of:
      0.33988667 = product of:
        1.1653258 = sum of:
          0.0077482015 = weight(abstract_txt:from in 5144) [ClassicSimilarity], result of:
            0.0077482015 = score(doc=5144,freq=2.0), product of:
              0.031768113 = queryWeight, product of:
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.011512693 = queryNorm
              0.2438987 = fieldWeight in 5144, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.0625 = fieldNorm(doc=5144)
          0.02438362 = weight(abstract_txt:terms in 5144) [ClassicSimilarity], result of:
            0.02438362 = score(doc=5144,freq=2.0), product of:
              0.06822176 = queryWeight, product of:
                1.4654323 = boost
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.011512693 = queryNorm
              0.35741702 = fieldWeight in 5144, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.0625 = fieldNorm(doc=5144)
          0.088451095 = weight(abstract_txt:weighting in 5144) [ClassicSimilarity], result of:
            0.088451095 = score(doc=5144,freq=1.0), product of:
              0.20292534 = queryWeight, product of:
                2.527391 = boost
                6.9740796 = idf(docFreq=112, maxDocs=44421)
                0.011512693 = queryNorm
              0.43587998 = fieldWeight in 5144, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9740796 = idf(docFreq=112, maxDocs=44421)
                0.0625 = fieldNorm(doc=5144)
          0.05840083 = weight(abstract_txt:document in 5144) [ClassicSimilarity], result of:
            0.05840083 = score(doc=5144,freq=2.0), product of:
              0.15386747 = queryWeight, product of:
                3.1123805 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.011512693 = queryNorm
              0.3795528 = fieldWeight in 5144, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.0625 = fieldNorm(doc=5144)
          0.08587424 = weight(abstract_txt:indexing in 5144) [ClassicSimilarity], result of:
            0.08587424 = score(doc=5144,freq=4.0), product of:
              0.1579184 = queryWeight, product of:
                3.1530848 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.011512693 = queryNorm
              0.5437887 = fieldWeight in 5144, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.0625 = fieldNorm(doc=5144)
          0.14418587 = weight(abstract_txt:thesaurus in 5144) [ClassicSimilarity], result of:
            0.14418587 = score(doc=5144,freq=4.0), product of:
              0.22308613 = queryWeight, product of:
                3.7476203 = boost
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.011512693 = queryNorm
              0.64632374 = fieldWeight in 5144, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.0625 = fieldNorm(doc=5144)
          0.7562819 = weight(abstract_txt:descriptors in 5144) [ClassicSimilarity], result of:
            0.7562819 = score(doc=5144,freq=6.0), product of:
              0.74124116 = queryWeight, product of:
                9.660821 = boost
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.011512693 = queryNorm
              1.0202913 = fieldWeight in 5144, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.0625 = fieldNorm(doc=5144)
        0.29166666 = coord(7/24)
    
  2. Loosjes, T.P.; Tichelaar, P.A.; Goossens, J.; Stuurman, P.: Ontsluiting op onderwerp (1977) 0.34
    0.33705196 = sum of:
      0.33705196 = product of:
        1.0111558 = sum of:
          0.006848508 = weight(abstract_txt:from in 978) [ClassicSimilarity], result of:
            0.006848508 = score(doc=978,freq=1.0), product of:
              0.031768113 = queryWeight, product of:
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.011512693 = queryNorm
              0.21557805 = fieldWeight in 978, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.078125 = fieldNorm(doc=978)
          0.021507058 = weight(abstract_txt:text in 978) [ClassicSimilarity], result of:
            0.021507058 = score(doc=978,freq=1.0), product of:
              0.0681263 = queryWeight, product of:
                1.4644066 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.011512693 = queryNorm
              0.3156939 = fieldWeight in 978, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.078125 = fieldNorm(doc=978)
          0.034926068 = weight(abstract_txt:index in 978) [ClassicSimilarity], result of:
            0.034926068 = score(doc=978,freq=1.0), product of:
              0.0941226 = queryWeight, product of:
                1.7212789 = boost
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.011512693 = queryNorm
              0.37106994 = fieldWeight in 978, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7496953 = idf(docFreq=1044, maxDocs=44421)
                0.078125 = fieldNorm(doc=978)
          0.038936846 = weight(abstract_txt:full in 978) [ClassicSimilarity], result of:
            0.038936846 = score(doc=978,freq=1.0), product of:
              0.10119708 = queryWeight, product of:
                1.7847947 = boost
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.011512693 = queryNorm
              0.38476256 = fieldWeight in 978, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.078125 = fieldNorm(doc=978)
          0.053671397 = weight(abstract_txt:indexing in 978) [ClassicSimilarity], result of:
            0.053671397 = score(doc=978,freq=1.0), product of:
              0.1579184 = queryWeight, product of:
                3.1530848 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.011512693 = queryNorm
              0.33986792 = fieldWeight in 978, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.078125 = fieldNorm(doc=978)
          0.05935733 = weight(abstract_txt:method in 978) [ClassicSimilarity], result of:
            0.05935733 = score(doc=978,freq=1.0), product of:
              0.16888343 = queryWeight, product of:
                3.2607148 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.011512693 = queryNorm
              0.35146925 = fieldWeight in 978, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.078125 = fieldNorm(doc=978)
          0.1274435 = weight(abstract_txt:thesaurus in 978) [ClassicSimilarity], result of:
            0.1274435 = score(doc=978,freq=2.0), product of:
              0.22308613 = queryWeight, product of:
                3.7476203 = boost
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.011512693 = queryNorm
              0.5712749 = fieldWeight in 978, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.078125 = fieldNorm(doc=978)
          0.6684651 = weight(abstract_txt:descriptors in 978) [ClassicSimilarity], result of:
            0.6684651 = score(doc=978,freq=3.0), product of:
              0.74124116 = queryWeight, product of:
                9.660821 = boost
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.011512693 = queryNorm
              0.90181863 = fieldWeight in 978, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.078125 = fieldNorm(doc=978)
        0.33333334 = coord(8/24)
    
  3. Lu, K.; Mao, J.; Li, G.: Toward effective automated weighted subject indexing : a comparison of different approaches in different environments (2018) 0.30
    0.29894096 = sum of:
      0.29894096 = product of:
        0.8968228 = sum of:
          0.0054788063 = weight(abstract_txt:from in 292) [ClassicSimilarity], result of:
            0.0054788063 = score(doc=292,freq=1.0), product of:
              0.031768113 = queryWeight, product of:
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.011512693 = queryNorm
              0.17246243 = fieldWeight in 292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.0625 = fieldNorm(doc=292)
          0.024332458 = weight(abstract_txt:text in 292) [ClassicSimilarity], result of:
            0.024332458 = score(doc=292,freq=2.0), product of:
              0.0681263 = queryWeight, product of:
                1.4644066 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.011512693 = queryNorm
              0.3571669 = fieldWeight in 292, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=292)
          0.044052012 = weight(abstract_txt:full in 292) [ClassicSimilarity], result of:
            0.044052012 = score(doc=292,freq=2.0), product of:
              0.10119708 = queryWeight, product of:
                1.7847947 = boost
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.011512693 = queryNorm
              0.4353091 = fieldWeight in 292, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.0625 = fieldNorm(doc=292)
          0.17690219 = weight(abstract_txt:weighting in 292) [ClassicSimilarity], result of:
            0.17690219 = score(doc=292,freq=4.0), product of:
              0.20292534 = queryWeight, product of:
                2.527391 = boost
                6.9740796 = idf(docFreq=112, maxDocs=44421)
                0.011512693 = queryNorm
              0.87175995 = fieldWeight in 292, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.9740796 = idf(docFreq=112, maxDocs=44421)
                0.0625 = fieldNorm(doc=292)
          0.04129562 = weight(abstract_txt:document in 292) [ClassicSimilarity], result of:
            0.04129562 = score(doc=292,freq=1.0), product of:
              0.15386747 = queryWeight, product of:
                3.1123805 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.011512693 = queryNorm
              0.26838437 = fieldWeight in 292, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.0625 = fieldNorm(doc=292)
          0.08587424 = weight(abstract_txt:indexing in 292) [ClassicSimilarity], result of:
            0.08587424 = score(doc=292,freq=4.0), product of:
              0.1579184 = queryWeight, product of:
                3.1530848 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.011512693 = queryNorm
              0.5437887 = fieldWeight in 292, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.0625 = fieldNorm(doc=292)
          0.082247935 = weight(abstract_txt:method in 292) [ClassicSimilarity], result of:
            0.082247935 = score(doc=292,freq=3.0), product of:
              0.16888343 = queryWeight, product of:
                3.2607148 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.011512693 = queryNorm
              0.4870101 = fieldWeight in 292, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0625 = fieldNorm(doc=292)
          0.43663955 = weight(abstract_txt:descriptors in 292) [ClassicSimilarity], result of:
            0.43663955 = score(doc=292,freq=2.0), product of:
              0.74124116 = queryWeight, product of:
                9.660821 = boost
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.011512693 = queryNorm
              0.58906543 = fieldWeight in 292, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.0625 = fieldNorm(doc=292)
        0.33333334 = coord(8/24)
    
  4. Fagan, J.L.: ¬The effectiveness of a nonsyntactic approach to automatic phrase indexing for document retrieval (1989) 0.30
    0.29805708 = sum of:
      0.29805708 = product of:
        1.0219101 = sum of:
          0.0054788063 = weight(abstract_txt:from in 2845) [ClassicSimilarity], result of:
            0.0054788063 = score(doc=2845,freq=1.0), product of:
              0.031768113 = queryWeight, product of:
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.011512693 = queryNorm
              0.17246243 = fieldWeight in 2845, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.0625 = fieldNorm(doc=2845)
          0.024332458 = weight(abstract_txt:text in 2845) [ClassicSimilarity], result of:
            0.024332458 = score(doc=2845,freq=2.0), product of:
              0.0681263 = queryWeight, product of:
                1.4644066 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.011512693 = queryNorm
              0.3571669 = fieldWeight in 2845, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=2845)
          0.12922797 = weight(abstract_txt:procedure in 2845) [ClassicSimilarity], result of:
            0.12922797 = score(doc=2845,freq=3.0), product of:
              0.181161 = queryWeight, product of:
                2.388013 = boost
                6.58948 = idf(docFreq=165, maxDocs=44421)
                0.011512693 = queryNorm
              0.7133321 = fieldWeight in 2845, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.58948 = idf(docFreq=165, maxDocs=44421)
                0.0625 = fieldNorm(doc=2845)
          0.09233982 = weight(abstract_txt:document in 2845) [ClassicSimilarity], result of:
            0.09233982 = score(doc=2845,freq=5.0), product of:
              0.15386747 = queryWeight, product of:
                3.1123805 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.011512693 = queryNorm
              0.6001257 = fieldWeight in 2845, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.0625 = fieldNorm(doc=2845)
          0.08587424 = weight(abstract_txt:indexing in 2845) [ClassicSimilarity], result of:
            0.08587424 = score(doc=2845,freq=4.0), product of:
              0.1579184 = queryWeight, product of:
                3.1530848 = boost
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.011512693 = queryNorm
              0.5437887 = fieldWeight in 2845, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.3503094 = idf(docFreq=1557, maxDocs=44421)
                0.0625 = fieldNorm(doc=2845)
          0.06715516 = weight(abstract_txt:method in 2845) [ClassicSimilarity], result of:
            0.06715516 = score(doc=2845,freq=2.0), product of:
              0.16888343 = queryWeight, product of:
                3.2607148 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.011512693 = queryNorm
              0.39764208 = fieldWeight in 2845, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0625 = fieldNorm(doc=2845)
          0.6175016 = weight(abstract_txt:descriptors in 2845) [ClassicSimilarity], result of:
            0.6175016 = score(doc=2845,freq=4.0), product of:
              0.74124116 = queryWeight, product of:
                9.660821 = boost
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.011512693 = queryNorm
              0.8330644 = fieldWeight in 2845, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.0625 = fieldNorm(doc=2845)
        0.29166666 = coord(7/24)
    
  5. Gopinath, M.A.: Descriptors and their role in information retrieval (1993) 0.26
    0.25868076 = sum of:
      0.25868076 = product of:
        1.2416676 = sum of:
          0.010957613 = weight(abstract_txt:from in 7801) [ClassicSimilarity], result of:
            0.010957613 = score(doc=7801,freq=1.0), product of:
              0.031768113 = queryWeight, product of:
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.011512693 = queryNorm
              0.34492487 = fieldWeight in 7801, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.125 = fieldNorm(doc=7801)
          0.03213064 = weight(abstract_txt:discusses in 7801) [ClassicSimilarity], result of:
            0.03213064 = score(doc=7801,freq=1.0), product of:
              0.065081924 = queryWeight, product of:
                1.4313126 = boost
                3.9495623 = idf(docFreq=2325, maxDocs=44421)
                0.011512693 = queryNorm
              0.4936953 = fieldWeight in 7801, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9495623 = idf(docFreq=2325, maxDocs=44421)
                0.125 = fieldNorm(doc=7801)
          0.034411293 = weight(abstract_txt:text in 7801) [ClassicSimilarity], result of:
            0.034411293 = score(doc=7801,freq=1.0), product of:
              0.0681263 = queryWeight, product of:
                1.4644066 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.011512693 = queryNorm
              0.50511026 = fieldWeight in 7801, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.125 = fieldNorm(doc=7801)
          0.09462392 = weight(abstract_txt:explains in 7801) [ClassicSimilarity], result of:
            0.09462392 = score(doc=7801,freq=1.0), product of:
              0.1337154 = queryWeight, product of:
                2.051611 = boost
                5.661213 = idf(docFreq=419, maxDocs=44421)
                0.011512693 = queryNorm
              0.7076516 = fieldWeight in 7801, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.661213 = idf(docFreq=419, maxDocs=44421)
                0.125 = fieldNorm(doc=7801)
          1.0695442 = weight(abstract_txt:descriptors in 7801) [ClassicSimilarity], result of:
            1.0695442 = score(doc=7801,freq=3.0), product of:
              0.74124116 = queryWeight, product of:
                9.660821 = boost
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.011512693 = queryNorm
              1.4429098 = fieldWeight in 7801, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.125 = fieldNorm(doc=7801)
        0.20833333 = coord(5/24)