Document (#35696)

Author
Amirhosseini, M.
Title
Quantitative evaluation of the movement from complexity toward simplicity in the structure of thesaurus descriptors
Source
Malaysian journal of library and information science. 20(2015), no.3, S.47-62
Year
2015
Abstract
The concepts of simplicity and complexity play major roles in information storage and retrieval in knowledge organizations. This paper reports an investigation of these concepts in the structure of descriptors. The main purpose of simplicity is to decrease the number of words in the construction of descriptors as this idea affects semantic relations, recall and precision. ISO 25964 has affirmed the purpose of simplicity by requiring splitting compound terms into simpler concepts. This work aims to elaborate the standard methods of evaluation by providing a more detailed evaluation of the descriptors structure and identifying effective factors in simplicity and complexity results in the structure of thesauri descriptors. The research population is taken from the descriptors of the Commonwealth Agricultural Bureaux (CAB) Thesaurus, the Persian Cultural Thesaurus (ASFA) and the Chemical Thesaurus. This research was conducted using the statistical and content analysis method. In this research we propose a new quantitative approach as well as novel indicators and indices involving Simplicity and Factoring Ratios to evaluate the descriptors structure. The results will be useful in the verification, selection and maintenance purposes in knowledge organizations and the inquiry method can be further developed in the field of ontology evaluation.
Content
Vgl. auch: https://www.researchgate.net/publication/285228543_Quantitative_evaluation_of_the_movement_from_complexity_toward_simplicity_in_the_structure_of_thesaurus_descriptors.
Theme
Konzeption und Anwendung des Prinzips Thesaurus
Wissensrepräsentation

Similar documents (content)

  1. Tudhope, D.; Binding, C.; Blocks, D.; Cunliffe, D.: FACET: thesaurus retrieval with semantic term expansion (2002) 0.21
    0.20760052 = sum of:
      0.20760052 = product of:
        0.6487516 = sum of:
          0.05409817 = weight(abstract_txt:compound in 1175) [ClassicSimilarity], result of:
            0.05409817 = score(doc=1175,freq=2.0), product of:
              0.09330502 = queryWeight, product of:
                7.496775 = idf(docFreq=66, maxDocs=44421)
                0.012446021 = queryNorm
              0.5797991 = fieldWeight in 1175, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.496775 = idf(docFreq=66, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1175)
          0.016149383 = weight(abstract_txt:purpose in 1175) [ClassicSimilarity], result of:
            0.016149383 = score(doc=1175,freq=1.0), product of:
              0.066156715 = queryWeight, product of:
                1.1908292 = boost
                4.4636893 = idf(docFreq=1390, maxDocs=44421)
                0.012446021 = queryNorm
              0.244108 = fieldWeight in 1175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4636893 = idf(docFreq=1390, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1175)
          0.012149802 = weight(abstract_txt:research in 1175) [ClassicSimilarity], result of:
            0.012149802 = score(doc=1175,freq=2.0), product of:
              0.049720615 = queryWeight, product of:
                1.2643764 = boost
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.012446021 = queryNorm
              0.24436146 = fieldWeight in 1175, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1175)
          0.006324528 = weight(abstract_txt:this in 1175) [ClassicSimilarity], result of:
            0.006324528 = score(doc=1175,freq=1.0), product of:
              0.048062023 = queryWeight, product of:
                1.6048466 = boost
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.012446021 = queryNorm
              0.13159096 = fieldWeight in 1175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1175)
          0.03628352 = weight(abstract_txt:concepts in 1175) [ClassicSimilarity], result of:
            0.03628352 = score(doc=1175,freq=2.0), product of:
              0.103109024 = queryWeight, product of:
                1.8207757 = boost
                4.549982 = idf(docFreq=1275, maxDocs=44421)
                0.012446021 = queryNorm
              0.3518947 = fieldWeight in 1175, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.549982 = idf(docFreq=1275, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1175)
          0.13282304 = weight(abstract_txt:thesaurus in 1175) [ClassicSimilarity], result of:
            0.13282304 = score(doc=1175,freq=7.0), product of:
              0.17754 = queryWeight, product of:
                2.7588341 = boost
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.012446021 = queryNorm
              0.7481302 = fieldWeight in 1175, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1175)
          0.06508019 = weight(abstract_txt:structure in 1175) [ClassicSimilarity], result of:
            0.06508019 = score(doc=1175,freq=3.0), product of:
              0.15765536 = queryWeight, product of:
                2.9066107 = boost
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.012446021 = queryNorm
              0.41280034 = fieldWeight in 1175, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1175)
          0.325843 = weight(abstract_txt:descriptors in 1175) [ClassicSimilarity], result of:
            0.325843 = score(doc=1175,freq=3.0), product of:
              0.5161681 = queryWeight, product of:
                6.2228894 = boost
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.012446021 = queryNorm
              0.63127303 = fieldWeight in 1175, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1175)
        0.32 = coord(8/25)
    
  2. Deokattey, S.; Dixit, D.K.; Bhanumurthy, K.: Co-word and facet analysis as tools for conceptualization in ontologies : a preliminary study of a micro-domain (2012) 0.18
    0.17609885 = sum of:
      0.17609885 = product of:
        0.7337452 = sum of:
          0.018895483 = weight(abstract_txt:method in 1841) [ClassicSimilarity], result of:
            0.018895483 = score(doc=1841,freq=1.0), product of:
              0.06720176 = queryWeight, product of:
                1.2001978 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.012446021 = queryNorm
              0.2811754 = fieldWeight in 1841, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0625 = fieldNorm(doc=1841)
          0.009818522 = weight(abstract_txt:research in 1841) [ClassicSimilarity], result of:
            0.009818522 = score(doc=1841,freq=1.0), product of:
              0.049720615 = queryWeight, product of:
                1.2643764 = boost
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.012446021 = queryNorm
              0.19747387 = fieldWeight in 1841, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.0625 = fieldNorm(doc=1841)
          0.0102219805 = weight(abstract_txt:this in 1841) [ClassicSimilarity], result of:
            0.0102219805 = score(doc=1841,freq=2.0), product of:
              0.048062023 = queryWeight, product of:
                1.6048466 = boost
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.012446021 = queryNorm
              0.21268311 = fieldWeight in 1841, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.0625 = fieldNorm(doc=1841)
          0.029321514 = weight(abstract_txt:concepts in 1841) [ClassicSimilarity], result of:
            0.029321514 = score(doc=1841,freq=1.0), product of:
              0.103109024 = queryWeight, product of:
                1.8207757 = boost
                4.549982 = idf(docFreq=1275, maxDocs=44421)
                0.012446021 = queryNorm
              0.28437388 = fieldWeight in 1841, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.549982 = idf(docFreq=1275, maxDocs=44421)
                0.0625 = fieldNorm(doc=1841)
          0.05737416 = weight(abstract_txt:thesaurus in 1841) [ClassicSimilarity], result of:
            0.05737416 = score(doc=1841,freq=1.0), product of:
              0.17754 = queryWeight, product of:
                2.7588341 = boost
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.012446021 = queryNorm
              0.32316187 = fieldWeight in 1841, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.0625 = fieldNorm(doc=1841)
          0.6081136 = weight(abstract_txt:descriptors in 1841) [ClassicSimilarity], result of:
            0.6081136 = score(doc=1841,freq=8.0), product of:
              0.5161681 = queryWeight, product of:
                6.2228894 = boost
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.012446021 = queryNorm
              1.1781309 = fieldWeight in 1841, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.0625 = fieldNorm(doc=1841)
        0.24 = coord(6/25)
    
  3. Amirhosseini, M.: Theoretical base of quantitative evaluation of unity in a thesaurus term network based on Kant's epistemology (2010) 0.17
    0.16811718 = sum of:
      0.16811718 = product of:
        0.5253662 = sum of:
          0.10817932 = weight(abstract_txt:ratios in 854) [ClassicSimilarity], result of:
            0.10817932 = score(doc=854,freq=3.0), product of:
              0.11835532 = queryWeight, product of:
                1.1262671 = boost
                8.443371 = idf(docFreq=25, maxDocs=44421)
                0.012446021 = queryNorm
              0.9140217 = fieldWeight in 854, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.443371 = idf(docFreq=25, maxDocs=44421)
                0.0625 = fieldNorm(doc=854)
          0.009818522 = weight(abstract_txt:research in 854) [ClassicSimilarity], result of:
            0.009818522 = score(doc=854,freq=1.0), product of:
              0.049720615 = queryWeight, product of:
                1.2643764 = boost
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.012446021 = queryNorm
              0.19747387 = fieldWeight in 854, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.0625 = fieldNorm(doc=854)
          0.10718815 = weight(abstract_txt:quantitative in 854) [ClassicSimilarity], result of:
            0.10718815 = score(doc=854,freq=6.0), product of:
              0.117631264 = queryWeight, product of:
                1.5879027 = boost
                5.9520745 = idf(docFreq=313, maxDocs=44421)
                0.012446021 = queryNorm
              0.9112216 = fieldWeight in 854, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.9520745 = idf(docFreq=313, maxDocs=44421)
                0.0625 = fieldNorm(doc=854)
          0.0144560635 = weight(abstract_txt:this in 854) [ClassicSimilarity], result of:
            0.0144560635 = score(doc=854,freq=4.0), product of:
              0.048062023 = queryWeight, product of:
                1.6048466 = boost
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.012446021 = queryNorm
              0.30077934 = fieldWeight in 854, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.0625 = fieldNorm(doc=854)
          0.029321514 = weight(abstract_txt:concepts in 854) [ClassicSimilarity], result of:
            0.029321514 = score(doc=854,freq=1.0), product of:
              0.103109024 = queryWeight, product of:
                1.8207757 = boost
                4.549982 = idf(docFreq=1275, maxDocs=44421)
                0.012446021 = queryNorm
              0.28437388 = fieldWeight in 854, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.549982 = idf(docFreq=1275, maxDocs=44421)
                0.0625 = fieldNorm(doc=854)
          0.09871248 = weight(abstract_txt:evaluation in 854) [ClassicSimilarity], result of:
            0.09871248 = score(doc=854,freq=7.0), product of:
              0.13326028 = queryWeight, product of:
                2.3901649 = boost
                4.479632 = idf(docFreq=1368, maxDocs=44421)
                0.012446021 = queryNorm
              0.7407495 = fieldWeight in 854, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.479632 = idf(docFreq=1368, maxDocs=44421)
                0.0625 = fieldNorm(doc=854)
          0.11474832 = weight(abstract_txt:thesaurus in 854) [ClassicSimilarity], result of:
            0.11474832 = score(doc=854,freq=4.0), product of:
              0.17754 = queryWeight, product of:
                2.7588341 = boost
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.012446021 = queryNorm
              0.64632374 = fieldWeight in 854, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.0625 = fieldNorm(doc=854)
          0.042941786 = weight(abstract_txt:structure in 854) [ClassicSimilarity], result of:
            0.042941786 = score(doc=854,freq=1.0), product of:
              0.15765536 = queryWeight, product of:
                2.9066107 = boost
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.012446021 = queryNorm
              0.27237758 = fieldWeight in 854, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.0625 = fieldNorm(doc=854)
        0.32 = coord(8/25)
    
  4. Harter, S.P.; Cheng, Y.-R.: Colinked descriptors : improving vocabulary selection for end-user searching (1996) 0.14
    0.14426655 = sum of:
      0.14426655 = product of:
        0.7213327 = sum of:
          0.023619354 = weight(abstract_txt:method in 4284) [ClassicSimilarity], result of:
            0.023619354 = score(doc=4284,freq=1.0), product of:
              0.06720176 = queryWeight, product of:
                1.2001978 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.012446021 = queryNorm
              0.35146925 = fieldWeight in 4284, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.078125 = fieldNorm(doc=4284)
          0.012273153 = weight(abstract_txt:research in 4284) [ClassicSimilarity], result of:
            0.012273153 = score(doc=4284,freq=1.0), product of:
              0.049720615 = queryWeight, product of:
                1.2643764 = boost
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.012446021 = queryNorm
              0.24684234 = fieldWeight in 4284, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.078125 = fieldNorm(doc=4284)
          0.012777476 = weight(abstract_txt:this in 4284) [ClassicSimilarity], result of:
            0.012777476 = score(doc=4284,freq=2.0), product of:
              0.048062023 = queryWeight, product of:
                1.6048466 = boost
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.012446021 = queryNorm
              0.26585388 = fieldWeight in 4284, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.078125 = fieldNorm(doc=4284)
          0.0717177 = weight(abstract_txt:thesaurus in 4284) [ClassicSimilarity], result of:
            0.0717177 = score(doc=4284,freq=1.0), product of:
              0.17754 = queryWeight, product of:
                2.7588341 = boost
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.012446021 = queryNorm
              0.40395233 = fieldWeight in 4284, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.078125 = fieldNorm(doc=4284)
          0.60094506 = weight(abstract_txt:descriptors in 4284) [ClassicSimilarity], result of:
            0.60094506 = score(doc=4284,freq=5.0), product of:
              0.5161681 = queryWeight, product of:
                6.2228894 = boost
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.012446021 = queryNorm
              1.1642429 = fieldWeight in 4284, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.078125 = fieldNorm(doc=4284)
        0.2 = coord(5/25)
    
  5. Riesthuis, G.J.A.: Information languages and multilingual subject access (2003) 0.12
    0.12190245 = sum of:
      0.12190245 = product of:
        0.76189035 = sum of:
          0.012649056 = weight(abstract_txt:this in 4963) [ClassicSimilarity], result of:
            0.012649056 = score(doc=4963,freq=1.0), product of:
              0.048062023 = queryWeight, product of:
                1.6048466 = boost
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.012446021 = queryNorm
              0.26318192 = fieldWeight in 4963, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.109375 = fieldNorm(doc=4963)
          0.1419938 = weight(abstract_txt:thesaurus in 4963) [ClassicSimilarity], result of:
            0.1419938 = score(doc=4963,freq=2.0), product of:
              0.17754 = queryWeight, product of:
                2.7588341 = boost
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.012446021 = queryNorm
              0.79978484 = fieldWeight in 4963, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.109375 = fieldNorm(doc=4963)
          0.07514812 = weight(abstract_txt:structure in 4963) [ClassicSimilarity], result of:
            0.07514812 = score(doc=4963,freq=1.0), product of:
              0.15765536 = queryWeight, product of:
                2.9066107 = boost
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.012446021 = queryNorm
              0.47666076 = fieldWeight in 4963, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.109375 = fieldNorm(doc=4963)
          0.53209937 = weight(abstract_txt:descriptors in 4963) [ClassicSimilarity], result of:
            0.53209937 = score(doc=4963,freq=2.0), product of:
              0.5161681 = queryWeight, product of:
                6.2228894 = boost
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.012446021 = queryNorm
              1.0308645 = fieldWeight in 4963, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.109375 = fieldNorm(doc=4963)
        0.16 = coord(4/25)