Document (#35253)

Author
Xu, Y.
Bernard, A.
Title
Knowledge organization through statistical computation : a new approach
Source
Knowledge organization. 36(2009) no.4, S.227-239
Year
2009
Abstract
Knowledge organization (KO) is an interdisciplinary issue which includes some problems in knowledge classification such as how to classify newly emerged knowledge. With the great complexity and ambiguity of knowledge, it is becoming sometimes inefficient to classify knowledge by logical reasoning. This paper attempts to propose a statistical approach to knowledge organization in order to resolve the problems in classifying complex and mass knowledge. By integrating the classification process into a mathematical model, a knowledge classifier, based on the maximum entropy theory, is constructed and the experimental results show that the classification results acquired from the classifier are reliable. The approach proposed in this paper is quite formal and is not dependent on specific contexts, so it could easily be adapted to the use of knowledge classification in other domains within KO.
Content
Vgl. unter: http://www.ergon-verlag.de/isko_ko/downloads/ko3620094e.pdf.
Theme
Automatisches Klassifizieren

Similar documents (content)

  1. Mengle, S.S.R.; Goharian, N.: Ambiguity measure feature-selection algorithm (2009) 0.24
    0.24345003 = sum of:
      0.24345003 = product of:
        0.76078135 = sum of:
          0.017377883 = weight(abstract_txt:results in 3804) [ClassicSimilarity], result of:
            0.017377883 = score(doc=3804,freq=1.0), product of:
              0.07992942 = queryWeight, product of:
                1.0698636 = boost
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.021476727 = queryNorm
              0.21741535 = fieldWeight in 3804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.0625 = fieldNorm(doc=3804)
          0.07537293 = weight(abstract_txt:ambiguity in 3804) [ClassicSimilarity], result of:
            0.07537293 = score(doc=3804,freq=1.0), product of:
              0.16872354 = queryWeight, product of:
                1.099127 = boost
                7.1475906 = idf(docFreq=94, maxDocs=44421)
                0.021476727 = queryNorm
              0.4467244 = fieldWeight in 3804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1475906 = idf(docFreq=94, maxDocs=44421)
                0.0625 = fieldNorm(doc=3804)
          0.032841902 = weight(abstract_txt:problems in 3804) [ClassicSimilarity], result of:
            0.032841902 = score(doc=3804,freq=1.0), product of:
              0.12217836 = queryWeight, product of:
                1.322733 = boost
                4.300847 = idf(docFreq=1636, maxDocs=44421)
                0.021476727 = queryNorm
              0.26880294 = fieldWeight in 3804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.300847 = idf(docFreq=1636, maxDocs=44421)
                0.0625 = fieldNorm(doc=3804)
          0.070444815 = weight(abstract_txt:statistical in 3804) [ClassicSimilarity], result of:
            0.070444815 = score(doc=3804,freq=1.0), product of:
              0.2032083 = queryWeight, product of:
                1.7058694 = boost
                5.5466094 = idf(docFreq=470, maxDocs=44421)
                0.021476727 = queryNorm
              0.3466631 = fieldWeight in 3804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5466094 = idf(docFreq=470, maxDocs=44421)
                0.0625 = fieldNorm(doc=3804)
          0.045854904 = weight(abstract_txt:approach in 3804) [ClassicSimilarity], result of:
            0.045854904 = score(doc=3804,freq=2.0), product of:
              0.13867123 = queryWeight, product of:
                1.725893 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.021476727 = queryNorm
              0.33067352 = fieldWeight in 3804, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.0625 = fieldNorm(doc=3804)
          0.115001105 = weight(abstract_txt:classify in 3804) [ClassicSimilarity], result of:
            0.115001105 = score(doc=3804,freq=1.0), product of:
              0.2817367 = queryWeight, product of:
                2.0086153 = boost
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.021476727 = queryNorm
              0.40818647 = fieldWeight in 3804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.0625 = fieldNorm(doc=3804)
          0.35135627 = weight(abstract_txt:classifier in 3804) [ClassicSimilarity], result of:
            0.35135627 = score(doc=3804,freq=5.0), product of:
              0.34691033 = queryWeight, product of:
                2.2288644 = boost
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.021476727 = queryNorm
              1.0128158 = fieldWeight in 3804, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.0625 = fieldNorm(doc=3804)
          0.052531514 = weight(abstract_txt:classification in 3804) [ClassicSimilarity], result of:
            0.052531514 = score(doc=3804,freq=1.0), product of:
              0.21053874 = queryWeight, product of:
                2.4555912 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.021476727 = queryNorm
              0.24950996 = fieldWeight in 3804, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.0625 = fieldNorm(doc=3804)
        0.32 = coord(8/25)
    
  2. Lan, K.C.; Ho, K.S.; Luk, R.W.P.; Leong, H.V.: Dialogue act recognition using maximum entropy (2008) 0.17
    0.1739821 = sum of:
      0.1739821 = product of:
        0.7249254 = sum of:
          0.017123828 = weight(abstract_txt:paper in 2717) [ClassicSimilarity], result of:
            0.017123828 = score(doc=2717,freq=1.0), product of:
              0.07914849 = queryWeight, product of:
                1.0646243 = boost
                3.4616103 = idf(docFreq=3788, maxDocs=44421)
                0.021476727 = queryNorm
              0.21635064 = fieldWeight in 2717, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4616103 = idf(docFreq=3788, maxDocs=44421)
                0.0625 = fieldNorm(doc=2717)
          0.07856565 = weight(abstract_txt:maximum in 2717) [ClassicSimilarity], result of:
            0.07856565 = score(doc=2717,freq=1.0), product of:
              0.17345516 = queryWeight, product of:
                1.1144322 = boost
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.021476727 = queryNorm
              0.45294502 = fieldWeight in 2717, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.0625 = fieldNorm(doc=2717)
          0.104255036 = weight(abstract_txt:entropy in 2717) [ClassicSimilarity], result of:
            0.104255036 = score(doc=2717,freq=1.0), product of:
              0.20945792 = queryWeight, product of:
                1.22464 = boost
                7.963798 = idf(docFreq=41, maxDocs=44421)
                0.021476727 = queryNorm
              0.49773738 = fieldWeight in 2717, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.963798 = idf(docFreq=41, maxDocs=44421)
                0.0625 = fieldNorm(doc=2717)
          0.05616056 = weight(abstract_txt:approach in 2717) [ClassicSimilarity], result of:
            0.05616056 = score(doc=2717,freq=3.0), product of:
              0.13867123 = queryWeight, product of:
                1.725893 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.021476727 = queryNorm
              0.4049907 = fieldWeight in 2717, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.0625 = fieldNorm(doc=2717)
          0.35135627 = weight(abstract_txt:classifier in 2717) [ClassicSimilarity], result of:
            0.35135627 = score(doc=2717,freq=5.0), product of:
              0.34691033 = queryWeight, product of:
                2.2288644 = boost
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.021476727 = queryNorm
              1.0128158 = fieldWeight in 2717, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.0625 = fieldNorm(doc=2717)
          0.117464036 = weight(abstract_txt:classification in 2717) [ClassicSimilarity], result of:
            0.117464036 = score(doc=2717,freq=5.0), product of:
              0.21053874 = queryWeight, product of:
                2.4555912 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.021476727 = queryNorm
              0.55792123 = fieldWeight in 2717, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.0625 = fieldNorm(doc=2717)
        0.24 = coord(6/25)
    
  3. Brychcín, T.; Konopík, M.: HPS: High precision stemmer (2015) 0.17
    0.1733174 = sum of:
      0.1733174 = product of:
        0.6189907 = sum of:
          0.06277137 = weight(abstract_txt:reliable in 3686) [ClassicSimilarity], result of:
            0.06277137 = score(doc=3686,freq=1.0), product of:
              0.14935043 = queryWeight, product of:
                1.0341018 = boost
                6.724734 = idf(docFreq=144, maxDocs=44421)
                0.021476727 = queryNorm
              0.42029586 = fieldWeight in 3686, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.724734 = idf(docFreq=144, maxDocs=44421)
                0.0625 = fieldNorm(doc=3686)
          0.024576036 = weight(abstract_txt:results in 3686) [ClassicSimilarity], result of:
            0.024576036 = score(doc=3686,freq=2.0), product of:
              0.07992942 = queryWeight, product of:
                1.0698636 = boost
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.021476727 = queryNorm
              0.30747172 = fieldWeight in 3686, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.0625 = fieldNorm(doc=3686)
          0.07856565 = weight(abstract_txt:maximum in 3686) [ClassicSimilarity], result of:
            0.07856565 = score(doc=3686,freq=1.0), product of:
              0.17345516 = queryWeight, product of:
                1.1144322 = boost
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.021476727 = queryNorm
              0.45294502 = fieldWeight in 3686, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.0625 = fieldNorm(doc=3686)
          0.104255036 = weight(abstract_txt:entropy in 3686) [ClassicSimilarity], result of:
            0.104255036 = score(doc=3686,freq=1.0), product of:
              0.20945792 = queryWeight, product of:
                1.22464 = boost
                7.963798 = idf(docFreq=41, maxDocs=44421)
                0.021476727 = queryNorm
              0.49773738 = fieldWeight in 3686, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.963798 = idf(docFreq=41, maxDocs=44421)
                0.0625 = fieldNorm(doc=3686)
          0.070444815 = weight(abstract_txt:statistical in 3686) [ClassicSimilarity], result of:
            0.070444815 = score(doc=3686,freq=1.0), product of:
              0.2032083 = queryWeight, product of:
                1.7058694 = boost
                5.5466094 = idf(docFreq=470, maxDocs=44421)
                0.021476727 = queryNorm
              0.3466631 = fieldWeight in 3686, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5466094 = idf(docFreq=470, maxDocs=44421)
                0.0625 = fieldNorm(doc=3686)
          0.05616056 = weight(abstract_txt:approach in 3686) [ClassicSimilarity], result of:
            0.05616056 = score(doc=3686,freq=3.0), product of:
              0.13867123 = queryWeight, product of:
                1.725893 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.021476727 = queryNorm
              0.4049907 = fieldWeight in 3686, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.0625 = fieldNorm(doc=3686)
          0.22221722 = weight(abstract_txt:classifier in 3686) [ClassicSimilarity], result of:
            0.22221722 = score(doc=3686,freq=2.0), product of:
              0.34691033 = queryWeight, product of:
                2.2288644 = boost
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.021476727 = queryNorm
              0.640561 = fieldWeight in 3686, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.0625 = fieldNorm(doc=3686)
        0.28 = coord(7/25)
    
  4. Sánchez, D.; Batet, M.; Valls, A.; Gibert, K.: Ontology-driven web-based semantic similarity (2010) 0.17
    0.17224383 = sum of:
      0.17224383 = product of:
        0.538262 = sum of:
          0.054924946 = weight(abstract_txt:reliable in 1335) [ClassicSimilarity], result of:
            0.054924946 = score(doc=1335,freq=1.0), product of:
              0.14935043 = queryWeight, product of:
                1.0341018 = boost
                6.724734 = idf(docFreq=144, maxDocs=44421)
                0.021476727 = queryNorm
              0.36775887 = fieldWeight in 1335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.724734 = idf(docFreq=144, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1335)
          0.014983349 = weight(abstract_txt:paper in 1335) [ClassicSimilarity], result of:
            0.014983349 = score(doc=1335,freq=1.0), product of:
              0.07914849 = queryWeight, product of:
                1.0646243 = boost
                3.4616103 = idf(docFreq=3788, maxDocs=44421)
                0.021476727 = queryNorm
              0.18930681 = fieldWeight in 1335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4616103 = idf(docFreq=3788, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1335)
          0.015205647 = weight(abstract_txt:results in 1335) [ClassicSimilarity], result of:
            0.015205647 = score(doc=1335,freq=1.0), product of:
              0.07992942 = queryWeight, product of:
                1.0698636 = boost
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.021476727 = queryNorm
              0.19023843 = fieldWeight in 1335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1335)
          0.09326924 = weight(abstract_txt:ambiguity in 1335) [ClassicSimilarity], result of:
            0.09326924 = score(doc=1335,freq=2.0), product of:
              0.16872354 = queryWeight, product of:
                1.099127 = boost
                7.1475906 = idf(docFreq=94, maxDocs=44421)
                0.021476727 = queryNorm
              0.55279326 = fieldWeight in 1335, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.1475906 = idf(docFreq=94, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1335)
          0.0828566 = weight(abstract_txt:computation in 1335) [ClassicSimilarity], result of:
            0.0828566 = score(doc=1335,freq=1.0), product of:
              0.19644673 = queryWeight, product of:
                1.1859939 = boost
                7.7124834 = idf(docFreq=53, maxDocs=44421)
                0.021476727 = queryNorm
              0.42177644 = fieldWeight in 1335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7124834 = idf(docFreq=53, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1335)
          0.028736666 = weight(abstract_txt:problems in 1335) [ClassicSimilarity], result of:
            0.028736666 = score(doc=1335,freq=1.0), product of:
              0.12217836 = queryWeight, product of:
                1.322733 = boost
                4.300847 = idf(docFreq=1636, maxDocs=44421)
                0.021476727 = queryNorm
              0.23520258 = fieldWeight in 1335, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.300847 = idf(docFreq=1636, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1335)
          0.08717101 = weight(abstract_txt:statistical in 1335) [ClassicSimilarity], result of:
            0.08717101 = score(doc=1335,freq=2.0), product of:
              0.2032083 = queryWeight, product of:
                1.7058694 = boost
                5.5466094 = idf(docFreq=470, maxDocs=44421)
                0.021476727 = queryNorm
              0.42897367 = fieldWeight in 1335, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5466094 = idf(docFreq=470, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1335)
          0.16111459 = weight(abstract_txt:knowledge in 1335) [ClassicSimilarity], result of:
            0.16111459 = score(doc=1335,freq=4.0), product of:
              0.41536507 = queryWeight, product of:
                5.4534965 = boost
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.021476727 = queryNorm
              0.3878867 = fieldWeight in 1335, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1335)
        0.32 = coord(8/25)
    
  5. Prabowo, R.; Jackson, M.; Burden, P.; Knoell, H.-D.: Ontology-based automatic classification for the Web pages : design, implementation and evaluation (2002) 0.15
    0.1546827 = sum of:
      0.1546827 = product of:
        0.6445112 = sum of:
          0.07382437 = weight(abstract_txt:classifying in 4383) [ClassicSimilarity], result of:
            0.07382437 = score(doc=4383,freq=1.0), product of:
              0.14340311 = queryWeight, product of:
                1.013303 = boost
                6.58948 = idf(docFreq=165, maxDocs=44421)
                0.021476727 = queryNorm
              0.5148031 = fieldWeight in 4383, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.58948 = idf(docFreq=165, maxDocs=44421)
                0.078125 = fieldNorm(doc=4383)
          0.021404786 = weight(abstract_txt:paper in 4383) [ClassicSimilarity], result of:
            0.021404786 = score(doc=4383,freq=1.0), product of:
              0.07914849 = queryWeight, product of:
                1.0646243 = boost
                3.4616103 = idf(docFreq=3788, maxDocs=44421)
                0.021476727 = queryNorm
              0.2704383 = fieldWeight in 4383, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4616103 = idf(docFreq=3788, maxDocs=44421)
                0.078125 = fieldNorm(doc=4383)
          0.021722354 = weight(abstract_txt:results in 4383) [ClassicSimilarity], result of:
            0.021722354 = score(doc=4383,freq=1.0), product of:
              0.07992942 = queryWeight, product of:
                1.0698636 = boost
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.021476727 = queryNorm
              0.2717692 = fieldWeight in 4383, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.078125 = fieldNorm(doc=4383)
          0.04053039 = weight(abstract_txt:approach in 4383) [ClassicSimilarity], result of:
            0.04053039 = score(doc=4383,freq=1.0), product of:
              0.13867123 = queryWeight, product of:
                1.725893 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.021476727 = queryNorm
              0.29227686 = fieldWeight in 4383, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.078125 = fieldNorm(doc=4383)
          0.34019926 = weight(abstract_txt:classifier in 4383) [ClassicSimilarity], result of:
            0.34019926 = score(doc=4383,freq=3.0), product of:
              0.34691033 = queryWeight, product of:
                2.2288644 = boost
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.021476727 = queryNorm
              0.9806547 = fieldWeight in 4383, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.078125 = fieldNorm(doc=4383)
          0.14683004 = weight(abstract_txt:classification in 4383) [ClassicSimilarity], result of:
            0.14683004 = score(doc=4383,freq=5.0), product of:
              0.21053874 = queryWeight, product of:
                2.4555912 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.021476727 = queryNorm
              0.6974015 = fieldWeight in 4383, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.078125 = fieldNorm(doc=4383)
        0.24 = coord(6/25)