Document (#28384)

Author
Prabowo, R.
Jackson, M.
Burden, P.
Knoell, H.-D.
Title
Ontology-based automatic classification for the Web pages : design, implementation and evaluation
Source
http://csdl.computer.org/comp/proceedings/wise/2002/1766/00/17660182abs.htm
Year
2002
Abstract
In recent years, we have witnessed the continual growth in the use of ontologies in order to provide a mechanism to enable machine reasoning. This paper describes an automatic classifier, which focuses on the use of ontologies for classifying Web pages with respect to the Dewey Decimal Classification (DDC) and Library of Congress Classification (LCC) schemes. Firstly, we explain how these ontologies can be built in a modular fashion, and mapped into DDC and LCC. Secondly, we propose the formal definition of a DDC-LCC and an ontology-classification-scheme mapping. Thirdly, we explain the way the classifier uses these ontologies to assist classification. Finally, an experiment in which the accuracy of the classifier was evaluated is presented. The experiment shows that our approach results an improved classification in terms of accuracy. This improvement, however, comes at a cost in a low overage ratio due to the incompleteness of the ontologies used
Content
Beitrag bei: The Third International Conference on Web Information Systems Engineering (WISE'00) Dec., 12-14, 2002, Singapore, S.182.
Theme
Automatisches Klassifizieren
Object
DDC
LCC

Similar documents (author)

  1. Jackson, P.: ¬A thesaurus for enhanced geographic access (1991) 5.44
    5.4410844 = sum of:
      5.4410844 = weight(author_txt:jackson in 2297) [ClassicSimilarity], result of:
        5.4410844 = fieldWeight in 2297, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.705735 = idf(docFreq=19, maxDocs=44421)
          0.625 = fieldNorm(doc=2297)
    
  2. Jackson, S.L.: Dziatzko, Karl (1984) 5.44
    5.4410844 = sum of:
      5.4410844 = weight(author_txt:jackson in 2684) [ClassicSimilarity], result of:
        5.4410844 = fieldWeight in 2684, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.705735 = idf(docFreq=19, maxDocs=44421)
          0.625 = fieldNorm(doc=2684)
    
  3. Jackson, S.L.: Drtina, Jaroslav (1984) 5.44
    5.4410844 = sum of:
      5.4410844 = weight(author_txt:jackson in 5795) [ClassicSimilarity], result of:
        5.4410844 = fieldWeight in 5795, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.705735 = idf(docFreq=19, maxDocs=44421)
          0.625 = fieldNorm(doc=5795)
    
  4. Jackson, J.N.: Every-name indexing (1992) 5.44
    5.4410844 = sum of:
      5.4410844 = weight(author_txt:jackson in 6355) [ClassicSimilarity], result of:
        5.4410844 = fieldWeight in 6355, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.705735 = idf(docFreq=19, maxDocs=44421)
          0.625 = fieldNorm(doc=6355)
    
  5. Jackson, K.: Easy and rapid access to national bibliographies and catalogs with software from On-line Computer Systems (1990) 5.44
    5.4410844 = sum of:
      5.4410844 = weight(author_txt:jackson in 3666) [ClassicSimilarity], result of:
        5.4410844 = fieldWeight in 3666, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.705735 = idf(docFreq=19, maxDocs=44421)
          0.625 = fieldNorm(doc=3666)
    

Similar documents (content)

  1. Farazi, M.: Faceted lightweight ontologies : a formalization and some experiments (2010) 0.19
    0.18833844 = sum of:
      0.18833844 = product of:
        0.78474355 = sum of:
          0.04200754 = weight(abstract_txt:reasoning in 997) [ClassicSimilarity], result of:
            0.04200754 = score(doc=997,freq=1.0), product of:
              0.10600978 = queryWeight, product of:
                1.031173 = boost
                6.3401756 = idf(docFreq=212, maxDocs=44421)
                0.016214857 = queryNorm
              0.39626098 = fieldWeight in 997, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3401756 = idf(docFreq=212, maxDocs=44421)
                0.0625 = fieldNorm(doc=997)
          0.015032364 = weight(abstract_txt:these in 997) [ClassicSimilarity], result of:
            0.015032364 = score(doc=997,freq=2.0), product of:
              0.053433377 = queryWeight, product of:
                1.0353326 = boost
                3.1828754 = idf(docFreq=5006, maxDocs=44421)
                0.016214857 = queryNorm
              0.2813291 = fieldWeight in 997, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1828754 = idf(docFreq=5006, maxDocs=44421)
                0.0625 = fieldNorm(doc=997)
          0.13607477 = weight(abstract_txt:ontology in 997) [ClassicSimilarity], result of:
            0.13607477 = score(doc=997,freq=6.0), product of:
              0.16091841 = queryWeight, product of:
                1.7967036 = boost
                5.5235233 = idf(docFreq=481, maxDocs=44421)
                0.016214857 = queryNorm
              0.84561336 = fieldWeight in 997, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.5235233 = idf(docFreq=481, maxDocs=44421)
                0.0625 = fieldNorm(doc=997)
          0.06962361 = weight(abstract_txt:accuracy in 997) [ClassicSimilarity], result of:
            0.06962361 = score(doc=997,freq=1.0), product of:
              0.18705766 = queryWeight, product of:
                1.9371413 = boost
                5.9552646 = idf(docFreq=312, maxDocs=44421)
                0.016214857 = queryNorm
              0.37220404 = fieldWeight in 997, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9552646 = idf(docFreq=312, maxDocs=44421)
                0.0625 = fieldNorm(doc=997)
          0.06292138 = weight(abstract_txt:classification in 997) [ClassicSimilarity], result of:
            0.06292138 = score(doc=997,freq=1.0), product of:
              0.25217983 = queryWeight, product of:
                3.8957348 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.016214857 = queryNorm
              0.24950996 = fieldWeight in 997, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.0625 = fieldNorm(doc=997)
          0.45908388 = weight(abstract_txt:ontologies in 997) [ClassicSimilarity], result of:
            0.45908388 = score(doc=997,freq=8.0), product of:
              0.44635713 = queryWeight, product of:
                4.731347 = boost
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.016214857 = queryNorm
              1.0285125 = fieldWeight in 997, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.0625 = fieldNorm(doc=997)
        0.24 = coord(6/25)
    
  2. Chung, Y.-M.; Noh, Y.-H.: Developing a specialized directory system by automatically classifying Web documents (2003) 0.15
    0.15181313 = sum of:
      0.15181313 = product of:
        0.63255477 = sum of:
          0.047160324 = weight(abstract_txt:classifying in 2566) [ClassicSimilarity], result of:
            0.047160324 = score(doc=2566,freq=1.0), product of:
              0.11451058 = queryWeight, product of:
                1.0717201 = boost
                6.58948 = idf(docFreq=165, maxDocs=44421)
                0.016214857 = queryNorm
              0.4118425 = fieldWeight in 2566, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.58948 = idf(docFreq=165, maxDocs=44421)
                0.0625 = fieldNorm(doc=2566)
          0.0725579 = weight(abstract_txt:ratio in 2566) [ClassicSimilarity], result of:
            0.0725579 = score(doc=2566,freq=1.0), product of:
              0.15261044 = queryWeight, product of:
                1.2372307 = boost
                7.607123 = idf(docFreq=59, maxDocs=44421)
                0.016214857 = queryNorm
              0.47544518 = fieldWeight in 2566, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.607123 = idf(docFreq=59, maxDocs=44421)
                0.0625 = fieldNorm(doc=2566)
          0.046236124 = weight(abstract_txt:automatic in 2566) [ClassicSimilarity], result of:
            0.046236124 = score(doc=2566,freq=1.0), product of:
              0.14238319 = queryWeight, product of:
                1.6900631 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.016214857 = queryNorm
              0.32473022 = fieldWeight in 2566, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.0625 = fieldNorm(doc=2566)
          0.05973577 = weight(abstract_txt:experiment in 2566) [ClassicSimilarity], result of:
            0.05973577 = score(doc=2566,freq=1.0), product of:
              0.16889913 = queryWeight, product of:
                1.840718 = boost
                5.658835 = idf(docFreq=420, maxDocs=44421)
                0.016214857 = queryNorm
              0.35367718 = fieldWeight in 2566, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.658835 = idf(docFreq=420, maxDocs=44421)
                0.0625 = fieldNorm(doc=2566)
          0.26616815 = weight(abstract_txt:classifier in 2566) [ClassicSimilarity], result of:
            0.26616815 = score(doc=2566,freq=2.0), product of:
              0.4155235 = queryWeight, product of:
                3.5360386 = boost
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.016214857 = queryNorm
              0.640561 = fieldWeight in 2566, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.0625 = fieldNorm(doc=2566)
          0.14069648 = weight(abstract_txt:classification in 2566) [ClassicSimilarity], result of:
            0.14069648 = score(doc=2566,freq=5.0), product of:
              0.25217983 = queryWeight, product of:
                3.8957348 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.016214857 = queryNorm
              0.55792123 = fieldWeight in 2566, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.0625 = fieldNorm(doc=2566)
        0.24 = coord(6/25)
    
  3. Giunchiglia, F.; Zaihrayeu, I.; Farazi, F.: Converting classifications into OWL ontologies (2009) 0.12
    0.123956956 = sum of:
      0.123956956 = product of:
        0.6197848 = sum of:
          0.074259534 = weight(abstract_txt:reasoning in 690) [ClassicSimilarity], result of:
            0.074259534 = score(doc=690,freq=2.0), product of:
              0.10600978 = queryWeight, product of:
                1.031173 = boost
                6.3401756 = idf(docFreq=212, maxDocs=44421)
                0.016214857 = queryNorm
              0.70049703 = fieldWeight in 690, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.3401756 = idf(docFreq=212, maxDocs=44421)
                0.078125 = fieldNorm(doc=690)
          0.013286858 = weight(abstract_txt:these in 690) [ClassicSimilarity], result of:
            0.013286858 = score(doc=690,freq=1.0), product of:
              0.053433377 = queryWeight, product of:
                1.0353326 = boost
                3.1828754 = idf(docFreq=5006, maxDocs=44421)
                0.016214857 = queryNorm
              0.24866214 = fieldWeight in 690, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1828754 = idf(docFreq=5006, maxDocs=44421)
                0.078125 = fieldNorm(doc=690)
          0.06944036 = weight(abstract_txt:ontology in 690) [ClassicSimilarity], result of:
            0.06944036 = score(doc=690,freq=1.0), product of:
              0.16091841 = queryWeight, product of:
                1.7967036 = boost
                5.5235233 = idf(docFreq=481, maxDocs=44421)
                0.016214857 = queryNorm
              0.43152526 = fieldWeight in 690, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5235233 = idf(docFreq=481, maxDocs=44421)
                0.078125 = fieldNorm(doc=690)
          0.1758706 = weight(abstract_txt:classification in 690) [ClassicSimilarity], result of:
            0.1758706 = score(doc=690,freq=5.0), product of:
              0.25217983 = queryWeight, product of:
                3.8957348 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.016214857 = queryNorm
              0.6974015 = fieldWeight in 690, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.078125 = fieldNorm(doc=690)
          0.28692743 = weight(abstract_txt:ontologies in 690) [ClassicSimilarity], result of:
            0.28692743 = score(doc=690,freq=2.0), product of:
              0.44635713 = queryWeight, product of:
                4.731347 = boost
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.016214857 = queryNorm
              0.6428203 = fieldWeight in 690, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.078125 = fieldNorm(doc=690)
        0.2 = coord(5/25)
    
  4. Prieto-Díaz, R.: ¬A faceted approach to building ontologies (2002) 0.12
    0.11802492 = sum of:
      0.11802492 = product of:
        0.5901246 = sum of:
          0.010629486 = weight(abstract_txt:these in 3259) [ClassicSimilarity], result of:
            0.010629486 = score(doc=3259,freq=1.0), product of:
              0.053433377 = queryWeight, product of:
                1.0353326 = boost
                3.1828754 = idf(docFreq=5006, maxDocs=44421)
                0.016214857 = queryNorm
              0.19892971 = fieldWeight in 3259, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1828754 = idf(docFreq=5006, maxDocs=44421)
                0.0625 = fieldNorm(doc=3259)
          0.078562796 = weight(abstract_txt:ontology in 3259) [ClassicSimilarity], result of:
            0.078562796 = score(doc=3259,freq=2.0), product of:
              0.16091841 = queryWeight, product of:
                1.7967036 = boost
                5.5235233 = idf(docFreq=481, maxDocs=44421)
                0.016214857 = queryNorm
              0.4882151 = fieldWeight in 3259, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5235233 = idf(docFreq=481, maxDocs=44421)
                0.0625 = fieldNorm(doc=3259)
          0.07507323 = weight(abstract_txt:explain in 3259) [ClassicSimilarity], result of:
            0.07507323 = score(doc=3259,freq=1.0), product of:
              0.19669554 = queryWeight, product of:
                1.9864188 = boost
                6.106756 = idf(docFreq=268, maxDocs=44421)
                0.016214857 = queryNorm
              0.38167226 = fieldWeight in 3259, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.106756 = idf(docFreq=268, maxDocs=44421)
                0.0625 = fieldNorm(doc=3259)
          0.06292138 = weight(abstract_txt:classification in 3259) [ClassicSimilarity], result of:
            0.06292138 = score(doc=3259,freq=1.0), product of:
              0.25217983 = queryWeight, product of:
                3.8957348 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.016214857 = queryNorm
              0.24950996 = fieldWeight in 3259, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.0625 = fieldNorm(doc=3259)
          0.3629377 = weight(abstract_txt:ontologies in 3259) [ClassicSimilarity], result of:
            0.3629377 = score(doc=3259,freq=5.0), product of:
              0.44635713 = queryWeight, product of:
                4.731347 = boost
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.016214857 = queryNorm
              0.81311053 = fieldWeight in 3259, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.0625 = fieldNorm(doc=3259)
        0.2 = coord(5/25)
    
  5. Almeida Campos, M.L. de; Machado Campos, M.L.; Dávila, A.M.R.; Espanha Gomes, H.; Campos, L.M.; Lira e Oliveira, L. de: Information sciences methodological aspects applied to ontology reuse tools : a study based on genomic annotations in the domain of trypanosomatides (2013) 0.12
    0.116507925 = sum of:
      0.116507925 = product of:
        0.5825396 = sum of:
          0.038311806 = weight(abstract_txt:improvement in 1635) [ClassicSimilarity], result of:
            0.038311806 = score(doc=1635,freq=1.0), product of:
              0.09969718 = queryWeight, product of:
                6.148508 = idf(docFreq=257, maxDocs=44421)
                0.016214857 = queryNorm
              0.38428175 = fieldWeight in 1635, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.148508 = idf(docFreq=257, maxDocs=44421)
                0.0625 = fieldNorm(doc=1635)
          0.046236124 = weight(abstract_txt:automatic in 1635) [ClassicSimilarity], result of:
            0.046236124 = score(doc=1635,freq=1.0), product of:
              0.14238319 = queryWeight, product of:
                1.6900631 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.016214857 = queryNorm
              0.32473022 = fieldWeight in 1635, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.0625 = fieldNorm(doc=1635)
          0.15712559 = weight(abstract_txt:ontology in 1635) [ClassicSimilarity], result of:
            0.15712559 = score(doc=1635,freq=8.0), product of:
              0.16091841 = queryWeight, product of:
                1.7967036 = boost
                5.5235233 = idf(docFreq=481, maxDocs=44421)
                0.016214857 = queryNorm
              0.9764302 = fieldWeight in 1635, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                5.5235233 = idf(docFreq=481, maxDocs=44421)
                0.0625 = fieldNorm(doc=1635)
          0.05973577 = weight(abstract_txt:experiment in 1635) [ClassicSimilarity], result of:
            0.05973577 = score(doc=1635,freq=1.0), product of:
              0.16889913 = queryWeight, product of:
                1.840718 = boost
                5.658835 = idf(docFreq=420, maxDocs=44421)
                0.016214857 = queryNorm
              0.35367718 = fieldWeight in 1635, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.658835 = idf(docFreq=420, maxDocs=44421)
                0.0625 = fieldNorm(doc=1635)
          0.2811303 = weight(abstract_txt:ontologies in 1635) [ClassicSimilarity], result of:
            0.2811303 = score(doc=1635,freq=3.0), product of:
              0.44635713 = queryWeight, product of:
                4.731347 = boost
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.016214857 = queryNorm
              0.6298327 = fieldWeight in 1635, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.0625 = fieldNorm(doc=1635)
        0.2 = coord(5/25)