Document (#35029)

Author
Liu, W.
Weichselbraun, A.
Scharl, A.
Chang, E.
Title
Semi-automatic ontology extension using spreading activation
Source
Journal of universal knowledge management. 0(2005) no.1, S.50-58
Year
2005
Abstract
This paper describes a system to semi-automatically extend and refine ontologies by mining textual data from the Web sites of international online media. Expanding a seed ontology creates a semantic network through co-occurrence analysis, trigger phrase analysis, and disambiguation based on the WordNet lexical dictionary. Spreading activation then processes this semantic network to find the most probable candidates for inclusion in an extended ontology. Approaches to identifying hierarchical relationships such as subsumption, head noun analysis and WordNet consultation are used to confirm and classify the found relationships. Using a seed ontology on "climate change" as an example, this paper demonstrates how spreading activation improves the result by naturally integrating the mentioned methods.
Theme
Data Mining

Similar documents (author)

  1. Chang, R.: DBase, relational data models, and MARC records (1992) 4.79
    4.78651 = sum of:
      4.78651 = weight(author_txt:chang in 5056) [ClassicSimilarity], result of:
        4.78651 = fieldWeight in 5056, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.6584163 = idf(docFreq=56, maxDocs=44421)
          0.625 = fieldNorm(doc=5056)
    
  2. Chang, R.: ¬The development of indexing technology (1993) 4.79
    4.78651 = sum of:
      4.78651 = weight(author_txt:chang in 7023) [ClassicSimilarity], result of:
        4.78651 = fieldWeight in 7023, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.6584163 = idf(docFreq=56, maxDocs=44421)
          0.625 = fieldNorm(doc=7023)
    
  3. Chang, R.: Keyword searching and indexing (1993) 4.79
    4.78651 = sum of:
      4.78651 = weight(author_txt:chang in 7222) [ClassicSimilarity], result of:
        4.78651 = fieldWeight in 7222, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.6584163 = idf(docFreq=56, maxDocs=44421)
          0.625 = fieldNorm(doc=7222)
    
  4. Chang, R.H.: To classify or not to classify? : a new look at an old problem (1989) 4.79
    4.78651 = sum of:
      4.78651 = weight(author_txt:chang in 2578) [ClassicSimilarity], result of:
        4.78651 = fieldWeight in 2578, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.6584163 = idf(docFreq=56, maxDocs=44421)
          0.625 = fieldNorm(doc=2578)
    
  5. Chang, S.H.: ¬The current state of Web search engines (1999) 4.79
    4.78651 = sum of:
      4.78651 = weight(author_txt:chang in 1509) [ClassicSimilarity], result of:
        4.78651 = fieldWeight in 1509, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.6584163 = idf(docFreq=56, maxDocs=44421)
          0.625 = fieldNorm(doc=1509)
    

Similar documents (content)

  1. Tudhope, D.; Binding, C.; Blocks, D.; Cunliffe, D.: FACET: thesaurus retrieval with semantic term expansion (2002) 0.14
    0.13795322 = sum of:
      0.13795322 = product of:
        0.5748051 = sum of:
          0.0047860676 = weight(abstract_txt:this in 1175) [ClassicSimilarity], result of:
            0.0047860676 = score(doc=1175,freq=1.0), product of:
              0.03637079 = queryWeight, product of:
                1.0186777 = boost
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.014838089 = queryNorm
              0.13159096 = fieldWeight in 1175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1175)
          0.04587767 = weight(abstract_txt:semantic in 1175) [ClassicSimilarity], result of:
            0.04587767 = score(doc=1175,freq=5.0), product of:
              0.08384568 = queryWeight, product of:
                1.2628598 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.014838089 = queryNorm
              0.54716796 = fieldWeight in 1175, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1175)
          0.022642195 = weight(abstract_txt:network in 1175) [ClassicSimilarity], result of:
            0.022642195 = score(doc=1175,freq=1.0), product of:
              0.08953967 = queryWeight, product of:
                1.3050362 = boost
                4.6239696 = idf(docFreq=1184, maxDocs=44421)
                0.014838089 = queryNorm
              0.25287333 = fieldWeight in 1175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6239696 = idf(docFreq=1184, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1175)
          0.03581287 = weight(abstract_txt:relationships in 1175) [ClassicSimilarity], result of:
            0.03581287 = score(doc=1175,freq=2.0), product of:
              0.09647598 = queryWeight, product of:
                1.3546416 = boost
                4.7997303 = idf(docFreq=993, maxDocs=44421)
                0.014838089 = queryNorm
              0.37121022 = fieldWeight in 1175, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7997303 = idf(docFreq=993, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1175)
          0.2306934 = weight(abstract_txt:activation in 1175) [ClassicSimilarity], result of:
            0.2306934 = score(doc=1175,freq=1.0), product of:
              0.48171517 = queryWeight, product of:
                3.7072818 = boost
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.014838089 = queryNorm
              0.47890002 = fieldWeight in 1175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1175)
          0.23499288 = weight(abstract_txt:spreading in 1175) [ClassicSimilarity], result of:
            0.23499288 = score(doc=1175,freq=1.0), product of:
              0.48768196 = queryWeight, product of:
                3.7301712 = boost
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.014838089 = queryNorm
              0.48185682 = fieldWeight in 1175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1175)
        0.24 = coord(6/25)
    
  2. Na, J.-C.; Neoh, H.L.: Effectiveness of UMLS semantic network as a seed ontology for building a medical domain ontology (2008) 0.13
    0.12800036 = sum of:
      0.12800036 = product of:
        0.6400018 = sum of:
          0.009473956 = weight(abstract_txt:this in 2910) [ClassicSimilarity], result of:
            0.009473956 = score(doc=2910,freq=3.0), product of:
              0.03637079 = queryWeight, product of:
                1.0186777 = boost
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.014838089 = queryNorm
              0.26048255 = fieldWeight in 2910, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.0625 = fieldNorm(doc=2910)
          0.07034441 = weight(abstract_txt:semantic in 2910) [ClassicSimilarity], result of:
            0.07034441 = score(doc=2910,freq=9.0), product of:
              0.08384568 = queryWeight, product of:
                1.2628598 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.014838089 = queryNorm
              0.8389747 = fieldWeight in 2910, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.0625 = fieldNorm(doc=2910)
          0.06338494 = weight(abstract_txt:network in 2910) [ClassicSimilarity], result of:
            0.06338494 = score(doc=2910,freq=6.0), product of:
              0.08953967 = queryWeight, product of:
                1.3050362 = boost
                4.6239696 = idf(docFreq=1184, maxDocs=44421)
                0.014838089 = queryNorm
              0.7078979 = fieldWeight in 2910, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.6239696 = idf(docFreq=1184, maxDocs=44421)
                0.0625 = fieldNorm(doc=2910)
          0.28071582 = weight(abstract_txt:seed in 2910) [ClassicSimilarity], result of:
            0.28071582 = score(doc=2910,freq=3.0), product of:
              0.30423746 = queryWeight, product of:
                2.4055874 = boost
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.014838089 = queryNorm
              0.9226866 = fieldWeight in 2910, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.0625 = fieldNorm(doc=2910)
          0.2160826 = weight(abstract_txt:ontology in 2910) [ClassicSimilarity], result of:
            0.2160826 = score(doc=2910,freq=6.0), product of:
              0.25553358 = queryWeight, product of:
                3.1178396 = boost
                5.5235233 = idf(docFreq=481, maxDocs=44421)
                0.014838089 = queryNorm
              0.84561336 = fieldWeight in 2910, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.5235233 = idf(docFreq=481, maxDocs=44421)
                0.0625 = fieldNorm(doc=2910)
        0.2 = coord(5/25)
    
  3. Kulyukin, V.A.; Settle, A.: Ranked retrieval with semantic networks and vector spaces (2001) 0.12
    0.118728444 = sum of:
      0.118728444 = product of:
        0.9894037 = sum of:
          0.058031175 = weight(abstract_txt:semantic in 934) [ClassicSimilarity], result of:
            0.058031175 = score(doc=934,freq=2.0), product of:
              0.08384568 = queryWeight, product of:
                1.2628598 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.014838089 = queryNorm
              0.6921188 = fieldWeight in 934, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.109375 = fieldNorm(doc=934)
          0.4613868 = weight(abstract_txt:activation in 934) [ClassicSimilarity], result of:
            0.4613868 = score(doc=934,freq=1.0), product of:
              0.48171517 = queryWeight, product of:
                3.7072818 = boost
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.014838089 = queryNorm
              0.95780003 = fieldWeight in 934, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.109375 = fieldNorm(doc=934)
          0.46998575 = weight(abstract_txt:spreading in 934) [ClassicSimilarity], result of:
            0.46998575 = score(doc=934,freq=1.0), product of:
              0.48768196 = queryWeight, product of:
                3.7301712 = boost
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.014838089 = queryNorm
              0.96371365 = fieldWeight in 934, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.109375 = fieldNorm(doc=934)
        0.12 = coord(3/25)
    
  4. Chen, H.; Ng, T.: ¬An algorithmic approach to concept exploration in a large knowledge network (automatic thesaurus consultation) : symbolic branch-and-bound search versus connectionist Hopfield Net Activation (1995) 0.12
    0.11845127 = sum of:
      0.11845127 = product of:
        0.74032044 = sum of:
          0.029310169 = weight(abstract_txt:semantic in 2271) [ClassicSimilarity], result of:
            0.029310169 = score(doc=2271,freq=1.0), product of:
              0.08384568 = queryWeight, product of:
                1.2628598 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.014838089 = queryNorm
              0.34957278 = fieldWeight in 2271, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.078125 = fieldNorm(doc=2271)
          0.045744143 = weight(abstract_txt:network in 2271) [ClassicSimilarity], result of:
            0.045744143 = score(doc=2271,freq=2.0), product of:
              0.08953967 = queryWeight, product of:
                1.3050362 = boost
                4.6239696 = idf(docFreq=1184, maxDocs=44421)
                0.014838089 = queryNorm
              0.5108813 = fieldWeight in 2271, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6239696 = idf(docFreq=1184, maxDocs=44421)
                0.078125 = fieldNorm(doc=2271)
          0.32956198 = weight(abstract_txt:activation in 2271) [ClassicSimilarity], result of:
            0.32956198 = score(doc=2271,freq=1.0), product of:
              0.48171517 = queryWeight, product of:
                3.7072818 = boost
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.014838089 = queryNorm
              0.6841428 = fieldWeight in 2271, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.078125 = fieldNorm(doc=2271)
          0.33570412 = weight(abstract_txt:spreading in 2271) [ClassicSimilarity], result of:
            0.33570412 = score(doc=2271,freq=1.0), product of:
              0.48768196 = queryWeight, product of:
                3.7301712 = boost
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.014838089 = queryNorm
              0.6883669 = fieldWeight in 2271, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.078125 = fieldNorm(doc=2271)
        0.16 = coord(4/25)
    
  5. Vlachidis, A.; Tudhope, D.: ¬A knowledge-based approach to information extraction for semantic interoperability in the archaeology domain (2016) 0.10
    0.10497537 = sum of:
      0.10497537 = product of:
        0.37491205 = sum of:
          0.046965063 = weight(abstract_txt:phrase in 3895) [ClassicSimilarity], result of:
            0.046965063 = score(doc=3895,freq=1.0), product of:
              0.10574222 = queryWeight, product of:
                1.0028224 = boost
                7.1063476 = idf(docFreq=98, maxDocs=44421)
                0.014838089 = queryNorm
              0.44414672 = fieldWeight in 3895, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1063476 = idf(docFreq=98, maxDocs=44421)
                0.0625 = fieldNorm(doc=3895)
          0.07294935 = weight(abstract_txt:disambiguation in 3895) [ClassicSimilarity], result of:
            0.07294935 = score(doc=3895,freq=2.0), product of:
              0.11256485 = queryWeight, product of:
                1.0346684 = boost
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.014838089 = queryNorm
              0.6480651 = fieldWeight in 3895, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.0625 = fieldNorm(doc=3895)
          0.061381653 = weight(abstract_txt:noun in 3895) [ClassicSimilarity], result of:
            0.061381653 = score(doc=3895,freq=1.0), product of:
              0.12640305 = queryWeight, product of:
                1.0964241 = boost
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.014838089 = queryNorm
              0.48560262 = fieldWeight in 3895, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.0625 = fieldNorm(doc=3895)
          0.057435967 = weight(abstract_txt:semantic in 3895) [ClassicSimilarity], result of:
            0.057435967 = score(doc=3895,freq=6.0), product of:
              0.08384568 = queryWeight, product of:
                1.2628598 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.014838089 = queryNorm
              0.68501997 = fieldWeight in 3895, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.0625 = fieldNorm(doc=3895)
          0.028941168 = weight(abstract_txt:relationships in 3895) [ClassicSimilarity], result of:
            0.028941168 = score(doc=3895,freq=1.0), product of:
              0.09647598 = queryWeight, product of:
                1.3546416 = boost
                4.7997303 = idf(docFreq=993, maxDocs=44421)
                0.014838089 = queryNorm
              0.29998314 = fieldWeight in 3895, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7997303 = idf(docFreq=993, maxDocs=44421)
                0.0625 = fieldNorm(doc=3895)
          0.019023513 = weight(abstract_txt:analysis in 3895) [ClassicSimilarity], result of:
            0.019023513 = score(doc=3895,freq=1.0), product of:
              0.08348967 = queryWeight, product of:
                1.543394 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.014838089 = queryNorm
              0.2278547 = fieldWeight in 3895, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.0625 = fieldNorm(doc=3895)
          0.08821535 = weight(abstract_txt:ontology in 3895) [ClassicSimilarity], result of:
            0.08821535 = score(doc=3895,freq=1.0), product of:
              0.25553358 = queryWeight, product of:
                3.1178396 = boost
                5.5235233 = idf(docFreq=481, maxDocs=44421)
                0.014838089 = queryNorm
              0.3452202 = fieldWeight in 3895, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5235233 = idf(docFreq=481, maxDocs=44421)
                0.0625 = fieldNorm(doc=3895)
        0.28 = coord(7/25)