Document (#32923)

Author
Kim, J.-M.
Shin, H.
Kim, H.-J.
Title
Schema and constraints-based matching and merging of Topic Maps
Source
Information processing and management. 43(2007) no.4, S.930-945
Year
2007
Abstract
In this paper, we propose a multi-strategic matching and merging approach to find correspondences between ontologies based on the syntactic or semantic characteristics and constraints of the Topic Maps. Our multi-strategic matching approach consists of a linguistic module and a Topic Map constraints-based module. A linguistic module computes similarities between concepts using morphological analysis, string normalization and tokenization and language-dependent heuristics. A Topic Map constraints-based module takes advantage of several Topic Maps-dependent techniques such as a topic property-based matching, a hierarchy-based matching, and an association-based matching. This is a composite matching procedure and need not generate a cross-pair of all topics from the ontologies because unmatched pairs of topics can be removed by characteristics and constraints of the Topic Maps. Merging between Topic Maps follows the matching operations. We set up the MERGE function to integrate two Topic Maps into a new Topic Map, which satisfies such merge requirements as entity preservation, property preservation, relation preservation, and conflict resolution. For our experiments, we used oriental philosophy ontologies, western philosophy ontologies, Yahoo western philosophy dictionary, and Wikipedia philosophy ontology as input ontologies. Our experiments show that the automatically generated matching results conform to the outputs generated manually by domain experts and can be of great benefit to the following merging operations.
Theme
Semantische Interoperabilität

Similar documents (author)

  1. Shin, H.-s.: Quality of Korean cataloging records in shared databases (2003) 4.88
    4.8777785 = sum of:
      4.8777785 = weight(author_txt:shin in 498) [ClassicSimilarity], result of:
        4.8777785 = fieldWeight in 498, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.5 = fieldNorm(doc=498)
    
  2. Shin, D.-H.: Next generation of information infrastructure : a comparative case study of Korea versus the United States of America (2008) 4.88
    4.8777785 = sum of:
      4.8777785 = weight(author_txt:shin in 3365) [ClassicSimilarity], result of:
        4.8777785 = fieldWeight in 3365, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.5 = fieldNorm(doc=3365)
    
  3. Leydesdorff, L.; Shin, J.C.: How to evaluate universities in terms of their relative citation impacts : fractional counting of citations and the normalization of differences among disciplines (2011) 4.88
    4.8777785 = sum of:
      4.8777785 = weight(author_txt:shin in 466) [ClassicSimilarity], result of:
        4.8777785 = fieldWeight in 466, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.5 = fieldNorm(doc=466)
    
  4. Keselman, A.; Rosemblat, G.; Kilicoglu, H.; Fiszman, M.; Jin, H.; Shin, D.; Rindflesch, T.C.: Adapting semantic natural language processing technology to address information overload in influenza epidemic management (2010) 2.44
    2.4388893 = sum of:
      2.4388893 = weight(author_txt:shin in 2312) [ClassicSimilarity], result of:
        2.4388893 = fieldWeight in 2312, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.25 = fieldNorm(doc=2312)
    
  5. Rosemblat, G.; Resnick, M.P.; Auston, I.; Shin, D.; Sneiderman, C.; Fizsman, M.; Rindflesch, T.C.: Extending SemRep to the public health domain (2013) 2.44
    2.4388893 = sum of:
      2.4388893 = weight(author_txt:shin in 3096) [ClassicSimilarity], result of:
        2.4388893 = fieldWeight in 3096, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.755557 = idf(docFreq=6, maxDocs=44421)
          0.25 = fieldNorm(doc=3096)
    

Similar documents (content)

  1. Widhalm, R.; Mueck, T.A.: Merging topics in well-formed XML topic maps (2003) 0.24
    0.23995547 = sum of:
      0.23995547 = product of:
        1.1997774 = sum of:
          0.028129105 = weight(abstract_txt:topics in 3186) [ClassicSimilarity], result of:
            0.028129105 = score(doc=3186,freq=1.0), product of:
              0.07089419 = queryWeight, product of:
                1.1294329 = boost
                5.078731 = idf(docFreq=751, maxDocs=44421)
                0.012359332 = queryNorm
              0.39677587 = fieldWeight in 3186, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.078731 = idf(docFreq=751, maxDocs=44421)
                0.078125 = fieldNorm(doc=3186)
          0.3685622 = weight(abstract_txt:merging in 3186) [ClassicSimilarity], result of:
            0.3685622 = score(doc=3186,freq=4.0), product of:
              0.31273076 = queryWeight, product of:
                3.3547132 = boost
                7.5425844 = idf(docFreq=63, maxDocs=44421)
                0.012359332 = queryNorm
              1.1785288 = fieldWeight in 3186, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.5425844 = idf(docFreq=63, maxDocs=44421)
                0.078125 = fieldNorm(doc=3186)
          0.2352516 = weight(abstract_txt:constraints in 3186) [ClassicSimilarity], result of:
            0.2352516 = score(doc=3186,freq=2.0), product of:
              0.31465295 = queryWeight, product of:
                3.7621925 = boost
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.012359332 = queryNorm
              0.7476542 = fieldWeight in 3186, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.078125 = fieldNorm(doc=3186)
          0.22837754 = weight(abstract_txt:maps in 3186) [ClassicSimilarity], result of:
            0.22837754 = score(doc=3186,freq=3.0), product of:
              0.28637975 = queryWeight, product of:
                3.9317589 = boost
                5.8933253 = idf(docFreq=332, maxDocs=44421)
                0.012359332 = queryNorm
              0.79746395 = fieldWeight in 3186, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.8933253 = idf(docFreq=332, maxDocs=44421)
                0.078125 = fieldNorm(doc=3186)
          0.33945695 = weight(abstract_txt:topic in 3186) [ClassicSimilarity], result of:
            0.33945695 = score(doc=3186,freq=6.0), product of:
              0.35099646 = queryWeight, product of:
                5.61942 = boost
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.012359332 = queryNorm
              0.9671235 = fieldWeight in 3186, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.078125 = fieldNorm(doc=3186)
        0.2 = coord(5/25)
    
  2. Baião Salgado Silva, G.; Lima, G.Â. Borém de Oliveira: Using topic maps in establishing compatibility of semantically structured hypertext contents (2012) 0.22
    0.2224802 = sum of:
      0.2224802 = product of:
        0.7945721 = sum of:
          0.01994133 = weight(abstract_txt:characteristics in 1633) [ClassicSimilarity], result of:
            0.01994133 = score(doc=1633,freq=1.0), product of:
              0.065405786 = queryWeight, product of:
                1.0848337 = boost
                4.8781815 = idf(docFreq=918, maxDocs=44421)
                0.012359332 = queryNorm
              0.30488634 = fieldWeight in 1633, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8781815 = idf(docFreq=918, maxDocs=44421)
                0.0625 = fieldNorm(doc=1633)
          0.015060312 = weight(abstract_txt:between in 1633) [ClassicSimilarity], result of:
            0.015060312 = score(doc=1633,freq=2.0), product of:
              0.049282167 = queryWeight, product of:
                1.1533089 = boost
                3.4573963 = idf(docFreq=3804, maxDocs=44421)
                0.012359332 = queryNorm
              0.30559355 = fieldWeight in 1633, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4573963 = idf(docFreq=3804, maxDocs=44421)
                0.0625 = fieldNorm(doc=1633)
          0.04688327 = weight(abstract_txt:property in 1633) [ClassicSimilarity], result of:
            0.04688327 = score(doc=1633,freq=1.0), product of:
              0.11564459 = queryWeight, product of:
                1.4425064 = boost
                6.4865317 = idf(docFreq=183, maxDocs=44421)
                0.012359332 = queryNorm
              0.40540823 = fieldWeight in 1633, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4865317 = idf(docFreq=183, maxDocs=44421)
                0.0625 = fieldNorm(doc=1633)
          0.18701267 = weight(abstract_txt:merge in 1633) [ClassicSimilarity], result of:
            0.18701267 = score(doc=1633,freq=3.0), product of:
              0.20167576 = queryWeight, product of:
                1.9049429 = boost
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.012359332 = queryNorm
              0.9272938 = fieldWeight in 1633, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.0625 = fieldNorm(doc=1633)
          0.0193905 = weight(abstract_txt:based in 1633) [ClassicSimilarity], result of:
            0.0193905 = score(doc=1633,freq=1.0), product of:
              0.097468 = queryWeight, product of:
                2.4775372 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.012359332 = queryNorm
              0.1989422 = fieldWeight in 1633, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0625 = fieldNorm(doc=1633)
          0.2583797 = weight(abstract_txt:maps in 1633) [ClassicSimilarity], result of:
            0.2583797 = score(doc=1633,freq=6.0), product of:
              0.28637975 = queryWeight, product of:
                3.9317589 = boost
                5.8933253 = idf(docFreq=332, maxDocs=44421)
                0.012359332 = queryNorm
              0.9022275 = fieldWeight in 1633, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.8933253 = idf(docFreq=332, maxDocs=44421)
                0.0625 = fieldNorm(doc=1633)
          0.24790427 = weight(abstract_txt:topic in 1633) [ClassicSimilarity], result of:
            0.24790427 = score(doc=1633,freq=5.0), product of:
              0.35099646 = queryWeight, product of:
                5.61942 = boost
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.012359332 = queryNorm
              0.7062871 = fieldWeight in 1633, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.0625 = fieldNorm(doc=1633)
        0.28 = coord(7/25)
    
  3. Green, R.: Topical relevance relationships : 2: an exploratory study and preliminary typology (1995) 0.20
    0.19823816 = sum of:
      0.19823816 = product of:
        0.82599235 = sum of:
          0.03978056 = weight(abstract_txt:topics in 3792) [ClassicSimilarity], result of:
            0.03978056 = score(doc=3792,freq=2.0), product of:
              0.07089419 = queryWeight, product of:
                1.1294329 = boost
                5.078731 = idf(docFreq=751, maxDocs=44421)
                0.012359332 = queryNorm
              0.5611258 = fieldWeight in 3792, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.078731 = idf(docFreq=751, maxDocs=44421)
                0.078125 = fieldNorm(doc=3792)
          0.01882539 = weight(abstract_txt:between in 3792) [ClassicSimilarity], result of:
            0.01882539 = score(doc=3792,freq=2.0), product of:
              0.049282167 = queryWeight, product of:
                1.1533089 = boost
                3.4573963 = idf(docFreq=3804, maxDocs=44421)
                0.012359332 = queryNorm
              0.38199192 = fieldWeight in 3792, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4573963 = idf(docFreq=3804, maxDocs=44421)
                0.078125 = fieldNorm(doc=3792)
          0.035903227 = weight(abstract_txt:generated in 3792) [ClassicSimilarity], result of:
            0.035903227 = score(doc=3792,freq=1.0), product of:
              0.083418496 = queryWeight, product of:
                1.2251416 = boost
                5.509105 = idf(docFreq=488, maxDocs=44421)
                0.012359332 = queryNorm
              0.43039885 = fieldWeight in 3792, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.509105 = idf(docFreq=488, maxDocs=44421)
                0.078125 = fieldNorm(doc=3792)
          0.16634801 = weight(abstract_txt:constraints in 3792) [ClassicSimilarity], result of:
            0.16634801 = score(doc=3792,freq=1.0), product of:
              0.31465295 = queryWeight, product of:
                3.7621925 = boost
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.012359332 = queryNorm
              0.5286714 = fieldWeight in 3792, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.078125 = fieldNorm(doc=3792)
          0.19598554 = weight(abstract_txt:topic in 3792) [ClassicSimilarity], result of:
            0.19598554 = score(doc=3792,freq=2.0), product of:
              0.35099646 = queryWeight, product of:
                5.61942 = boost
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.012359332 = queryNorm
              0.558369 = fieldWeight in 3792, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.078125 = fieldNorm(doc=3792)
          0.36914966 = weight(abstract_txt:matching in 3792) [ClassicSimilarity], result of:
            0.36914966 = score(doc=3792,freq=3.0), product of:
              0.45151493 = queryWeight, product of:
                6.0464077 = boost
                6.0419855 = idf(docFreq=286, maxDocs=44421)
                0.012359332 = queryNorm
              0.81758016 = fieldWeight in 3792, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.0419855 = idf(docFreq=286, maxDocs=44421)
                0.078125 = fieldNorm(doc=3792)
        0.24 = coord(6/25)
    
  4. Cregan, A.: ¬An OWL DL construction for the ISO Topic Map Data Model (2005) 0.17
    0.16524418 = sum of:
      0.16524418 = product of:
        0.82622087 = sum of:
          0.015060312 = weight(abstract_txt:between in 718) [ClassicSimilarity], result of:
            0.015060312 = score(doc=718,freq=2.0), product of:
              0.049282167 = queryWeight, product of:
                1.1533089 = boost
                3.4573963 = idf(docFreq=3804, maxDocs=44421)
                0.012359332 = queryNorm
              0.30559355 = fieldWeight in 718, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4573963 = idf(docFreq=3804, maxDocs=44421)
                0.0625 = fieldNorm(doc=718)
          0.0193905 = weight(abstract_txt:based in 718) [ClassicSimilarity], result of:
            0.0193905 = score(doc=718,freq=1.0), product of:
              0.097468 = queryWeight, product of:
                2.4775372 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.012359332 = queryNorm
              0.1989422 = fieldWeight in 718, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0625 = fieldNorm(doc=718)
          0.18820128 = weight(abstract_txt:constraints in 718) [ClassicSimilarity], result of:
            0.18820128 = score(doc=718,freq=2.0), product of:
              0.31465295 = queryWeight, product of:
                3.7621925 = boost
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.012359332 = queryNorm
              0.5981234 = fieldWeight in 718, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.7669935 = idf(docFreq=138, maxDocs=44421)
                0.0625 = fieldNorm(doc=718)
          0.23586732 = weight(abstract_txt:maps in 718) [ClassicSimilarity], result of:
            0.23586732 = score(doc=718,freq=5.0), product of:
              0.28637975 = queryWeight, product of:
                3.9317589 = boost
                5.8933253 = idf(docFreq=332, maxDocs=44421)
                0.012359332 = queryNorm
              0.8236173 = fieldWeight in 718, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.8933253 = idf(docFreq=332, maxDocs=44421)
                0.0625 = fieldNorm(doc=718)
          0.36770147 = weight(abstract_txt:topic in 718) [ClassicSimilarity], result of:
            0.36770147 = score(doc=718,freq=11.0), product of:
              0.35099646 = queryWeight, product of:
                5.61942 = boost
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.012359332 = queryNorm
              1.0475931 = fieldWeight in 718, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.0625 = fieldNorm(doc=718)
        0.2 = coord(5/25)
    
  5. Pepper, S.; Groenmo, G.O.: Towards a general theory of scope (2002) 0.15
    0.15176699 = sum of:
      0.15176699 = product of:
        0.7588349 = sum of:
          0.022503283 = weight(abstract_txt:topics in 1539) [ClassicSimilarity], result of:
            0.022503283 = score(doc=1539,freq=1.0), product of:
              0.07089419 = queryWeight, product of:
                1.1294329 = boost
                5.078731 = idf(docFreq=751, maxDocs=44421)
                0.012359332 = queryNorm
              0.3174207 = fieldWeight in 1539, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.078731 = idf(docFreq=751, maxDocs=44421)
                0.0625 = fieldNorm(doc=1539)
          0.107971825 = weight(abstract_txt:merge in 1539) [ClassicSimilarity], result of:
            0.107971825 = score(doc=1539,freq=1.0), product of:
              0.20167576 = queryWeight, product of:
                1.9049429 = boost
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.012359332 = queryNorm
              0.53537333 = fieldWeight in 1539, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.0625 = fieldNorm(doc=1539)
          0.0193905 = weight(abstract_txt:based in 1539) [ClassicSimilarity], result of:
            0.0193905 = score(doc=1539,freq=1.0), product of:
              0.097468 = queryWeight, product of:
                2.4775372 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.012359332 = queryNorm
              0.1989422 = fieldWeight in 1539, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0625 = fieldNorm(doc=1539)
          0.2583797 = weight(abstract_txt:maps in 1539) [ClassicSimilarity], result of:
            0.2583797 = score(doc=1539,freq=6.0), product of:
              0.28637975 = queryWeight, product of:
                3.9317589 = boost
                5.8933253 = idf(docFreq=332, maxDocs=44421)
                0.012359332 = queryNorm
              0.9022275 = fieldWeight in 1539, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.8933253 = idf(docFreq=332, maxDocs=44421)
                0.0625 = fieldNorm(doc=1539)
          0.3505896 = weight(abstract_txt:topic in 1539) [ClassicSimilarity], result of:
            0.3505896 = score(doc=1539,freq=10.0), product of:
              0.35099646 = queryWeight, product of:
                5.61942 = boost
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.012359332 = queryNorm
              0.9988408 = fieldWeight in 1539, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.0625 = fieldNorm(doc=1539)
        0.2 = coord(5/25)