Document (#40380)

Author
El idrissi esserhrouchni, O. et al.
Frikh, B.
Ouhbi, B.
Title
OntologyLine : a new framework for learning non-taxonomic relations of domain ontology
Source
Knowledge discovery, knowledge engineering and knowledge management: 7th International Joint Conference, IC3K 2015, Lisbon, Portugal, November 12-14, 2015, Revised Selected Papers. Eds.: A. Fred et al
Imprint
Cham : Springer
Year
2016
Pages
S.345-364
Series
Communications in computer and information science; 631
Abstract
Domain Ontology learning has been introduced as a technology that aims at reducing the bottleneck of knowledge acquisition in the construction of domain ontologies. However, the discovery and the labelling of non-taxonomic relations have been identified as one of the most difficult problems in this learning process. In this paper, we propose OntologyLine, a new system for discovering non-taxonomic relations and building domain ontology from scratch. The proposed system is based on adapting Open Information Extraction algorithms to extract and label relations between domain concepts. OntologyLine was tested in two different domains: the financial and cancer domains. It was evaluated against gold standard ontology and was compared to state-of-the-art ontology learning algorithm. The experimental results show that OntologyLine is more effective for acquiring non-taxonomic relations and gives better results in terms of precision, recall and F-measure.
Theme
Wissensrepräsentation

Similar documents (content)

  1. Xu, Y.; Li, G.; Mou, L.; Lu, Y.: Learning non-taxonomic relations on demand for ontology extension (2014) 0.46
    0.45579794 = sum of:
      0.45579794 = product of:
        1.4243686 = sum of:
          0.033982515 = weight(abstract_txt:extraction in 3961) [ClassicSimilarity], result of:
            0.033982515 = score(doc=3961,freq=1.0), product of:
              0.087809 = queryWeight, product of:
                1.0632493 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.013337283 = queryNorm
              0.38700494 = fieldWeight in 3961, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.0625 = fieldNorm(doc=3961)
          0.034738094 = weight(abstract_txt:acquisition in 3961) [ClassicSimilarity], result of:
            0.034738094 = score(doc=3961,freq=1.0), product of:
              0.0891058 = queryWeight, product of:
                1.0710719 = boost
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.013337283 = queryNorm
              0.38985223 = fieldWeight in 3961, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2376356 = idf(docFreq=235, maxDocs=44421)
                0.0625 = fieldNorm(doc=3961)
          0.012050542 = weight(abstract_txt:results in 3961) [ClassicSimilarity], result of:
            0.012050542 = score(doc=3961,freq=1.0), product of:
              0.055426363 = queryWeight, product of:
                1.1946448 = boost
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.013337283 = queryNorm
              0.21741535 = fieldWeight in 3961, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.0625 = fieldNorm(doc=3961)
          0.12210675 = weight(abstract_txt:learning in 3961) [ClassicSimilarity], result of:
            0.12210675 = score(doc=3961,freq=4.0), product of:
              0.20599742 = queryWeight, product of:
                3.2570655 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.013337283 = queryNorm
              0.59275866 = fieldWeight in 3961, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.0625 = fieldNorm(doc=3961)
          0.07568081 = weight(abstract_txt:domain in 3961) [ClassicSimilarity], result of:
            0.07568081 = score(doc=3961,freq=1.0), product of:
              0.2560644 = queryWeight, product of:
                4.0599923 = boost
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.013337283 = queryNorm
              0.29555383 = fieldWeight in 3961, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.0625 = fieldNorm(doc=3961)
          0.17056096 = weight(abstract_txt:ontology in 3961) [ClassicSimilarity], result of:
            0.17056096 = score(doc=3961,freq=2.0), product of:
              0.34935617 = queryWeight, product of:
                4.7422543 = boost
                5.5235233 = idf(docFreq=481, maxDocs=44421)
                0.013337283 = queryNorm
              0.4882151 = fieldWeight in 3961, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5235233 = idf(docFreq=481, maxDocs=44421)
                0.0625 = fieldNorm(doc=3961)
          0.32273754 = weight(abstract_txt:relations in 3961) [ClassicSimilarity], result of:
            0.32273754 = score(doc=3961,freq=7.0), product of:
              0.35201323 = queryWeight, product of:
                4.760254 = boost
                5.5444884 = idf(docFreq=471, maxDocs=44421)
                0.013337283 = queryNorm
              0.9168336 = fieldWeight in 3961, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.5444884 = idf(docFreq=471, maxDocs=44421)
                0.0625 = fieldNorm(doc=3961)
          0.6525114 = weight(abstract_txt:taxonomic in 3961) [ClassicSimilarity], result of:
            0.6525114 = score(doc=3961,freq=5.0), product of:
              0.58450836 = queryWeight, product of:
                5.4864445 = boost
                7.9878955 = idf(docFreq=40, maxDocs=44421)
                0.013337283 = queryNorm
              1.1163423 = fieldWeight in 3961, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.9878955 = idf(docFreq=40, maxDocs=44421)
                0.0625 = fieldNorm(doc=3961)
        0.32 = coord(8/25)
    
  2. Zarrad, R.; Doggaz, N.; Zagrouba, E.: Wikipedia HTML structure analysis for ontology construction (2018) 0.40
    0.39743507 = sum of:
      0.39743507 = product of:
        1.419411 = sum of:
          0.048058532 = weight(abstract_txt:extraction in 302) [ClassicSimilarity], result of:
            0.048058532 = score(doc=302,freq=2.0), product of:
              0.087809 = queryWeight, product of:
                1.0632493 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.013337283 = queryNorm
              0.5473076 = fieldWeight in 302, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.0625 = fieldNorm(doc=302)
          0.082134105 = weight(abstract_txt:extract in 302) [ClassicSimilarity], result of:
            0.082134105 = score(doc=302,freq=4.0), product of:
              0.09962408 = queryWeight, product of:
                1.132525 = boost
                6.595522 = idf(docFreq=164, maxDocs=44421)
                0.013337283 = queryNorm
              0.82444024 = fieldWeight in 302, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.595522 = idf(docFreq=164, maxDocs=44421)
                0.0625 = fieldNorm(doc=302)
          0.013517609 = weight(abstract_txt:been in 302) [ClassicSimilarity], result of:
            0.013517609 = score(doc=302,freq=1.0), product of:
              0.05983821 = queryWeight, product of:
                1.2412804 = boost
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.013337283 = queryNorm
              0.22590263 = fieldWeight in 302, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.0625 = fieldNorm(doc=302)
          0.0692746 = weight(abstract_txt:gold in 302) [ClassicSimilarity], result of:
            0.0692746 = score(doc=302,freq=1.0), product of:
              0.1411729 = queryWeight, product of:
                1.3481596 = boost
                7.85132 = idf(docFreq=46, maxDocs=44421)
                0.013337283 = queryNorm
              0.4907075 = fieldWeight in 302, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.85132 = idf(docFreq=46, maxDocs=44421)
                0.0625 = fieldNorm(doc=302)
          0.20889367 = weight(abstract_txt:ontology in 302) [ClassicSimilarity], result of:
            0.20889367 = score(doc=302,freq=3.0), product of:
              0.34935617 = queryWeight, product of:
                4.7422543 = boost
                5.5235233 = idf(docFreq=481, maxDocs=44421)
                0.013337283 = queryNorm
              0.59793895 = fieldWeight in 302, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.5235233 = idf(docFreq=481, maxDocs=44421)
                0.0625 = fieldNorm(doc=302)
          0.34502095 = weight(abstract_txt:relations in 302) [ClassicSimilarity], result of:
            0.34502095 = score(doc=302,freq=8.0), product of:
              0.35201323 = queryWeight, product of:
                4.760254 = boost
                5.5444884 = idf(docFreq=471, maxDocs=44421)
                0.013337283 = queryNorm
              0.98013633 = fieldWeight in 302, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                5.5444884 = idf(docFreq=471, maxDocs=44421)
                0.0625 = fieldNorm(doc=302)
          0.6525114 = weight(abstract_txt:taxonomic in 302) [ClassicSimilarity], result of:
            0.6525114 = score(doc=302,freq=5.0), product of:
              0.58450836 = queryWeight, product of:
                5.4864445 = boost
                7.9878955 = idf(docFreq=40, maxDocs=44421)
                0.013337283 = queryNorm
              1.1163423 = fieldWeight in 302, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.9878955 = idf(docFreq=40, maxDocs=44421)
                0.0625 = fieldNorm(doc=302)
        0.28 = coord(7/25)
    
  3. Jiang, X.; Tan, A.-H.: CRCTOL: a semantic-based domain ontology learning system (2009) 0.38
    0.38163158 = sum of:
      0.38163158 = product of:
        1.0600877 = sum of:
          0.029734703 = weight(abstract_txt:extraction in 307) [ClassicSimilarity], result of:
            0.029734703 = score(doc=307,freq=1.0), product of:
              0.087809 = queryWeight, product of:
                1.0632493 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.013337283 = queryNorm
              0.33862934 = fieldWeight in 307, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.0546875 = fieldNorm(doc=307)
          0.03593367 = weight(abstract_txt:extract in 307) [ClassicSimilarity], result of:
            0.03593367 = score(doc=307,freq=1.0), product of:
              0.09962408 = queryWeight, product of:
                1.132525 = boost
                6.595522 = idf(docFreq=164, maxDocs=44421)
                0.013337283 = queryNorm
              0.36069262 = fieldWeight in 307, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.595522 = idf(docFreq=164, maxDocs=44421)
                0.0546875 = fieldNorm(doc=307)
          0.009610505 = weight(abstract_txt:system in 307) [ClassicSimilarity], result of:
            0.009610505 = score(doc=307,freq=1.0), product of:
              0.052103966 = queryWeight, product of:
                1.1582866 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.013337283 = queryNorm
              0.18444863 = fieldWeight in 307, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.0546875 = fieldNorm(doc=307)
          0.0118279075 = weight(abstract_txt:been in 307) [ClassicSimilarity], result of:
            0.0118279075 = score(doc=307,freq=1.0), product of:
              0.05983821 = queryWeight, product of:
                1.2412804 = boost
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.013337283 = queryNorm
              0.1976648 = fieldWeight in 307, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.0546875 = fieldNorm(doc=307)
          0.09252911 = weight(abstract_txt:learning in 307) [ClassicSimilarity], result of:
            0.09252911 = score(doc=307,freq=3.0), product of:
              0.20599742 = queryWeight, product of:
                3.2570655 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.013337283 = queryNorm
              0.44917604 = fieldWeight in 307, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.0546875 = fieldNorm(doc=307)
          0.13244142 = weight(abstract_txt:domain in 307) [ClassicSimilarity], result of:
            0.13244142 = score(doc=307,freq=4.0), product of:
              0.2560644 = queryWeight, product of:
                4.0599923 = boost
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.013337283 = queryNorm
              0.5172192 = fieldWeight in 307, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.0546875 = fieldNorm(doc=307)
          0.27920404 = weight(abstract_txt:ontology in 307) [ClassicSimilarity], result of:
            0.27920404 = score(doc=307,freq=7.0), product of:
              0.34935617 = queryWeight, product of:
                4.7422543 = boost
                5.5235233 = idf(docFreq=481, maxDocs=44421)
                0.013337283 = queryNorm
              0.79919595 = fieldWeight in 307, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.5235233 = idf(docFreq=481, maxDocs=44421)
                0.0546875 = fieldNorm(doc=307)
          0.21347083 = weight(abstract_txt:relations in 307) [ClassicSimilarity], result of:
            0.21347083 = score(doc=307,freq=4.0), product of:
              0.35201323 = queryWeight, product of:
                4.760254 = boost
                5.5444884 = idf(docFreq=471, maxDocs=44421)
                0.013337283 = queryNorm
              0.60642844 = fieldWeight in 307, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.5444884 = idf(docFreq=471, maxDocs=44421)
                0.0546875 = fieldNorm(doc=307)
          0.25533548 = weight(abstract_txt:taxonomic in 307) [ClassicSimilarity], result of:
            0.25533548 = score(doc=307,freq=1.0), product of:
              0.58450836 = queryWeight, product of:
                5.4864445 = boost
                7.9878955 = idf(docFreq=40, maxDocs=44421)
                0.013337283 = queryNorm
              0.43683803 = fieldWeight in 307, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9878955 = idf(docFreq=40, maxDocs=44421)
                0.0546875 = fieldNorm(doc=307)
        0.36 = coord(9/25)
    
  4. Na, J.-C.; Neoh, H.L.: Effectiveness of UMLS semantic network as a seed ontology for building a medical domain ontology (2008) 0.31
    0.31403935 = sum of:
      0.31403935 = product of:
        0.98137295 = sum of:
          0.033982515 = weight(abstract_txt:extraction in 2910) [ClassicSimilarity], result of:
            0.033982515 = score(doc=2910,freq=1.0), product of:
              0.087809 = queryWeight, product of:
                1.0632493 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.013337283 = queryNorm
              0.38700494 = fieldWeight in 2910, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.0625 = fieldNorm(doc=2910)
          0.041067053 = weight(abstract_txt:extract in 2910) [ClassicSimilarity], result of:
            0.041067053 = score(doc=2910,freq=1.0), product of:
              0.09962408 = queryWeight, product of:
                1.132525 = boost
                6.595522 = idf(docFreq=164, maxDocs=44421)
                0.013337283 = queryNorm
              0.41222012 = fieldWeight in 2910, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.595522 = idf(docFreq=164, maxDocs=44421)
                0.0625 = fieldNorm(doc=2910)
          0.010983435 = weight(abstract_txt:system in 2910) [ClassicSimilarity], result of:
            0.010983435 = score(doc=2910,freq=1.0), product of:
              0.052103966 = queryWeight, product of:
                1.1582866 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.013337283 = queryNorm
              0.21079844 = fieldWeight in 2910, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.0625 = fieldNorm(doc=2910)
          0.18762584 = weight(abstract_txt:cancer in 2910) [ClassicSimilarity], result of:
            0.18762584 = score(doc=2910,freq=5.0), product of:
              0.16041331 = queryWeight, product of:
                1.4370961 = boost
                8.369263 = idf(docFreq=27, maxDocs=44421)
                0.013337283 = queryNorm
              1.1696401 = fieldWeight in 2910, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                8.369263 = idf(docFreq=27, maxDocs=44421)
                0.0625 = fieldNorm(doc=2910)
          0.049650945 = weight(abstract_txt:domains in 2910) [ClassicSimilarity], result of:
            0.049650945 = score(doc=2910,freq=1.0), product of:
              0.14245039 = queryWeight, product of:
                1.9151926 = boost
                5.576784 = idf(docFreq=456, maxDocs=44421)
                0.013337283 = queryNorm
              0.348549 = fieldWeight in 2910, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.576784 = idf(docFreq=456, maxDocs=44421)
                0.0625 = fieldNorm(doc=2910)
          0.15136161 = weight(abstract_txt:domain in 2910) [ClassicSimilarity], result of:
            0.15136161 = score(doc=2910,freq=4.0), product of:
              0.2560644 = queryWeight, product of:
                4.0599923 = boost
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.013337283 = queryNorm
              0.59110767 = fieldWeight in 2910, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.0625 = fieldNorm(doc=2910)
          0.29542026 = weight(abstract_txt:ontology in 2910) [ClassicSimilarity], result of:
            0.29542026 = score(doc=2910,freq=6.0), product of:
              0.34935617 = queryWeight, product of:
                4.7422543 = boost
                5.5235233 = idf(docFreq=481, maxDocs=44421)
                0.013337283 = queryNorm
              0.84561336 = fieldWeight in 2910, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.5235233 = idf(docFreq=481, maxDocs=44421)
                0.0625 = fieldNorm(doc=2910)
          0.21128131 = weight(abstract_txt:relations in 2910) [ClassicSimilarity], result of:
            0.21128131 = score(doc=2910,freq=3.0), product of:
              0.35201323 = queryWeight, product of:
                4.760254 = boost
                5.5444884 = idf(docFreq=471, maxDocs=44421)
                0.013337283 = queryNorm
              0.60020846 = fieldWeight in 2910, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.5444884 = idf(docFreq=471, maxDocs=44421)
                0.0625 = fieldNorm(doc=2910)
        0.32 = coord(8/25)
    
  5. Suchanek, F.M.; Kasneci, G.; Weikum, G.: YAGO: a large ontology from Wikipedia and WordNet (2008) 0.20
    0.20291315 = sum of:
      0.20291315 = product of:
        1.0145657 = sum of:
          0.013729295 = weight(abstract_txt:system in 391) [ClassicSimilarity], result of:
            0.013729295 = score(doc=391,freq=1.0), product of:
              0.052103966 = queryWeight, product of:
                1.1582866 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.013337283 = queryNorm
              0.26349807 = fieldWeight in 391, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.078125 = fieldNorm(doc=391)
          0.029266482 = weight(abstract_txt:been in 391) [ClassicSimilarity], result of:
            0.029266482 = score(doc=391,freq=3.0), product of:
              0.05983821 = queryWeight, product of:
                1.2412804 = boost
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.013337283 = queryNorm
              0.48909354 = fieldWeight in 391, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.078125 = fieldNorm(doc=391)
          0.15075602 = weight(abstract_txt:ontology in 391) [ClassicSimilarity], result of:
            0.15075602 = score(doc=391,freq=1.0), product of:
              0.34935617 = queryWeight, product of:
                4.7422543 = boost
                5.5235233 = idf(docFreq=481, maxDocs=44421)
                0.013337283 = queryNorm
              0.43152526 = fieldWeight in 391, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5235233 = idf(docFreq=481, maxDocs=44421)
                0.078125 = fieldNorm(doc=391)
          0.30495834 = weight(abstract_txt:relations in 391) [ClassicSimilarity], result of:
            0.30495834 = score(doc=391,freq=4.0), product of:
              0.35201323 = queryWeight, product of:
                4.760254 = boost
                5.5444884 = idf(docFreq=471, maxDocs=44421)
                0.013337283 = queryNorm
              0.86632633 = fieldWeight in 391, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.5444884 = idf(docFreq=471, maxDocs=44421)
                0.078125 = fieldNorm(doc=391)
          0.51585555 = weight(abstract_txt:taxonomic in 391) [ClassicSimilarity], result of:
            0.51585555 = score(doc=391,freq=2.0), product of:
              0.58450836 = queryWeight, product of:
                5.4864445 = boost
                7.9878955 = idf(docFreq=40, maxDocs=44421)
                0.013337283 = queryNorm
              0.88254607 = fieldWeight in 391, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.9878955 = idf(docFreq=40, maxDocs=44421)
                0.078125 = fieldNorm(doc=391)
        0.2 = coord(5/25)