Document (#36886)

Author
Schulz, S.
Schober, D.
Tudose, I.
Stenzhorn, H.
Title
¬The pitfalls of thesaurus ontologization : the case of the NCI thesaurus
Issue
Published online 2010 November 13.
Source
AMIA Annual Symposium 2010: Improving Health: Informatics and IT Changing the World: Proceedings
Year
2010
Pages
S.727-731
Abstract
Thesauri that are "ontologized" into OWL-DL semantics are highly amenable to modeling errors resulting from falsely interpreting existential restrictions. We investigated the OWL-DL representation of the NCI Thesaurus (NCIT) in order to assess the correctness of existential restrictions. A random sample of 354 axioms using the someValuesFrom operator was taken. According to a rating performed by two domain experts, roughly half of these examples, and in consequence more than 76,000 axioms in the OWL-DL version, make incorrect assertions if interpreted according to description logics semantics. These axioms therefore constitute a huge source for unintended models, rendering most logic-based reasoning unreliable. After identifying typical error patterns we discuss some possible improvements. Our recommendation is to either amend the problematic axioms in the OWL-DL formalization or to consider some less strict representational format.
Content
Vgl.: http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3041372/.
Theme
Wissensrepräsentation
Field
Medizin
Object
NCI Thesaurus

Similar documents (author)

  1. Schulz, H.: Zur Charakterisierung der BBK/A (1988) 4.66
    4.6581078 = sum of:
      4.6581078 = weight(author_txt:schulz in 90) [ClassicSimilarity], result of:
        4.6581078 = fieldWeight in 90, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.4529724 = idf(docFreq=69, maxDocs=44421)
          0.625 = fieldNorm(doc=90)
    
  2. Schulz, U.: Was ist eine sinnvolle Schlagwortsyntax (eine Polemik) (1991) 4.66
    4.6581078 = sum of:
      4.6581078 = weight(author_txt:schulz in 129) [ClassicSimilarity], result of:
        4.6581078 = fieldWeight in 129, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.4529724 = idf(docFreq=69, maxDocs=44421)
          0.625 = fieldNorm(doc=129)
    
  3. Schulz, U.: Einführung in die Grundlagen der inhaltlichen Erschließung mit BISMAS am Fachbereich BID der FHS Hannover (1991) 4.66
    4.6581078 = sum of:
      4.6581078 = weight(author_txt:schulz in 457) [ClassicSimilarity], result of:
        4.6581078 = fieldWeight in 457, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.4529724 = idf(docFreq=69, maxDocs=44421)
          0.625 = fieldNorm(doc=457)
    
  4. Schulz, U.: ¬Die niederländische Basisklassifikation: eine Alternative für die "Sachgruppen" im Fremddatenangebot der Deutschen Bibliothek (1991) 4.66
    4.6581078 = sum of:
      4.6581078 = weight(author_txt:schulz in 948) [ClassicSimilarity], result of:
        4.6581078 = fieldWeight in 948, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.4529724 = idf(docFreq=69, maxDocs=44421)
          0.625 = fieldNorm(doc=948)
    
  5. Schulz, H.: ¬Die Adaption der BBK : Ergänzungen zur Methodik (1983) 4.66
    4.6581078 = sum of:
      4.6581078 = weight(author_txt:schulz in 1124) [ClassicSimilarity], result of:
        4.6581078 = fieldWeight in 1124, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.4529724 = idf(docFreq=69, maxDocs=44421)
          0.625 = fieldNorm(doc=1124)
    

Similar documents (content)

  1. Blanco, E.; Moldovan, D.: ¬A model for composing semantic relations (2011) 0.11
    0.11416306 = sum of:
      0.11416306 = product of:
        0.9513589 = sum of:
          0.068384886 = weight(abstract_txt:semantics in 762) [ClassicSimilarity], result of:
            0.068384886 = score(doc=762,freq=1.0), product of:
              0.14520663 = queryWeight, product of:
                1.6387192 = boost
                6.0281444 = idf(docFreq=290, maxDocs=44421)
                0.014699355 = queryNorm
              0.4709488 = fieldWeight in 762, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0281444 = idf(docFreq=290, maxDocs=44421)
                0.078125 = fieldNorm(doc=762)
          0.14321613 = weight(abstract_txt:restrictions in 762) [ClassicSimilarity], result of:
            0.14321613 = score(doc=762,freq=1.0), product of:
              0.23768823 = queryWeight, product of:
                2.096598 = boost
                7.7124834 = idf(docFreq=53, maxDocs=44421)
                0.014699355 = queryNorm
              0.60253775 = fieldWeight in 762, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7124834 = idf(docFreq=53, maxDocs=44421)
                0.078125 = fieldNorm(doc=762)
          0.7397579 = weight(abstract_txt:axioms in 762) [ClassicSimilarity], result of:
            0.7397579 = score(doc=762,freq=3.0), product of:
              0.6204532 = queryWeight, product of:
                4.7904997 = boost
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.014699355 = queryNorm
              1.1922864 = fieldWeight in 762, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.078125 = fieldNorm(doc=762)
        0.12 = coord(3/25)
    
  2. Das, S.; Naskar, D.; Roy, S.: Reorganizing educational institutional domain using faceted ontological principles (2022) 0.09
    0.09268877 = sum of:
      0.09268877 = product of:
        0.7724064 = sum of:
          0.010877888 = weight(abstract_txt:some in 2100) [ClassicSimilarity], result of:
            0.010877888 = score(doc=2100,freq=1.0), product of:
              0.054072615 = queryWeight, product of:
                3.6785707 = idf(docFreq=3049, maxDocs=44421)
                0.014699355 = queryNorm
              0.20117185 = fieldWeight in 2100, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6785707 = idf(docFreq=3049, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2100)
          0.093012184 = weight(abstract_txt:formalization in 2100) [ClassicSimilarity], result of:
            0.093012184 = score(doc=2100,freq=2.0), product of:
              0.14243637 = queryWeight, product of:
                1.1476429 = boost
                8.443371 = idf(docFreq=25, maxDocs=44421)
                0.014699355 = queryNorm
              0.65300864 = fieldWeight in 2100, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.443371 = idf(docFreq=25, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2100)
          0.66851634 = weight(abstract_txt:axioms in 2100) [ClassicSimilarity], result of:
            0.66851634 = score(doc=2100,freq=5.0), product of:
              0.6204532 = queryWeight, product of:
                4.7904997 = boost
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.014699355 = queryNorm
              1.0774646 = fieldWeight in 2100, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2100)
        0.12 = coord(3/25)
    
  3. Martínez-González, M.M.; Alvite-Díez, M.L.: Thesauri and Semantic Web : discussion of the evolution of thesauri toward their integration with the Semantic Web (2019) 0.05
    0.05128171 = sum of:
      0.05128171 = product of:
        0.4273476 = sum of:
          0.012431871 = weight(abstract_txt:some in 997) [ClassicSimilarity], result of:
            0.012431871 = score(doc=997,freq=1.0), product of:
              0.054072615 = queryWeight, product of:
                3.6785707 = idf(docFreq=3049, maxDocs=44421)
                0.014699355 = queryNorm
              0.22991067 = fieldWeight in 997, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6785707 = idf(docFreq=3049, maxDocs=44421)
                0.0625 = fieldNorm(doc=997)
          0.07323618 = weight(abstract_txt:thesaurus in 997) [ClassicSimilarity], result of:
            0.07323618 = score(doc=997,freq=2.0), product of:
              0.16024725 = queryWeight, product of:
                2.1083963 = boost
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.014699355 = queryNorm
              0.4570199 = fieldWeight in 997, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.0625 = fieldNorm(doc=997)
          0.34167954 = weight(abstract_txt:axioms in 997) [ClassicSimilarity], result of:
            0.34167954 = score(doc=997,freq=1.0), product of:
              0.6204532 = queryWeight, product of:
                4.7904997 = boost
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.014699355 = queryNorm
              0.5506935 = fieldWeight in 997, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.0625 = fieldNorm(doc=997)
        0.12 = coord(3/25)
    
  4. Fischer, D.H.: Converting a thesaurus to OWL : Notes on the paper "The National Cancer Institute's Thesaurus and Ontology" (2004) 0.05
    0.050583918 = sum of:
      0.050583918 = product of:
        0.25291958 = sum of:
          0.041753042 = weight(abstract_txt:strict in 3362) [ClassicSimilarity], result of:
            0.041753042 = score(doc=3362,freq=1.0), product of:
              0.13166846 = queryWeight, product of:
                1.1034107 = boost
                8.117949 = idf(docFreq=35, maxDocs=44421)
                0.014699355 = queryNorm
              0.31710738 = fieldWeight in 3362, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.117949 = idf(docFreq=35, maxDocs=44421)
                0.0390625 = fieldNorm(doc=3362)
          0.0595239 = weight(abstract_txt:assertions in 3362) [ClassicSimilarity], result of:
            0.0595239 = score(doc=3362,freq=1.0), product of:
              0.16678254 = queryWeight, product of:
                1.241857 = boost
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.014699355 = queryNorm
              0.35689527 = fieldWeight in 3362, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.0390625 = fieldNorm(doc=3362)
          0.031817485 = weight(abstract_txt:according in 3362) [ClassicSimilarity], result of:
            0.031817485 = score(doc=3362,freq=2.0), product of:
              0.10985005 = queryWeight, product of:
                1.4253169 = boost
                5.2431293 = idf(docFreq=637, maxDocs=44421)
                0.014699355 = queryNorm
              0.28964472 = fieldWeight in 3362, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2431293 = idf(docFreq=637, maxDocs=44421)
                0.0390625 = fieldNorm(doc=3362)
          0.034192443 = weight(abstract_txt:semantics in 3362) [ClassicSimilarity], result of:
            0.034192443 = score(doc=3362,freq=1.0), product of:
              0.14520663 = queryWeight, product of:
                1.6387192 = boost
                6.0281444 = idf(docFreq=290, maxDocs=44421)
                0.014699355 = queryNorm
              0.2354744 = fieldWeight in 3362, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0281444 = idf(docFreq=290, maxDocs=44421)
                0.0390625 = fieldNorm(doc=3362)
          0.08563272 = weight(abstract_txt:thesaurus in 3362) [ClassicSimilarity], result of:
            0.08563272 = score(doc=3362,freq=7.0), product of:
              0.16024725 = queryWeight, product of:
                2.1083963 = boost
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.014699355 = queryNorm
              0.5343787 = fieldWeight in 3362, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.0390625 = fieldNorm(doc=3362)
        0.2 = coord(5/25)
    
  5. Gnoli, C.: ISKO News (2007) 0.05
    0.048705366 = sum of:
      0.048705366 = product of:
        0.6088171 = sum of:
          0.010877888 = weight(abstract_txt:some in 2092) [ClassicSimilarity], result of:
            0.010877888 = score(doc=2092,freq=1.0), product of:
              0.054072615 = queryWeight, product of:
                3.6785707 = idf(docFreq=3049, maxDocs=44421)
                0.014699355 = queryNorm
              0.20117185 = fieldWeight in 2092, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6785707 = idf(docFreq=3049, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2092)
          0.5979392 = weight(abstract_txt:axioms in 2092) [ClassicSimilarity], result of:
            0.5979392 = score(doc=2092,freq=4.0), product of:
              0.6204532 = queryWeight, product of:
                4.7904997 = boost
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.014699355 = queryNorm
              0.96371365 = fieldWeight in 2092, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2092)
        0.08 = coord(2/25)