Document (#31017)

Author
Endres-Niggemeyer, B.
Jauris-Heipke, S.
Pinsky, S.M.
Ulbricht, U.
Title
Wissen gewinnen durch Wissen : Ontologiebasierte Informationsextraktion
Source
Information - Wissenschaft und Praxis. 57(2006) H.6/7, S.301-308
Year
2006
Abstract
Die ontologiebasierte Informationsextraktion, über die hier berichtet wird, ist Teil eines Systems zum automatischen Zusammenfassen, das sich am Vorgehen kompetenter Menschen orientiert. Dahinter steht die Annahme, dass Menschen die Ergebnisse eines Systems leichter übernehmen können, wenn sie mit Verfahren erarbeitet worden sind, die sie selbst auch benutzen. Das erste Anwendungsgebiet ist Knochenmarktransplantation (KMT). Im Kern des Systems Summit-BMT (Summarize It in Bone Marrow Transplantation) steht eine Ontologie des Fachgebietes. Sie ist als MySQL-Datenbank realisiert und versorgt menschliche Benutzer und Systemkomponenten mit Wissen. Summit-BMT unterstützt die Frageformulierung mit einem empirisch fundierten Szenario-Interface. Die Retrievalergebnisse werden durch ein Textpassagenretrieval vorselektiert und dann kognitiv fundierten Agenten unterbreitet, die unter Einsatz ihrer Wissensbasis / Ontologie genauer prüfen, ob die Propositionen aus der Benutzerfrage getroffen werden. Die relevanten Textclips aus dem Duelldokument werden in das Szenarioformular eingetragen und mit einem Link zu ihrem Vorkommen im Original präsentiert. In diesem Artikel stehen die Ontologie und ihr Gebrauch zur wissensbasierten Informationsextraktion im Mittelpunkt. Die Ontologiedatenbank hält unterschiedliche Wissenstypen so bereit, dass sie leicht kombiniert werden können: Konzepte, Propositionen und ihre syntaktisch-semantischen Schemata, Unifikatoren, Paraphrasen und Definitionen von Frage-Szenarios. Auf sie stützen sich die Systemagenten, welche von Menschen adaptierte Zusammenfassungsstrategien ausführen. Mängel in anderen Verarbeitungsschritten führen zu Verlusten, aber die eigentliche Qualität der Ergebnisse steht und fällt mit der Qualität der Ontologie. Erste Tests der Extraktionsleistung fallen verblüffend positiv aus.
Theme
Automatisches Abstracting
Wissensrepräsentation

Similar documents (author)

  1. Endres-Niggemeyer, B.: Sprachverarbeitung im Informationsbereich (1989) 5.96
    5.960057 = sum of:
      5.960057 = sum of:
        2.8797019 = weight(author_txt:niggemeyer in 4859) [ClassicSimilarity], result of:
          2.8797019 = score(doc=4859,freq=1.0), product of:
            0.6910589 = queryWeight, product of:
              8.334172 = idf(docFreq=28, maxDocs=44421)
              0.08291872 = queryNorm
            4.167086 = fieldWeight in 4859, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.334172 = idf(docFreq=28, maxDocs=44421)
              0.5 = fieldNorm(doc=4859)
        3.080355 = weight(author_txt:endres in 4859) [ClassicSimilarity], result of:
          3.080355 = score(doc=4859,freq=1.0), product of:
            0.7227984 = queryWeight, product of:
              1.0227066 = boost
              8.523414 = idf(docFreq=23, maxDocs=44421)
              0.08291872 = queryNorm
            4.261707 = fieldWeight in 4859, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.523414 = idf(docFreq=23, maxDocs=44421)
              0.5 = fieldNorm(doc=4859)
    
  2. Endres-Niggemeyer, B.: ¬An empirical process model of abstracting (1992) 5.96
    5.960057 = sum of:
      5.960057 = sum of:
        2.8797019 = weight(author_txt:niggemeyer in 448) [ClassicSimilarity], result of:
          2.8797019 = score(doc=448,freq=1.0), product of:
            0.6910589 = queryWeight, product of:
              8.334172 = idf(docFreq=28, maxDocs=44421)
              0.08291872 = queryNorm
            4.167086 = fieldWeight in 448, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.334172 = idf(docFreq=28, maxDocs=44421)
              0.5 = fieldNorm(doc=448)
        3.080355 = weight(author_txt:endres in 448) [ClassicSimilarity], result of:
          3.080355 = score(doc=448,freq=1.0), product of:
            0.7227984 = queryWeight, product of:
              1.0227066 = boost
              8.523414 = idf(docFreq=23, maxDocs=44421)
              0.08291872 = queryNorm
            4.261707 = fieldWeight in 448, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.523414 = idf(docFreq=23, maxDocs=44421)
              0.5 = fieldNorm(doc=448)
    
  3. Endres-Niggemeyer, B.: Summarising text for intelligent communication : results of the Dagstuhl seminar (1994) 5.96
    5.960057 = sum of:
      5.960057 = sum of:
        2.8797019 = weight(author_txt:niggemeyer in 481) [ClassicSimilarity], result of:
          2.8797019 = score(doc=481,freq=1.0), product of:
            0.6910589 = queryWeight, product of:
              8.334172 = idf(docFreq=28, maxDocs=44421)
              0.08291872 = queryNorm
            4.167086 = fieldWeight in 481, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.334172 = idf(docFreq=28, maxDocs=44421)
              0.5 = fieldNorm(doc=481)
        3.080355 = weight(author_txt:endres in 481) [ClassicSimilarity], result of:
          3.080355 = score(doc=481,freq=1.0), product of:
            0.7227984 = queryWeight, product of:
              1.0227066 = boost
              8.523414 = idf(docFreq=23, maxDocs=44421)
              0.08291872 = queryNorm
            4.261707 = fieldWeight in 481, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.523414 = idf(docFreq=23, maxDocs=44421)
              0.5 = fieldNorm(doc=481)
    
  4. Endres-Niggemeyer, B.: Wissensbasierte Ansätze zur Formalerfassung (1988) 5.96
    5.960057 = sum of:
      5.960057 = sum of:
        2.8797019 = weight(author_txt:niggemeyer in 592) [ClassicSimilarity], result of:
          2.8797019 = score(doc=592,freq=1.0), product of:
            0.6910589 = queryWeight, product of:
              8.334172 = idf(docFreq=28, maxDocs=44421)
              0.08291872 = queryNorm
            4.167086 = fieldWeight in 592, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.334172 = idf(docFreq=28, maxDocs=44421)
              0.5 = fieldNorm(doc=592)
        3.080355 = weight(author_txt:endres in 592) [ClassicSimilarity], result of:
          3.080355 = score(doc=592,freq=1.0), product of:
            0.7227984 = queryWeight, product of:
              1.0227066 = boost
              8.523414 = idf(docFreq=23, maxDocs=44421)
              0.08291872 = queryNorm
            4.261707 = fieldWeight in 592, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.523414 = idf(docFreq=23, maxDocs=44421)
              0.5 = fieldNorm(doc=592)
    
  5. Endres-Niggemeyer, B.: Content analysis : a special case of text compression (1989) 5.96
    5.960057 = sum of:
      5.960057 = sum of:
        2.8797019 = weight(author_txt:niggemeyer in 3617) [ClassicSimilarity], result of:
          2.8797019 = score(doc=3617,freq=1.0), product of:
            0.6910589 = queryWeight, product of:
              8.334172 = idf(docFreq=28, maxDocs=44421)
              0.08291872 = queryNorm
            4.167086 = fieldWeight in 3617, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.334172 = idf(docFreq=28, maxDocs=44421)
              0.5 = fieldNorm(doc=3617)
        3.080355 = weight(author_txt:endres in 3617) [ClassicSimilarity], result of:
          3.080355 = score(doc=3617,freq=1.0), product of:
            0.7227984 = queryWeight, product of:
              1.0227066 = boost
              8.523414 = idf(docFreq=23, maxDocs=44421)
              0.08291872 = queryNorm
            4.261707 = fieldWeight in 3617, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.523414 = idf(docFreq=23, maxDocs=44421)
              0.5 = fieldNorm(doc=3617)
    

Similar documents (content)

  1. Endres-Niggemeyer, B.; Ziegert, C.: SummIt-BMT : (Summarize It in BMT) in Diagnose und Therapie, Abschlussbericht (2002) 0.20
    0.1957738 = sum of:
      0.1957738 = product of:
        0.97886896 = sum of:
          0.12881738 = weight(abstract_txt:zusammenfassen in 497) [ClassicSimilarity], result of:
            0.12881738 = score(doc=497,freq=2.0), product of:
              0.15506123 = queryWeight, product of:
                1.0102445 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.016330538 = queryNorm
              0.8307517 = fieldWeight in 497, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.0625 = fieldNorm(doc=497)
          0.094185345 = weight(abstract_txt:syntaktisch in 497) [ClassicSimilarity], result of:
            0.094185345 = score(doc=497,freq=1.0), product of:
              0.15855713 = queryWeight, product of:
                1.0215691 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.016330538 = queryNorm
              0.5940152 = fieldWeight in 497, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.0625 = fieldNorm(doc=497)
          0.050111905 = weight(abstract_txt:werden in 497) [ClassicSimilarity], result of:
            0.050111905 = score(doc=497,freq=7.0), product of:
              0.086392924 = queryWeight, product of:
                1.508148 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.016330538 = queryNorm
              0.5800464 = fieldWeight in 497, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.0625 = fieldNorm(doc=497)
          0.3643506 = weight(abstract_txt:summit in 497) [ClassicSimilarity], result of:
            0.3643506 = score(doc=497,freq=4.0), product of:
              0.31012246 = queryWeight, product of:
                2.020489 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.016330538 = queryNorm
              1.1748604 = fieldWeight in 497, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.0625 = fieldNorm(doc=497)
          0.34140372 = weight(abstract_txt:ontologie in 497) [ClassicSimilarity], result of:
            0.34140372 = score(doc=497,freq=3.0), product of:
              0.4118022 = queryWeight, product of:
                3.2926776 = boost
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.016330538 = queryNorm
              0.82904786 = fieldWeight in 497, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.0625 = fieldNorm(doc=497)
        0.2 = coord(5/25)
    
  2. Stollberg, M.: Ontologiebasierte Wissensmodellierung : Verwendung als semantischer Grundbaustein des Semantic Web (2002) 0.14
    0.14395194 = sum of:
      0.14395194 = product of:
        0.59979975 = sum of:
          0.063660055 = weight(abstract_txt:anwendungsgebiet in 495) [ClassicSimilarity], result of:
            0.063660055 = score(doc=495,freq=1.0), product of:
              0.16705324 = queryWeight, product of:
                1.0485818 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.016330538 = queryNorm
              0.38107646 = fieldWeight in 495, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.0390625 = fieldNorm(doc=495)
          0.022578288 = weight(abstract_txt:ergebnisse in 495) [ClassicSimilarity], result of:
            0.022578288 = score(doc=495,freq=1.0), product of:
              0.105458334 = queryWeight, product of:
                1.1782306 = boost
                5.4808774 = idf(docFreq=502, maxDocs=44421)
                0.016330538 = queryNorm
              0.21409677 = fieldWeight in 495, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4808774 = idf(docFreq=502, maxDocs=44421)
                0.0390625 = fieldNorm(doc=495)
          0.048808604 = weight(abstract_txt:werden in 495) [ClassicSimilarity], result of:
            0.048808604 = score(doc=495,freq=17.0), product of:
              0.086392924 = queryWeight, product of:
                1.508148 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.016330538 = queryNorm
              0.56496066 = fieldWeight in 495, product of:
                4.1231055 = tf(freq=17.0), with freq of:
                  17.0 = termFreq=17.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.0390625 = fieldNorm(doc=495)
          0.08491443 = weight(abstract_txt:fundierten in 495) [ClassicSimilarity], result of:
            0.08491443 = score(doc=495,freq=1.0), product of:
              0.25503975 = queryWeight, product of:
                1.8322883 = boost
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.016330538 = queryNorm
              0.33294585 = fieldWeight in 495, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.0390625 = fieldNorm(doc=495)
          0.13345148 = weight(abstract_txt:ontologiebasierte in 495) [ClassicSimilarity], result of:
            0.13345148 = score(doc=495,freq=1.0), product of:
              0.3447486 = queryWeight, product of:
                2.1303017 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.016330538 = queryNorm
              0.38709795 = fieldWeight in 495, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.0390625 = fieldNorm(doc=495)
          0.24638692 = weight(abstract_txt:ontologie in 495) [ClassicSimilarity], result of:
            0.24638692 = score(doc=495,freq=4.0), product of:
              0.4118022 = queryWeight, product of:
                3.2926776 = boost
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.016330538 = queryNorm
              0.59831375 = fieldWeight in 495, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.0390625 = fieldNorm(doc=495)
        0.24 = coord(6/25)
    
  3. Werrmann, J.: Modellierung im Kontext : Ontologie-basiertes Information Retrieval (2011) 0.08
    0.07572727 = sum of:
      0.07572727 = product of:
        0.6310606 = sum of:
          0.03314591 = weight(abstract_txt:werden in 2141) [ClassicSimilarity], result of:
            0.03314591 = score(doc=2141,freq=1.0), product of:
              0.086392924 = queryWeight, product of:
                1.508148 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.016330538 = queryNorm
              0.38366464 = fieldWeight in 2141, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.109375 = fieldNorm(doc=2141)
          0.110093445 = weight(abstract_txt:wissen in 2141) [ClassicSimilarity], result of:
            0.110093445 = score(doc=2141,freq=2.0), product of:
              0.1386893 = queryWeight, product of:
                1.6548437 = boost
                5.131986 = idf(docFreq=712, maxDocs=44421)
                0.016330538 = queryNorm
              0.7938136 = fieldWeight in 2141, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.131986 = idf(docFreq=712, maxDocs=44421)
                0.109375 = fieldNorm(doc=2141)
          0.48782122 = weight(abstract_txt:ontologie in 2141) [ClassicSimilarity], result of:
            0.48782122 = score(doc=2141,freq=2.0), product of:
              0.4118022 = queryWeight, product of:
                3.2926776 = boost
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.016330538 = queryNorm
              1.1846008 = fieldWeight in 2141, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.109375 = fieldNorm(doc=2141)
        0.12 = coord(3/25)
    
  4. Aprin, L.: Entwicklung eines semantisch operierenden Risikomanagement-Informationssystems am Beispiel der Europäischen Organisation für Kernforschung (CERN) (2012) 0.07
    0.06705425 = sum of:
      0.06705425 = product of:
        0.33527124 = sum of:
          0.03906893 = weight(abstract_txt:qualität in 3286) [ClassicSimilarity], result of:
            0.03906893 = score(doc=3286,freq=1.0), product of:
              0.1346027 = queryWeight, product of:
                1.3311186 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.016330538 = queryNorm
              0.2902537 = fieldWeight in 3286, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.046875 = fieldNorm(doc=3286)
          0.01420539 = weight(abstract_txt:werden in 3286) [ClassicSimilarity], result of:
            0.01420539 = score(doc=3286,freq=1.0), product of:
              0.086392924 = queryWeight, product of:
                1.508148 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.016330538 = queryNorm
              0.1644277 = fieldWeight in 3286, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.046875 = fieldNorm(doc=3286)
          0.08827114 = weight(abstract_txt:wissen in 3286) [ClassicSimilarity], result of:
            0.08827114 = score(doc=3286,freq=7.0), product of:
              0.1386893 = queryWeight, product of:
                1.6548437 = boost
                5.131986 = idf(docFreq=712, maxDocs=44421)
                0.016330538 = queryNorm
              0.63646686 = fieldWeight in 3286, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.131986 = idf(docFreq=712, maxDocs=44421)
                0.046875 = fieldNorm(doc=3286)
          0.045893636 = weight(abstract_txt:steht in 3286) [ClassicSimilarity], result of:
            0.045893636 = score(doc=3286,freq=1.0), product of:
              0.17153975 = queryWeight, product of:
                1.8404241 = boost
                5.707506 = idf(docFreq=400, maxDocs=44421)
                0.016330538 = queryNorm
              0.26753935 = fieldWeight in 3286, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.707506 = idf(docFreq=400, maxDocs=44421)
                0.046875 = fieldNorm(doc=3286)
          0.14783216 = weight(abstract_txt:ontologie in 3286) [ClassicSimilarity], result of:
            0.14783216 = score(doc=3286,freq=1.0), product of:
              0.4118022 = queryWeight, product of:
                3.2926776 = boost
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.016330538 = queryNorm
              0.35898826 = fieldWeight in 3286, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.046875 = fieldNorm(doc=3286)
        0.2 = coord(5/25)
    
  5. Smith, B.; Siebert, D.; Ceusters, W.: Was die philosophische Ontologie zur biomedizinischen Informatik beitragen kann (2004) 0.06
    0.06129356 = sum of:
      0.06129356 = product of:
        0.5107797 = sum of:
          0.051088836 = weight(abstract_txt:ergebnisse in 3181) [ClassicSimilarity], result of:
            0.051088836 = score(doc=3181,freq=2.0), product of:
              0.105458334 = queryWeight, product of:
                1.1782306 = boost
                5.4808774 = idf(docFreq=502, maxDocs=44421)
                0.016330538 = queryNorm
              0.4844457 = fieldWeight in 3181, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4808774 = idf(docFreq=502, maxDocs=44421)
                0.0625 = fieldNorm(doc=3181)
          0.01894052 = weight(abstract_txt:werden in 3181) [ClassicSimilarity], result of:
            0.01894052 = score(doc=3181,freq=1.0), product of:
              0.086392924 = queryWeight, product of:
                1.508148 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.016330538 = queryNorm
              0.21923694 = fieldWeight in 3181, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.0625 = fieldNorm(doc=3181)
          0.44075033 = weight(abstract_txt:ontologie in 3181) [ClassicSimilarity], result of:
            0.44075033 = score(doc=3181,freq=5.0), product of:
              0.4118022 = queryWeight, product of:
                3.2926776 = boost
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.016330538 = queryNorm
              1.0702962 = fieldWeight in 3181, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.0625 = fieldNorm(doc=3181)
        0.12 = coord(3/25)