Document (#30982)

Strötgen, R.
Mandl, T.
Schneider, R.
Entwicklung und Evaluierung eines Question Answering Systems im Rahmen des Cross Language Evaluation Forum (CLEF)
Effektive Information Retrieval Verfahren in Theorie und Praxis: ausgewählte und erweiterte Beiträge des Vierten Hildesheimer Evaluierungs- und Retrievalworkshop (HIER 2005), Hildesheim, 20.7.2005. Hrsg.: T. Mandl u. C. Womser-Hacker
Konstanz : UVK Verlagsgesellschaft
Schriften zur Informationswissenschaft; Bd.45
Question Answering Systeme versuchen, zu konkreten Fragen eine korrekte Antwort zu liefern. Dazu durchsuchen sie einen Dokumentenbestand und extrahieren einen Bruchteil eines Dokuments. Dieser Beitrag beschreibt die Entwicklung eines modularen Systems zum multilingualen Question Answering. Die Strategie bei der Entwicklung zielte auf eine schnellstmögliche Verwendbarkeit eines modularen Systems, das auf viele frei verfügbare Ressourcen zugreift. Das System integriert Module zur Erkennung von Eigennamen, zu Indexierung und Retrieval, elektronische Wörterbücher, Online-Übersetzungswerkzeuge sowie Textkorpora zu Trainings- und Testzwecken und implementiert eigene Ansätze zu den Bereichen der Frage- und AntwortTaxonomien, zum Passagenretrieval und zum Ranking alternativer Antworten.
Multilinguale Probleme

Similar documents (author)

  1. Hellweg, H.; Krause, J.; Mandl, T.; Marx, J.; Müller, M.N.O.; Mutschke, P.; Strötgen, R.: Treatment of semantic heterogeneity in information retrieval (2001) 1.91
    1.9095825 = sum of:
      1.9095825 = product of:
        2.8643737 = sum of:
          1.1039107 = weight(author_txt:mandl in 560) [ClassicSimilarity], result of:
            1.1039107 = score(doc=560,freq=1.0), product of:
              0.53615665 = queryWeight, product of:
                1.1289947 = boost
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.05766305 = queryNorm
              2.058933 = fieldWeight in 560, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.25 = fieldNorm(doc=560)
          1.760463 = weight(author_txt:strötgen in 560) [ClassicSimilarity], result of:
            1.760463 = score(doc=560,freq=1.0), product of:
              0.7318471 = queryWeight, product of:
                1.3190347 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.05766305 = queryNorm
              2.4055066 = fieldWeight in 560, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.25 = fieldNorm(doc=560)
        0.6666667 = coord(2/3)
  2. Strötgen, R.: Treatment of semantic heterogeneity using meta-data extraction and query translation (2002) 1.47
    1.4670525 = sum of:
      1.4670525 = product of:
        4.4011574 = sum of:
          4.4011574 = weight(author_txt:strötgen in 4595) [ClassicSimilarity], result of:
            4.4011574 = score(doc=4595,freq=1.0), product of:
              0.7318471 = queryWeight, product of:
                1.3190347 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.05766305 = queryNorm
              6.0137663 = fieldWeight in 4595, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.625 = fieldNorm(doc=4595)
        0.33333334 = coord(1/3)
  3. Strötgen, R.: Anfragetransfers zur Integration von Internetquellen in Digitalen Bibliotheken auf der Grundlage statistischer Termrelationen (2007) 1.47
    1.4670525 = sum of:
      1.4670525 = product of:
        4.4011574 = sum of:
          4.4011574 = weight(author_txt:strötgen in 713) [ClassicSimilarity], result of:
            4.4011574 = score(doc=713,freq=1.0), product of:
              0.7318471 = queryWeight, product of:
                1.3190347 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.05766305 = queryNorm
              6.0137663 = fieldWeight in 713, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.625 = fieldNorm(doc=713)
        0.33333334 = coord(1/3)
  4. Strötgen, R.; Kokkelink, S.: Metadatenextraktion aus Internetquellen : Heterogenitätsbehandlung im Projekt CARMEN (2001) 1.17
    1.173642 = sum of:
      1.173642 = product of:
        3.520926 = sum of:
          3.520926 = weight(author_txt:strötgen in 6808) [ClassicSimilarity], result of:
            3.520926 = score(doc=6808,freq=1.0), product of:
              0.7318471 = queryWeight, product of:
                1.3190347 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.05766305 = queryNorm
              4.811013 = fieldWeight in 6808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.5 = fieldNorm(doc=6808)
        0.33333334 = coord(1/3)
  5. Mandl, T.: Einsatz neuronaler Netze als Transferkomponenten beim Retrieval in heterogenen Dokumentbeständen (2000) 0.92
    0.9199256 = sum of:
      0.9199256 = product of:
        2.7597766 = sum of:
          2.7597766 = weight(author_txt:mandl in 563) [ClassicSimilarity], result of:
            2.7597766 = score(doc=563,freq=1.0), product of:
              0.53615665 = queryWeight, product of:
                1.1289947 = boost
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.05766305 = queryNorm
              5.1473327 = fieldWeight in 563, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.625 = fieldNorm(doc=563)
        0.33333334 = coord(1/3)

Similar documents (content)

  1. Fischer, H.G.: CONDOR: Modell eines integrierten DB-/IR-Systems für strukturierte und unstrukturierte Daten (1982) 0.11
    0.11414047 = sum of:
      0.11414047 = product of:
        0.71337795 = sum of:
          0.11494691 = weight(abstract_txt:implementiert in 5196) [ClassicSimilarity], result of:
            0.11494691 = score(doc=5196,freq=1.0), product of:
              0.13784796 = queryWeight, product of:
                1.0569955 = boost
                7.62393 = idf(docFreq=58, maxDocs=44421)
                0.017105993 = queryNorm
              0.8338673 = fieldWeight in 5196, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.62393 = idf(docFreq=58, maxDocs=44421)
                0.109375 = fieldNorm(doc=5196)
          0.053500086 = weight(abstract_txt:systems in 5196) [ClassicSimilarity], result of:
            0.053500086 = score(doc=5196,freq=3.0), product of:
              0.08278884 = queryWeight, product of:
                1.4187945 = boost
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.017105993 = queryNorm
              0.6462234 = fieldWeight in 5196, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.109375 = fieldNorm(doc=5196)
          0.099538155 = weight(abstract_txt:eines in 5196) [ClassicSimilarity], result of:
            0.099538155 = score(doc=5196,freq=1.0), product of:
              0.19879946 = queryWeight, product of:
                2.5386953 = boost
                4.577795 = idf(docFreq=1240, maxDocs=44421)
                0.017105993 = queryNorm
              0.5006963 = fieldWeight in 5196, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.577795 = idf(docFreq=1240, maxDocs=44421)
                0.109375 = fieldNorm(doc=5196)
          0.4453928 = weight(abstract_txt:modularen in 5196) [ClassicSimilarity], result of:
            0.4453928 = score(doc=5196,freq=1.0), product of:
              0.42845732 = queryWeight, product of:
                2.635371 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.017105993 = queryNorm
              1.0395266 = fieldWeight in 5196, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.109375 = fieldNorm(doc=5196)
        0.16 = coord(4/25)
  2. Saint-Dizier, P.; Moens, M.-F.: Knowledge and reasoning for question answering : research perspectives (2011) 0.08
    0.075208664 = sum of:
      0.075208664 = product of:
        0.6267389 = sum of:
          0.026475677 = weight(abstract_txt:systems in 3746) [ClassicSimilarity], result of:
            0.026475677 = score(doc=3746,freq=1.0), product of:
              0.08278884 = queryWeight, product of:
                1.4187945 = boost
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.017105993 = queryNorm
              0.31979766 = fieldWeight in 3746, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.09375 = fieldNorm(doc=3746)
          0.2099187 = weight(abstract_txt:question in 3746) [ClassicSimilarity], result of:
            0.2099187 = score(doc=3746,freq=5.0), product of:
              0.19250904 = queryWeight, product of:
                2.1635113 = boost
                5.2016807 = idf(docFreq=664, maxDocs=44421)
                0.017105993 = queryNorm
              1.0904355 = fieldWeight in 3746, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.2016807 = idf(docFreq=664, maxDocs=44421)
                0.09375 = fieldNorm(doc=3746)
          0.3903445 = weight(abstract_txt:answering in 3746) [ClassicSimilarity], result of:
            0.3903445 = score(doc=3746,freq=4.0), product of:
              0.31358296 = queryWeight, product of:
                2.7612762 = boost
                6.6388726 = idf(docFreq=157, maxDocs=44421)
                0.017105993 = queryNorm
              1.2447886 = fieldWeight in 3746, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.6388726 = idf(docFreq=157, maxDocs=44421)
                0.09375 = fieldNorm(doc=3746)
        0.12 = coord(3/25)
  3. Galitsky, B.: Can many agents answer questions better than one? (2005) 0.07
    0.073450245 = sum of:
      0.073450245 = product of:
        0.6120854 = sum of:
          0.022063065 = weight(abstract_txt:systems in 4094) [ClassicSimilarity], result of:
            0.022063065 = score(doc=4094,freq=1.0), product of:
              0.08278884 = queryWeight, product of:
                1.4187945 = boost
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.017105993 = queryNorm
              0.26649806 = fieldWeight in 4094, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.078125 = fieldNorm(doc=4094)
          0.19162866 = weight(abstract_txt:question in 4094) [ClassicSimilarity], result of:
            0.19162866 = score(doc=4094,freq=6.0), product of:
              0.19250904 = queryWeight, product of:
                2.1635113 = boost
                5.2016807 = idf(docFreq=664, maxDocs=44421)
                0.017105993 = queryNorm
              0.99542683 = fieldWeight in 4094, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.2016807 = idf(docFreq=664, maxDocs=44421)
                0.078125 = fieldNorm(doc=4094)
          0.39839366 = weight(abstract_txt:answering in 4094) [ClassicSimilarity], result of:
            0.39839366 = score(doc=4094,freq=6.0), product of:
              0.31358296 = queryWeight, product of:
                2.7612762 = boost
                6.6388726 = idf(docFreq=157, maxDocs=44421)
                0.017105993 = queryNorm
              1.270457 = fieldWeight in 4094, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.6388726 = idf(docFreq=157, maxDocs=44421)
                0.078125 = fieldNorm(doc=4094)
        0.12 = coord(3/25)
  4. Schneider, R.: Question answering : das Retrieval der Zukunft? (2007) 0.07
    0.06911822 = sum of:
      0.06911822 = product of:
        0.57598513 = sum of:
          0.099075764 = weight(abstract_txt:entwicklung in 6953) [ClassicSimilarity], result of:
            0.099075764 = score(doc=6953,freq=1.0), product of:
              0.18006149 = queryWeight, product of:
                2.0923967 = boost
                5.030701 = idf(docFreq=788, maxDocs=44421)
                0.017105993 = queryNorm
              0.55023295 = fieldWeight in 6953, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.030701 = idf(docFreq=788, maxDocs=44421)
                0.109375 = fieldNorm(doc=6953)
          0.15489161 = weight(abstract_txt:question in 6953) [ClassicSimilarity], result of:
            0.15489161 = score(doc=6953,freq=2.0), product of:
              0.19250904 = queryWeight, product of:
                2.1635113 = boost
                5.2016807 = idf(docFreq=664, maxDocs=44421)
                0.017105993 = queryNorm
              0.8045939 = fieldWeight in 6953, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2016807 = idf(docFreq=664, maxDocs=44421)
                0.109375 = fieldNorm(doc=6953)
          0.32201776 = weight(abstract_txt:answering in 6953) [ClassicSimilarity], result of:
            0.32201776 = score(doc=6953,freq=2.0), product of:
              0.31358296 = queryWeight, product of:
                2.7612762 = boost
                6.6388726 = idf(docFreq=157, maxDocs=44421)
                0.017105993 = queryNorm
              1.0268981 = fieldWeight in 6953, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.6388726 = idf(docFreq=157, maxDocs=44421)
                0.109375 = fieldNorm(doc=6953)
        0.12 = coord(3/25)
  5. Mayer, A.-K.; Leichner, N.; Peter, J.; Krampen, G.: Mit "BLInk" zu fachlicher Informationskompetenz : ein Blended Learning-Kurs für die wissenschaftliche Psychologie und verwandte Fächer (2015) 0.07
    0.06900973 = sum of:
      0.06900973 = product of:
        0.5750811 = sum of:
          0.34809375 = weight(abstract_txt:trainings in 3597) [ClassicSimilarity], result of:
            0.34809375 = score(doc=3597,freq=2.0), product of:
              0.20950529 = queryWeight, product of:
                1.3030782 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.017105993 = queryNorm
              1.6615034 = fieldWeight in 3597, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.125 = fieldNorm(doc=3597)
          0.113229446 = weight(abstract_txt:entwicklung in 3597) [ClassicSimilarity], result of:
            0.113229446 = score(doc=3597,freq=1.0), product of:
              0.18006149 = queryWeight, product of:
                2.0923967 = boost
                5.030701 = idf(docFreq=788, maxDocs=44421)
                0.017105993 = queryNorm
              0.62883765 = fieldWeight in 3597, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.030701 = idf(docFreq=788, maxDocs=44421)
                0.125 = fieldNorm(doc=3597)
          0.1137579 = weight(abstract_txt:eines in 3597) [ClassicSimilarity], result of:
            0.1137579 = score(doc=3597,freq=1.0), product of:
              0.19879946 = queryWeight, product of:
                2.5386953 = boost
                4.577795 = idf(docFreq=1240, maxDocs=44421)
                0.017105993 = queryNorm
              0.5722244 = fieldWeight in 3597, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.577795 = idf(docFreq=1240, maxDocs=44421)
                0.125 = fieldNorm(doc=3597)
        0.12 = coord(3/25)