Document (#26851)

Author
Ferret, O.
Grau, B.
Hurault-Plantet, M.
Illouz, G.
Jacquemin, C.
Monceaux, L.
Robba, I.
Vilnat, A.
Title
How NLP can improve question answering
Source
Knowledge organization. 29(2002) nos.3/4, S.135-155
Year
2002
Abstract
Answering open-domain factual questions requires Natural Language processing for refining document selection and answer identification. With our system QALC, we have participated in the Question Answering track of the TREC8, TREC9 and TREC10 evaluations. QALC performs an analysis of documents relying an multiword term searches and their linguistic variation both to minimize the number of documents selected and to provide additional clues when comparing question and sentence representations. This comparison process also makes use of the results of a syntactic parsing of the questions and Named Entity recognition functionalities. Answer extraction relies an the application of syntactic patterns chosen according to the kind of information that is sought, and categorized depending an the syntactic form of the question. These patterns allow QALC to handle nicely linguistic variations at the answer level.
Theme
Computerlinguistik
Retrievalstudien
Sprachretrieval
Object
TREC

Similar documents (author)

  1. Grau, O.: Infos lokal gewoben : die WWW-Sprache HTML und die passende Software (1994) 6.01
    6.0137663 = sum of:
      6.0137663 = weight(author_txt:grau in 7565) [ClassicSimilarity], result of:
        6.0137663 = fieldWeight in 7565, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.622026 = idf(docFreq=7, maxDocs=44421)
          0.625 = fieldNorm(doc=7565)
    
  2. Grau, O.: Alles integriert : Informationssurfen im World Wide Web (1994) 6.01
    6.0137663 = sum of:
      6.0137663 = weight(author_txt:grau in 7612) [ClassicSimilarity], result of:
        6.0137663 = fieldWeight in 7612, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.622026 = idf(docFreq=7, maxDocs=44421)
          0.625 = fieldNorm(doc=7612)
    
  3. Grau, B.: Finding answers to questions, in text collections or Web, in open domain or specialty domains (2012) 6.01
    6.0137663 = sum of:
      6.0137663 = weight(author_txt:grau in 1107) [ClassicSimilarity], result of:
        6.0137663 = fieldWeight in 1107, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.622026 = idf(docFreq=7, maxDocs=44421)
          0.625 = fieldNorm(doc=1107)
    
  4. Grau, J.E.; Mehrotra, R.: Similar shape retrieval using a structural feature index (1993) 4.81
    4.811013 = sum of:
      4.811013 = weight(author_txt:grau in 7331) [ClassicSimilarity], result of:
        4.811013 = fieldWeight in 7331, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.622026 = idf(docFreq=7, maxDocs=44421)
          0.5 = fieldNorm(doc=7331)
    
  5. Ferret, O.; Grau, B.; Masson, N.: Utilisation d'un réseau de cooccurences lexikales pour a méliorer une analyse thématique fondée sur la distribution des mots (1999) 3.61
    3.60826 = sum of:
      3.60826 = weight(author_txt:grau in 295) [ClassicSimilarity], result of:
        3.60826 = fieldWeight in 295, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.622026 = idf(docFreq=7, maxDocs=44421)
          0.375 = fieldNorm(doc=295)
    

Similar documents (content)

  1. Grau, B.: Finding answers to questions, in text collections or Web, in open domain or specialty domains (2012) 0.28
    0.28444117 = sum of:
      0.28444117 = product of:
        0.88887864 = sum of:
          0.10402361 = weight(abstract_txt:factual in 1107) [ClassicSimilarity], result of:
            0.10402361 = score(doc=1107,freq=2.0), product of:
              0.15667228 = queryWeight, product of:
                1.0967051 = boost
                7.5118127 = idf(docFreq=65, maxDocs=44421)
                0.019017681 = queryNorm
              0.6639567 = fieldWeight in 1107, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.5118127 = idf(docFreq=65, maxDocs=44421)
                0.0625 = fieldNorm(doc=1107)
          0.093806945 = weight(abstract_txt:clues in 1107) [ClassicSimilarity], result of:
            0.093806945 = score(doc=1107,freq=1.0), product of:
              0.18424861 = queryWeight, product of:
                1.1893122 = boost
                8.146119 = idf(docFreq=34, maxDocs=44421)
                0.019017681 = queryNorm
              0.50913244 = fieldWeight in 1107, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.146119 = idf(docFreq=34, maxDocs=44421)
                0.0625 = fieldNorm(doc=1107)
          0.024330692 = weight(abstract_txt:documents in 1107) [ClassicSimilarity], result of:
            0.024330692 = score(doc=1107,freq=1.0), product of:
              0.094412 = queryWeight, product of:
                1.2039886 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.019017681 = queryNorm
              0.25770763 = fieldWeight in 1107, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.0625 = fieldNorm(doc=1107)
          0.057130236 = weight(abstract_txt:questions in 1107) [ClassicSimilarity], result of:
            0.057130236 = score(doc=1107,freq=2.0), product of:
              0.13238077 = queryWeight, product of:
                1.4256773 = boost
                4.8825436 = idf(docFreq=914, maxDocs=44421)
                0.019017681 = queryNorm
              0.43155995 = fieldWeight in 1107, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.8825436 = idf(docFreq=914, maxDocs=44421)
                0.0625 = fieldNorm(doc=1107)
          0.06815876 = weight(abstract_txt:linguistic in 1107) [ClassicSimilarity], result of:
            0.06815876 = score(doc=1107,freq=1.0), product of:
              0.18761693 = queryWeight, product of:
                1.697246 = boost
                5.8125896 = idf(docFreq=360, maxDocs=44421)
                0.019017681 = queryNorm
              0.36328685 = fieldWeight in 1107, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8125896 = idf(docFreq=360, maxDocs=44421)
                0.0625 = fieldNorm(doc=1107)
          0.107547104 = weight(abstract_txt:answer in 1107) [ClassicSimilarity], result of:
            0.107547104 = score(doc=1107,freq=1.0), product of:
              0.29108542 = queryWeight, product of:
                2.5891943 = boost
                5.9115076 = idf(docFreq=326, maxDocs=44421)
                0.019017681 = queryNorm
              0.36946923 = fieldWeight in 1107, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9115076 = idf(docFreq=326, maxDocs=44421)
                0.0625 = fieldNorm(doc=1107)
          0.2154279 = weight(abstract_txt:answering in 1107) [ClassicSimilarity], result of:
            0.2154279 = score(doc=1107,freq=2.0), product of:
              0.36712387 = queryWeight, product of:
                2.9077744 = boost
                6.6388726 = idf(docFreq=157, maxDocs=44421)
                0.019017681 = queryNorm
              0.58679897 = fieldWeight in 1107, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.6388726 = idf(docFreq=157, maxDocs=44421)
                0.0625 = fieldNorm(doc=1107)
          0.21845336 = weight(abstract_txt:question in 1107) [ClassicSimilarity], result of:
            0.21845336 = score(doc=1107,freq=5.0), product of:
              0.30050385 = queryWeight, product of:
                3.0377274 = boost
                5.2016807 = idf(docFreq=664, maxDocs=44421)
                0.019017681 = queryNorm
              0.72695696 = fieldWeight in 1107, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.2016807 = idf(docFreq=664, maxDocs=44421)
                0.0625 = fieldNorm(doc=1107)
        0.32 = coord(8/25)
    
  2. Lin, J.; Katz, B.: Building a reusable test collection for question answering (2006) 0.18
    0.18367137 = sum of:
      0.18367137 = product of:
        0.91835684 = sum of:
          0.052677494 = weight(abstract_txt:documents in 45) [ClassicSimilarity], result of:
            0.052677494 = score(doc=45,freq=3.0), product of:
              0.094412 = queryWeight, product of:
                1.2039886 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.019017681 = queryNorm
              0.55795336 = fieldWeight in 45, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.078125 = fieldNorm(doc=45)
          0.050496474 = weight(abstract_txt:questions in 45) [ClassicSimilarity], result of:
            0.050496474 = score(doc=45,freq=1.0), product of:
              0.13238077 = queryWeight, product of:
                1.4256773 = boost
                4.8825436 = idf(docFreq=914, maxDocs=44421)
                0.019017681 = queryNorm
              0.38144872 = fieldWeight in 45, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8825436 = idf(docFreq=914, maxDocs=44421)
                0.078125 = fieldNorm(doc=45)
          0.19011822 = weight(abstract_txt:answer in 45) [ClassicSimilarity], result of:
            0.19011822 = score(doc=45,freq=2.0), product of:
              0.29108542 = queryWeight, product of:
                2.5891943 = boost
                5.9115076 = idf(docFreq=326, maxDocs=44421)
                0.019017681 = queryNorm
              0.6531355 = fieldWeight in 45, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.9115076 = idf(docFreq=326, maxDocs=44421)
                0.078125 = fieldNorm(doc=45)
          0.38082635 = weight(abstract_txt:answering in 45) [ClassicSimilarity], result of:
            0.38082635 = score(doc=45,freq=4.0), product of:
              0.36712387 = queryWeight, product of:
                2.9077744 = boost
                6.6388726 = idf(docFreq=157, maxDocs=44421)
                0.019017681 = queryNorm
              1.0373238 = fieldWeight in 45, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.6388726 = idf(docFreq=157, maxDocs=44421)
                0.078125 = fieldNorm(doc=45)
          0.2442383 = weight(abstract_txt:question in 45) [ClassicSimilarity], result of:
            0.2442383 = score(doc=45,freq=4.0), product of:
              0.30050385 = queryWeight, product of:
                3.0377274 = boost
                5.2016807 = idf(docFreq=664, maxDocs=44421)
                0.019017681 = queryNorm
              0.8127626 = fieldWeight in 45, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.2016807 = idf(docFreq=664, maxDocs=44421)
                0.078125 = fieldNorm(doc=45)
        0.2 = coord(5/25)
    
  3. Saint-Dizier, P.; Moens, M.-F.: Knowledge and reasoning for question answering : research perspectives (2011) 0.18
    0.17970355 = sum of:
      0.17970355 = product of:
        1.1231472 = sum of:
          0.1103337 = weight(abstract_txt:factual in 3746) [ClassicSimilarity], result of:
            0.1103337 = score(doc=3746,freq=1.0), product of:
              0.15667228 = queryWeight, product of:
                1.0967051 = boost
                7.5118127 = idf(docFreq=65, maxDocs=44421)
                0.019017681 = queryNorm
              0.70423245 = fieldWeight in 3746, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5118127 = idf(docFreq=65, maxDocs=44421)
                0.09375 = fieldNorm(doc=3746)
          0.22814186 = weight(abstract_txt:answer in 3746) [ClassicSimilarity], result of:
            0.22814186 = score(doc=3746,freq=2.0), product of:
              0.29108542 = queryWeight, product of:
                2.5891943 = boost
                5.9115076 = idf(docFreq=326, maxDocs=44421)
                0.019017681 = queryNorm
              0.7837626 = fieldWeight in 3746, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.9115076 = idf(docFreq=326, maxDocs=44421)
                0.09375 = fieldNorm(doc=3746)
          0.45699164 = weight(abstract_txt:answering in 3746) [ClassicSimilarity], result of:
            0.45699164 = score(doc=3746,freq=4.0), product of:
              0.36712387 = queryWeight, product of:
                2.9077744 = boost
                6.6388726 = idf(docFreq=157, maxDocs=44421)
                0.019017681 = queryNorm
              1.2447886 = fieldWeight in 3746, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.6388726 = idf(docFreq=157, maxDocs=44421)
                0.09375 = fieldNorm(doc=3746)
          0.32768008 = weight(abstract_txt:question in 3746) [ClassicSimilarity], result of:
            0.32768008 = score(doc=3746,freq=5.0), product of:
              0.30050385 = queryWeight, product of:
                3.0377274 = boost
                5.2016807 = idf(docFreq=664, maxDocs=44421)
                0.019017681 = queryNorm
              1.0904355 = fieldWeight in 3746, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.2016807 = idf(docFreq=664, maxDocs=44421)
                0.09375 = fieldNorm(doc=3746)
        0.16 = coord(4/25)
    
  4. Liu, Z.; Jansen, B.J.: ASK: A taxonomy of accuracy, social, and knowledge information seeking posts in social question and answering (2017) 0.18
    0.1786976 = sum of:
      0.1786976 = product of:
        0.893488 = sum of:
          0.08079436 = weight(abstract_txt:questions in 4345) [ClassicSimilarity], result of:
            0.08079436 = score(doc=4345,freq=4.0), product of:
              0.13238077 = queryWeight, product of:
                1.4256773 = boost
                4.8825436 = idf(docFreq=914, maxDocs=44421)
                0.019017681 = queryNorm
              0.61031795 = fieldWeight in 4345, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.8825436 = idf(docFreq=914, maxDocs=44421)
                0.0625 = fieldNorm(doc=4345)
          0.107547104 = weight(abstract_txt:answer in 4345) [ClassicSimilarity], result of:
            0.107547104 = score(doc=4345,freq=1.0), product of:
              0.29108542 = queryWeight, product of:
                2.5891943 = boost
                5.9115076 = idf(docFreq=326, maxDocs=44421)
                0.019017681 = queryNorm
              0.36946923 = fieldWeight in 4345, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9115076 = idf(docFreq=326, maxDocs=44421)
                0.0625 = fieldNorm(doc=4345)
          0.20509484 = weight(abstract_txt:syntactic in 4345) [ClassicSimilarity], result of:
            0.20509484 = score(doc=4345,freq=2.0), product of:
              0.3552885 = queryWeight, product of:
                2.86052 = boost
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.019017681 = queryNorm
              0.5772628 = fieldWeight in 4345, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.0625 = fieldNorm(doc=4345)
          0.30466107 = weight(abstract_txt:answering in 4345) [ClassicSimilarity], result of:
            0.30466107 = score(doc=4345,freq=4.0), product of:
              0.36712387 = queryWeight, product of:
                2.9077744 = boost
                6.6388726 = idf(docFreq=157, maxDocs=44421)
                0.019017681 = queryNorm
              0.8298591 = fieldWeight in 4345, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.6388726 = idf(docFreq=157, maxDocs=44421)
                0.0625 = fieldNorm(doc=4345)
          0.19539063 = weight(abstract_txt:question in 4345) [ClassicSimilarity], result of:
            0.19539063 = score(doc=4345,freq=4.0), product of:
              0.30050385 = queryWeight, product of:
                3.0377274 = boost
                5.2016807 = idf(docFreq=664, maxDocs=44421)
                0.019017681 = queryNorm
              0.6502101 = fieldWeight in 4345, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.2016807 = idf(docFreq=664, maxDocs=44421)
                0.0625 = fieldNorm(doc=4345)
        0.2 = coord(5/25)
    
  5. Galitsky, B.: Can many agents answer questions better than one? (2005) 0.16
    0.1609855 = sum of:
      0.1609855 = product of:
        1.0061594 = sum of:
          0.050496474 = weight(abstract_txt:questions in 4094) [ClassicSimilarity], result of:
            0.050496474 = score(doc=4094,freq=1.0), product of:
              0.13238077 = queryWeight, product of:
                1.4256773 = boost
                4.8825436 = idf(docFreq=914, maxDocs=44421)
                0.019017681 = queryNorm
              0.38144872 = fieldWeight in 4094, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8825436 = idf(docFreq=914, maxDocs=44421)
                0.078125 = fieldNorm(doc=4094)
          0.19011822 = weight(abstract_txt:answer in 4094) [ClassicSimilarity], result of:
            0.19011822 = score(doc=4094,freq=2.0), product of:
              0.29108542 = queryWeight, product of:
                2.5891943 = boost
                5.9115076 = idf(docFreq=326, maxDocs=44421)
                0.019017681 = queryNorm
              0.6531355 = fieldWeight in 4094, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.9115076 = idf(docFreq=326, maxDocs=44421)
                0.078125 = fieldNorm(doc=4094)
          0.4664151 = weight(abstract_txt:answering in 4094) [ClassicSimilarity], result of:
            0.4664151 = score(doc=4094,freq=6.0), product of:
              0.36712387 = queryWeight, product of:
                2.9077744 = boost
                6.6388726 = idf(docFreq=157, maxDocs=44421)
                0.019017681 = queryNorm
              1.270457 = fieldWeight in 4094, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.6388726 = idf(docFreq=157, maxDocs=44421)
                0.078125 = fieldNorm(doc=4094)
          0.2991296 = weight(abstract_txt:question in 4094) [ClassicSimilarity], result of:
            0.2991296 = score(doc=4094,freq=6.0), product of:
              0.30050385 = queryWeight, product of:
                3.0377274 = boost
                5.2016807 = idf(docFreq=664, maxDocs=44421)
                0.019017681 = queryNorm
              0.99542683 = fieldWeight in 4094, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.2016807 = idf(docFreq=664, maxDocs=44421)
                0.078125 = fieldNorm(doc=4094)
        0.16 = coord(4/25)