Document (#38593)

Author
Nédellec, C.
Bossy, R.
Valsamou, D.
Ranoux, M.
Golik, W.
Sourdille, P.
Title
Information extraction from bbliography for marker-assisted selection in wheat
Source
Metadata and semantics research: 8th Research Conference, MTSR 2014, Karlsruhe, Germany, November 27-29, 2014, Proceedings. Eds.: S. Closs et al
Imprint
Cham : Springer
Year
2014
Pages
S.301-313
Series
Communications in computer and information science; 478
Abstract
Improvement of most animal and plant species of agronomical interest in the near future has become an international stake because of the increasing demand for feeding a growing world population and to mitigate the reduction of the industrial resources. The recent advent of genomic tools contributed to improve the discovery of linkage between molecular markers and genes that are involved in the control of traits of agronomical interest such as grain number or disease resistance. This information is mostly published as scientific papers but rarely available in databases. Here, we present a method aiming at automatically extract this information from the scientific literature and relying on a knowledge model of the target information and on the WheatPhenotype ontology that we developed for this purpose. The information extraction results were evaluated and integrated into the on-line semantic search engine AlvisIR WheatMarker.
Field
Agrarwissenschaften

Similar documents (content)

  1. Hofmann-Apitius, M.: Direct use of information extraction from scientific text for modeling and simulation in the life sciences (2009) 0.14
    0.13877045 = sum of:
      0.13877045 = product of:
        0.6938522 = sum of:
          0.17417675 = weight(abstract_txt:disease in 3814) [ClassicSimilarity], result of:
            0.17417675 = score(doc=3814,freq=4.0), product of:
              0.18066995 = queryWeight, product of:
                1.082187 = boost
                7.7124834 = idf(docFreq=53, maxDocs=44421)
                0.021646583 = queryNorm
              0.9640604 = fieldWeight in 3814, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.7124834 = idf(docFreq=53, maxDocs=44421)
                0.0625 = fieldNorm(doc=3814)
          0.14217605 = weight(abstract_txt:molecular in 3814) [ClassicSimilarity], result of:
            0.14217605 = score(doc=3814,freq=2.0), product of:
              0.19881696 = queryWeight, product of:
                1.1352358 = boost
                8.090549 = idf(docFreq=36, maxDocs=44421)
                0.021646583 = queryNorm
              0.7151103 = fieldWeight in 3814, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.090549 = idf(docFreq=36, maxDocs=44421)
                0.0625 = fieldNorm(doc=3814)
          0.2507727 = weight(abstract_txt:genes in 3814) [ClassicSimilarity], result of:
            0.2507727 = score(doc=3814,freq=3.0), product of:
              0.25354725 = queryWeight, product of:
                1.2820023 = boost
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.021646583 = queryNorm
              0.9890571 = fieldWeight in 3814, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.0625 = fieldNorm(doc=3814)
          0.09985897 = weight(abstract_txt:scientific in 3814) [ClassicSimilarity], result of:
            0.09985897 = score(doc=3814,freq=7.0), product of:
              0.13036105 = queryWeight, product of:
                1.3000147 = boost
                4.6324444 = idf(docFreq=1174, maxDocs=44421)
                0.021646583 = queryNorm
              0.76601845 = fieldWeight in 3814, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.6324444 = idf(docFreq=1174, maxDocs=44421)
                0.0625 = fieldNorm(doc=3814)
          0.02686773 = weight(abstract_txt:information in 3814) [ClassicSimilarity], result of:
            0.02686773 = score(doc=3814,freq=4.0), product of:
              0.08885935 = queryWeight, product of:
                1.697055 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.021646583 = queryNorm
              0.30236244 = fieldWeight in 3814, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0625 = fieldNorm(doc=3814)
        0.2 = coord(5/25)
    
  2. Dextre Clarke, S.G.: ¬The Information Retrieval Thesaurus (2019) 0.10
    0.10095623 = sum of:
      0.10095623 = product of:
        0.420651 = sum of:
          0.014026124 = weight(abstract_txt:this in 210) [ClassicSimilarity], result of:
            0.014026124 = score(doc=210,freq=2.0), product of:
              0.052758772 = queryWeight, product of:
                1.0129018 = boost
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.021646583 = queryNorm
              0.26585388 = fieldWeight in 210, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.078125 = fieldNorm(doc=210)
          0.09165406 = weight(abstract_txt:industrial in 210) [ClassicSimilarity], result of:
            0.09165406 = score(doc=210,freq=1.0), product of:
              0.16109186 = queryWeight, product of:
                1.0218712 = boost
                7.282627 = idf(docFreq=82, maxDocs=44421)
                0.021646583 = queryNorm
              0.56895524 = fieldWeight in 210, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.282627 = idf(docFreq=82, maxDocs=44421)
                0.078125 = fieldNorm(doc=210)
          0.18097962 = weight(abstract_txt:genes in 210) [ClassicSimilarity], result of:
            0.18097962 = score(doc=210,freq=1.0), product of:
              0.25354725 = queryWeight, product of:
                1.2820023 = boost
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.021646583 = queryNorm
              0.71379054 = fieldWeight in 210, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.078125 = fieldNorm(doc=210)
          0.04717893 = weight(abstract_txt:scientific in 210) [ClassicSimilarity], result of:
            0.04717893 = score(doc=210,freq=1.0), product of:
              0.13036105 = queryWeight, product of:
                1.3000147 = boost
                4.6324444 = idf(docFreq=1174, maxDocs=44421)
                0.021646583 = queryNorm
              0.36190972 = fieldWeight in 210, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6324444 = idf(docFreq=1174, maxDocs=44421)
                0.078125 = fieldNorm(doc=210)
          0.063064314 = weight(abstract_txt:interest in 210) [ClassicSimilarity], result of:
            0.063064314 = score(doc=210,freq=1.0), product of:
              0.15818729 = queryWeight, product of:
                1.4320564 = boost
                5.1029587 = idf(docFreq=733, maxDocs=44421)
                0.021646583 = queryNorm
              0.39866865 = fieldWeight in 210, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1029587 = idf(docFreq=733, maxDocs=44421)
                0.078125 = fieldNorm(doc=210)
          0.023747941 = weight(abstract_txt:information in 210) [ClassicSimilarity], result of:
            0.023747941 = score(doc=210,freq=2.0), product of:
              0.08885935 = queryWeight, product of:
                1.697055 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.021646583 = queryNorm
              0.26725316 = fieldWeight in 210, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.078125 = fieldNorm(doc=210)
        0.24 = coord(6/25)
    
  3. Liu, R.-L.: ¬A passage extractor for classification of disease aspect information (2013) 0.08
    0.078377366 = sum of:
      0.078377366 = product of:
        0.39188683 = sum of:
          0.007934375 = weight(abstract_txt:this in 2107) [ClassicSimilarity], result of:
            0.007934375 = score(doc=2107,freq=1.0), product of:
              0.052758772 = queryWeight, product of:
                1.0129018 = boost
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.021646583 = queryNorm
              0.15038967 = fieldWeight in 2107, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.0625 = fieldNorm(doc=2107)
          0.21332209 = weight(abstract_txt:disease in 2107) [ClassicSimilarity], result of:
            0.21332209 = score(doc=2107,freq=6.0), product of:
              0.18066995 = queryWeight, product of:
                1.082187 = boost
                7.7124834 = idf(docFreq=53, maxDocs=44421)
                0.021646583 = queryNorm
              1.1807281 = fieldWeight in 2107, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.7124834 = idf(docFreq=53, maxDocs=44421)
                0.0625 = fieldNorm(doc=2107)
          0.05045145 = weight(abstract_txt:interest in 2107) [ClassicSimilarity], result of:
            0.05045145 = score(doc=2107,freq=1.0), product of:
              0.15818729 = queryWeight, product of:
                1.4320564 = boost
                5.1029587 = idf(docFreq=733, maxDocs=44421)
                0.021646583 = queryNorm
              0.31893492 = fieldWeight in 2107, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1029587 = idf(docFreq=733, maxDocs=44421)
                0.0625 = fieldNorm(doc=2107)
          0.030039037 = weight(abstract_txt:information in 2107) [ClassicSimilarity], result of:
            0.030039037 = score(doc=2107,freq=5.0), product of:
              0.08885935 = queryWeight, product of:
                1.697055 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.021646583 = queryNorm
              0.3380515 = fieldWeight in 2107, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0625 = fieldNorm(doc=2107)
          0.090139866 = weight(abstract_txt:extraction in 2107) [ClassicSimilarity], result of:
            0.090139866 = score(doc=2107,freq=1.0), product of:
              0.23291658 = queryWeight, product of:
                1.737699 = boost
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.021646583 = queryNorm
              0.38700494 = fieldWeight in 2107, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.0625 = fieldNorm(doc=2107)
        0.2 = coord(5/25)
    
  4. Michon, J.: Biomedicine and the Semantic Web : a knowledge model for visual phenotype (2006) 0.08
    0.07780438 = sum of:
      0.07780438 = product of:
        0.3890219 = sum of:
          0.013742739 = weight(abstract_txt:this in 371) [ClassicSimilarity], result of:
            0.013742739 = score(doc=371,freq=3.0), product of:
              0.052758772 = queryWeight, product of:
                1.0129018 = boost
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.021646583 = queryNorm
              0.26048255 = fieldWeight in 371, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.0625 = fieldNorm(doc=371)
          0.08708838 = weight(abstract_txt:disease in 371) [ClassicSimilarity], result of:
            0.08708838 = score(doc=371,freq=1.0), product of:
              0.18066995 = queryWeight, product of:
                1.082187 = boost
                7.7124834 = idf(docFreq=53, maxDocs=44421)
                0.021646583 = queryNorm
              0.4820302 = fieldWeight in 371, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7124834 = idf(docFreq=53, maxDocs=44421)
                0.0625 = fieldNorm(doc=371)
          0.10053365 = weight(abstract_txt:molecular in 371) [ClassicSimilarity], result of:
            0.10053365 = score(doc=371,freq=1.0), product of:
              0.19881696 = queryWeight, product of:
                1.1352358 = boost
                8.090549 = idf(docFreq=36, maxDocs=44421)
                0.021646583 = queryNorm
              0.50565934 = fieldWeight in 371, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.090549 = idf(docFreq=36, maxDocs=44421)
                0.0625 = fieldNorm(doc=371)
          0.15761812 = weight(abstract_txt:genomic in 371) [ClassicSimilarity], result of:
            0.15761812 = score(doc=371,freq=1.0), product of:
              0.26831806 = queryWeight, product of:
                1.3188163 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.021646583 = queryNorm
              0.5874302 = fieldWeight in 371, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.0625 = fieldNorm(doc=371)
          0.030039037 = weight(abstract_txt:information in 371) [ClassicSimilarity], result of:
            0.030039037 = score(doc=371,freq=5.0), product of:
              0.08885935 = queryWeight, product of:
                1.697055 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.021646583 = queryNorm
              0.3380515 = fieldWeight in 371, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0625 = fieldNorm(doc=371)
        0.2 = coord(5/25)
    
  5. Sy, M.-F.; Ranwez, S.; Montmain, J.; Ragnault, A.; Crampes, M.; Ranwez, V.: User centered and ontology based information retrieval system for life sciences (2012) 0.08
    0.07562673 = sum of:
      0.07562673 = product of:
        0.37813365 = sum of:
          0.015524076 = weight(abstract_txt:this in 1699) [ClassicSimilarity], result of:
            0.015524076 = score(doc=1699,freq=5.0), product of:
              0.052758772 = queryWeight, product of:
                1.0129018 = boost
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.021646583 = queryNorm
              0.29424635 = fieldWeight in 1699, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1699)
          0.07172383 = weight(abstract_txt:aiming in 1699) [ClassicSimilarity], result of:
            0.07172383 = score(doc=1699,freq=1.0), product of:
              0.17351995 = queryWeight, product of:
                1.0605571 = boost
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.021646583 = queryNorm
              0.41334632 = fieldWeight in 1699, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1699)
          0.12668572 = weight(abstract_txt:genes in 1699) [ClassicSimilarity], result of:
            0.12668572 = score(doc=1699,freq=1.0), product of:
              0.25354725 = queryWeight, product of:
                1.2820023 = boost
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.021646583 = queryNorm
              0.49965334 = fieldWeight in 1699, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1699)
          0.13791586 = weight(abstract_txt:genomic in 1699) [ClassicSimilarity], result of:
            0.13791586 = score(doc=1699,freq=1.0), product of:
              0.26831806 = queryWeight, product of:
                1.3188163 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.021646583 = queryNorm
              0.5140014 = fieldWeight in 1699, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1699)
          0.026284156 = weight(abstract_txt:information in 1699) [ClassicSimilarity], result of:
            0.026284156 = score(doc=1699,freq=5.0), product of:
              0.08885935 = queryWeight, product of:
                1.697055 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.021646583 = queryNorm
              0.29579505 = fieldWeight in 1699, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1699)
        0.2 = coord(5/25)