Document (#34849)

Author
Cortez, E.
Silva, A.S. da
Gonçalves, M.A.
Mesquita, F.
Moura, E.S. de
Title
¬A flexible approach for extracting metadata from bibliographic citations
Source
Journal of the American Society for Information Science and Technology. 60(2009) no.6, S.1144-1158
Year
2009
Abstract
In this article we present FLUX-CiM, a novel method for extracting components (e.g., author names, article titles, venues, page numbers) from bibliographic citations. Our method does not rely on patterns encoding specific delimiters used in a particular citation style. This feature yields a high degree of automation and flexibility, and allows FLUX-CiM to extract from citations in any given format. Differently from previous methods that are based on models learned from user-driven training, our method relies on a knowledge base automatically constructed from an existing set of sample metadata records from a given field (e.g., computer science, health sciences, social sciences, etc.). These records are usually available on the Web or other public data repositories. To demonstrate the effectiveness and applicability of our proposed method, we present a series of experiments in which we apply it to extract bibliographic data from citations in articles of different fields. Results of these experiments exhibit precision and recall levels above 94% for all fields, and perfect extraction for the large majority of citations tested. In addition, in a comparison against a state-of-the-art information-extraction method, ours produced superior results without the training phase required by that method. Finally, we present a strategy for using bibliographic data resulting from the extraction process with FLUX-CiM to automatically update and expand the knowledge base of a given domain. We show that this strategy can be used to achieve good extraction results even if only a very small initial sample of bibliographic records is available for building the knowledge base.
Theme
Formalerschließung
Object
FLUX-CiM

Similar documents (author)

  1. Cortez, E.; Herrera, M.R.; Silva, A.S. da; Moura, E.S. de; Neubert, M.: Lightweight methods for large-scale product categorization (2011) 2.45
    2.4460807 = sum of:
      2.4460807 = product of:
        3.261441 = sum of:
          0.6999512 = weight(author_txt:silva in 4758) [ClassicSimilarity], result of:
            0.6999512 = score(doc=4758,freq=1.0), product of:
              0.37294763 = queryWeight, product of:
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.049678445 = queryNorm
              1.8768082 = fieldWeight in 4758, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.25 = fieldNorm(doc=4758)
          1.0898302 = weight(author_txt:moura in 4758) [ClassicSimilarity], result of:
            1.0898302 = score(doc=4758,freq=1.0), product of:
              0.5010048 = queryWeight, product of:
                1.1590363 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.049678445 = queryNorm
              2.1752887 = fieldWeight in 4758, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.25 = fieldNorm(doc=4758)
          1.4716597 = weight(author_txt:cortez in 4758) [ClassicSimilarity], result of:
            1.4716597 = score(doc=4758,freq=1.0), product of:
              0.6120792 = queryWeight, product of:
                1.2810907 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.049678445 = queryNorm
              2.4043615 = fieldWeight in 4758, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.25 = fieldNorm(doc=4758)
        0.75 = coord(3/4)
    
  2. Moura, E.S. de; Fernandes, D.; Ribeiro-Neto, B.; Silva, A.S. da; Gonçalves, M.A.: Using structural information to improve search in Web collections (2010) 2.12
    2.1209507 = sum of:
      2.1209507 = product of:
        2.8279343 = sum of:
          0.6999512 = weight(author_txt:silva in 4119) [ClassicSimilarity], result of:
            0.6999512 = score(doc=4119,freq=1.0), product of:
              0.37294763 = queryWeight, product of:
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.049678445 = queryNorm
              1.8768082 = fieldWeight in 4119, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.25 = fieldNorm(doc=4119)
          1.038153 = weight(author_txt:gonçalves in 4119) [ClassicSimilarity], result of:
            1.038153 = score(doc=4119,freq=1.0), product of:
              0.48503935 = queryWeight, product of:
                1.1404192 = boost
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.049678445 = queryNorm
              2.1403482 = fieldWeight in 4119, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.25 = fieldNorm(doc=4119)
          1.0898302 = weight(author_txt:moura in 4119) [ClassicSimilarity], result of:
            1.0898302 = score(doc=4119,freq=1.0), product of:
              0.5010048 = queryWeight, product of:
                1.1590363 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.049678445 = queryNorm
              2.1752887 = fieldWeight in 4119, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.25 = fieldNorm(doc=4119)
        0.75 = coord(3/4)
    
  3. Silva, R.M.; Gonçalves, M.A.; Veloso, A.: ¬A Two-stage active learning method for learning to rank (2014) 1.30
    1.3035781 = sum of:
      1.3035781 = product of:
        2.6071563 = sum of:
          1.0499268 = weight(author_txt:silva in 1184) [ClassicSimilarity], result of:
            1.0499268 = score(doc=1184,freq=1.0), product of:
              0.37294763 = queryWeight, product of:
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.049678445 = queryNorm
              2.8152122 = fieldWeight in 1184, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5072327 = idf(docFreq=65, maxDocs=44218)
                0.375 = fieldNorm(doc=1184)
          1.5572296 = weight(author_txt:gonçalves in 1184) [ClassicSimilarity], result of:
            1.5572296 = score(doc=1184,freq=1.0), product of:
              0.48503935 = queryWeight, product of:
                1.1404192 = boost
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.049678445 = queryNorm
              3.2105222 = fieldWeight in 1184, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.375 = fieldNorm(doc=1184)
        0.5 = coord(2/4)
    
  4. Calado, P.; Cristo, M.; Gonçalves, M.A.; Moura, E.S. de; Ribeiro-Neto, B.; Ziviani, N.: Link-based similarity measures for the classification of Web documents (2006) 1.06
    1.0639915 = sum of:
      1.0639915 = product of:
        2.127983 = sum of:
          1.038153 = weight(author_txt:gonçalves in 4921) [ClassicSimilarity], result of:
            1.038153 = score(doc=4921,freq=1.0), product of:
              0.48503935 = queryWeight, product of:
                1.1404192 = boost
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.049678445 = queryNorm
              2.1403482 = fieldWeight in 4921, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.25 = fieldNorm(doc=4921)
          1.0898302 = weight(author_txt:moura in 4921) [ClassicSimilarity], result of:
            1.0898302 = score(doc=4921,freq=1.0), product of:
              0.5010048 = queryWeight, product of:
                1.1590363 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.049678445 = queryNorm
              2.1752887 = fieldWeight in 4921, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.25 = fieldNorm(doc=4921)
        0.5 = coord(2/4)
    
  5. Couto, T.; Cristo, M.; Gonçalves, M.A.; Calado, P.; Ziviani, N.; Moura, E.; Ribeiro-Neto, B.: ¬A comparative study of citations and links in document classification (2006) 1.06
    1.0639915 = sum of:
      1.0639915 = product of:
        2.127983 = sum of:
          1.038153 = weight(author_txt:gonçalves in 2531) [ClassicSimilarity], result of:
            1.038153 = score(doc=2531,freq=1.0), product of:
              0.48503935 = queryWeight, product of:
                1.1404192 = boost
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.049678445 = queryNorm
              2.1403482 = fieldWeight in 2531, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.25 = fieldNorm(doc=2531)
          1.0898302 = weight(author_txt:moura in 2531) [ClassicSimilarity], result of:
            1.0898302 = score(doc=2531,freq=1.0), product of:
              0.5010048 = queryWeight, product of:
                1.1590363 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.049678445 = queryNorm
              2.1752887 = fieldWeight in 2531, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.25 = fieldNorm(doc=2531)
        0.5 = coord(2/4)
    

Similar documents (content)

  1. Lawson, M.: Automatic extraction of citations from the text of English-language patents : an example of template mining (1996) 0.31
    0.31369016 = sum of:
      0.31369016 = product of:
        0.78422534 = sum of:
          0.032637365 = weight(abstract_txt:data in 2654) [ClassicSimilarity], result of:
            0.032637365 = score(doc=2654,freq=5.0), product of:
              0.06999689 = queryWeight, product of:
                1.076358 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.019491745 = queryNorm
              0.46626878 = fieldWeight in 2654, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=2654)
          0.016598178 = weight(abstract_txt:results in 2654) [ClassicSimilarity], result of:
            0.016598178 = score(doc=2654,freq=1.0), product of:
              0.07626038 = queryWeight, product of:
                1.1234838 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.019491745 = queryNorm
              0.21765138 = fieldWeight in 2654, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.0625 = fieldNorm(doc=2654)
          0.043994993 = weight(abstract_txt:automatically in 2654) [ClassicSimilarity], result of:
            0.043994993 = score(doc=2654,freq=1.0), product of:
              0.12759405 = queryWeight, product of:
                1.1865524 = boost
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.019491745 = queryNorm
              0.3448044 = fieldWeight in 2654, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.0625 = fieldNorm(doc=2654)
          0.044805862 = weight(abstract_txt:sample in 2654) [ClassicSimilarity], result of:
            0.044805862 = score(doc=2654,freq=1.0), product of:
              0.12915707 = queryWeight, product of:
                1.1937978 = boost
                5.550558 = idf(docFreq=466, maxDocs=44218)
                0.019491745 = queryNorm
              0.34690988 = fieldWeight in 2654, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.550558 = idf(docFreq=466, maxDocs=44218)
                0.0625 = fieldNorm(doc=2654)
          0.07522607 = weight(abstract_txt:extract in 2654) [ClassicSimilarity], result of:
            0.07522607 = score(doc=2654,freq=1.0), product of:
              0.18244858 = queryWeight, product of:
                1.4188678 = boost
                6.5970206 = idf(docFreq=163, maxDocs=44218)
                0.019491745 = queryNorm
              0.4123138 = fieldWeight in 2654, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5970206 = idf(docFreq=163, maxDocs=44218)
                0.0625 = fieldNorm(doc=2654)
          0.08737967 = weight(abstract_txt:extracting in 2654) [ClassicSimilarity], result of:
            0.08737967 = score(doc=2654,freq=1.0), product of:
              0.20160526 = queryWeight, product of:
                1.4914979 = boost
                6.9347134 = idf(docFreq=116, maxDocs=44218)
                0.019491745 = queryNorm
              0.4334196 = fieldWeight in 2654, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9347134 = idf(docFreq=116, maxDocs=44218)
                0.0625 = fieldNorm(doc=2654)
          0.049560636 = weight(abstract_txt:bibliographic in 2654) [ClassicSimilarity], result of:
            0.049560636 = score(doc=2654,freq=1.0), product of:
              0.18748485 = queryWeight, product of:
                2.2741797 = boost
                4.229516 = idf(docFreq=1749, maxDocs=44218)
                0.019491745 = queryNorm
              0.26434475 = fieldWeight in 2654, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.229516 = idf(docFreq=1749, maxDocs=44218)
                0.0625 = fieldNorm(doc=2654)
          0.17590132 = weight(abstract_txt:extraction in 2654) [ClassicSimilarity], result of:
            0.17590132 = score(doc=2654,freq=2.0), product of:
              0.32142106 = queryWeight, product of:
                2.6633232 = boost
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.019491745 = queryNorm
              0.54726136 = fieldWeight in 2654, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.0625 = fieldNorm(doc=2654)
          0.03520555 = weight(abstract_txt:from in 2654) [ClassicSimilarity], result of:
            0.03520555 = score(doc=2654,freq=2.0), product of:
              0.1441108 = queryWeight, product of:
                2.675015 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.019491745 = queryNorm
              0.24429502 = fieldWeight in 2654, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0625 = fieldNorm(doc=2654)
          0.22291568 = weight(abstract_txt:citations in 2654) [ClassicSimilarity], result of:
            0.22291568 = score(doc=2654,freq=5.0), product of:
              0.2987528 = queryWeight, product of:
                2.8707654 = boost
                5.339045 = idf(docFreq=576, maxDocs=44218)
                0.019491745 = queryNorm
              0.74615425 = fieldWeight in 2654, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.339045 = idf(docFreq=576, maxDocs=44218)
                0.0625 = fieldNorm(doc=2654)
        0.4 = coord(10/25)
    
  2. Cota, R.G.; Ferreira, A.A.; Nascimento, C.; Gonçalves, M.A.; Laender, A.H.F.: ¬An unsupervised heuristic-based hierarchical method for name disambiguation in bibliographic citations (2010) 0.24
    0.24199165 = sum of:
      0.24199165 = product of:
        0.672199 = sum of:
          0.105342224 = weight(abstract_txt:ours in 3986) [ClassicSimilarity], result of:
            0.105342224 = score(doc=3986,freq=1.0), product of:
              0.18125358 = queryWeight, product of:
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.019491745 = queryNorm
              0.581187 = fieldWeight in 3986, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.298992 = idf(docFreq=10, maxDocs=44218)
                0.0625 = fieldNorm(doc=3986)
          0.035004336 = weight(abstract_txt:training in 3986) [ClassicSimilarity], result of:
            0.035004336 = score(doc=3986,freq=1.0), product of:
              0.10955768 = queryWeight, product of:
                1.0994946 = boost
                5.112096 = idf(docFreq=723, maxDocs=44218)
                0.019491745 = queryNorm
              0.319506 = fieldWeight in 3986, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.112096 = idf(docFreq=723, maxDocs=44218)
                0.0625 = fieldNorm(doc=3986)
          0.028748887 = weight(abstract_txt:results in 3986) [ClassicSimilarity], result of:
            0.028748887 = score(doc=3986,freq=3.0), product of:
              0.07626038 = queryWeight, product of:
                1.1234838 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.019491745 = queryNorm
              0.37698326 = fieldWeight in 3986, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.0625 = fieldNorm(doc=3986)
          0.03968335 = weight(abstract_txt:experiments in 3986) [ClassicSimilarity], result of:
            0.03968335 = score(doc=3986,freq=1.0), product of:
              0.11911519 = queryWeight, product of:
                1.1464504 = boost
                5.3304167 = idf(docFreq=581, maxDocs=44218)
                0.019491745 = queryNorm
              0.33315104 = fieldWeight in 3986, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3304167 = idf(docFreq=581, maxDocs=44218)
                0.0625 = fieldNorm(doc=3986)
          0.04571748 = weight(abstract_txt:present in 3986) [ClassicSimilarity], result of:
            0.04571748 = score(doc=3986,freq=2.0), product of:
              0.11893333 = queryWeight, product of:
                1.403037 = boost
                4.348943 = idf(docFreq=1552, maxDocs=44218)
                0.019491745 = queryNorm
              0.3843959 = fieldWeight in 3986, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.348943 = idf(docFreq=1552, maxDocs=44218)
                0.0625 = fieldNorm(doc=3986)
          0.049560636 = weight(abstract_txt:bibliographic in 3986) [ClassicSimilarity], result of:
            0.049560636 = score(doc=3986,freq=1.0), product of:
              0.18748485 = queryWeight, product of:
                2.2741797 = boost
                4.229516 = idf(docFreq=1749, maxDocs=44218)
                0.019491745 = queryNorm
              0.26434475 = fieldWeight in 3986, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.229516 = idf(docFreq=1749, maxDocs=44218)
                0.0625 = fieldNorm(doc=3986)
          0.03520555 = weight(abstract_txt:from in 3986) [ClassicSimilarity], result of:
            0.03520555 = score(doc=3986,freq=2.0), product of:
              0.1441108 = queryWeight, product of:
                2.675015 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.019491745 = queryNorm
              0.24429502 = fieldWeight in 3986, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0625 = fieldNorm(doc=3986)
          0.17266974 = weight(abstract_txt:citations in 3986) [ClassicSimilarity], result of:
            0.17266974 = score(doc=3986,freq=3.0), product of:
              0.2987528 = queryWeight, product of:
                2.8707654 = boost
                5.339045 = idf(docFreq=576, maxDocs=44218)
                0.019491745 = queryNorm
              0.5779686 = fieldWeight in 3986, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.339045 = idf(docFreq=576, maxDocs=44218)
                0.0625 = fieldNorm(doc=3986)
          0.16026682 = weight(abstract_txt:method in 3986) [ClassicSimilarity], result of:
            0.16026682 = score(doc=3986,freq=5.0), product of:
              0.25478533 = queryWeight, product of:
                2.9041533 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.019491745 = queryNorm
              0.6290269 = fieldWeight in 3986, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0625 = fieldNorm(doc=3986)
        0.36 = coord(9/25)
    
  3. Riloff, E.: ¬An empirical study of automated dictionary construction for information extraction in three domains (1996) 0.23
    0.2301445 = sum of:
      0.2301445 = product of:
        0.71920156 = sum of:
          0.06125759 = weight(abstract_txt:training in 6752) [ClassicSimilarity], result of:
            0.06125759 = score(doc=6752,freq=1.0), product of:
              0.10955768 = queryWeight, product of:
                1.0994946 = boost
                5.112096 = idf(docFreq=723, maxDocs=44218)
                0.019491745 = queryNorm
              0.5591355 = fieldWeight in 6752, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.112096 = idf(docFreq=723, maxDocs=44218)
                0.109375 = fieldNorm(doc=6752)
          0.029046811 = weight(abstract_txt:results in 6752) [ClassicSimilarity], result of:
            0.029046811 = score(doc=6752,freq=1.0), product of:
              0.07626038 = queryWeight, product of:
                1.1234838 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.019491745 = queryNorm
              0.38088992 = fieldWeight in 6752, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.109375 = fieldNorm(doc=6752)
          0.030843401 = weight(abstract_txt:knowledge in 6752) [ClassicSimilarity], result of:
            0.030843401 = score(doc=6752,freq=1.0), product of:
              0.07937337 = queryWeight, product of:
                1.146185 = boost
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.019491745 = queryNorm
              0.38858628 = fieldWeight in 6752, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.109375 = fieldNorm(doc=6752)
          0.09821128 = weight(abstract_txt:experiments in 6752) [ClassicSimilarity], result of:
            0.09821128 = score(doc=6752,freq=2.0), product of:
              0.11911519 = queryWeight, product of:
                1.1464504 = boost
                5.3304167 = idf(docFreq=581, maxDocs=44218)
                0.019491745 = queryNorm
              0.82450676 = fieldWeight in 6752, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.3304167 = idf(docFreq=581, maxDocs=44218)
                0.109375 = fieldNorm(doc=6752)
          0.07699124 = weight(abstract_txt:automatically in 6752) [ClassicSimilarity], result of:
            0.07699124 = score(doc=6752,freq=1.0), product of:
              0.12759405 = queryWeight, product of:
                1.1865524 = boost
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.019491745 = queryNorm
              0.60340774 = fieldWeight in 6752, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.109375 = fieldNorm(doc=6752)
          0.07145924 = weight(abstract_txt:given in 6752) [ClassicSimilarity], result of:
            0.07145924 = score(doc=6752,freq=1.0), product of:
              0.13897572 = queryWeight, product of:
                1.516655 = boost
                4.701121 = idf(docFreq=1091, maxDocs=44218)
                0.019491745 = queryNorm
              0.5141851 = fieldWeight in 6752, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.701121 = idf(docFreq=1091, maxDocs=44218)
                0.109375 = fieldNorm(doc=6752)
          0.30782732 = weight(abstract_txt:extraction in 6752) [ClassicSimilarity], result of:
            0.30782732 = score(doc=6752,freq=2.0), product of:
              0.32142106 = queryWeight, product of:
                2.6633232 = boost
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.019491745 = queryNorm
              0.9577074 = fieldWeight in 6752, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.109375 = fieldNorm(doc=6752)
          0.043564647 = weight(abstract_txt:from in 6752) [ClassicSimilarity], result of:
            0.043564647 = score(doc=6752,freq=1.0), product of:
              0.1441108 = queryWeight, product of:
                2.675015 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.019491745 = queryNorm
              0.30229968 = fieldWeight in 6752, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.109375 = fieldNorm(doc=6752)
        0.32 = coord(8/25)
    
  4. Ru, C.; Tang, J.; Li, S.; Xie, S.; Wang, T.: Using semantic similarity to reduce wrong labels in distant supervision for relation extraction (2018) 0.23
    0.23003969 = sum of:
      0.23003969 = product of:
        0.6389991 = sum of:
          0.029191747 = weight(abstract_txt:data in 5055) [ClassicSimilarity], result of:
            0.029191747 = score(doc=5055,freq=4.0), product of:
              0.06999689 = queryWeight, product of:
                1.076358 = boost
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.019491745 = queryNorm
              0.41704348 = fieldWeight in 5055, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.3363478 = idf(docFreq=4274, maxDocs=44218)
                0.0625 = fieldNorm(doc=5055)
          0.035004336 = weight(abstract_txt:training in 5055) [ClassicSimilarity], result of:
            0.035004336 = score(doc=5055,freq=1.0), product of:
              0.10955768 = queryWeight, product of:
                1.0994946 = boost
                5.112096 = idf(docFreq=723, maxDocs=44218)
                0.019491745 = queryNorm
              0.319506 = fieldWeight in 5055, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.112096 = idf(docFreq=723, maxDocs=44218)
                0.0625 = fieldNorm(doc=5055)
          0.016598178 = weight(abstract_txt:results in 5055) [ClassicSimilarity], result of:
            0.016598178 = score(doc=5055,freq=1.0), product of:
              0.07626038 = queryWeight, product of:
                1.1234838 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.019491745 = queryNorm
              0.21765138 = fieldWeight in 5055, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.0625 = fieldNorm(doc=5055)
          0.017624801 = weight(abstract_txt:knowledge in 5055) [ClassicSimilarity], result of:
            0.017624801 = score(doc=5055,freq=1.0), product of:
              0.07937337 = queryWeight, product of:
                1.146185 = boost
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.019491745 = queryNorm
              0.2220493 = fieldWeight in 5055, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.0625 = fieldNorm(doc=5055)
          0.062218312 = weight(abstract_txt:automatically in 5055) [ClassicSimilarity], result of:
            0.062218312 = score(doc=5055,freq=2.0), product of:
              0.12759405 = queryWeight, product of:
                1.1865524 = boost
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.019491745 = queryNorm
              0.48762706 = fieldWeight in 5055, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.0625 = fieldNorm(doc=5055)
          0.07025204 = weight(abstract_txt:base in 5055) [ClassicSimilarity], result of:
            0.07025204 = score(doc=5055,freq=1.0), product of:
              0.19954062 = queryWeight, product of:
                1.8173265 = boost
                5.633102 = idf(docFreq=429, maxDocs=44218)
                0.019491745 = queryNorm
              0.35206887 = fieldWeight in 5055, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.633102 = idf(docFreq=429, maxDocs=44218)
                0.0625 = fieldNorm(doc=5055)
          0.24876204 = weight(abstract_txt:extraction in 5055) [ClassicSimilarity], result of:
            0.24876204 = score(doc=5055,freq=4.0), product of:
              0.32142106 = queryWeight, product of:
                2.6633232 = boost
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.019491745 = queryNorm
              0.77394444 = fieldWeight in 5055, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.0625 = fieldNorm(doc=5055)
          0.03520555 = weight(abstract_txt:from in 5055) [ClassicSimilarity], result of:
            0.03520555 = score(doc=5055,freq=2.0), product of:
              0.1441108 = queryWeight, product of:
                2.675015 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.019491745 = queryNorm
              0.24429502 = fieldWeight in 5055, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.0625 = fieldNorm(doc=5055)
          0.12414214 = weight(abstract_txt:method in 5055) [ClassicSimilarity], result of:
            0.12414214 = score(doc=5055,freq=3.0), product of:
              0.25478533 = queryWeight, product of:
                2.9041533 = boost
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.019491745 = queryNorm
              0.4872421 = fieldWeight in 5055, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.50095 = idf(docFreq=1333, maxDocs=44218)
                0.0625 = fieldNorm(doc=5055)
        0.36 = coord(9/25)
    
  5. Li, J.; Zhang, Z.; Li, X.; Chen, H.: Kernel-based learning for biomedical relation extraction (2008) 0.22
    0.21974808 = sum of:
      0.21974808 = product of:
        0.68671274 = sum of:
          0.020747723 = weight(abstract_txt:results in 1611) [ClassicSimilarity], result of:
            0.020747723 = score(doc=1611,freq=1.0), product of:
              0.07626038 = queryWeight, product of:
                1.1234838 = boost
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.019491745 = queryNorm
              0.27206424 = fieldWeight in 1611, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.482422 = idf(docFreq=3693, maxDocs=44218)
                0.078125 = fieldNorm(doc=1611)
          0.022031 = weight(abstract_txt:knowledge in 1611) [ClassicSimilarity], result of:
            0.022031 = score(doc=1611,freq=1.0), product of:
              0.07937337 = queryWeight, product of:
                1.146185 = boost
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.019491745 = queryNorm
              0.2775616 = fieldWeight in 1611, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5527887 = idf(docFreq=3442, maxDocs=44218)
                0.078125 = fieldNorm(doc=1611)
          0.04960419 = weight(abstract_txt:experiments in 1611) [ClassicSimilarity], result of:
            0.04960419 = score(doc=1611,freq=1.0), product of:
              0.11911519 = queryWeight, product of:
                1.1464504 = boost
                5.3304167 = idf(docFreq=581, maxDocs=44218)
                0.019491745 = queryNorm
              0.41643882 = fieldWeight in 1611, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3304167 = idf(docFreq=581, maxDocs=44218)
                0.078125 = fieldNorm(doc=1611)
          0.07777289 = weight(abstract_txt:automatically in 1611) [ClassicSimilarity], result of:
            0.07777289 = score(doc=1611,freq=2.0), product of:
              0.12759405 = queryWeight, product of:
                1.1865524 = boost
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.019491745 = queryNorm
              0.60953385 = fieldWeight in 1611, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5168705 = idf(docFreq=482, maxDocs=44218)
                0.078125 = fieldNorm(doc=1611)
          0.094032586 = weight(abstract_txt:extract in 1611) [ClassicSimilarity], result of:
            0.094032586 = score(doc=1611,freq=1.0), product of:
              0.18244858 = queryWeight, product of:
                1.4188678 = boost
                6.5970206 = idf(docFreq=163, maxDocs=44218)
                0.019491745 = queryNorm
              0.51539224 = fieldWeight in 1611, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5970206 = idf(docFreq=163, maxDocs=44218)
                0.078125 = fieldNorm(doc=1611)
          0.109224595 = weight(abstract_txt:extracting in 1611) [ClassicSimilarity], result of:
            0.109224595 = score(doc=1611,freq=1.0), product of:
              0.20160526 = queryWeight, product of:
                1.4914979 = boost
                6.9347134 = idf(docFreq=116, maxDocs=44218)
                0.019491745 = queryNorm
              0.5417745 = fieldWeight in 1611, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9347134 = idf(docFreq=116, maxDocs=44218)
                0.078125 = fieldNorm(doc=1611)
          0.2692928 = weight(abstract_txt:extraction in 1611) [ClassicSimilarity], result of:
            0.2692928 = score(doc=1611,freq=3.0), product of:
              0.32142106 = queryWeight, product of:
                2.6633232 = boost
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.019491745 = queryNorm
              0.83781946 = fieldWeight in 1611, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.1915555 = idf(docFreq=245, maxDocs=44218)
                0.078125 = fieldNorm(doc=1611)
          0.04400694 = weight(abstract_txt:from in 1611) [ClassicSimilarity], result of:
            0.04400694 = score(doc=1611,freq=2.0), product of:
              0.1441108 = queryWeight, product of:
                2.675015 = boost
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.019491745 = queryNorm
              0.30536878 = fieldWeight in 1611, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.7638826 = idf(docFreq=7577, maxDocs=44218)
                0.078125 = fieldNorm(doc=1611)
        0.32 = coord(8/25)