Document (#43095)

Author
Díez Platas, M.L.
Muñoz, S.R.
González-Blanco, E.
Ruiz Fabo, P.
Álvarez Mellado, E.
Title
Medieval Spanish (12th-15th centuries) named entity recognition and attribute annotation system based on contextual information
Source
Journal of the Association for Information Science and Technology. 72(2021) no.2, S.224-238
Year
2021
Abstract
The recognition of named entities in Spanish medieval texts presents great complexity, involving specific challenges: First, the complex morphosyntactic characteristics in proper-noun use in medieval texts. Second, the lack of strict orthographic standards. Finally, diachronic and geographical variations in Spanish from the 12th to 15th century. In this period, named entities usually appear as complex text structure. For example, it was frequent to add nicknames and information about the persons role in society and geographic origin. To tackle this complexity, named entity recognition and classification system has been implemented. The system uses contextual cues based on semantics to detect entities and assign a type. Given the occurrence of entities with attached attributes, entity contexts are also parsed to determine entity-type-specific dependencies for these attributes. Moreover, it uses a variant generator to handle the diachronic evolution of Spanish medieval terms from a phonetic and morphosyntactic viewpoint. The tool iteratively enriches its proper lexica, dictionaries, and gazetteers. The system was evaluated on a corpus of over 3,000 manually annotated entities of different types and periods, obtaining F1 scores between 0.74 and 0.87. Attribute annotation was evaluated for a person and role name attributes with an overall F1 of 0.75.
Content
Vgl.: https://asistdl.onlinelibrary.wiley.com/doi/10.1002/asi.24399.
Theme
Formalerschließung
Form
Handschriften
Inkunabeln
Location
ES

Similar documents (author)

  1. Blanco, E. González- => González-Blanco, E.: 1.49
    1.492199 = sum of:
      1.492199 = product of:
        3.7304974 = sum of:
          1.4707742 = weight(author_txt:gonzález in 94) [ClassicSimilarity], result of:
            1.4707742 = score(doc=94,freq=2.0), product of:
              0.3473893 = queryWeight, product of:
                7.983315 = idf(docFreq=40, maxDocs=44218)
                0.04351442 = queryNorm
              4.2337923 = fieldWeight in 94, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.983315 = idf(docFreq=40, maxDocs=44218)
                0.375 = fieldNorm(doc=94)
          2.2597232 = weight(author_txt:blanco in 94) [ClassicSimilarity], result of:
            2.2597232 = score(doc=94,freq=2.0), product of:
              0.46254712 = queryWeight, product of:
                1.1539042 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.04351442 = queryNorm
              4.8853903 = fieldWeight in 94, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.375 = fieldNorm(doc=94)
        0.4 = coord(2/5)
    
  2. Blanco, E. González- => González-Blanco, E.: 1.49
    1.492199 = sum of:
      1.492199 = product of:
        3.7304974 = sum of:
          1.4707742 = weight(author_txt:gonzález in 467) [ClassicSimilarity], result of:
            1.4707742 = score(doc=467,freq=2.0), product of:
              0.3473893 = queryWeight, product of:
                7.983315 = idf(docFreq=40, maxDocs=44218)
                0.04351442 = queryNorm
              4.2337923 = fieldWeight in 467, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.983315 = idf(docFreq=40, maxDocs=44218)
                0.375 = fieldNorm(doc=467)
          2.2597232 = weight(author_txt:blanco in 467) [ClassicSimilarity], result of:
            2.2597232 = score(doc=467,freq=2.0), product of:
              0.46254712 = queryWeight, product of:
                1.1539042 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.04351442 = queryNorm
              4.8853903 = fieldWeight in 467, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.375 = fieldNorm(doc=467)
        0.4 = coord(2/5)
    
  3. Martínez-González, M.M.; Alvite-Díez, M.L.: Thesauri and Semantic Web : discussion of the evolution of thesauri toward their integration with the Semantic Web (2019) 1.21
    1.210548 = sum of:
      1.210548 = product of:
        3.02637 = sum of:
          1.0399944 = weight(author_txt:gonzález in 5997) [ClassicSimilarity], result of:
            1.0399944 = score(doc=5997,freq=1.0), product of:
              0.3473893 = queryWeight, product of:
                7.983315 = idf(docFreq=40, maxDocs=44218)
                0.04351442 = queryNorm
              2.9937432 = fieldWeight in 5997, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.983315 = idf(docFreq=40, maxDocs=44218)
                0.375 = fieldNorm(doc=5997)
          1.9863758 = weight(author_txt:díez in 5997) [ClassicSimilarity], result of:
            1.9863758 = score(doc=5997,freq=1.0), product of:
              0.53477377 = queryWeight, product of:
                1.2407286 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.04351442 = queryNorm
              3.7144227 = fieldWeight in 5997, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.375 = fieldNorm(doc=5997)
        0.4 = coord(2/5)
    
  4. Moreiro González, J.A.; Franco Álvarez, G.; Garcia Martul, D.: ¬Un vocabulario controlado para una hemerotecá : posibilidades y características de los topicsets (2007) 0.91
    0.9115414 = sum of:
      0.9115414 = product of:
        2.2788534 = sum of:
          0.866662 = weight(author_txt:gonzález in 1117) [ClassicSimilarity], result of:
            0.866662 = score(doc=1117,freq=1.0), product of:
              0.3473893 = queryWeight, product of:
                7.983315 = idf(docFreq=40, maxDocs=44218)
                0.04351442 = queryNorm
              2.494786 = fieldWeight in 1117, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.983315 = idf(docFreq=40, maxDocs=44218)
                0.3125 = fieldNorm(doc=1117)
          1.4121915 = weight(author_txt:álvarez in 1117) [ClassicSimilarity], result of:
            1.4121915 = score(doc=1117,freq=1.0), product of:
              0.48103762 = queryWeight, product of:
                1.1767421 = boost
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.04351442 = queryNorm
              2.9357195 = fieldWeight in 1117, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.394302 = idf(docFreq=9, maxDocs=44218)
                0.3125 = fieldNorm(doc=1117)
        0.4 = coord(2/5)
    
  5. Pérez Pozo, Á.; Rosa, J. de la; Ros, S.; González-Blanco, E.; Hernández, L.; Sisto, M. de: ¬A bridge too far for artificial intelligence? : automatic classification of stanzas in Spanish poetry (2022) 0.62
    0.6155007 = sum of:
      0.6155007 = product of:
        1.5387517 = sum of:
          0.6066634 = weight(author_txt:gonzález in 468) [ClassicSimilarity], result of:
            0.6066634 = score(doc=468,freq=1.0), product of:
              0.3473893 = queryWeight, product of:
                7.983315 = idf(docFreq=40, maxDocs=44218)
                0.04351442 = queryNorm
              1.7463502 = fieldWeight in 468, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.983315 = idf(docFreq=40, maxDocs=44218)
                0.21875 = fieldNorm(doc=468)
          0.9320883 = weight(author_txt:blanco in 468) [ClassicSimilarity], result of:
            0.9320883 = score(doc=468,freq=1.0), product of:
              0.46254712 = queryWeight, product of:
                1.1539042 = boost
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.04351442 = queryNorm
              2.0151207 = fieldWeight in 468, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.211981 = idf(docFreq=11, maxDocs=44218)
                0.21875 = fieldNorm(doc=468)
        0.4 = coord(2/5)
    

Similar documents (content)

  1. Shaalan, K.; Raza, H.: NERA: Named Entity Recognition for Arabic (2009) 0.27
    0.27194077 = sum of:
      0.27194077 = product of:
        0.84981495 = sum of:
          0.07263395 = weight(abstract_txt:orthographic in 2953) [ClassicSimilarity], result of:
            0.07263395 = score(doc=2953,freq=1.0), product of:
              0.13809942 = queryWeight, product of:
                1.0440149 = boost
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.013753885 = queryNorm
              0.52595407 = fieldWeight in 2953, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.617446 = idf(docFreq=7, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2953)
          0.030529745 = weight(abstract_txt:evaluated in 2953) [ClassicSimilarity], result of:
            0.030529745 = score(doc=2953,freq=1.0), product of:
              0.0976317 = queryWeight, product of:
                1.2414271 = boost
                5.7180014 = idf(docFreq=394, maxDocs=44218)
                0.013753885 = queryNorm
              0.3127032 = fieldWeight in 2953, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7180014 = idf(docFreq=394, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2953)
          0.033761967 = weight(abstract_txt:complexity in 2953) [ClassicSimilarity], result of:
            0.033761967 = score(doc=2953,freq=1.0), product of:
              0.10440643 = queryWeight, product of:
                1.2837765 = boost
                5.913062 = idf(docFreq=324, maxDocs=44218)
                0.013753885 = queryNorm
              0.32337058 = fieldWeight in 2953, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.913062 = idf(docFreq=324, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2953)
          0.017714027 = weight(abstract_txt:system in 2953) [ClassicSimilarity], result of:
            0.017714027 = score(doc=2953,freq=2.0), product of:
              0.06791832 = queryWeight, product of:
                1.4643141 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.013753885 = queryNorm
              0.26081368 = fieldWeight in 2953, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2953)
          0.097296305 = weight(abstract_txt:recognition in 2953) [ClassicSimilarity], result of:
            0.097296305 = score(doc=2953,freq=3.0), product of:
              0.16781457 = queryWeight, product of:
                1.9933623 = boost
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.013753885 = queryNorm
              0.57978463 = fieldWeight in 2953, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2953)
          0.16149995 = weight(abstract_txt:entity in 2953) [ClassicSimilarity], result of:
            0.16149995 = score(doc=2953,freq=4.0), product of:
              0.23525941 = queryWeight, product of:
                2.7252998 = boost
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.013753885 = queryNorm
              0.68647605 = fieldWeight in 2953, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2953)
          0.27431238 = weight(abstract_txt:named in 2953) [ClassicSimilarity], result of:
            0.27431238 = score(doc=2953,freq=7.0), product of:
              0.2779178 = queryWeight, product of:
                2.9620948 = boost
                6.82169 = idf(docFreq=130, maxDocs=44218)
                0.013753885 = queryNorm
              0.98702705 = fieldWeight in 2953, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.82169 = idf(docFreq=130, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2953)
          0.16206662 = weight(abstract_txt:entities in 2953) [ClassicSimilarity], result of:
            0.16206662 = score(doc=2953,freq=4.0), product of:
              0.25401798 = queryWeight, product of:
                3.1661248 = boost
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.013753885 = queryNorm
              0.6380124 = fieldWeight in 2953, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2953)
        0.32 = coord(8/25)
    
  2. Vlachidis, A.; Tudhope, D.: ¬A knowledge-based approach to information extraction for semantic interoperability in the archaeology domain (2016) 0.15
    0.15173252 = sum of:
      0.15173252 = product of:
        0.5419018 = sum of:
          0.0483274 = weight(abstract_txt:contextual in 2895) [ClassicSimilarity], result of:
            0.0483274 = score(doc=2895,freq=1.0), product of:
              0.12131367 = queryWeight, product of:
                1.3838234 = boost
                6.373877 = idf(docFreq=204, maxDocs=44218)
                0.013753885 = queryNorm
              0.39836732 = fieldWeight in 2895, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.373877 = idf(docFreq=204, maxDocs=44218)
                0.0625 = fieldNorm(doc=2895)
          0.014315096 = weight(abstract_txt:system in 2895) [ClassicSimilarity], result of:
            0.014315096 = score(doc=2895,freq=1.0), product of:
              0.06791832 = queryWeight, product of:
                1.4643141 = boost
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.013753885 = queryNorm
              0.21076928 = fieldWeight in 2895, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3723085 = idf(docFreq=4123, maxDocs=44218)
                0.0625 = fieldNorm(doc=2895)
          0.06467653 = weight(abstract_txt:annotation in 2895) [ClassicSimilarity], result of:
            0.06467653 = score(doc=2895,freq=1.0), product of:
              0.14732572 = queryWeight, product of:
                1.5249833 = boost
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.013753885 = queryNorm
              0.43900365 = fieldWeight in 2895, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.0625 = fieldNorm(doc=2895)
          0.11119578 = weight(abstract_txt:recognition in 2895) [ClassicSimilarity], result of:
            0.11119578 = score(doc=2895,freq=3.0), product of:
              0.16781457 = queryWeight, product of:
                1.9933623 = boost
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.013753885 = queryNorm
              0.662611 = fieldWeight in 2895, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.0625 = fieldNorm(doc=2895)
          0.092285685 = weight(abstract_txt:entity in 2895) [ClassicSimilarity], result of:
            0.092285685 = score(doc=2895,freq=1.0), product of:
              0.23525941 = queryWeight, product of:
                2.7252998 = boost
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.013753885 = queryNorm
              0.39227203 = fieldWeight in 2895, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.0625 = fieldNorm(doc=2895)
          0.11849182 = weight(abstract_txt:named in 2895) [ClassicSimilarity], result of:
            0.11849182 = score(doc=2895,freq=1.0), product of:
              0.2779178 = queryWeight, product of:
                2.9620948 = boost
                6.82169 = idf(docFreq=130, maxDocs=44218)
                0.013753885 = queryNorm
              0.42635563 = fieldWeight in 2895, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.82169 = idf(docFreq=130, maxDocs=44218)
                0.0625 = fieldNorm(doc=2895)
          0.092609495 = weight(abstract_txt:entities in 2895) [ClassicSimilarity], result of:
            0.092609495 = score(doc=2895,freq=1.0), product of:
              0.25401798 = queryWeight, product of:
                3.1661248 = boost
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.013753885 = queryNorm
              0.36457852 = fieldWeight in 2895, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.0625 = fieldNorm(doc=2895)
        0.28 = coord(7/25)
    
  3. Tan, X.; Luo, X.; Wang, X.; Wang, H.; Hou, X.: Representation and display of digital images of cultural heritage : a semantic enrichment approach (2021) 0.13
    0.13094538 = sum of:
      0.13094538 = product of:
        0.5456058 = sum of:
          0.0483274 = weight(abstract_txt:contextual in 455) [ClassicSimilarity], result of:
            0.0483274 = score(doc=455,freq=1.0), product of:
              0.12131367 = queryWeight, product of:
                1.3838234 = boost
                6.373877 = idf(docFreq=204, maxDocs=44218)
                0.013753885 = queryNorm
              0.39836732 = fieldWeight in 455, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.373877 = idf(docFreq=204, maxDocs=44218)
                0.0625 = fieldNorm(doc=455)
          0.09146643 = weight(abstract_txt:annotation in 455) [ClassicSimilarity], result of:
            0.09146643 = score(doc=455,freq=2.0), product of:
              0.14732572 = queryWeight, product of:
                1.5249833 = boost
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.013753885 = queryNorm
              0.6208449 = fieldWeight in 455, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.0240583 = idf(docFreq=106, maxDocs=44218)
                0.0625 = fieldNorm(doc=455)
          0.06419891 = weight(abstract_txt:recognition in 455) [ClassicSimilarity], result of:
            0.06419891 = score(doc=455,freq=1.0), product of:
              0.16781457 = queryWeight, product of:
                1.9933623 = boost
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.013753885 = queryNorm
              0.38255864 = fieldWeight in 455, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.0625 = fieldNorm(doc=455)
          0.13051167 = weight(abstract_txt:entity in 455) [ClassicSimilarity], result of:
            0.13051167 = score(doc=455,freq=2.0), product of:
              0.23525941 = queryWeight, product of:
                2.7252998 = boost
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.013753885 = queryNorm
              0.5547564 = fieldWeight in 455, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.0625 = fieldNorm(doc=455)
          0.11849182 = weight(abstract_txt:named in 455) [ClassicSimilarity], result of:
            0.11849182 = score(doc=455,freq=1.0), product of:
              0.2779178 = queryWeight, product of:
                2.9620948 = boost
                6.82169 = idf(docFreq=130, maxDocs=44218)
                0.013753885 = queryNorm
              0.42635563 = fieldWeight in 455, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.82169 = idf(docFreq=130, maxDocs=44218)
                0.0625 = fieldNorm(doc=455)
          0.092609495 = weight(abstract_txt:entities in 455) [ClassicSimilarity], result of:
            0.092609495 = score(doc=455,freq=1.0), product of:
              0.25401798 = queryWeight, product of:
                3.1661248 = boost
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.013753885 = queryNorm
              0.36457852 = fieldWeight in 455, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.0625 = fieldNorm(doc=455)
        0.24 = coord(6/25)
    
  4. Alahmari, F.; Thom, J.A.; Magee, L.: ¬A model for ranking entity attributes using DBpedia (2014) 0.11
    0.108014524 = sum of:
      0.108014524 = product of:
        0.5400726 = sum of:
          0.02320651 = weight(abstract_txt:type in 1623) [ClassicSimilarity], result of:
            0.02320651 = score(doc=1623,freq=1.0), product of:
              0.074391045 = queryWeight, product of:
                1.0836427 = boost
                4.991248 = idf(docFreq=816, maxDocs=44218)
                0.013753885 = queryNorm
              0.311953 = fieldWeight in 1623, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.991248 = idf(docFreq=816, maxDocs=44218)
                0.0625 = fieldNorm(doc=1623)
          0.067719676 = weight(abstract_txt:attribute in 1623) [ClassicSimilarity], result of:
            0.067719676 = score(doc=1623,freq=1.0), product of:
              0.15191151 = queryWeight, product of:
                1.5485353 = boost
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.013753885 = queryNorm
              0.44578367 = fieldWeight in 1623, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.132539 = idf(docFreq=95, maxDocs=44218)
                0.0625 = fieldNorm(doc=1623)
          0.11181978 = weight(abstract_txt:attributes in 1623) [ClassicSimilarity], result of:
            0.11181978 = score(doc=1623,freq=3.0), product of:
              0.1684418 = queryWeight, product of:
                1.9970841 = boost
                6.1323667 = idf(docFreq=260, maxDocs=44218)
                0.013753885 = queryNorm
              0.66384816 = fieldWeight in 1623, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.1323667 = idf(docFreq=260, maxDocs=44218)
                0.0625 = fieldNorm(doc=1623)
          0.20635706 = weight(abstract_txt:entity in 1623) [ClassicSimilarity], result of:
            0.20635706 = score(doc=1623,freq=5.0), product of:
              0.23525941 = queryWeight, product of:
                2.7252998 = boost
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.013753885 = queryNorm
              0.8771469 = fieldWeight in 1623, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.0625 = fieldNorm(doc=1623)
          0.1309696 = weight(abstract_txt:entities in 1623) [ClassicSimilarity], result of:
            0.1309696 = score(doc=1623,freq=2.0), product of:
              0.25401798 = queryWeight, product of:
                3.1661248 = boost
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.013753885 = queryNorm
              0.51559186 = fieldWeight in 1623, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.0625 = fieldNorm(doc=1623)
        0.2 = coord(5/25)
    
  5. Nagy T., I.: Detecting multiword expressions and named entities in natural language texts (2014) 0.11
    0.10592135 = sum of:
      0.10592135 = product of:
        0.44133896 = sum of:
          0.0471491 = weight(abstract_txt:texts in 1536) [ClassicSimilarity], result of:
            0.0471491 = score(doc=1536,freq=5.0), product of:
              0.09546695 = queryWeight, product of:
                1.2275871 = boost
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.013753885 = queryNorm
              0.49387878 = fieldWeight in 1536, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.6542544 = idf(docFreq=420, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1536)
          0.03251031 = weight(abstract_txt:proper in 1536) [ClassicSimilarity], result of:
            0.03251031 = score(doc=1536,freq=1.0), product of:
              0.12741137 = queryWeight, product of:
                1.4181751 = boost
                6.532101 = idf(docFreq=174, maxDocs=44218)
                0.013753885 = queryNorm
              0.2551602 = fieldWeight in 1536, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.532101 = idf(docFreq=174, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1536)
          0.04012432 = weight(abstract_txt:recognition in 1536) [ClassicSimilarity], result of:
            0.04012432 = score(doc=1536,freq=1.0), product of:
              0.16781457 = queryWeight, product of:
                1.9933623 = boost
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.013753885 = queryNorm
              0.23909914 = fieldWeight in 1536, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1209383 = idf(docFreq=263, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1536)
          0.057678554 = weight(abstract_txt:entity in 1536) [ClassicSimilarity], result of:
            0.057678554 = score(doc=1536,freq=1.0), product of:
              0.23525941 = queryWeight, product of:
                2.7252998 = boost
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.013753885 = queryNorm
              0.24517001 = fieldWeight in 1536, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2763524 = idf(docFreq=225, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1536)
          0.14811479 = weight(abstract_txt:named in 1536) [ClassicSimilarity], result of:
            0.14811479 = score(doc=1536,freq=4.0), product of:
              0.2779178 = queryWeight, product of:
                2.9620948 = boost
                6.82169 = idf(docFreq=130, maxDocs=44218)
                0.013753885 = queryNorm
              0.53294456 = fieldWeight in 1536, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.82169 = idf(docFreq=130, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1536)
          0.11576187 = weight(abstract_txt:entities in 1536) [ClassicSimilarity], result of:
            0.11576187 = score(doc=1536,freq=4.0), product of:
              0.25401798 = queryWeight, product of:
                3.1661248 = boost
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.013753885 = queryNorm
              0.45572314 = fieldWeight in 1536, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.8332562 = idf(docFreq=351, maxDocs=44218)
                0.0390625 = fieldNorm(doc=1536)
        0.24 = coord(6/25)