Document (#27678)

Author
Jones, I.
Cunliffe, D.
Tudhope, D.
Title
Natural language processing and knowledge organization systems as an aid to retrieval
Source
Knowledge organization and the global information society: Proceedings of the 8th International ISKO Conference 13-16 July 2004, London, UK. Ed.: I.C. McIlwaine
Imprint
Würzburg : Ergon Verlag
Year
2004
Pages
S.351-356
Series
Advances in knowledge organization; vol.9
Abstract
This paper discusses research that employs methods from Natural Language Processing (NLP) in exploiting the intellectual resources of Knowledge Organization Systems (KOS), particularly in the retrieval of information. A technique for the disambiguation of homographs and nominal compounds in free text, where these are known ambiguous terms in the KOS itself, is described. The use of Roget's Thesaurus as an intermediary in the process is also reported. A short review of the relevant literature in the field is given. Design considerations, results and conclusions are presented from the implementation of a prototype system. The linguistic techniques are applied at two complementary levels, namely an a free text string used as an entry point to the KOS, and an the underlying controlled vocabulary itself.
Content
1. Introduction The need for research into the application of linguistic techniques in Information Retrieval (IR) in general, and a similar need in faceted Knowledge Organization Systems (KOS) has been indicated by various authors. Smeaton (1997) points out the inherent limitations of conventional approaches to IR based an "bags of words", mainly difficulties caused by lexical ambiguity in the words concerned, and goes an to suggest the possibility of using Natural Language Processing (NLP) in query formulation. Past experience with a faceted retrieval system highlighted the need for integrating the linguistic perspective in order to fully utilise the potential of a KOS (Tudhope et al." 2002). The present research seeks to address some of these needs in using NLP to improve the efficacy of KOS tools in query and retrieval systems. Syntactic parsing and part-of-speech tagging can substantially reduce lexical ambiguity through homograph disambiguation. Given the two strings "1 fable the motion" and "I put the motion an the fable", for instance, the parser used in this research clearly indicates that 'fable' in the first string is a verb, while 'table' in the second string is a noun, a distinction that would be missed in the "bag of words" approach. This syntactic disambiguation enables a more precise matching from free text to the controlled vocabulary of a KOS and vice versa. The use of a general linguistic resource, namely Roget's Thesaurus of English Words and Phrases (RTEWP), as an intermediary in this process, is investigated. The adaptation of the Link parser (Sleator & Temperley, 1993) to the purposes of the research is reported. The design and implementation of the early practical stages of the project are described, and the results of the initial experiments are presented and evaluated. Applications of the techniques developed are foreseen in the areas of query disambiguation, information retrieval and automatic indexing. In the first section of the paper a brief review of the literature and relevant current work in the field is presented. The second section includes reports an the development of algorithms, the construction of data sets and theoretical and experimental work undertaken to date. The third section evaluates the results obtained, and outlines directions for future research.
Theme
Computerlinguistik

Similar documents (author)

  1. Blocks, D.; Cunliffe, D.; Tudhope, D.: ¬A reference model for user-system interaction in thesaurus-based searching (2006) 2.86
    2.8561122 = sum of:
      2.8561122 = product of:
        4.2841682 = sum of:
          1.6344721 = weight(author_txt:tudhope in 327) [ClassicSimilarity], result of:
            1.6344721 = score(doc=327,freq=1.0), product of:
              0.5387264 = queryWeight, product of:
                1.1659038 = boost
                8.090549 = idf(docFreq=36, maxDocs=44421)
                0.057112023 = queryNorm
              3.033956 = fieldWeight in 327, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.090549 = idf(docFreq=36, maxDocs=44421)
                0.375 = fieldNorm(doc=327)
          2.6496964 = weight(author_txt:cunliffe in 327) [ClassicSimilarity], result of:
            2.6496964 = score(doc=327,freq=1.0), product of:
              0.74344236 = queryWeight, product of:
                1.3696268 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.057112023 = queryNorm
              3.5640912 = fieldWeight in 327, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.375 = fieldNorm(doc=327)
        0.6666667 = coord(2/3)
    
  2. Tudhope, D.; Blocks, D.; Cunliffe, D.; Binding, C.: Query expansion via conceptual distance in thesaurus indexed collections (2006) 2.38
    2.3800936 = sum of:
      2.3800936 = product of:
        3.5701404 = sum of:
          1.3620602 = weight(author_txt:tudhope in 3215) [ClassicSimilarity], result of:
            1.3620602 = score(doc=3215,freq=1.0), product of:
              0.5387264 = queryWeight, product of:
                1.1659038 = boost
                8.090549 = idf(docFreq=36, maxDocs=44421)
                0.057112023 = queryNorm
              2.5282967 = fieldWeight in 3215, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.090549 = idf(docFreq=36, maxDocs=44421)
                0.3125 = fieldNorm(doc=3215)
          2.2080803 = weight(author_txt:cunliffe in 3215) [ClassicSimilarity], result of:
            2.2080803 = score(doc=3215,freq=1.0), product of:
              0.74344236 = queryWeight, product of:
                1.3696268 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.057112023 = queryNorm
              2.9700758 = fieldWeight in 3215, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.3125 = fieldNorm(doc=3215)
        0.6666667 = coord(2/3)
    
  3. Tudhope, D.; Binding, C.; Blocks, D.; Cunliffe, D.: Compound descriptors in context : a matching function for classifications and thesauri (2002) 2.38
    2.3800936 = sum of:
      2.3800936 = product of:
        3.5701404 = sum of:
          1.3620602 = weight(author_txt:tudhope in 4179) [ClassicSimilarity], result of:
            1.3620602 = score(doc=4179,freq=1.0), product of:
              0.5387264 = queryWeight, product of:
                1.1659038 = boost
                8.090549 = idf(docFreq=36, maxDocs=44421)
                0.057112023 = queryNorm
              2.5282967 = fieldWeight in 4179, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.090549 = idf(docFreq=36, maxDocs=44421)
                0.3125 = fieldNorm(doc=4179)
          2.2080803 = weight(author_txt:cunliffe in 4179) [ClassicSimilarity], result of:
            2.2080803 = score(doc=4179,freq=1.0), product of:
              0.74344236 = queryWeight, product of:
                1.3696268 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.057112023 = queryNorm
              2.9700758 = fieldWeight in 4179, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.3125 = fieldNorm(doc=4179)
        0.6666667 = coord(2/3)
    
  4. Tudhope, D.; Binding, C.; Blocks, D.; Cunliffe, D.: FACET: thesaurus retrieval with semantic term expansion (2002) 2.38
    2.3800936 = sum of:
      2.3800936 = product of:
        3.5701404 = sum of:
          1.3620602 = weight(author_txt:tudhope in 1175) [ClassicSimilarity], result of:
            1.3620602 = score(doc=1175,freq=1.0), product of:
              0.5387264 = queryWeight, product of:
                1.1659038 = boost
                8.090549 = idf(docFreq=36, maxDocs=44421)
                0.057112023 = queryNorm
              2.5282967 = fieldWeight in 1175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.090549 = idf(docFreq=36, maxDocs=44421)
                0.3125 = fieldNorm(doc=1175)
          2.2080803 = weight(author_txt:cunliffe in 1175) [ClassicSimilarity], result of:
            2.2080803 = score(doc=1175,freq=1.0), product of:
              0.74344236 = queryWeight, product of:
                1.3696268 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.057112023 = queryNorm
              2.9700758 = fieldWeight in 1175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.3125 = fieldNorm(doc=1175)
        0.6666667 = coord(2/3)
    
  5. Tudhope, D.; Alani, H.; Jones, C.: Augmenting thesaurus relationships : possibilities for retrieval (2001) 1.78
    1.7771883 = sum of:
      1.7771883 = product of:
        2.6657825 = sum of:
          1.0313104 = weight(author_txt:jones in 2520) [ClassicSimilarity], result of:
            1.0313104 = score(doc=2520,freq=1.0), product of:
              0.39631712 = queryWeight, product of:
                6.939294 = idf(docFreq=116, maxDocs=44421)
                0.057112023 = queryNorm
              2.6022353 = fieldWeight in 2520, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.939294 = idf(docFreq=116, maxDocs=44421)
                0.375 = fieldNorm(doc=2520)
          1.6344721 = weight(author_txt:tudhope in 2520) [ClassicSimilarity], result of:
            1.6344721 = score(doc=2520,freq=1.0), product of:
              0.5387264 = queryWeight, product of:
                1.1659038 = boost
                8.090549 = idf(docFreq=36, maxDocs=44421)
                0.057112023 = queryNorm
              3.033956 = fieldWeight in 2520, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.090549 = idf(docFreq=36, maxDocs=44421)
                0.375 = fieldNorm(doc=2520)
        0.6666667 = coord(2/3)
    

Similar documents (content)

  1. Nagy T., I.: Detecting multiword expressions and named entities in natural language texts (2014) 0.26
    0.26262942 = sum of:
      0.26262942 = product of:
        0.72952616 = sum of:
          0.036958832 = weight(abstract_txt:namely in 2536) [ClassicSimilarity], result of:
            0.036958832 = score(doc=2536,freq=1.0), product of:
              0.149887 = queryWeight, product of:
                1.0655963 = boost
                6.312396 = idf(docFreq=218, maxDocs=44421)
                0.02228317 = queryNorm
              0.24657798 = fieldWeight in 2536, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.312396 = idf(docFreq=218, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2536)
          0.012347874 = weight(abstract_txt:retrieval in 2536) [ClassicSimilarity], result of:
            0.012347874 = score(doc=2536,freq=1.0), product of:
              0.09092639 = queryWeight, product of:
                1.1737368 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.02228317 = queryNorm
              0.13580078 = fieldWeight in 2536, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2536)
          0.013107684 = weight(abstract_txt:knowledge in 2536) [ClassicSimilarity], result of:
            0.013107684 = score(doc=2536,freq=1.0), product of:
              0.09461916 = queryWeight, product of:
                1.1973339 = boost
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.02228317 = queryNorm
              0.13853097 = fieldWeight in 2536, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2536)
          0.019390725 = weight(abstract_txt:text in 2536) [ClassicSimilarity], result of:
            0.019390725 = score(doc=2536,freq=1.0), product of:
              0.1228451 = queryWeight, product of:
                1.3642837 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.02228317 = queryNorm
              0.15784696 = fieldWeight in 2536, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2536)
          0.04264933 = weight(abstract_txt:language in 2536) [ClassicSimilarity], result of:
            0.04264933 = score(doc=2536,freq=4.0), product of:
              0.13088301 = queryWeight, product of:
                1.4082099 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.02228317 = queryNorm
              0.3258584 = fieldWeight in 2536, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2536)
          0.21665004 = weight(abstract_txt:compounds in 2536) [ClassicSimilarity], result of:
            0.21665004 = score(doc=2536,freq=6.0), product of:
              0.26816815 = queryWeight, product of:
                1.4253265 = boost
                8.443371 = idf(docFreq=25, maxDocs=44421)
                0.02228317 = queryNorm
              0.80788875 = fieldWeight in 2536, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                8.443371 = idf(docFreq=25, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2536)
          0.27674216 = weight(abstract_txt:nominal in 2536) [ClassicSimilarity], result of:
            0.27674216 = score(doc=2536,freq=7.0), product of:
              0.29989508 = queryWeight, product of:
                1.5072851 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.02228317 = queryNorm
              0.9227966 = fieldWeight in 2536, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2536)
          0.03510539 = weight(abstract_txt:processing in 2536) [ClassicSimilarity], result of:
            0.03510539 = score(doc=2536,freq=1.0), product of:
              0.1824782 = queryWeight, product of:
                1.6627665 = boost
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.02228317 = queryNorm
              0.19238128 = fieldWeight in 2536, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2536)
          0.076574124 = weight(abstract_txt:natural in 2536) [ClassicSimilarity], result of:
            0.076574124 = score(doc=2536,freq=4.0), product of:
              0.19334361 = queryWeight, product of:
                1.7115543 = boost
                5.0694656 = idf(docFreq=758, maxDocs=44421)
                0.02228317 = queryNorm
              0.396052 = fieldWeight in 2536, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.0694656 = idf(docFreq=758, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2536)
        0.36 = coord(9/25)
    
  2. Guglielmo, E.J.; Rowe, N.C.: Natural-language retrieval of images based on descriptive captions (1996) 0.22
    0.22336765 = sum of:
      0.22336765 = product of:
        0.6980239 = sum of:
          0.04887199 = weight(abstract_txt:prototype in 6692) [ClassicSimilarity], result of:
            0.04887199 = score(doc=6692,freq=1.0), product of:
              0.13200139 = queryWeight, product of:
                5.9238153 = idf(docFreq=322, maxDocs=44421)
                0.02228317 = queryNorm
              0.37023845 = fieldWeight in 6692, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9238153 = idf(docFreq=322, maxDocs=44421)
                0.0625 = fieldNorm(doc=6692)
          0.019756598 = weight(abstract_txt:retrieval in 6692) [ClassicSimilarity], result of:
            0.019756598 = score(doc=6692,freq=1.0), product of:
              0.09092639 = queryWeight, product of:
                1.1737368 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.02228317 = queryNorm
              0.21728125 = fieldWeight in 6692, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=6692)
          0.099652864 = weight(abstract_txt:ambiguous in 6692) [ClassicSimilarity], result of:
            0.099652864 = score(doc=6692,freq=1.0), product of:
              0.21225846 = queryWeight, product of:
                1.2680701 = boost
                7.5118127 = idf(docFreq=65, maxDocs=44421)
                0.02228317 = queryNorm
              0.4694883 = fieldWeight in 6692, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5118127 = idf(docFreq=65, maxDocs=44421)
                0.0625 = fieldNorm(doc=6692)
          0.034119464 = weight(abstract_txt:language in 6692) [ClassicSimilarity], result of:
            0.034119464 = score(doc=6692,freq=1.0), product of:
              0.13088301 = queryWeight, product of:
                1.4082099 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.02228317 = queryNorm
              0.26068673 = fieldWeight in 6692, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.0625 = fieldNorm(doc=6692)
          0.1415152 = weight(abstract_txt:compounds in 6692) [ClassicSimilarity], result of:
            0.1415152 = score(doc=6692,freq=1.0), product of:
              0.26816815 = queryWeight, product of:
                1.4253265 = boost
                8.443371 = idf(docFreq=25, maxDocs=44421)
                0.02228317 = queryNorm
              0.5277107 = fieldWeight in 6692, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.443371 = idf(docFreq=25, maxDocs=44421)
                0.0625 = fieldNorm(doc=6692)
          0.23667984 = weight(abstract_txt:nominal in 6692) [ClassicSimilarity], result of:
            0.23667984 = score(doc=6692,freq=2.0), product of:
              0.29989508 = queryWeight, product of:
                1.5072851 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.02228317 = queryNorm
              0.7892088 = fieldWeight in 6692, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.0625 = fieldNorm(doc=6692)
          0.056168623 = weight(abstract_txt:processing in 6692) [ClassicSimilarity], result of:
            0.056168623 = score(doc=6692,freq=1.0), product of:
              0.1824782 = queryWeight, product of:
                1.6627665 = boost
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.02228317 = queryNorm
              0.30781004 = fieldWeight in 6692, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.0625 = fieldNorm(doc=6692)
          0.0612593 = weight(abstract_txt:natural in 6692) [ClassicSimilarity], result of:
            0.0612593 = score(doc=6692,freq=1.0), product of:
              0.19334361 = queryWeight, product of:
                1.7115543 = boost
                5.0694656 = idf(docFreq=758, maxDocs=44421)
                0.02228317 = queryNorm
              0.3168416 = fieldWeight in 6692, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0694656 = idf(docFreq=758, maxDocs=44421)
                0.0625 = fieldNorm(doc=6692)
        0.32 = coord(8/25)
    
  3. Taylor, S.L.: Integrating natural language understanding with document structure analysis (1994) 0.20
    0.1973407 = sum of:
      0.1973407 = product of:
        0.7047882 = sum of:
          0.07330798 = weight(abstract_txt:prototype in 1862) [ClassicSimilarity], result of:
            0.07330798 = score(doc=1862,freq=1.0), product of:
              0.13200139 = queryWeight, product of:
                5.9238153 = idf(docFreq=322, maxDocs=44421)
                0.02228317 = queryNorm
              0.5553577 = fieldWeight in 1862, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9238153 = idf(docFreq=322, maxDocs=44421)
                0.09375 = fieldNorm(doc=1862)
          0.0296349 = weight(abstract_txt:retrieval in 1862) [ClassicSimilarity], result of:
            0.0296349 = score(doc=1862,freq=1.0), product of:
              0.09092639 = queryWeight, product of:
                1.1737368 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.02228317 = queryNorm
              0.3259219 = fieldWeight in 1862, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.09375 = fieldNorm(doc=1862)
          0.13051541 = weight(abstract_txt:employs in 1862) [ClassicSimilarity], result of:
            0.13051541 = score(doc=1862,freq=1.0), product of:
              0.19390345 = queryWeight, product of:
                1.2120025 = boost
                7.179679 = idf(docFreq=91, maxDocs=44421)
                0.02228317 = queryNorm
              0.67309487 = fieldWeight in 1862, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.179679 = idf(docFreq=91, maxDocs=44421)
                0.09375 = fieldNorm(doc=1862)
          0.08060573 = weight(abstract_txt:text in 1862) [ClassicSimilarity], result of:
            0.08060573 = score(doc=1862,freq=3.0), product of:
              0.1228451 = queryWeight, product of:
                1.3642837 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.02228317 = queryNorm
              0.6561575 = fieldWeight in 1862, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.09375 = fieldNorm(doc=1862)
          0.072378315 = weight(abstract_txt:language in 1862) [ClassicSimilarity], result of:
            0.072378315 = score(doc=1862,freq=2.0), product of:
              0.13088301 = queryWeight, product of:
                1.4082099 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.02228317 = queryNorm
              0.5530001 = fieldWeight in 1862, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.09375 = fieldNorm(doc=1862)
          0.18839529 = weight(abstract_txt:processing in 1862) [ClassicSimilarity], result of:
            0.18839529 = score(doc=1862,freq=5.0), product of:
              0.1824782 = queryWeight, product of:
                1.6627665 = boost
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.02228317 = queryNorm
              1.0324262 = fieldWeight in 1862, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.09375 = fieldNorm(doc=1862)
          0.12995058 = weight(abstract_txt:natural in 1862) [ClassicSimilarity], result of:
            0.12995058 = score(doc=1862,freq=2.0), product of:
              0.19334361 = queryWeight, product of:
                1.7115543 = boost
                5.0694656 = idf(docFreq=758, maxDocs=44421)
                0.02228317 = queryNorm
              0.6721225 = fieldWeight in 1862, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.0694656 = idf(docFreq=758, maxDocs=44421)
                0.09375 = fieldNorm(doc=1862)
        0.28 = coord(7/25)
    
  4. Köhler, J.; Philippi, S.; Specht, M.; Rüegg, A.: Ontology based text indexing and querying for the semantic web (2006) 0.20
    0.19601983 = sum of:
      0.19601983 = product of:
        0.5444995 = sum of:
          0.04887199 = weight(abstract_txt:prototype in 267) [ClassicSimilarity], result of:
            0.04887199 = score(doc=267,freq=1.0), product of:
              0.13200139 = queryWeight, product of:
                5.9238153 = idf(docFreq=322, maxDocs=44421)
                0.02228317 = queryNorm
              0.37023845 = fieldWeight in 267, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9238153 = idf(docFreq=322, maxDocs=44421)
                0.0625 = fieldNorm(doc=267)
          0.02794005 = weight(abstract_txt:retrieval in 267) [ClassicSimilarity], result of:
            0.02794005 = score(doc=267,freq=2.0), product of:
              0.09092639 = queryWeight, product of:
                1.1737368 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.02228317 = queryNorm
              0.3072821 = fieldWeight in 267, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=267)
          0.08701028 = weight(abstract_txt:string in 267) [ClassicSimilarity], result of:
            0.08701028 = score(doc=267,freq=1.0), product of:
              0.19390345 = queryWeight, product of:
                1.2120025 = boost
                7.179679 = idf(docFreq=91, maxDocs=44421)
                0.02228317 = queryNorm
              0.44872993 = fieldWeight in 267, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.179679 = idf(docFreq=91, maxDocs=44421)
                0.0625 = fieldNorm(doc=267)
          0.092667274 = weight(abstract_txt:disambiguation in 267) [ClassicSimilarity], result of:
            0.092667274 = score(doc=267,freq=1.0), product of:
              0.20221937 = queryWeight, product of:
                1.2377192 = boost
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.02228317 = queryNorm
              0.45825124 = fieldWeight in 267, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.0625 = fieldNorm(doc=267)
          0.053737152 = weight(abstract_txt:text in 267) [ClassicSimilarity], result of:
            0.053737152 = score(doc=267,freq=3.0), product of:
              0.1228451 = queryWeight, product of:
                1.3642837 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.02228317 = queryNorm
              0.4374383 = fieldWeight in 267, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=267)
          0.034119464 = weight(abstract_txt:language in 267) [ClassicSimilarity], result of:
            0.034119464 = score(doc=267,freq=1.0), product of:
              0.13088301 = queryWeight, product of:
                1.4082099 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.02228317 = queryNorm
              0.26068673 = fieldWeight in 267, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.0625 = fieldNorm(doc=267)
          0.056168623 = weight(abstract_txt:processing in 267) [ClassicSimilarity], result of:
            0.056168623 = score(doc=267,freq=1.0), product of:
              0.1824782 = queryWeight, product of:
                1.6627665 = boost
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.02228317 = queryNorm
              0.30781004 = fieldWeight in 267, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.0625 = fieldNorm(doc=267)
          0.0612593 = weight(abstract_txt:natural in 267) [ClassicSimilarity], result of:
            0.0612593 = score(doc=267,freq=1.0), product of:
              0.19334361 = queryWeight, product of:
                1.7115543 = boost
                5.0694656 = idf(docFreq=758, maxDocs=44421)
                0.02228317 = queryNorm
              0.3168416 = fieldWeight in 267, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0694656 = idf(docFreq=758, maxDocs=44421)
                0.0625 = fieldNorm(doc=267)
          0.0827254 = weight(abstract_txt:free in 267) [ClassicSimilarity], result of:
            0.0827254 = score(doc=267,freq=1.0), product of:
              0.23621513 = queryWeight, product of:
                1.8918191 = boost
                5.6033936 = idf(docFreq=444, maxDocs=44421)
                0.02228317 = queryNorm
              0.3502121 = fieldWeight in 267, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6033936 = idf(docFreq=444, maxDocs=44421)
                0.0625 = fieldNorm(doc=267)
        0.36 = coord(9/25)
    
  5. Chowdhury, G.G.: Natural language processing (2002) 0.18
    0.17935205 = sum of:
      0.17935205 = product of:
        0.5604752 = sum of:
          0.073917665 = weight(abstract_txt:namely in 5284) [ClassicSimilarity], result of:
            0.073917665 = score(doc=5284,freq=1.0), product of:
              0.149887 = queryWeight, product of:
                1.0655963 = boost
                6.312396 = idf(docFreq=218, maxDocs=44421)
                0.02228317 = queryNorm
              0.49315596 = fieldWeight in 5284, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.312396 = idf(docFreq=218, maxDocs=44421)
                0.078125 = fieldNorm(doc=5284)
          0.032993056 = weight(abstract_txt:systems in 5284) [ClassicSimilarity], result of:
            0.032993056 = score(doc=5284,freq=2.0), product of:
              0.0875414 = queryWeight, product of:
                1.1516818 = boost
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.02228317 = queryNorm
              0.37688518 = fieldWeight in 5284, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.078125 = fieldNorm(doc=5284)
          0.024695748 = weight(abstract_txt:retrieval in 5284) [ClassicSimilarity], result of:
            0.024695748 = score(doc=5284,freq=1.0), product of:
              0.09092639 = queryWeight, product of:
                1.1737368 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.02228317 = queryNorm
              0.27160156 = fieldWeight in 5284, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.078125 = fieldNorm(doc=5284)
          0.026215369 = weight(abstract_txt:knowledge in 5284) [ClassicSimilarity], result of:
            0.026215369 = score(doc=5284,freq=1.0), product of:
              0.09461916 = queryWeight, product of:
                1.1973339 = boost
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.02228317 = queryNorm
              0.27706194 = fieldWeight in 5284, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.078125 = fieldNorm(doc=5284)
          0.05484525 = weight(abstract_txt:text in 5284) [ClassicSimilarity], result of:
            0.05484525 = score(doc=5284,freq=2.0), product of:
              0.1228451 = queryWeight, product of:
                1.3642837 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.02228317 = queryNorm
              0.4464586 = fieldWeight in 5284, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.078125 = fieldNorm(doc=5284)
          0.09536679 = weight(abstract_txt:language in 5284) [ClassicSimilarity], result of:
            0.09536679 = score(doc=5284,freq=5.0), product of:
              0.13088301 = queryWeight, product of:
                1.4082099 = boost
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.02228317 = queryNorm
              0.7286415 = fieldWeight in 5284, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.1709876 = idf(docFreq=1863, maxDocs=44421)
                0.078125 = fieldNorm(doc=5284)
          0.09929303 = weight(abstract_txt:processing in 5284) [ClassicSimilarity], result of:
            0.09929303 = score(doc=5284,freq=2.0), product of:
              0.1824782 = queryWeight, product of:
                1.6627665 = boost
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.02228317 = queryNorm
              0.5441364 = fieldWeight in 5284, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.078125 = fieldNorm(doc=5284)
          0.15314825 = weight(abstract_txt:natural in 5284) [ClassicSimilarity], result of:
            0.15314825 = score(doc=5284,freq=4.0), product of:
              0.19334361 = queryWeight, product of:
                1.7115543 = boost
                5.0694656 = idf(docFreq=758, maxDocs=44421)
                0.02228317 = queryNorm
              0.792104 = fieldWeight in 5284, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.0694656 = idf(docFreq=758, maxDocs=44421)
                0.078125 = fieldNorm(doc=5284)
        0.32 = coord(8/25)