Document (#29395)

Author
Galvez, C.
Moya-Anegón, F. de
Solana, V.H.
Title
Term conflation methods in information retrieval : non-linguistic and linguistic approaches
Source
Journal of documentation. 61(2005) no.4, S.520-547
Year
2005
Abstract
Purpose - To propose a categorization of the different conflation procedures at the two basic approaches, non-linguistic and linguistic techniques, and to justify the application of normalization methods within the framework of linguistic techniques. Design/methodology/approach - Presents a range of term conflation methods, that can be used in information retrieval. The uniterm and multiterm variants can be considered equivalent units for the purposes of automatic indexing. Stemming algorithms, segmentation rules, association measures and clustering techniques are well evaluated non-linguistic methods, and experiments with these techniques show a wide variety of results. Alternatively, the lemmatisation and the use of syntactic pattern-matching, through equivalence relations represented in finite-state transducers (FST), are emerging methods for the recognition and standardization of terms. Findings - The survey attempts to point out the positive and negative effects of the linguistic approach and its potential as a term conflation method. Originality/value - Outlines the importance of FSTs for the normalization of term variants.
Footnote
Vgl. auch unter: http://www.emeraldinsight.com/10.1108/00220410510607507
Theme
Computerlinguistik

Similar documents (author)

  1. Herrero-Solana, V.; Moya Anegón, F. de: Graphical Table of Contents (GTOC) for library collections : the application of UDC codes for the subject maps (2003) 5.65
    5.649338 = sum of:
      5.649338 = sum of:
        1.5363436 = weight(author_txt:moya in 3758) [ClassicSimilarity], result of:
          1.5363436 = score(doc=3758,freq=1.0), product of:
            0.50114524 = queryWeight, product of:
              8.175107 = idf(docFreq=33, maxDocs=44421)
              0.061301365 = queryNorm
            3.0656652 = fieldWeight in 3758, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.175107 = idf(docFreq=33, maxDocs=44421)
              0.375 = fieldNorm(doc=3758)
        1.6079949 = weight(author_txt:anegón in 3758) [ClassicSimilarity], result of:
          1.6079949 = score(doc=3758,freq=1.0), product of:
            0.51660806 = queryWeight, product of:
              1.0153103 = boost
              8.30027 = idf(docFreq=29, maxDocs=44421)
              0.061301365 = queryNorm
            3.1126013 = fieldWeight in 3758, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.30027 = idf(docFreq=29, maxDocs=44421)
              0.375 = fieldNorm(doc=3758)
        2.5049992 = weight(author_txt:solana in 3758) [ClassicSimilarity], result of:
          2.5049992 = score(doc=3758,freq=1.0), product of:
            0.6942402 = queryWeight, product of:
              1.1769909 = boost
              9.622026 = idf(docFreq=7, maxDocs=44421)
              0.061301365 = queryNorm
            3.60826 = fieldWeight in 3758, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.622026 = idf(docFreq=7, maxDocs=44421)
              0.375 = fieldNorm(doc=3758)
    
  2. Guerrero-Bote, V.P.; Moya Anegón, F. de; Herrero Solana, V.: Document organization using Kohonen's algorithm (2002) 4.71
    4.707781 = sum of:
      4.707781 = sum of:
        1.2802862 = weight(author_txt:moya in 3564) [ClassicSimilarity], result of:
          1.2802862 = score(doc=3564,freq=1.0), product of:
            0.50114524 = queryWeight, product of:
              8.175107 = idf(docFreq=33, maxDocs=44421)
              0.061301365 = queryNorm
            2.5547209 = fieldWeight in 3564, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.175107 = idf(docFreq=33, maxDocs=44421)
              0.3125 = fieldNorm(doc=3564)
        1.3399957 = weight(author_txt:anegón in 3564) [ClassicSimilarity], result of:
          1.3399957 = score(doc=3564,freq=1.0), product of:
            0.51660806 = queryWeight, product of:
              1.0153103 = boost
              8.30027 = idf(docFreq=29, maxDocs=44421)
              0.061301365 = queryNorm
            2.5938344 = fieldWeight in 3564, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.30027 = idf(docFreq=29, maxDocs=44421)
              0.3125 = fieldNorm(doc=3564)
        2.0874991 = weight(author_txt:solana in 3564) [ClassicSimilarity], result of:
          2.0874991 = score(doc=3564,freq=1.0), product of:
            0.6942402 = queryWeight, product of:
              1.1769909 = boost
              9.622026 = idf(docFreq=7, maxDocs=44421)
              0.061301365 = queryNorm
            3.0068831 = fieldWeight in 3564, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.622026 = idf(docFreq=7, maxDocs=44421)
              0.3125 = fieldNorm(doc=3564)
    
  3. Moya-Anegón, F. de; Vargas-Quesada, B.; Chinchilla-Rodríguez, Z.; Corera-Álvarez, E.; Munoz-Fernández, F.J.; Herrero-Solana, V.; SCImago Group: Visualizing the marrow of science (2007) 2.82
    2.824669 = sum of:
      2.824669 = sum of:
        0.7681718 = weight(author_txt:moya in 2313) [ClassicSimilarity], result of:
          0.7681718 = score(doc=2313,freq=1.0), product of:
            0.50114524 = queryWeight, product of:
              8.175107 = idf(docFreq=33, maxDocs=44421)
              0.061301365 = queryNorm
            1.5328326 = fieldWeight in 2313, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.175107 = idf(docFreq=33, maxDocs=44421)
              0.1875 = fieldNorm(doc=2313)
        0.80399746 = weight(author_txt:anegón in 2313) [ClassicSimilarity], result of:
          0.80399746 = score(doc=2313,freq=1.0), product of:
            0.51660806 = queryWeight, product of:
              1.0153103 = boost
              8.30027 = idf(docFreq=29, maxDocs=44421)
              0.061301365 = queryNorm
            1.5563006 = fieldWeight in 2313, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.30027 = idf(docFreq=29, maxDocs=44421)
              0.1875 = fieldNorm(doc=2313)
        1.2524996 = weight(author_txt:solana in 2313) [ClassicSimilarity], result of:
          1.2524996 = score(doc=2313,freq=1.0), product of:
            0.6942402 = queryWeight, product of:
              1.1769909 = boost
              9.622026 = idf(docFreq=7, maxDocs=44421)
              0.061301365 = queryNorm
            1.80413 = fieldWeight in 2313, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.622026 = idf(docFreq=7, maxDocs=44421)
              0.1875 = fieldNorm(doc=2313)
    
  4. Herrero Solana, V.; Moya Anegon, F. de: Bibliographic displays of Web-based OPACs : multivariate analysis applied to Latin-American catalogues (2001) 2.69
    2.6942286 = sum of:
      2.6942286 = product of:
        4.0413427 = sum of:
          1.5363436 = weight(author_txt:moya in 143) [ClassicSimilarity], result of:
            1.5363436 = score(doc=143,freq=1.0), product of:
              0.50114524 = queryWeight, product of:
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.061301365 = queryNorm
              3.0656652 = fieldWeight in 143, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.375 = fieldNorm(doc=143)
          2.5049992 = weight(author_txt:solana in 143) [ClassicSimilarity], result of:
            2.5049992 = score(doc=143,freq=1.0), product of:
              0.6942402 = queryWeight, product of:
                1.1769909 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.061301365 = queryNorm
              3.60826 = fieldWeight in 143, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.375 = fieldNorm(doc=143)
        0.6666667 = coord(2/3)
    
  5. Anegón, F. de Moya -> Moya Anegón, F. de: 2.47
    2.4704256 = sum of:
      2.4704256 = product of:
        3.7056384 = sum of:
          1.8105981 = weight(author_txt:moya in 3523) [ClassicSimilarity], result of:
            1.8105981 = score(doc=3523,freq=2.0), product of:
              0.50114524 = queryWeight, product of:
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.061301365 = queryNorm
              3.612921 = fieldWeight in 3523, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.3125 = fieldNorm(doc=3523)
          1.8950402 = weight(author_txt:anegón in 3523) [ClassicSimilarity], result of:
            1.8950402 = score(doc=3523,freq=2.0), product of:
              0.51660806 = queryWeight, product of:
                1.0153103 = boost
                8.30027 = idf(docFreq=29, maxDocs=44421)
                0.061301365 = queryNorm
              3.6682358 = fieldWeight in 3523, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.30027 = idf(docFreq=29, maxDocs=44421)
                0.3125 = fieldNorm(doc=3523)
        0.6666667 = coord(2/3)
    

Similar documents (content)

  1. Galvez, C.; Moya-Anegón, F. de: ¬An evaluation of conflation accuracy using finite-state transducers (2006) 0.58
    0.584755 = sum of:
      0.584755 = product of:
        1.6243194 = sum of:
          0.02115322 = weight(abstract_txt:retrieval in 599) [ClassicSimilarity], result of:
            0.02115322 = score(doc=599,freq=3.0), product of:
              0.044965934 = queryWeight, product of:
                1.1176968 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.011572239 = queryNorm
              0.4704277 = fieldWeight in 599, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.078125 = fieldNorm(doc=599)
          0.015219572 = weight(abstract_txt:approach in 599) [ClassicSimilarity], result of:
            0.015219572 = score(doc=599,freq=1.0), product of:
              0.05207245 = queryWeight, product of:
                1.20278 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.011572239 = queryNorm
              0.29227686 = fieldWeight in 599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.078125 = fieldNorm(doc=599)
          0.10571424 = weight(abstract_txt:finite in 599) [ClassicSimilarity], result of:
            0.10571424 = score(doc=599,freq=1.0), product of:
              0.15045919 = queryWeight, product of:
                1.4456946 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.011572239 = queryNorm
              0.70261073 = fieldWeight in 599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.078125 = fieldNorm(doc=599)
          0.122465186 = weight(abstract_txt:variants in 599) [ClassicSimilarity], result of:
            0.122465186 = score(doc=599,freq=1.0), product of:
              0.20909716 = queryWeight, product of:
                2.4102175 = boost
                7.496775 = idf(docFreq=66, maxDocs=44421)
                0.011572239 = queryNorm
              0.58568555 = fieldWeight in 599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.496775 = idf(docFreq=66, maxDocs=44421)
                0.078125 = fieldNorm(doc=599)
          0.17977588 = weight(abstract_txt:normalization in 599) [ClassicSimilarity], result of:
            0.17977588 = score(doc=599,freq=2.0), product of:
              0.21436341 = queryWeight, product of:
                2.4403803 = boost
                7.590594 = idf(docFreq=60, maxDocs=44421)
                0.011572239 = queryNorm
              0.83865 = fieldWeight in 599, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.590594 = idf(docFreq=60, maxDocs=44421)
                0.078125 = fieldNorm(doc=599)
          0.06407772 = weight(abstract_txt:term in 599) [ClassicSimilarity], result of:
            0.06407772 = score(doc=599,freq=1.0), product of:
              0.17106234 = queryWeight, product of:
                3.0830061 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.011572239 = queryNorm
              0.37458694 = fieldWeight in 599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.078125 = fieldNorm(doc=599)
          0.05169192 = weight(abstract_txt:methods in 599) [ClassicSimilarity], result of:
            0.05169192 = score(doc=599,freq=1.0), product of:
              0.15968648 = queryWeight, product of:
                3.3303223 = boost
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.011572239 = queryNorm
              0.3237088 = fieldWeight in 599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.078125 = fieldNorm(doc=599)
          0.86443514 = weight(abstract_txt:conflation in 599) [ClassicSimilarity], result of:
            0.86443514 = score(doc=599,freq=3.0), product of:
              0.6721469 = queryWeight, product of:
                6.1112394 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.011572239 = queryNorm
              1.2860806 = fieldWeight in 599, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.078125 = fieldNorm(doc=599)
          0.19978653 = weight(abstract_txt:linguistic in 599) [ClassicSimilarity], result of:
            0.19978653 = score(doc=599,freq=1.0), product of:
              0.43995324 = queryWeight, product of:
                6.540628 = boost
                5.8125896 = idf(docFreq=360, maxDocs=44421)
                0.011572239 = queryNorm
              0.45410857 = fieldWeight in 599, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8125896 = idf(docFreq=360, maxDocs=44421)
                0.078125 = fieldNorm(doc=599)
        0.36 = coord(9/25)
    
  2. Galvez, C.; Moya-Anegón, F.: Approximate personal name-matching through finite-state graphs (2007) 0.30
    0.30441353 = sum of:
      0.30441353 = product of:
        0.95129234 = sum of:
          0.009770255 = weight(abstract_txt:retrieval in 1614) [ClassicSimilarity], result of:
            0.009770255 = score(doc=1614,freq=1.0), product of:
              0.044965934 = queryWeight, product of:
                1.1176968 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.011572239 = queryNorm
              0.21728125 = fieldWeight in 1614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=1614)
          0.012175659 = weight(abstract_txt:approach in 1614) [ClassicSimilarity], result of:
            0.012175659 = score(doc=1614,freq=1.0), product of:
              0.05207245 = queryWeight, product of:
                1.20278 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.011572239 = queryNorm
              0.2338215 = fieldWeight in 1614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.0625 = fieldNorm(doc=1614)
          0.08587949 = weight(abstract_txt:equivalence in 1614) [ClassicSimilarity], result of:
            0.08587949 = score(doc=1614,freq=3.0), product of:
              0.10539555 = queryWeight, product of:
                1.2099804 = boost
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.011572239 = queryNorm
              0.81483036 = fieldWeight in 1614, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.0625 = fieldNorm(doc=1614)
          0.14648195 = weight(abstract_txt:finite in 1614) [ClassicSimilarity], result of:
            0.14648195 = score(doc=1614,freq=3.0), product of:
              0.15045919 = queryWeight, product of:
                1.4456946 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.011572239 = queryNorm
              0.973566 = fieldWeight in 1614, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.0625 = fieldNorm(doc=1614)
          0.1959443 = weight(abstract_txt:variants in 1614) [ClassicSimilarity], result of:
            0.1959443 = score(doc=1614,freq=4.0), product of:
              0.20909716 = queryWeight, product of:
                2.4102175 = boost
                7.496775 = idf(docFreq=66, maxDocs=44421)
                0.011572239 = queryNorm
              0.9370969 = fieldWeight in 1614, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.496775 = idf(docFreq=66, maxDocs=44421)
                0.0625 = fieldNorm(doc=1614)
          0.043292496 = weight(abstract_txt:techniques in 1614) [ClassicSimilarity], result of:
            0.043292496 = score(doc=1614,freq=1.0), product of:
              0.15283804 = queryWeight, product of:
                2.9141567 = boost
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.011572239 = queryNorm
              0.28325734 = fieldWeight in 1614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.0625 = fieldNorm(doc=1614)
          0.058482725 = weight(abstract_txt:methods in 1614) [ClassicSimilarity], result of:
            0.058482725 = score(doc=1614,freq=2.0), product of:
              0.15968648 = queryWeight, product of:
                3.3303223 = boost
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.011572239 = queryNorm
              0.3662347 = fieldWeight in 1614, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1434727 = idf(docFreq=1915, maxDocs=44421)
                0.0625 = fieldNorm(doc=1614)
          0.39926547 = weight(abstract_txt:conflation in 1614) [ClassicSimilarity], result of:
            0.39926547 = score(doc=1614,freq=1.0), product of:
              0.6721469 = queryWeight, product of:
                6.1112394 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.011572239 = queryNorm
              0.5940152 = fieldWeight in 1614, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.0625 = fieldNorm(doc=1614)
        0.32 = coord(8/25)
    
  3. Mustafa, S.H.; AI-Radaideh, Q.A.: Using n-grams for Arabic text searching (2004) 0.23
    0.23199792 = sum of:
      0.23199792 = product of:
        1.1599896 = sum of:
          0.017271534 = weight(abstract_txt:retrieval in 3888) [ClassicSimilarity], result of:
            0.017271534 = score(doc=3888,freq=2.0), product of:
              0.044965934 = queryWeight, product of:
                1.1176968 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.011572239 = queryNorm
              0.3841026 = fieldWeight in 3888, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.078125 = fieldNorm(doc=3888)
          0.026361074 = weight(abstract_txt:approach in 3888) [ClassicSimilarity], result of:
            0.026361074 = score(doc=3888,freq=3.0), product of:
              0.05207245 = queryWeight, product of:
                1.20278 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.011572239 = queryNorm
              0.5062384 = fieldWeight in 3888, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.078125 = fieldNorm(doc=3888)
          0.05411562 = weight(abstract_txt:techniques in 3888) [ClassicSimilarity], result of:
            0.05411562 = score(doc=3888,freq=1.0), product of:
              0.15283804 = queryWeight, product of:
                2.9141567 = boost
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.011572239 = queryNorm
              0.35407168 = fieldWeight in 3888, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.078125 = fieldNorm(doc=3888)
          0.06407772 = weight(abstract_txt:term in 3888) [ClassicSimilarity], result of:
            0.06407772 = score(doc=3888,freq=1.0), product of:
              0.17106234 = queryWeight, product of:
                3.0830061 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.011572239 = queryNorm
              0.37458694 = fieldWeight in 3888, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.078125 = fieldNorm(doc=3888)
          0.99816364 = weight(abstract_txt:conflation in 3888) [ClassicSimilarity], result of:
            0.99816364 = score(doc=3888,freq=4.0), product of:
              0.6721469 = queryWeight, product of:
                6.1112394 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.011572239 = queryNorm
              1.4850379 = fieldWeight in 3888, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.078125 = fieldNorm(doc=3888)
        0.2 = coord(5/25)
    
  4. Willett, P.: Best-match text retrieval (1993) 0.16
    0.16114892 = sum of:
      0.16114892 = product of:
        1.0071808 = sum of:
          0.01954051 = weight(abstract_txt:retrieval in 7817) [ClassicSimilarity], result of:
            0.01954051 = score(doc=7817,freq=1.0), product of:
              0.044965934 = queryWeight, product of:
                1.1176968 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.011572239 = queryNorm
              0.4345625 = fieldWeight in 7817, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.125 = fieldNorm(doc=7817)
          0.08658499 = weight(abstract_txt:techniques in 7817) [ClassicSimilarity], result of:
            0.08658499 = score(doc=7817,freq=1.0), product of:
              0.15283804 = queryWeight, product of:
                2.9141567 = boost
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.011572239 = queryNorm
              0.5665147 = fieldWeight in 7817, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.125 = fieldNorm(doc=7817)
          0.10252435 = weight(abstract_txt:term in 7817) [ClassicSimilarity], result of:
            0.10252435 = score(doc=7817,freq=1.0), product of:
              0.17106234 = queryWeight, product of:
                3.0830061 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.011572239 = queryNorm
              0.5993391 = fieldWeight in 7817, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.125 = fieldNorm(doc=7817)
          0.79853094 = weight(abstract_txt:conflation in 7817) [ClassicSimilarity], result of:
            0.79853094 = score(doc=7817,freq=1.0), product of:
              0.6721469 = queryWeight, product of:
                6.1112394 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.011572239 = queryNorm
              1.1880304 = fieldWeight in 7817, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.125 = fieldNorm(doc=7817)
        0.16 = coord(4/25)
    
  5. Jacquemin, C.: What is the tree that we see through the window : a linguistic approach to windowing and term variation (1996) 0.15
    0.15053517 = sum of:
      0.15053517 = product of:
        0.62722987 = sum of:
          0.04858209 = weight(abstract_txt:syntactic in 5646) [ClassicSimilarity], result of:
            0.04858209 = score(doc=5646,freq=1.0), product of:
              0.07934623 = queryWeight, product of:
                1.0498575 = boost
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.011572239 = queryNorm
              0.6122797 = fieldWeight in 5646, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.09375 = fieldNorm(doc=5646)
          0.018263487 = weight(abstract_txt:approach in 5646) [ClassicSimilarity], result of:
            0.018263487 = score(doc=5646,freq=1.0), product of:
              0.05207245 = queryWeight, product of:
                1.20278 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.011572239 = queryNorm
              0.35073224 = fieldWeight in 5646, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.09375 = fieldNorm(doc=5646)
          0.14695823 = weight(abstract_txt:variants in 5646) [ClassicSimilarity], result of:
            0.14695823 = score(doc=5646,freq=1.0), product of:
              0.20909716 = queryWeight, product of:
                2.4102175 = boost
                7.496775 = idf(docFreq=66, maxDocs=44421)
                0.011572239 = queryNorm
              0.7028227 = fieldWeight in 5646, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.496775 = idf(docFreq=66, maxDocs=44421)
                0.09375 = fieldNorm(doc=5646)
          0.06493874 = weight(abstract_txt:techniques in 5646) [ClassicSimilarity], result of:
            0.06493874 = score(doc=5646,freq=1.0), product of:
              0.15283804 = queryWeight, product of:
                2.9141567 = boost
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.011572239 = queryNorm
              0.424886 = fieldWeight in 5646, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.09375 = fieldNorm(doc=5646)
          0.10874349 = weight(abstract_txt:term in 5646) [ClassicSimilarity], result of:
            0.10874349 = score(doc=5646,freq=2.0), product of:
              0.17106234 = queryWeight, product of:
                3.0830061 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.011572239 = queryNorm
              0.6356951 = fieldWeight in 5646, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.09375 = fieldNorm(doc=5646)
          0.23974384 = weight(abstract_txt:linguistic in 5646) [ClassicSimilarity], result of:
            0.23974384 = score(doc=5646,freq=1.0), product of:
              0.43995324 = queryWeight, product of:
                6.540628 = boost
                5.8125896 = idf(docFreq=360, maxDocs=44421)
                0.011572239 = queryNorm
              0.5449303 = fieldWeight in 5646, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8125896 = idf(docFreq=360, maxDocs=44421)
                0.09375 = fieldNorm(doc=5646)
        0.24 = coord(6/25)