Document (#40043)

Author
Savoy, J.
Title
Text representation strategies : an example with the State of the union addresses
Source
Journal of the Association for Information Science and Technology. 67(2016) no.8, S.1858-1870
Year
2016
Abstract
Based on State of the Union addresses from 1790 to 2014 (225 speeches delivered by 42 presidents), this paper describes and evaluates different text representation strategies. To determine the most important words of a given text, the term frequencies (tf) or the tf?idf weighting scheme can be applied. Recently, latent Dirichlet allocation (LDA) has been proposed to define the topics included in a corpus. As another strategy, this study proposes to apply a vocabulary specificity measure (Z?score) to determine the most significantly overused word-types or short sequences of them. Our experiments show that the simple term frequency measure is not able to discriminate between specific terms associated with a document or a set of texts. Using the tf idf or LDA approach, the selection requires some arbitrary decisions. Based on the term-specific measure (Z?score), the term selection has a clear theoretical basis. Moreover, the most significant sentences for each presidency can be determined. As another facet, we can visualize the dynamic evolution of usage of some terms associated with their specificity measures. Finally, this technique can be employed to define the most important lexical leaders introducing terms overused by the k following presidencies.
Content
Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23510/abstract.
Theme
Computerlinguistik

Similar documents (author)

  1. Savoy, J.: Stemming of French words based on grammatical categories (1993) 5.21
    5.2088575 = sum of:
      5.2088575 = weight(author_txt:savoy in 4649) [ClassicSimilarity], result of:
        5.2088575 = fieldWeight in 4649, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.334172 = idf(docFreq=28, maxDocs=44421)
          0.625 = fieldNorm(doc=4649)
    
  2. Savoy, J.: Effectiveness of information retrieval systems used in a hypertext environment (1993) 5.21
    5.2088575 = sum of:
      5.2088575 = weight(author_txt:savoy in 6510) [ClassicSimilarity], result of:
        5.2088575 = fieldWeight in 6510, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.334172 = idf(docFreq=28, maxDocs=44421)
          0.625 = fieldNorm(doc=6510)
    
  3. Savoy, J.: ¬A learning scheme for information retrieval in hypertext (1994) 5.21
    5.2088575 = sum of:
      5.2088575 = weight(author_txt:savoy in 7291) [ClassicSimilarity], result of:
        5.2088575 = fieldWeight in 7291, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.334172 = idf(docFreq=28, maxDocs=44421)
          0.625 = fieldNorm(doc=7291)
    
  4. Savoy, J.: Bayesian inference networks and spreading activation in hypertext systems (1992) 5.21
    5.2088575 = sum of:
      5.2088575 = weight(author_txt:savoy in 260) [ClassicSimilarity], result of:
        5.2088575 = fieldWeight in 260, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.334172 = idf(docFreq=28, maxDocs=44421)
          0.625 = fieldNorm(doc=260)
    
  5. Savoy, J.: Searching information in legal hypertext systems (1993/94) 5.21
    5.2088575 = sum of:
      5.2088575 = weight(author_txt:savoy in 825) [ClassicSimilarity], result of:
        5.2088575 = fieldWeight in 825, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.334172 = idf(docFreq=28, maxDocs=44421)
          0.625 = fieldNorm(doc=825)
    

Similar documents (content)

  1. Savoy, J.: Text clustering : an application with the 'State of the Union' addresses (2015) 0.44
    0.43559885 = sum of:
      0.43559885 = product of:
        0.9899974 = sum of:
          0.026186205 = weight(abstract_txt:important in 3128) [ClassicSimilarity], result of:
            0.026186205 = score(doc=3128,freq=1.0), product of:
              0.099511035 = queryWeight, product of:
                1.0635929 = boost
                4.21038 = idf(docFreq=1791, maxDocs=44421)
                0.022221558 = queryNorm
              0.26314875 = fieldWeight in 3128, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.21038 = idf(docFreq=1791, maxDocs=44421)
                0.0625 = fieldNorm(doc=3128)
          0.15060218 = weight(abstract_txt:speeches in 3128) [ClassicSimilarity], result of:
            0.15060218 = score(doc=3128,freq=1.0), product of:
              0.25353253 = queryWeight, product of:
                1.2004433 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.022221558 = queryNorm
              0.5940152 = fieldWeight in 3128, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.0625 = fieldNorm(doc=3128)
          0.039439745 = weight(abstract_txt:state in 3128) [ClassicSimilarity], result of:
            0.039439745 = score(doc=3128,freq=1.0), product of:
              0.13075118 = queryWeight, product of:
                1.2191653 = boost
                4.8262353 = idf(docFreq=967, maxDocs=44421)
                0.022221558 = queryNorm
              0.3016397 = fieldWeight in 3128, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8262353 = idf(docFreq=967, maxDocs=44421)
                0.0625 = fieldNorm(doc=3128)
          0.16286767 = weight(abstract_txt:1790 in 3128) [ClassicSimilarity], result of:
            0.16286767 = score(doc=3128,freq=1.0), product of:
              0.26711777 = queryWeight, product of:
                1.2321857 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.022221558 = queryNorm
              0.6097223 = fieldWeight in 3128, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.0625 = fieldNorm(doc=3128)
          0.041939065 = weight(abstract_txt:representation in 3128) [ClassicSimilarity], result of:
            0.041939065 = score(doc=3128,freq=1.0), product of:
              0.13621826 = queryWeight, product of:
                1.2443928 = boost
                4.9261017 = idf(docFreq=875, maxDocs=44421)
                0.022221558 = queryNorm
              0.30788136 = fieldWeight in 3128, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9261017 = idf(docFreq=875, maxDocs=44421)
                0.0625 = fieldNorm(doc=3128)
          0.24142164 = weight(abstract_txt:presidents in 3128) [ClassicSimilarity], result of:
            0.24142164 = score(doc=3128,freq=2.0), product of:
              0.2756261 = queryWeight, product of:
                1.2516559 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.022221558 = queryNorm
              0.8759027 = fieldWeight in 3128, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.0625 = fieldNorm(doc=3128)
          0.05763884 = weight(abstract_txt:another in 3128) [ClassicSimilarity], result of:
            0.05763884 = score(doc=3128,freq=1.0), product of:
              0.16838355 = queryWeight, product of:
                1.3835334 = boost
                5.476909 = idf(docFreq=504, maxDocs=44421)
                0.022221558 = queryNorm
              0.34230682 = fieldWeight in 3128, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.476909 = idf(docFreq=504, maxDocs=44421)
                0.0625 = fieldNorm(doc=3128)
          0.06841175 = weight(abstract_txt:addresses in 3128) [ClassicSimilarity], result of:
            0.06841175 = score(doc=3128,freq=1.0), product of:
              0.18876001 = queryWeight, product of:
                1.4648556 = boost
                5.7988343 = idf(docFreq=365, maxDocs=44421)
                0.022221558 = queryNorm
              0.36242715 = fieldWeight in 3128, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7988343 = idf(docFreq=365, maxDocs=44421)
                0.0625 = fieldNorm(doc=3128)
          0.07917812 = weight(abstract_txt:define in 3128) [ClassicSimilarity], result of:
            0.07917812 = score(doc=3128,freq=1.0), product of:
              0.20807807 = queryWeight, product of:
                1.5379881 = boost
                6.0883393 = idf(docFreq=273, maxDocs=44421)
                0.022221558 = queryNorm
              0.3805212 = fieldWeight in 3128, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0883393 = idf(docFreq=273, maxDocs=44421)
                0.0625 = fieldNorm(doc=3128)
          0.07932086 = weight(abstract_txt:union in 3128) [ClassicSimilarity], result of:
            0.07932086 = score(doc=3128,freq=1.0), product of:
              0.20832808 = queryWeight, product of:
                1.5389117 = boost
                6.0919957 = idf(docFreq=272, maxDocs=44421)
                0.022221558 = queryNorm
              0.38074973 = fieldWeight in 3128, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0919957 = idf(docFreq=272, maxDocs=44421)
                0.0625 = fieldNorm(doc=3128)
          0.042991344 = weight(abstract_txt:most in 3128) [ClassicSimilarity], result of:
            0.042991344 = score(doc=3128,freq=1.0), product of:
              0.17448317 = queryWeight, product of:
                1.9917351 = boost
                3.94228 = idf(docFreq=2342, maxDocs=44421)
                0.022221558 = queryNorm
              0.2463925 = fieldWeight in 3128, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.94228 = idf(docFreq=2342, maxDocs=44421)
                0.0625 = fieldNorm(doc=3128)
        0.44 = coord(11/25)
    
  2. Savoy, J.: Estimating the probability of an authorship attribution (2016) 0.29
    0.29419658 = sum of:
      0.29419658 = product of:
        0.7354914 = sum of:
          0.026186205 = weight(abstract_txt:important in 3937) [ClassicSimilarity], result of:
            0.026186205 = score(doc=3937,freq=1.0), product of:
              0.099511035 = queryWeight, product of:
                1.0635929 = boost
                4.21038 = idf(docFreq=1791, maxDocs=44421)
                0.022221558 = queryNorm
              0.26314875 = fieldWeight in 3937, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.21038 = idf(docFreq=1791, maxDocs=44421)
                0.0625 = fieldNorm(doc=3937)
          0.039439745 = weight(abstract_txt:state in 3937) [ClassicSimilarity], result of:
            0.039439745 = score(doc=3937,freq=1.0), product of:
              0.13075118 = queryWeight, product of:
                1.2191653 = boost
                4.8262353 = idf(docFreq=967, maxDocs=44421)
                0.022221558 = queryNorm
              0.3016397 = fieldWeight in 3937, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8262353 = idf(docFreq=967, maxDocs=44421)
                0.0625 = fieldNorm(doc=3937)
          0.16286767 = weight(abstract_txt:1790 in 3937) [ClassicSimilarity], result of:
            0.16286767 = score(doc=3937,freq=1.0), product of:
              0.26711777 = queryWeight, product of:
                1.2321857 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.022221558 = queryNorm
              0.6097223 = fieldWeight in 3937, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.0625 = fieldNorm(doc=3937)
          0.17071088 = weight(abstract_txt:presidents in 3937) [ClassicSimilarity], result of:
            0.17071088 = score(doc=3937,freq=1.0), product of:
              0.2756261 = queryWeight, product of:
                1.2516559 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.022221558 = queryNorm
              0.61935675 = fieldWeight in 3937, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.0625 = fieldNorm(doc=3937)
          0.04606801 = weight(abstract_txt:associated in 3937) [ClassicSimilarity], result of:
            0.04606801 = score(doc=3937,freq=1.0), product of:
              0.14501819 = queryWeight, product of:
                1.2839587 = boost
                5.082729 = idf(docFreq=748, maxDocs=44421)
                0.022221558 = queryNorm
              0.31767055 = fieldWeight in 3937, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.082729 = idf(docFreq=748, maxDocs=44421)
                0.0625 = fieldNorm(doc=3937)
          0.050387908 = weight(abstract_txt:determine in 3937) [ClassicSimilarity], result of:
            0.050387908 = score(doc=3937,freq=1.0), product of:
              0.15394789 = queryWeight, product of:
                1.322899 = boost
                5.2368793 = idf(docFreq=641, maxDocs=44421)
                0.022221558 = queryNorm
              0.32730496 = fieldWeight in 3937, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2368793 = idf(docFreq=641, maxDocs=44421)
                0.0625 = fieldNorm(doc=3937)
          0.06841175 = weight(abstract_txt:addresses in 3937) [ClassicSimilarity], result of:
            0.06841175 = score(doc=3937,freq=1.0), product of:
              0.18876001 = queryWeight, product of:
                1.4648556 = boost
                5.7988343 = idf(docFreq=365, maxDocs=44421)
                0.022221558 = queryNorm
              0.36242715 = fieldWeight in 3937, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7988343 = idf(docFreq=365, maxDocs=44421)
                0.0625 = fieldNorm(doc=3937)
          0.049107004 = weight(abstract_txt:text in 3937) [ClassicSimilarity], result of:
            0.049107004 = score(doc=3937,freq=2.0), product of:
              0.13749036 = queryWeight, product of:
                1.5311635 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.022221558 = queryNorm
              0.3571669 = fieldWeight in 3937, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=3937)
          0.07932086 = weight(abstract_txt:union in 3937) [ClassicSimilarity], result of:
            0.07932086 = score(doc=3937,freq=1.0), product of:
              0.20832808 = queryWeight, product of:
                1.5389117 = boost
                6.0919957 = idf(docFreq=272, maxDocs=44421)
                0.022221558 = queryNorm
              0.38074973 = fieldWeight in 3937, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0919957 = idf(docFreq=272, maxDocs=44421)
                0.0625 = fieldNorm(doc=3937)
          0.042991344 = weight(abstract_txt:most in 3937) [ClassicSimilarity], result of:
            0.042991344 = score(doc=3937,freq=1.0), product of:
              0.17448317 = queryWeight, product of:
                1.9917351 = boost
                3.94228 = idf(docFreq=2342, maxDocs=44421)
                0.022221558 = queryNorm
              0.2463925 = fieldWeight in 3937, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.94228 = idf(docFreq=2342, maxDocs=44421)
                0.0625 = fieldNorm(doc=3937)
        0.4 = coord(10/25)
    
  3. Kim, W.; Wilbur, W.J.: Corpus-based statistical screening for content-bearing terms (2001) 0.26
    0.25896344 = sum of:
      0.25896344 = product of:
        0.7193429 = sum of:
          0.021004634 = weight(abstract_txt:specific in 188) [ClassicSimilarity], result of:
            0.021004634 = score(doc=188,freq=1.0), product of:
              0.10406997 = queryWeight, product of:
                1.0876834 = boost
                4.305746 = idf(docFreq=1628, maxDocs=44421)
                0.022221558 = queryNorm
              0.20183185 = fieldWeight in 188, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.305746 = idf(docFreq=1628, maxDocs=44421)
                0.046875 = fieldNorm(doc=188)
          0.040867906 = weight(abstract_txt:selection in 188) [ClassicSimilarity], result of:
            0.040867906 = score(doc=188,freq=1.0), product of:
              0.1621948 = queryWeight, product of:
                1.3578702 = boost
                5.375318 = idf(docFreq=558, maxDocs=44421)
                0.022221558 = queryNorm
              0.25196803 = fieldWeight in 188, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.375318 = idf(docFreq=558, maxDocs=44421)
                0.046875 = fieldNorm(doc=188)
          0.026042921 = weight(abstract_txt:text in 188) [ClassicSimilarity], result of:
            0.026042921 = score(doc=188,freq=1.0), product of:
              0.13749036 = queryWeight, product of:
                1.5311635 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.022221558 = queryNorm
              0.18941635 = fieldWeight in 188, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.046875 = fieldNorm(doc=188)
          0.05219535 = weight(abstract_txt:terms in 188) [ClassicSimilarity], result of:
            0.05219535 = score(doc=188,freq=4.0), product of:
              0.137683 = queryWeight, product of:
                1.5322357 = boost
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.022221558 = queryNorm
              0.379098 = fieldWeight in 188, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.046875 = fieldNorm(doc=188)
          0.08413248 = weight(abstract_txt:union in 188) [ClassicSimilarity], result of:
            0.08413248 = score(doc=188,freq=2.0), product of:
              0.20832808 = queryWeight, product of:
                1.5389117 = boost
                6.0919957 = idf(docFreq=272, maxDocs=44421)
                0.022221558 = queryNorm
              0.40384609 = fieldWeight in 188, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.0919957 = idf(docFreq=272, maxDocs=44421)
                0.046875 = fieldNorm(doc=188)
          0.17716539 = weight(abstract_txt:score in 188) [ClassicSimilarity], result of:
            0.17716539 = score(doc=188,freq=4.0), product of:
              0.27165306 = queryWeight, product of:
                1.7573049 = boost
                6.9565353 = idf(docFreq=114, maxDocs=44421)
                0.022221558 = queryNorm
              0.6521752 = fieldWeight in 188, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.9565353 = idf(docFreq=114, maxDocs=44421)
                0.046875 = fieldNorm(doc=188)
          0.15405415 = weight(abstract_txt:specificity in 188) [ClassicSimilarity], result of:
            0.15405415 = score(doc=188,freq=2.0), product of:
              0.31180826 = queryWeight, product of:
                1.8827108 = boost
                7.4529724 = idf(docFreq=69, maxDocs=44421)
                0.022221558 = queryNorm
              0.49406692 = fieldWeight in 188, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.4529724 = idf(docFreq=69, maxDocs=44421)
                0.046875 = fieldNorm(doc=188)
          0.06340732 = weight(abstract_txt:measure in 188) [ClassicSimilarity], result of:
            0.06340732 = score(doc=188,freq=1.0), product of:
              0.2488315 = queryWeight, product of:
                2.059862 = boost
                5.4361663 = idf(docFreq=525, maxDocs=44421)
                0.022221558 = queryNorm
              0.2548203 = fieldWeight in 188, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4361663 = idf(docFreq=525, maxDocs=44421)
                0.046875 = fieldNorm(doc=188)
          0.10047276 = weight(abstract_txt:term in 188) [ClassicSimilarity], result of:
            0.10047276 = score(doc=188,freq=3.0), product of:
              0.25809753 = queryWeight, product of:
                2.4224048 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.022221558 = queryNorm
              0.38928217 = fieldWeight in 188, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.046875 = fieldNorm(doc=188)
        0.36 = coord(9/25)
    
  4. Belbachir, F.; Boughanem, M.: Using language models to improve opinion detection (2018) 0.15
    0.14923783 = sum of:
      0.14923783 = product of:
        0.4145495 = sum of:
          0.08673737 = weight(abstract_txt:dirichlet in 44) [ClassicSimilarity], result of:
            0.08673737 = score(doc=44,freq=1.0), product of:
              0.19184257 = queryWeight, product of:
                1.0442327 = boost
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.022221558 = queryNorm
              0.45212787 = fieldWeight in 44, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.0546875 = fieldNorm(doc=44)
          0.022912929 = weight(abstract_txt:important in 44) [ClassicSimilarity], result of:
            0.022912929 = score(doc=44,freq=1.0), product of:
              0.099511035 = queryWeight, product of:
                1.0635929 = boost
                4.21038 = idf(docFreq=1791, maxDocs=44421)
                0.022221558 = queryNorm
              0.23025516 = fieldWeight in 44, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.21038 = idf(docFreq=1791, maxDocs=44421)
                0.0546875 = fieldNorm(doc=44)
          0.024505407 = weight(abstract_txt:specific in 44) [ClassicSimilarity], result of:
            0.024505407 = score(doc=44,freq=1.0), product of:
              0.10406997 = queryWeight, product of:
                1.0876834 = boost
                4.305746 = idf(docFreq=1628, maxDocs=44421)
                0.022221558 = queryNorm
              0.23547049 = fieldWeight in 44, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.305746 = idf(docFreq=1628, maxDocs=44421)
                0.0546875 = fieldNorm(doc=44)
          0.034509778 = weight(abstract_txt:state in 44) [ClassicSimilarity], result of:
            0.034509778 = score(doc=44,freq=1.0), product of:
              0.13075118 = queryWeight, product of:
                1.2191653 = boost
                4.8262353 = idf(docFreq=967, maxDocs=44421)
                0.022221558 = queryNorm
              0.26393473 = fieldWeight in 44, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8262353 = idf(docFreq=967, maxDocs=44421)
                0.0546875 = fieldNorm(doc=44)
          0.04408942 = weight(abstract_txt:determine in 44) [ClassicSimilarity], result of:
            0.04408942 = score(doc=44,freq=1.0), product of:
              0.15394789 = queryWeight, product of:
                1.322899 = boost
                5.2368793 = idf(docFreq=641, maxDocs=44421)
                0.022221558 = queryNorm
              0.28639185 = fieldWeight in 44, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2368793 = idf(docFreq=641, maxDocs=44421)
                0.0546875 = fieldNorm(doc=44)
          0.03038341 = weight(abstract_txt:text in 44) [ClassicSimilarity], result of:
            0.03038341 = score(doc=44,freq=1.0), product of:
              0.13749036 = queryWeight, product of:
                1.5311635 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.022221558 = queryNorm
              0.22098574 = fieldWeight in 44, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0546875 = fieldNorm(doc=44)
          0.03044729 = weight(abstract_txt:terms in 44) [ClassicSimilarity], result of:
            0.03044729 = score(doc=44,freq=1.0), product of:
              0.137683 = queryWeight, product of:
                1.5322357 = boost
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.022221558 = queryNorm
              0.2211405 = fieldWeight in 44, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.0546875 = fieldNorm(doc=44)
          0.103346474 = weight(abstract_txt:score in 44) [ClassicSimilarity], result of:
            0.103346474 = score(doc=44,freq=1.0), product of:
              0.27165306 = queryWeight, product of:
                1.7573049 = boost
                6.9565353 = idf(docFreq=114, maxDocs=44421)
                0.022221558 = queryNorm
              0.38043553 = fieldWeight in 44, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9565353 = idf(docFreq=114, maxDocs=44421)
                0.0546875 = fieldNorm(doc=44)
          0.037617426 = weight(abstract_txt:most in 44) [ClassicSimilarity], result of:
            0.037617426 = score(doc=44,freq=1.0), product of:
              0.17448317 = queryWeight, product of:
                1.9917351 = boost
                3.94228 = idf(docFreq=2342, maxDocs=44421)
                0.022221558 = queryNorm
              0.21559344 = fieldWeight in 44, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.94228 = idf(docFreq=2342, maxDocs=44421)
                0.0546875 = fieldNorm(doc=44)
        0.36 = coord(9/25)
    
  5. Ruthven, I.; Lalmas, M.; Rijsbergen, K. van: Combining and selecting characteristics of information use (2002) 0.15
    0.14732781 = sum of:
      0.14732781 = product of:
        0.52617073 = sum of:
          0.03779093 = weight(abstract_txt:determine in 208) [ClassicSimilarity], result of:
            0.03779093 = score(doc=208,freq=1.0), product of:
              0.15394789 = queryWeight, product of:
                1.322899 = boost
                5.2368793 = idf(docFreq=641, maxDocs=44421)
                0.022221558 = queryNorm
              0.24547872 = fieldWeight in 208, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2368793 = idf(docFreq=641, maxDocs=44421)
                0.046875 = fieldNorm(doc=208)
          0.04322913 = weight(abstract_txt:another in 208) [ClassicSimilarity], result of:
            0.04322913 = score(doc=208,freq=1.0), product of:
              0.16838355 = queryWeight, product of:
                1.3835334 = boost
                5.476909 = idf(docFreq=504, maxDocs=44421)
                0.022221558 = queryNorm
              0.2567301 = fieldWeight in 208, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.476909 = idf(docFreq=504, maxDocs=44421)
                0.046875 = fieldNorm(doc=208)
          0.026042921 = weight(abstract_txt:text in 208) [ClassicSimilarity], result of:
            0.026042921 = score(doc=208,freq=1.0), product of:
              0.13749036 = queryWeight, product of:
                1.5311635 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.022221558 = queryNorm
              0.18941635 = fieldWeight in 208, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.046875 = fieldNorm(doc=208)
          0.05219535 = weight(abstract_txt:terms in 208) [ClassicSimilarity], result of:
            0.05219535 = score(doc=208,freq=4.0), product of:
              0.137683 = queryWeight, product of:
                1.5322357 = boost
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.022221558 = queryNorm
              0.379098 = fieldWeight in 208, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.046875 = fieldNorm(doc=208)
          0.05938359 = weight(abstract_txt:define in 208) [ClassicSimilarity], result of:
            0.05938359 = score(doc=208,freq=1.0), product of:
              0.20807807 = queryWeight, product of:
                1.5379881 = boost
                6.0883393 = idf(docFreq=273, maxDocs=44421)
                0.022221558 = queryNorm
              0.2853909 = fieldWeight in 208, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0883393 = idf(docFreq=273, maxDocs=44421)
                0.046875 = fieldNorm(doc=208)
          0.15405415 = weight(abstract_txt:specificity in 208) [ClassicSimilarity], result of:
            0.15405415 = score(doc=208,freq=2.0), product of:
              0.31180826 = queryWeight, product of:
                1.8827108 = boost
                7.4529724 = idf(docFreq=69, maxDocs=44421)
                0.022221558 = queryNorm
              0.49406692 = fieldWeight in 208, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.4529724 = idf(docFreq=69, maxDocs=44421)
                0.046875 = fieldNorm(doc=208)
          0.15347469 = weight(abstract_txt:term in 208) [ClassicSimilarity], result of:
            0.15347469 = score(doc=208,freq=7.0), product of:
              0.25809753 = queryWeight, product of:
                2.4224048 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.022221558 = queryNorm
              0.59463835 = fieldWeight in 208, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.046875 = fieldNorm(doc=208)
        0.28 = coord(7/25)