Document (#39129)

Author
Savoy, J.
Title
Text clustering : an application with the 'State of the Union' addresses
Source
Journal of the Association for Information Science and Technology. 66(2015) no.8, S.1645-1654
Year
2015
Abstract
This paper describes a clustering and authorship attribution study over the State of the Union addresses from 1790 to 2014 (224 speeches delivered by 41 presidents). To define the style of each presidency, we have applied a principal component analysis (PCA) based on the part-of-speech (POS) frequencies. From Roosevelt (1934), each president tends to own a distinctive style whereas previous presidents tend usually to share some stylistic aspects with others. Applying an automatic classification based on the frequencies of all content-bearing word-types we show that chronology tends to play a central role in forming clusters, a factor that is more important than political affiliation. Using the 300 most frequent word-types, we generate another clustering representation based on the style of each president. This second view shares similarities with the first one, but usually with more numerous and smaller clusters. Finally, an authorship attribution approach for each speech can reach a success rate of around 95.7% under some constraints. When an incorrect assignment is detected, the proposed author often belongs to the same party and has lived during roughly the same time period as the presumed author. A deeper analysis of some incorrect assignments reveals interesting reasons justifying difficult attributions.
Content
Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23283/abstract.

Similar documents (author)

  1. Savoy, J.: Stemming of French words based on grammatical categories (1993) 5.21
    5.2059946 = sum of:
      5.2059946 = weight(author_txt:savoy in 4650) [ClassicSimilarity], result of:
        5.2059946 = fieldWeight in 4650, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.329592 = idf(docFreq=28, maxDocs=44218)
          0.625 = fieldNorm(doc=4650)
    
  2. Savoy, J.: Effectiveness of information retrieval systems used in a hypertext environment (1993) 5.21
    5.2059946 = sum of:
      5.2059946 = weight(author_txt:savoy in 6511) [ClassicSimilarity], result of:
        5.2059946 = fieldWeight in 6511, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.329592 = idf(docFreq=28, maxDocs=44218)
          0.625 = fieldNorm(doc=6511)
    
  3. Savoy, J.: ¬A learning scheme for information retrieval in hypertext (1994) 5.21
    5.2059946 = sum of:
      5.2059946 = weight(author_txt:savoy in 7292) [ClassicSimilarity], result of:
        5.2059946 = fieldWeight in 7292, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.329592 = idf(docFreq=28, maxDocs=44218)
          0.625 = fieldNorm(doc=7292)
    
  4. Savoy, J.: Bayesian inference networks and spreading activation in hypertext systems (1992) 5.21
    5.2059946 = sum of:
      5.2059946 = weight(author_txt:savoy in 192) [ClassicSimilarity], result of:
        5.2059946 = fieldWeight in 192, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.329592 = idf(docFreq=28, maxDocs=44218)
          0.625 = fieldNorm(doc=192)
    
  5. Savoy, J.: Searching information in legal hypertext systems (1993/94) 5.21
    5.2059946 = sum of:
      5.2059946 = weight(author_txt:savoy in 757) [ClassicSimilarity], result of:
        5.2059946 = fieldWeight in 757, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.329592 = idf(docFreq=28, maxDocs=44218)
          0.625 = fieldNorm(doc=757)
    

Similar documents (content)

  1. Savoy, J.: Estimating the probability of an authorship attribution (2016) 0.47
    0.47362462 = sum of:
      0.47362462 = product of:
        0.986718 = sum of:
          0.017366474 = weight(abstract_txt:based in 2937) [ClassicSimilarity], result of:
            0.017366474 = score(doc=2937,freq=2.0), product of:
              0.061632276 = queryWeight, product of:
                1.0067499 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.019203402 = queryNorm
              0.28177565 = fieldWeight in 2937, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=2937)
          0.028471412 = weight(abstract_txt:state in 2937) [ClassicSimilarity], result of:
            0.028471412 = score(doc=2937,freq=1.0), product of:
              0.09431613 = queryWeight, product of:
                1.0168686 = boost
                4.829954 = idf(docFreq=959, maxDocs=44218)
                0.019203402 = queryNorm
              0.30187213 = fieldWeight in 2937, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.829954 = idf(docFreq=959, maxDocs=44218)
                0.0625 = fieldNorm(doc=2937)
          0.117137104 = weight(abstract_txt:1790 in 2937) [ClassicSimilarity], result of:
            0.117137104 = score(doc=2937,freq=1.0), product of:
              0.19220573 = queryWeight, product of:
                1.0264552 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.019203402 = queryNorm
              0.6094361 = fieldWeight in 2937, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.0625 = fieldNorm(doc=2937)
          0.062336408 = weight(abstract_txt:author in 2937) [ClassicSimilarity], result of:
            0.062336408 = score(doc=2937,freq=4.0), product of:
              0.10018157 = queryWeight, product of:
                1.0480108 = boost
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.019203402 = queryNorm
              0.6222343 = fieldWeight in 2937, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.0625 = fieldNorm(doc=2937)
          0.007893891 = weight(abstract_txt:with in 2937) [ClassicSimilarity], result of:
            0.007893891 = score(doc=2937,freq=1.0), product of:
              0.05052629 = queryWeight, product of:
                1.0525568 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.019203402 = queryNorm
              0.15623334 = fieldWeight in 2937, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.0625 = fieldNorm(doc=2937)
          0.018857433 = weight(abstract_txt:some in 2937) [ClassicSimilarity], result of:
            0.018857433 = score(doc=2937,freq=1.0), product of:
              0.08203492 = queryWeight, product of:
                1.1614937 = boost
                3.6779325 = idf(docFreq=3037, maxDocs=44218)
                0.019203402 = queryNorm
              0.22987078 = fieldWeight in 2937, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6779325 = idf(docFreq=3037, maxDocs=44218)
                0.0625 = fieldNorm(doc=2937)
          0.049506463 = weight(abstract_txt:addresses in 2937) [ClassicSimilarity], result of:
            0.049506463 = score(doc=2937,freq=1.0), product of:
              0.13638122 = queryWeight, product of:
                1.2227823 = boost
                5.808009 = idf(docFreq=360, maxDocs=44218)
                0.019203402 = queryNorm
              0.36300057 = fieldWeight in 2937, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.808009 = idf(docFreq=360, maxDocs=44218)
                0.0625 = fieldNorm(doc=2937)
          0.057000518 = weight(abstract_txt:union in 2937) [ClassicSimilarity], result of:
            0.057000518 = score(doc=2937,freq=1.0), product of:
              0.14981864 = queryWeight, product of:
                1.2816067 = boost
                6.087415 = idf(docFreq=272, maxDocs=44218)
                0.019203402 = queryNorm
              0.38046345 = fieldWeight in 2937, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.087415 = idf(docFreq=272, maxDocs=44218)
                0.0625 = fieldNorm(doc=2937)
          0.17174156 = weight(abstract_txt:authorship in 2937) [ClassicSimilarity], result of:
            0.17174156 = score(doc=2937,freq=4.0), product of:
              0.19688392 = queryWeight, product of:
                1.4691865 = boost
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.019203402 = queryNorm
              0.87229854 = fieldWeight in 2937, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.0625 = fieldNorm(doc=2937)
          0.17553477 = weight(abstract_txt:attribution in 2937) [ClassicSimilarity], result of:
            0.17553477 = score(doc=2937,freq=2.0), product of:
              0.25169742 = queryWeight, product of:
                1.661159 = boost
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.019203402 = queryNorm
              0.6974039 = fieldWeight in 2937, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.0625 = fieldNorm(doc=2937)
          0.035310496 = weight(abstract_txt:each in 2937) [ClassicSimilarity], result of:
            0.035310496 = score(doc=2937,freq=1.0), product of:
              0.13717002 = queryWeight, product of:
                1.7342688 = boost
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.019203402 = queryNorm
              0.25742137 = fieldWeight in 2937, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.0625 = fieldNorm(doc=2937)
          0.24556147 = weight(abstract_txt:presidents in 2937) [ClassicSimilarity], result of:
            0.24556147 = score(doc=2937,freq=1.0), product of:
              0.39666158 = queryWeight, product of:
                2.085364 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.019203402 = queryNorm
              0.6190705 = fieldWeight in 2937, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.0625 = fieldNorm(doc=2937)
        0.48 = coord(12/25)
    
  2. Savoy, J.: Text representation strategies : an example with the State of the union addresses (2016) 0.41
    0.406745 = sum of:
      0.406745 = product of:
        0.8473854 = sum of:
          0.1083115 = weight(abstract_txt:speeches in 3042) [ClassicSimilarity], result of:
            0.1083115 = score(doc=3042,freq=1.0), product of:
              0.18242584 = queryWeight, product of:
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.019203402 = queryNorm
              0.5937289 = fieldWeight in 3042, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.499662 = idf(docFreq=8, maxDocs=44218)
                0.0625 = fieldNorm(doc=3042)
          0.017366474 = weight(abstract_txt:based in 3042) [ClassicSimilarity], result of:
            0.017366474 = score(doc=3042,freq=2.0), product of:
              0.061632276 = queryWeight, product of:
                1.0067499 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.019203402 = queryNorm
              0.28177565 = fieldWeight in 3042, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=3042)
          0.028471412 = weight(abstract_txt:state in 3042) [ClassicSimilarity], result of:
            0.028471412 = score(doc=3042,freq=1.0), product of:
              0.09431613 = queryWeight, product of:
                1.0168686 = boost
                4.829954 = idf(docFreq=959, maxDocs=44218)
                0.019203402 = queryNorm
              0.30187213 = fieldWeight in 3042, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.829954 = idf(docFreq=959, maxDocs=44218)
                0.0625 = fieldNorm(doc=3042)
          0.117137104 = weight(abstract_txt:1790 in 3042) [ClassicSimilarity], result of:
            0.117137104 = score(doc=3042,freq=1.0), product of:
              0.19220573 = queryWeight, product of:
                1.0264552 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.019203402 = queryNorm
              0.6094361 = fieldWeight in 3042, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.0625 = fieldNorm(doc=3042)
          0.011163648 = weight(abstract_txt:with in 3042) [ClassicSimilarity], result of:
            0.011163648 = score(doc=3042,freq=2.0), product of:
              0.05052629 = queryWeight, product of:
                1.0525568 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.019203402 = queryNorm
              0.22094731 = fieldWeight in 3042, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.0625 = fieldNorm(doc=3042)
          0.040576402 = weight(abstract_txt:word in 3042) [ClassicSimilarity], result of:
            0.040576402 = score(doc=3042,freq=1.0), product of:
              0.119443454 = queryWeight, product of:
                1.1443346 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.019203402 = queryNorm
              0.33971223 = fieldWeight in 3042, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.0625 = fieldNorm(doc=3042)
          0.026668437 = weight(abstract_txt:some in 3042) [ClassicSimilarity], result of:
            0.026668437 = score(doc=3042,freq=2.0), product of:
              0.08203492 = queryWeight, product of:
                1.1614937 = boost
                3.6779325 = idf(docFreq=3037, maxDocs=44218)
                0.019203402 = queryNorm
              0.32508639 = fieldWeight in 3042, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6779325 = idf(docFreq=3037, maxDocs=44218)
                0.0625 = fieldNorm(doc=3042)
          0.049506463 = weight(abstract_txt:addresses in 3042) [ClassicSimilarity], result of:
            0.049506463 = score(doc=3042,freq=1.0), product of:
              0.13638122 = queryWeight, product of:
                1.2227823 = boost
                5.808009 = idf(docFreq=360, maxDocs=44218)
                0.019203402 = queryNorm
              0.36300057 = fieldWeight in 3042, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.808009 = idf(docFreq=360, maxDocs=44218)
                0.0625 = fieldNorm(doc=3042)
          0.057000518 = weight(abstract_txt:union in 3042) [ClassicSimilarity], result of:
            0.057000518 = score(doc=3042,freq=1.0), product of:
              0.14981864 = queryWeight, product of:
                1.2816067 = boost
                6.087415 = idf(docFreq=272, maxDocs=44218)
                0.019203402 = queryNorm
              0.38046345 = fieldWeight in 3042, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.087415 = idf(docFreq=272, maxDocs=44218)
                0.0625 = fieldNorm(doc=3042)
          0.11031151 = weight(abstract_txt:frequencies in 3042) [ClassicSimilarity], result of:
            0.11031151 = score(doc=3042,freq=1.0), product of:
              0.23266293 = queryWeight, product of:
                1.5971122 = boost
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.019203402 = queryNorm
              0.47412583 = fieldWeight in 3042, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5860133 = idf(docFreq=60, maxDocs=44218)
                0.0625 = fieldNorm(doc=3042)
          0.035310496 = weight(abstract_txt:each in 3042) [ClassicSimilarity], result of:
            0.035310496 = score(doc=3042,freq=1.0), product of:
              0.13717002 = queryWeight, product of:
                1.7342688 = boost
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.019203402 = queryNorm
              0.25742137 = fieldWeight in 3042, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.0625 = fieldNorm(doc=3042)
          0.24556147 = weight(abstract_txt:presidents in 3042) [ClassicSimilarity], result of:
            0.24556147 = score(doc=3042,freq=1.0), product of:
              0.39666158 = queryWeight, product of:
                2.085364 = boost
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.019203402 = queryNorm
              0.6190705 = fieldWeight in 3042, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.905128 = idf(docFreq=5, maxDocs=44218)
                0.0625 = fieldNorm(doc=3042)
        0.48 = coord(12/25)
    
  3. Stamatatos, E.: Masking topic-related information to enhance authorship attribution (2018) 0.16
    0.1612334 = sum of:
      0.1612334 = product of:
        0.6718058 = sum of:
          0.028471412 = weight(abstract_txt:state in 4124) [ClassicSimilarity], result of:
            0.028471412 = score(doc=4124,freq=1.0), product of:
              0.09431613 = queryWeight, product of:
                1.0168686 = boost
                4.829954 = idf(docFreq=959, maxDocs=44218)
                0.019203402 = queryNorm
              0.30187213 = fieldWeight in 4124, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.829954 = idf(docFreq=959, maxDocs=44218)
                0.0625 = fieldNorm(doc=4124)
          0.031168204 = weight(abstract_txt:author in 4124) [ClassicSimilarity], result of:
            0.031168204 = score(doc=4124,freq=1.0), product of:
              0.10018157 = queryWeight, product of:
                1.0480108 = boost
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.019203402 = queryNorm
              0.31111714 = fieldWeight in 4124, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.0625 = fieldNorm(doc=4124)
          0.011163648 = weight(abstract_txt:with in 4124) [ClassicSimilarity], result of:
            0.011163648 = score(doc=4124,freq=2.0), product of:
              0.05052629 = queryWeight, product of:
                1.0525568 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.019203402 = queryNorm
              0.22094731 = fieldWeight in 4124, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.0625 = fieldNorm(doc=4124)
          0.14873254 = weight(abstract_txt:authorship in 4124) [ClassicSimilarity], result of:
            0.14873254 = score(doc=4124,freq=3.0), product of:
              0.19688392 = queryWeight, product of:
                1.4691865 = boost
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.019203402 = queryNorm
              0.75543267 = fieldWeight in 4124, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.0625 = fieldNorm(doc=4124)
          0.30403516 = weight(abstract_txt:attribution in 4124) [ClassicSimilarity], result of:
            0.30403516 = score(doc=4124,freq=6.0), product of:
              0.25169742 = queryWeight, product of:
                1.661159 = boost
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.019203402 = queryNorm
              1.2079391 = fieldWeight in 4124, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.0625 = fieldNorm(doc=4124)
          0.14823484 = weight(abstract_txt:style in 4124) [ClassicSimilarity], result of:
            0.14823484 = score(doc=4124,freq=2.0), product of:
              0.25741506 = queryWeight, product of:
                2.0574744 = boost
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.019203402 = queryNorm
              0.57585925 = fieldWeight in 4124, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.0625 = fieldNorm(doc=4124)
        0.24 = coord(6/25)
    
  4. Cai, X.; Li, W.: Enhancing sentence-level clustering with integrated and interactive frameworks for theme-based summarization (2011) 0.14
    0.13541238 = sum of:
      0.13541238 = product of:
        0.5642183 = sum of:
          0.017366474 = weight(abstract_txt:based in 4770) [ClassicSimilarity], result of:
            0.017366474 = score(doc=4770,freq=2.0), product of:
              0.061632276 = queryWeight, product of:
                1.0067499 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.019203402 = queryNorm
              0.28177565 = fieldWeight in 4770, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=4770)
          0.007893891 = weight(abstract_txt:with in 4770) [ClassicSimilarity], result of:
            0.007893891 = score(doc=4770,freq=1.0), product of:
              0.05052629 = queryWeight, product of:
                1.0525568 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.019203402 = queryNorm
              0.15623334 = fieldWeight in 4770, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.0625 = fieldNorm(doc=4770)
          0.081152804 = weight(abstract_txt:word in 4770) [ClassicSimilarity], result of:
            0.081152804 = score(doc=4770,freq=4.0), product of:
              0.119443454 = queryWeight, product of:
                1.1443346 = boost
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.019203402 = queryNorm
              0.67942446 = fieldWeight in 4770, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.4353957 = idf(docFreq=523, maxDocs=44218)
                0.0625 = fieldNorm(doc=4770)
          0.06987857 = weight(abstract_txt:clusters in 4770) [ClassicSimilarity], result of:
            0.06987857 = score(doc=4770,freq=1.0), product of:
              0.17161003 = queryWeight, product of:
                1.3716495 = boost
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.019203402 = queryNorm
              0.407194 = fieldWeight in 4770, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.0625 = fieldNorm(doc=4770)
          0.035310496 = weight(abstract_txt:each in 4770) [ClassicSimilarity], result of:
            0.035310496 = score(doc=4770,freq=1.0), product of:
              0.13717002 = queryWeight, product of:
                1.7342688 = boost
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.019203402 = queryNorm
              0.25742137 = fieldWeight in 4770, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.118742 = idf(docFreq=1954, maxDocs=44218)
                0.0625 = fieldNorm(doc=4770)
          0.35261604 = weight(abstract_txt:clustering in 4770) [ClassicSimilarity], result of:
            0.35261604 = score(doc=4770,freq=15.0), product of:
              0.23434086 = queryWeight, product of:
                1.9630957 = boost
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.019203402 = queryNorm
              1.5047143 = fieldWeight in 4770, product of:
                3.8729835 = tf(freq=15.0), with freq of:
                  15.0 = termFreq=15.0
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.0625 = fieldNorm(doc=4770)
        0.24 = coord(6/25)
    
  5. Savoy, J.: Authorship of Pauline epistles revisited (2019) 0.13
    0.12696077 = sum of:
      0.12696077 = product of:
        0.4534313 = sum of:
          0.017366474 = weight(abstract_txt:based in 5386) [ClassicSimilarity], result of:
            0.017366474 = score(doc=5386,freq=2.0), product of:
              0.061632276 = queryWeight, product of:
                1.0067499 = boost
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.019203402 = queryNorm
              0.28177565 = fieldWeight in 5386, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1879277 = idf(docFreq=4958, maxDocs=44218)
                0.0625 = fieldNorm(doc=5386)
          0.053984914 = weight(abstract_txt:author in 5386) [ClassicSimilarity], result of:
            0.053984914 = score(doc=5386,freq=3.0), product of:
              0.10018157 = queryWeight, product of:
                1.0480108 = boost
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.019203402 = queryNorm
              0.5388707 = fieldWeight in 5386, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.9778743 = idf(docFreq=827, maxDocs=44218)
                0.0625 = fieldNorm(doc=5386)
          0.011163648 = weight(abstract_txt:with in 5386) [ClassicSimilarity], result of:
            0.011163648 = score(doc=5386,freq=2.0), product of:
              0.05052629 = queryWeight, product of:
                1.0525568 = boost
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.019203402 = queryNorm
              0.22094731 = fieldWeight in 5386, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4997334 = idf(docFreq=9868, maxDocs=44218)
                0.0625 = fieldNorm(doc=5386)
          0.06987857 = weight(abstract_txt:clusters in 5386) [ClassicSimilarity], result of:
            0.06987857 = score(doc=5386,freq=1.0), product of:
              0.17161003 = queryWeight, product of:
                1.3716495 = boost
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.019203402 = queryNorm
              0.407194 = fieldWeight in 5386, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.515104 = idf(docFreq=177, maxDocs=44218)
                0.0625 = fieldNorm(doc=5386)
          0.08587078 = weight(abstract_txt:authorship in 5386) [ClassicSimilarity], result of:
            0.08587078 = score(doc=5386,freq=1.0), product of:
              0.19688392 = queryWeight, product of:
                1.4691865 = boost
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.019203402 = queryNorm
              0.43614927 = fieldWeight in 5386, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9783883 = idf(docFreq=111, maxDocs=44218)
                0.0625 = fieldNorm(doc=5386)
          0.12412183 = weight(abstract_txt:attribution in 5386) [ClassicSimilarity], result of:
            0.12412183 = score(doc=5386,freq=1.0), product of:
              0.25169742 = queryWeight, product of:
                1.661159 = boost
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.019203402 = queryNorm
              0.49313906 = fieldWeight in 5386, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.890225 = idf(docFreq=44, maxDocs=44218)
                0.0625 = fieldNorm(doc=5386)
          0.09104507 = weight(abstract_txt:clustering in 5386) [ClassicSimilarity], result of:
            0.09104507 = score(doc=5386,freq=1.0), product of:
              0.23434086 = queryWeight, product of:
                1.9630957 = boost
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.019203402 = queryNorm
              0.38851553 = fieldWeight in 5386, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2162485 = idf(docFreq=239, maxDocs=44218)
                0.0625 = fieldNorm(doc=5386)
        0.28 = coord(7/25)