Document (#39773)

Author
Denning, J.
Pera, M.S.
Ng, Y.-K.
Title
¬A readability level prediction tool for K-12 books
Source
Journal of the Association for Information Science and Technology. 67(2016) no.3, S.550-565
Year
2016
Abstract
The readability levels of books identify suitable reading materials. Unfortunately, the majority of published books are assigned a readability level range, which is not useful to readers who look for books at a particular grade level. Existing readability formulas/analysis tools require at least an excerpt of a book to estimate its readability level, which is a severe constraint, since copyright laws prohibit book contents from being made publicly accessible. To alleviate the constraint, we have developed TRoLL which relies on publicly accessible online book metadata, in addition to using a book's snippet, if it is available, to predict its readability level. Based on a multi-dimensional regression analysis, TRoLL determines the grade level of any book instantly, even without a sample of its text, and considers its topical suitability, which is unique. Furthermore, TRoLL is a significant contribution to the educational community, since its computed book readability levels can enrich K-12 readers' book selections and aid parents, teachers, and librarians in locating reading materials suitable for their K-12 readers, which can be a time-consuming and frustrating task that does not always yield a quality outcome. Conducted empirical studies have verified the prediction accuracy of TRoLL and demonstrated its superiority over well-known readability formulas/analysis tools.
Content
Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23417/abstract.

Similar documents (author)

  1. Pera, M. Soledad => Soledad Pera, M.: 5.04
    5.0403857 = sum of:
      5.0403857 = weight(author_txt:pera in 3875) [ClassicSimilarity], result of:
        5.0403857 = fieldWeight in 3875, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.375 = fieldNorm(doc=3875)
    
  2. Pera, M.S.; Ng, Y.-K.: SpamED : a spam E-mail detection approach based on phrase similarity (2009) 4.16
    4.1581063 = sum of:
      4.1581063 = weight(author_txt:pera in 3721) [ClassicSimilarity], result of:
        4.1581063 = fieldWeight in 3721, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.4375 = fieldNorm(doc=3721)
    
  3. Azpiazu, I.M.; Soledad Pera, M.: Is cross-lingual readability assessment possible? (2020) 4.16
    4.1581063 = sum of:
      4.1581063 = weight(author_txt:pera in 868) [ClassicSimilarity], result of:
        4.1581063 = fieldWeight in 868, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.4375 = fieldNorm(doc=868)
    
  4. Pera, M.S.; Lund, W.; Ng, Y.-K.: ¬A sophisticated library search strategy using folksonomies and similarity matching (2009) 3.56
    3.5640912 = sum of:
      3.5640912 = weight(author_txt:pera in 3939) [ClassicSimilarity], result of:
        3.5640912 = fieldWeight in 3939, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.375 = fieldNorm(doc=3939)
    
  5. Soledad Pera, M.; Ng, Y.-K.: Recommending books to be exchanged online in the absence of wish lists (2018) 3.56
    3.5640912 = sum of:
      3.5640912 = weight(author_txt:pera in 182) [ClassicSimilarity], result of:
        3.5640912 = fieldWeight in 182, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.375 = fieldNorm(doc=182)
    

Similar documents (content)

  1. Leroy, G.; Miller, T.; Rosemblat, G.; Browne, A.: ¬A balanced approach to health information evaluation : a vocabulary-based naïve Bayes classifier and readability formulas (2008) 0.41
    0.41071427 = sum of:
      0.41071427 = product of:
        1.4668367 = sum of:
          0.025725141 = weight(abstract_txt:since in 2998) [ClassicSimilarity], result of:
            0.025725141 = score(doc=2998,freq=1.0), product of:
              0.0673953 = queryWeight, product of:
                1.1158643 = boost
                4.8858275 = idf(docFreq=911, maxDocs=44421)
                0.012361754 = queryNorm
              0.38170528 = fieldWeight in 2998, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8858275 = idf(docFreq=911, maxDocs=44421)
                0.078125 = fieldNorm(doc=2998)
          0.04420968 = weight(abstract_txt:levels in 2998) [ClassicSimilarity], result of:
            0.04420968 = score(doc=2998,freq=2.0), product of:
              0.076746635 = queryWeight, product of:
                1.1907655 = boost
                5.2137837 = idf(docFreq=656, maxDocs=44421)
                0.012361754 = queryNorm
              0.5760472 = fieldWeight in 2998, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2137837 = idf(docFreq=656, maxDocs=44421)
                0.078125 = fieldNorm(doc=2998)
          0.013641365 = weight(abstract_txt:which in 2998) [ClassicSimilarity], result of:
            0.013641365 = score(doc=2998,freq=1.0), product of:
              0.05992522 = queryWeight, product of:
                1.6636862 = boost
                2.9137893 = idf(docFreq=6552, maxDocs=44421)
                0.012361754 = queryNorm
              0.2276398 = fieldWeight in 2998, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9137893 = idf(docFreq=6552, maxDocs=44421)
                0.078125 = fieldNorm(doc=2998)
          0.14975728 = weight(abstract_txt:grade in 2998) [ClassicSimilarity], result of:
            0.14975728 = score(doc=2998,freq=2.0), product of:
              0.17310372 = queryWeight, product of:
                1.7883387 = boost
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.012361754 = queryNorm
              0.86513036 = fieldWeight in 2998, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.078125 = fieldNorm(doc=2998)
          0.1762698 = weight(abstract_txt:formulas in 2998) [ClassicSimilarity], result of:
            0.1762698 = score(doc=2998,freq=2.0), product of:
              0.1929744 = queryWeight, product of:
                1.8881931 = boost
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.012361754 = queryNorm
              0.9134362 = fieldWeight in 2998, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.078125 = fieldNorm(doc=2998)
          0.060100753 = weight(abstract_txt:level in 2998) [ClassicSimilarity], result of:
            0.060100753 = score(doc=2998,freq=1.0), product of:
              0.17113997 = queryWeight, product of:
                3.0798738 = boost
                4.4950905 = idf(docFreq=1347, maxDocs=44421)
                0.012361754 = queryNorm
              0.35117894 = fieldWeight in 2998, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4950905 = idf(docFreq=1347, maxDocs=44421)
                0.078125 = fieldNorm(doc=2998)
          0.9971326 = weight(abstract_txt:readability in 2998) [ClassicSimilarity], result of:
            0.9971326 = score(doc=2998,freq=4.0), product of:
              0.7718976 = queryWeight, product of:
                7.5527725 = boost
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.012361754 = queryNorm
              1.2917938 = fieldWeight in 2998, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.078125 = fieldNorm(doc=2998)
        0.28 = coord(7/25)
    
  2. Azpiazu, I.M.; Soledad Pera, M.: Is cross-lingual readability assessment possible? (2020) 0.21
    0.20555821 = sum of:
      0.20555821 = product of:
        1.027791 = sum of:
          0.021882676 = weight(abstract_txt:levels in 868) [ClassicSimilarity], result of:
            0.021882676 = score(doc=868,freq=1.0), product of:
              0.076746635 = queryWeight, product of:
                1.1907655 = boost
                5.2137837 = idf(docFreq=656, maxDocs=44421)
                0.012361754 = queryNorm
              0.2851288 = fieldWeight in 868, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2137837 = idf(docFreq=656, maxDocs=44421)
                0.0546875 = fieldNorm(doc=868)
          0.09942574 = weight(abstract_txt:prediction in 868) [ClassicSimilarity], result of:
            0.09942574 = score(doc=868,freq=3.0), product of:
              0.14597704 = queryWeight, product of:
                1.6422484 = boost
                7.190608 = idf(docFreq=90, maxDocs=44421)
                0.012361754 = queryNorm
              0.6811054 = fieldWeight in 868, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.190608 = idf(docFreq=90, maxDocs=44421)
                0.0546875 = fieldNorm(doc=868)
          0.009548955 = weight(abstract_txt:which in 868) [ClassicSimilarity], result of:
            0.009548955 = score(doc=868,freq=1.0), product of:
              0.05992522 = queryWeight, product of:
                1.6636862 = boost
                2.9137893 = idf(docFreq=6552, maxDocs=44421)
                0.012361754 = queryNorm
              0.15934785 = fieldWeight in 868, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9137893 = idf(docFreq=6552, maxDocs=44421)
                0.0546875 = fieldNorm(doc=868)
          0.042070527 = weight(abstract_txt:level in 868) [ClassicSimilarity], result of:
            0.042070527 = score(doc=868,freq=1.0), product of:
              0.17113997 = queryWeight, product of:
                3.0798738 = boost
                4.4950905 = idf(docFreq=1347, maxDocs=44421)
                0.012361754 = queryNorm
              0.24582526 = fieldWeight in 868, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4950905 = idf(docFreq=1347, maxDocs=44421)
                0.0546875 = fieldNorm(doc=868)
          0.85486317 = weight(abstract_txt:readability in 868) [ClassicSimilarity], result of:
            0.85486317 = score(doc=868,freq=6.0), product of:
              0.7718976 = queryWeight, product of:
                7.5527725 = boost
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.012361754 = queryNorm
              1.1074826 = fieldWeight in 868, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.0546875 = fieldNorm(doc=868)
        0.2 = coord(5/25)
    
  3. Collins-Thompson, K.; Callan, J.: Predicting reading difficulty with statistical language models (2005) 0.17
    0.1727257 = sum of:
      0.1727257 = product of:
        0.8636285 = sum of:
          0.035367746 = weight(abstract_txt:levels in 5579) [ClassicSimilarity], result of:
            0.035367746 = score(doc=5579,freq=2.0), product of:
              0.076746635 = queryWeight, product of:
                1.1907655 = boost
                5.2137837 = idf(docFreq=656, maxDocs=44421)
                0.012361754 = queryNorm
              0.46083772 = fieldWeight in 5579, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2137837 = idf(docFreq=656, maxDocs=44421)
                0.0625 = fieldNorm(doc=5579)
          0.07639528 = weight(abstract_txt:reading in 5579) [ClassicSimilarity], result of:
            0.07639528 = score(doc=5579,freq=4.0), product of:
              0.101786174 = queryWeight, product of:
                1.3713268 = boost
                6.004374 = idf(docFreq=297, maxDocs=44421)
                0.012361754 = queryNorm
              0.75054675 = fieldWeight in 5579, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.004374 = idf(docFreq=297, maxDocs=44421)
                0.0625 = fieldNorm(doc=5579)
          0.11980583 = weight(abstract_txt:grade in 5579) [ClassicSimilarity], result of:
            0.11980583 = score(doc=5579,freq=2.0), product of:
              0.17310372 = queryWeight, product of:
                1.7883387 = boost
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.012361754 = queryNorm
              0.6921043 = fieldWeight in 5579, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.0625 = fieldNorm(doc=5579)
          0.06799624 = weight(abstract_txt:level in 5579) [ClassicSimilarity], result of:
            0.06799624 = score(doc=5579,freq=2.0), product of:
              0.17113997 = queryWeight, product of:
                3.0798738 = boost
                4.4950905 = idf(docFreq=1347, maxDocs=44421)
                0.012361754 = queryNorm
              0.39731362 = fieldWeight in 5579, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4950905 = idf(docFreq=1347, maxDocs=44421)
                0.0625 = fieldNorm(doc=5579)
          0.5640634 = weight(abstract_txt:readability in 5579) [ClassicSimilarity], result of:
            0.5640634 = score(doc=5579,freq=2.0), product of:
              0.7718976 = queryWeight, product of:
                7.5527725 = boost
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.012361754 = queryNorm
              0.73074895 = fieldWeight in 5579, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.0625 = fieldNorm(doc=5579)
        0.2 = coord(5/25)
    
  4. Kauchak, D.; Leroy, G.; Hogue, A.: Measuring text difficulty using parse-tree frequency (2017) 0.17
    0.17027612 = sum of:
      0.17027612 = product of:
        0.8513806 = sum of:
          0.018137336 = weight(abstract_txt:analysis in 4786) [ClassicSimilarity], result of:
            0.018137336 = score(doc=4786,freq=2.0), product of:
              0.05628602 = queryWeight, product of:
                1.2489425 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.012361754 = queryNorm
              0.3222352 = fieldWeight in 4786, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.0625 = fieldNorm(doc=4786)
          0.14101584 = weight(abstract_txt:formulas in 4786) [ClassicSimilarity], result of:
            0.14101584 = score(doc=4786,freq=2.0), product of:
              0.1929744 = queryWeight, product of:
                1.8881931 = boost
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.012361754 = queryNorm
              0.73074895 = fieldWeight in 4786, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.0625 = fieldNorm(doc=4786)
          0.060167834 = weight(abstract_txt:readers in 4786) [ClassicSimilarity], result of:
            0.060167834 = score(doc=4786,freq=1.0), product of:
              0.15773852 = queryWeight, product of:
                2.0907931 = boost
                6.1030455 = idf(docFreq=269, maxDocs=44421)
                0.012361754 = queryNorm
              0.38144034 = fieldWeight in 4786, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1030455 = idf(docFreq=269, maxDocs=44421)
                0.0625 = fieldNorm(doc=4786)
          0.06799624 = weight(abstract_txt:level in 4786) [ClassicSimilarity], result of:
            0.06799624 = score(doc=4786,freq=2.0), product of:
              0.17113997 = queryWeight, product of:
                3.0798738 = boost
                4.4950905 = idf(docFreq=1347, maxDocs=44421)
                0.012361754 = queryNorm
              0.39731362 = fieldWeight in 4786, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4950905 = idf(docFreq=1347, maxDocs=44421)
                0.0625 = fieldNorm(doc=4786)
          0.5640634 = weight(abstract_txt:readability in 4786) [ClassicSimilarity], result of:
            0.5640634 = score(doc=4786,freq=2.0), product of:
              0.7718976 = queryWeight, product of:
                7.5527725 = boost
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.012361754 = queryNorm
              0.73074895 = fieldWeight in 4786, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.0625 = fieldNorm(doc=4786)
        0.2 = coord(5/25)
    
  5. Jiang, Z.; Gu, Q.; Yin, Y.; Wang, J.; Chen, D.: GRAW+ : a two-view graph propagation method with word coupling for readability assessment (2019) 0.14
    0.14274687 = sum of:
      0.14274687 = product of:
        0.8921679 = sum of:
          0.025008772 = weight(abstract_txt:levels in 218) [ClassicSimilarity], result of:
            0.025008772 = score(doc=218,freq=1.0), product of:
              0.076746635 = queryWeight, product of:
                1.1907655 = boost
                5.2137837 = idf(docFreq=656, maxDocs=44421)
                0.012361754 = queryNorm
              0.32586148 = fieldWeight in 218, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2137837 = idf(docFreq=656, maxDocs=44421)
                0.0625 = fieldNorm(doc=218)
          0.054019623 = weight(abstract_txt:reading in 218) [ClassicSimilarity], result of:
            0.054019623 = score(doc=218,freq=2.0), product of:
              0.101786174 = queryWeight, product of:
                1.3713268 = boost
                6.004374 = idf(docFreq=297, maxDocs=44421)
                0.012361754 = queryNorm
              0.5307167 = fieldWeight in 218, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.004374 = idf(docFreq=297, maxDocs=44421)
                0.0625 = fieldNorm(doc=218)
          0.015433443 = weight(abstract_txt:which in 218) [ClassicSimilarity], result of:
            0.015433443 = score(doc=218,freq=2.0), product of:
              0.05992522 = queryWeight, product of:
                1.6636862 = boost
                2.9137893 = idf(docFreq=6552, maxDocs=44421)
                0.012361754 = queryNorm
              0.25754502 = fieldWeight in 218, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.9137893 = idf(docFreq=6552, maxDocs=44421)
                0.0625 = fieldNorm(doc=218)
          0.79770607 = weight(abstract_txt:readability in 218) [ClassicSimilarity], result of:
            0.79770607 = score(doc=218,freq=4.0), product of:
              0.7718976 = queryWeight, product of:
                7.5527725 = boost
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.012361754 = queryNorm
              1.0334351 = fieldWeight in 218, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.267481 = idf(docFreq=30, maxDocs=44421)
                0.0625 = fieldNorm(doc=218)
        0.16 = coord(4/25)