Document (#39773)

Author
Denning, J.
Pera, M.S.
Ng, Y.-K.
Title
¬A readability level prediction tool for K-12 books
Source
Journal of the Association for Information Science and Technology. 67(2016) no.3, S.550-565
Year
2016
Abstract
The readability levels of books identify suitable reading materials. Unfortunately, the majority of published books are assigned a readability level range, which is not useful to readers who look for books at a particular grade level. Existing readability formulas/analysis tools require at least an excerpt of a book to estimate its readability level, which is a severe constraint, since copyright laws prohibit book contents from being made publicly accessible. To alleviate the constraint, we have developed TRoLL which relies on publicly accessible online book metadata, in addition to using a book's snippet, if it is available, to predict its readability level. Based on a multi-dimensional regression analysis, TRoLL determines the grade level of any book instantly, even without a sample of its text, and considers its topical suitability, which is unique. Furthermore, TRoLL is a significant contribution to the educational community, since its computed book readability levels can enrich K-12 readers' book selections and aid parents, teachers, and librarians in locating reading materials suitable for their K-12 readers, which can be a time-consuming and frustrating task that does not always yield a quality outcome. Conducted empirical studies have verified the prediction accuracy of TRoLL and demonstrated its superiority over well-known readability formulas/analysis tools.
Content
Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23417/abstract.

Similar documents (author)

  1. Pera, M. Soledad => Soledad Pera, M.: 5.04
    5.0379567 = sum of:
      5.0379567 = weight(author_txt:pera in 3876) [ClassicSimilarity], result of:
        5.0379567 = fieldWeight in 3876, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.375 = fieldNorm(doc=3876)
    
  2. Pera, M.S.; Ng, Y.-K.: SpamED : a spam E-mail detection approach based on phrase similarity (2009) 4.16
    4.156102 = sum of:
      4.156102 = weight(author_txt:pera in 2721) [ClassicSimilarity], result of:
        4.156102 = fieldWeight in 2721, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.4375 = fieldNorm(doc=2721)
    
  3. Azpiazu, I.M.; Soledad Pera, M.: Is cross-lingual readability assessment possible? (2020) 4.16
    4.156102 = sum of:
      4.156102 = weight(author_txt:pera in 5868) [ClassicSimilarity], result of:
        4.156102 = fieldWeight in 5868, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.4375 = fieldNorm(doc=5868)
    
  4. Pera, M.S.; Lund, W.; Ng, Y.-K.: ¬A sophisticated library search strategy using folksonomies and similarity matching (2009) 3.56
    3.5623734 = sum of:
      3.5623734 = weight(author_txt:pera in 2939) [ClassicSimilarity], result of:
        3.5623734 = fieldWeight in 2939, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.375 = fieldNorm(doc=2939)
    
  5. Soledad Pera, M.; Ng, Y.-K.: Recommending books to be exchanged online in the absence of wish lists (2018) 3.56
    3.5623734 = sum of:
      3.5623734 = weight(author_txt:pera in 4182) [ClassicSimilarity], result of:
        3.5623734 = fieldWeight in 4182, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.499662 = idf(docFreq=8, maxDocs=44218)
          0.375 = fieldNorm(doc=4182)
    

Similar documents (content)

  1. Leroy, G.; Miller, T.; Rosemblat, G.; Browne, A.: ¬A balanced approach to health information evaluation : a vocabulary-based naïve Bayes classifier and readability formulas (2008) 0.41
    0.41055045 = sum of:
      0.41055045 = product of:
        1.4662516 = sum of:
          0.025813987 = weight(abstract_txt:since in 1998) [ClassicSimilarity], result of:
            0.025813987 = score(doc=1998,freq=1.0), product of:
              0.067569554 = queryWeight, product of:
                1.117415 = boost
                4.890058 = idf(docFreq=903, maxDocs=44218)
                0.01236581 = queryNorm
              0.3820358 = fieldWeight in 1998, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.890058 = idf(docFreq=903, maxDocs=44218)
                0.078125 = fieldNorm(doc=1998)
          0.044403613 = weight(abstract_txt:levels in 1998) [ClassicSimilarity], result of:
            0.044403613 = score(doc=1998,freq=2.0), product of:
              0.07699276 = queryWeight, product of:
                1.1927897 = boost
                5.219915 = idf(docFreq=649, maxDocs=44218)
                0.01236581 = queryNorm
              0.5767245 = fieldWeight in 1998, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.219915 = idf(docFreq=649, maxDocs=44218)
                0.078125 = fieldNorm(doc=1998)
          0.013694144 = weight(abstract_txt:which in 1998) [ClassicSimilarity], result of:
            0.013694144 = score(doc=1998,freq=1.0), product of:
              0.06009674 = queryWeight, product of:
                1.6662279 = boost
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.01236581 = queryNorm
              0.22786833 = fieldWeight in 1998, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.078125 = fieldNorm(doc=1998)
          0.14962193 = weight(abstract_txt:grade in 1998) [ClassicSimilarity], result of:
            0.14962193 = score(doc=1998,freq=2.0), product of:
              0.1730485 = queryWeight, product of:
                1.7882279 = boost
                7.825686 = idf(docFreq=47, maxDocs=44218)
                0.01236581 = queryNorm
              0.8646243 = fieldWeight in 1998, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.825686 = idf(docFreq=47, maxDocs=44218)
                0.078125 = fieldNorm(doc=1998)
          0.17612687 = weight(abstract_txt:formulas in 1998) [ClassicSimilarity], result of:
            0.17612687 = score(doc=1998,freq=2.0), product of:
              0.1929248 = queryWeight, product of:
                1.8881347 = boost
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.01236581 = queryNorm
              0.91293013 = fieldWeight in 1998, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.078125 = fieldNorm(doc=1998)
          0.06026704 = weight(abstract_txt:level in 1998) [ClassicSimilarity], result of:
            0.06026704 = score(doc=1998,freq=1.0), product of:
              0.17150415 = queryWeight, product of:
                3.08345 = boost
                4.497956 = idf(docFreq=1337, maxDocs=44218)
                0.01236581 = queryNorm
              0.3514028 = fieldWeight in 1998, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.497956 = idf(docFreq=1337, maxDocs=44218)
                0.078125 = fieldNorm(doc=1998)
          0.996324 = weight(abstract_txt:readability in 1998) [ClassicSimilarity], result of:
            0.996324 = score(doc=1998,freq=4.0), product of:
              0.7716992 = queryWeight, product of:
                7.552539 = boost
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.01236581 = queryNorm
              1.2910782 = fieldWeight in 1998, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.078125 = fieldNorm(doc=1998)
        0.28 = coord(7/25)
    
  2. Azpiazu, I.M.; Soledad Pera, M.: Is cross-lingual readability assessment possible? (2020) 0.21
    0.20544834 = sum of:
      0.20544834 = product of:
        1.0272417 = sum of:
          0.021978669 = weight(abstract_txt:levels in 5868) [ClassicSimilarity], result of:
            0.021978669 = score(doc=5868,freq=1.0), product of:
              0.07699276 = queryWeight, product of:
                1.1927897 = boost
                5.219915 = idf(docFreq=649, maxDocs=44218)
                0.01236581 = queryNorm
              0.2854641 = fieldWeight in 5868, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.219915 = idf(docFreq=649, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5868)
          0.09932037 = weight(abstract_txt:prediction in 5868) [ClassicSimilarity], result of:
            0.09932037 = score(doc=5868,freq=3.0), product of:
              0.14591527 = queryWeight, product of:
                1.6420611 = boost
                7.1860275 = idf(docFreq=90, maxDocs=44218)
                0.01236581 = queryNorm
              0.6806715 = fieldWeight in 5868, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.1860275 = idf(docFreq=90, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5868)
          0.009585901 = weight(abstract_txt:which in 5868) [ClassicSimilarity], result of:
            0.009585901 = score(doc=5868,freq=1.0), product of:
              0.06009674 = queryWeight, product of:
                1.6662279 = boost
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.01236581 = queryNorm
              0.15950784 = fieldWeight in 5868, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5868)
          0.042186927 = weight(abstract_txt:level in 5868) [ClassicSimilarity], result of:
            0.042186927 = score(doc=5868,freq=1.0), product of:
              0.17150415 = queryWeight, product of:
                3.08345 = boost
                4.497956 = idf(docFreq=1337, maxDocs=44218)
                0.01236581 = queryNorm
              0.24598196 = fieldWeight in 5868, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.497956 = idf(docFreq=1337, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5868)
          0.8541699 = weight(abstract_txt:readability in 5868) [ClassicSimilarity], result of:
            0.8541699 = score(doc=5868,freq=6.0), product of:
              0.7716992 = queryWeight, product of:
                7.552539 = boost
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.01236581 = queryNorm
              1.106869 = fieldWeight in 5868, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.0546875 = fieldNorm(doc=5868)
        0.2 = coord(5/25)
    
  3. Collins-Thompson, K.; Callan, J.: Predicting reading difficulty with statistical language models (2005) 0.17
    0.17265926 = sum of:
      0.17265926 = product of:
        0.8632963 = sum of:
          0.035522893 = weight(abstract_txt:levels in 4579) [ClassicSimilarity], result of:
            0.035522893 = score(doc=4579,freq=2.0), product of:
              0.07699276 = queryWeight, product of:
                1.1927897 = boost
                5.219915 = idf(docFreq=649, maxDocs=44218)
                0.01236581 = queryNorm
              0.46137965 = fieldWeight in 4579, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.219915 = idf(docFreq=649, maxDocs=44218)
                0.0625 = fieldNorm(doc=4579)
          0.07628548 = weight(abstract_txt:reading in 4579) [ClassicSimilarity], result of:
            0.07628548 = score(doc=4579,freq=4.0), product of:
              0.10171748 = queryWeight, product of:
                1.3709978 = boost
                5.9997935 = idf(docFreq=297, maxDocs=44218)
                0.01236581 = queryNorm
              0.7499742 = fieldWeight in 4579, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.9997935 = idf(docFreq=297, maxDocs=44218)
                0.0625 = fieldNorm(doc=4579)
          0.11969755 = weight(abstract_txt:grade in 4579) [ClassicSimilarity], result of:
            0.11969755 = score(doc=4579,freq=2.0), product of:
              0.1730485 = queryWeight, product of:
                1.7882279 = boost
                7.825686 = idf(docFreq=47, maxDocs=44218)
                0.01236581 = queryNorm
              0.69169945 = fieldWeight in 4579, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.825686 = idf(docFreq=47, maxDocs=44218)
                0.0625 = fieldNorm(doc=4579)
          0.068184376 = weight(abstract_txt:level in 4579) [ClassicSimilarity], result of:
            0.068184376 = score(doc=4579,freq=2.0), product of:
              0.17150415 = queryWeight, product of:
                3.08345 = boost
                4.497956 = idf(docFreq=1337, maxDocs=44218)
                0.01236581 = queryNorm
              0.39756688 = fieldWeight in 4579, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.497956 = idf(docFreq=1337, maxDocs=44218)
                0.0625 = fieldNorm(doc=4579)
          0.56360596 = weight(abstract_txt:readability in 4579) [ClassicSimilarity], result of:
            0.56360596 = score(doc=4579,freq=2.0), product of:
              0.7716992 = queryWeight, product of:
                7.552539 = boost
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.01236581 = queryNorm
              0.7303441 = fieldWeight in 4579, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.0625 = fieldNorm(doc=4579)
        0.2 = coord(5/25)
    
  4. Kauchak, D.; Leroy, G.; Hogue, A.: Measuring text difficulty using parse-tree frequency (2017) 0.17
    0.17020921 = sum of:
      0.17020921 = product of:
        0.851046 = sum of:
          0.018270597 = weight(abstract_txt:analysis in 3786) [ClassicSimilarity], result of:
            0.018270597 = score(doc=3786,freq=2.0), product of:
              0.056577433 = queryWeight, product of:
                1.2522937 = boost
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.01236581 = queryNorm
              0.3229308 = fieldWeight in 3786, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6535451 = idf(docFreq=3112, maxDocs=44218)
                0.0625 = fieldNorm(doc=3786)
          0.14090149 = weight(abstract_txt:formulas in 3786) [ClassicSimilarity], result of:
            0.14090149 = score(doc=3786,freq=2.0), product of:
              0.1929248 = queryWeight, product of:
                1.8881347 = boost
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.01236581 = queryNorm
              0.7303441 = fieldWeight in 3786, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.0625 = fieldNorm(doc=3786)
          0.060083605 = weight(abstract_txt:readers in 3786) [ClassicSimilarity], result of:
            0.060083605 = score(doc=3786,freq=1.0), product of:
              0.157636 = queryWeight, product of:
                2.0903177 = boost
                6.0984654 = idf(docFreq=269, maxDocs=44218)
                0.01236581 = queryNorm
              0.3811541 = fieldWeight in 3786, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0984654 = idf(docFreq=269, maxDocs=44218)
                0.0625 = fieldNorm(doc=3786)
          0.068184376 = weight(abstract_txt:level in 3786) [ClassicSimilarity], result of:
            0.068184376 = score(doc=3786,freq=2.0), product of:
              0.17150415 = queryWeight, product of:
                3.08345 = boost
                4.497956 = idf(docFreq=1337, maxDocs=44218)
                0.01236581 = queryNorm
              0.39756688 = fieldWeight in 3786, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.497956 = idf(docFreq=1337, maxDocs=44218)
                0.0625 = fieldNorm(doc=3786)
          0.56360596 = weight(abstract_txt:readability in 3786) [ClassicSimilarity], result of:
            0.56360596 = score(doc=3786,freq=2.0), product of:
              0.7716992 = queryWeight, product of:
                7.552539 = boost
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.01236581 = queryNorm
              0.7303441 = fieldWeight in 3786, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.0625 = fieldNorm(doc=3786)
        0.2 = coord(5/25)
    
  5. Jiang, Z.; Gu, Q.; Yin, Y.; Wang, J.; Chen, D.: GRAW+ : a two-view graph propagation method with word coupling for readability assessment (2019) 0.14
    0.14265804 = sum of:
      0.14265804 = product of:
        0.89161277 = sum of:
          0.025118478 = weight(abstract_txt:levels in 5218) [ClassicSimilarity], result of:
            0.025118478 = score(doc=5218,freq=1.0), product of:
              0.07699276 = queryWeight, product of:
                1.1927897 = boost
                5.219915 = idf(docFreq=649, maxDocs=44218)
                0.01236581 = queryNorm
              0.32624468 = fieldWeight in 5218, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.219915 = idf(docFreq=649, maxDocs=44218)
                0.0625 = fieldNorm(doc=5218)
          0.053941984 = weight(abstract_txt:reading in 5218) [ClassicSimilarity], result of:
            0.053941984 = score(doc=5218,freq=2.0), product of:
              0.10171748 = queryWeight, product of:
                1.3709978 = boost
                5.9997935 = idf(docFreq=297, maxDocs=44218)
                0.01236581 = queryNorm
              0.5303118 = fieldWeight in 5218, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.9997935 = idf(docFreq=297, maxDocs=44218)
                0.0625 = fieldNorm(doc=5218)
          0.015493155 = weight(abstract_txt:which in 5218) [ClassicSimilarity], result of:
            0.015493155 = score(doc=5218,freq=2.0), product of:
              0.06009674 = queryWeight, product of:
                1.6662279 = boost
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.01236581 = queryNorm
              0.2578036 = fieldWeight in 5218, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.9167147 = idf(docFreq=6503, maxDocs=44218)
                0.0625 = fieldNorm(doc=5218)
          0.7970592 = weight(abstract_txt:readability in 5218) [ClassicSimilarity], result of:
            0.7970592 = score(doc=5218,freq=4.0), product of:
              0.7716992 = queryWeight, product of:
                7.552539 = boost
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.01236581 = queryNorm
              1.0328625 = fieldWeight in 5218, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.2629 = idf(docFreq=30, maxDocs=44218)
                0.0625 = fieldNorm(doc=5218)
        0.16 = coord(4/25)