Document (#37505)

Author
McArthur, D.
Crompton, H.
Title
Understanding public-access cyberlearning projects using text mining and topic analysis
Source
Journal of the American Society for Information Science and Technology. 63(2012) no.11, S.2146-2152
Year
2012
Abstract
The federal government has encouraged open access to publicly funded federal science research results, but it is unclear what knowledge can be gleaned from them and how the knowledge can be used to improve scientific research and shape federal research policies. In this article, we present the results of a preliminary study of cyberlearning projects funded by the National Science Foundation (NSF) that address these issues. Our work demonstrates that text-mining tools can be used to partially automate the process of finding NSF's cyberlearning awards and characterizing the fine-grained topics implicit in award abstracts. The methodology we have established to assess NSF's cyberlearning investments should generalize to other areas of research and other repositories of public-access documents.

Similar documents (content)

  1. Zia, L.L.: ¬The NSF National Science, Technology, Engineering, and Mathematics Education Digital Library (NSDL) Program : new projects from fiscal year 2004 (2005) 0.19
    0.19184914 = sum of:
      0.19184914 = product of:
        0.5329143 = sum of:
          0.010465263 = weight(abstract_txt:other in 2221) [ClassicSimilarity], result of:
            0.010465263 = score(doc=2221,freq=1.0), product of:
              0.07614067 = queryWeight, product of:
                1.0114939 = boost
                3.5186288 = idf(docFreq=3578, maxDocs=44421)
                0.021393407 = queryNorm
              0.13744643 = fieldWeight in 2221, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5186288 = idf(docFreq=3578, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2221)
          0.073815584 = weight(abstract_txt:encouraged in 2221) [ClassicSimilarity], result of:
            0.073815584 = score(doc=2221,freq=2.0), product of:
              0.17641221 = queryWeight, product of:
                1.08869 = boost
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.021393407 = queryNorm
              0.41842672 = fieldWeight in 2221, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2221)
          0.027430894 = weight(abstract_txt:science in 2221) [ClassicSimilarity], result of:
            0.027430894 = score(doc=2221,freq=4.0), product of:
              0.09118496 = queryWeight, product of:
                1.1069207 = boost
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.021393407 = queryNorm
              0.30082697 = fieldWeight in 2221, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2221)
          0.08907325 = weight(abstract_txt:award in 2221) [ClassicSimilarity], result of:
            0.08907325 = score(doc=2221,freq=2.0), product of:
              0.19995308 = queryWeight, product of:
                1.1590549 = boost
                8.063882 = idf(docFreq=37, maxDocs=44421)
                0.021393407 = queryNorm
              0.44547075 = fieldWeight in 2221, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.063882 = idf(docFreq=37, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2221)
          0.14809869 = weight(abstract_txt:awards in 2221) [ClassicSimilarity], result of:
            0.14809869 = score(doc=2221,freq=3.0), product of:
              0.24515098 = queryWeight, product of:
                1.2833844 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.021393407 = queryNorm
              0.60411215 = fieldWeight in 2221, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2221)
          0.023430355 = weight(abstract_txt:public in 2221) [ClassicSimilarity], result of:
            0.023430355 = score(doc=2221,freq=1.0), product of:
              0.13030742 = queryWeight, product of:
                1.3232427 = boost
                4.603092 = idf(docFreq=1209, maxDocs=44421)
                0.021393407 = queryNorm
              0.17980829 = fieldWeight in 2221, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.603092 = idf(docFreq=1209, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2221)
          0.112752505 = weight(abstract_txt:projects in 2221) [ClassicSimilarity], result of:
            0.112752505 = score(doc=2221,freq=10.0), product of:
              0.17239872 = queryWeight, product of:
                1.5220256 = boost
                5.2945876 = idf(docFreq=605, maxDocs=44421)
                0.021393407 = queryNorm
              0.65402174 = fieldWeight in 2221, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                5.2945876 = idf(docFreq=605, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2221)
          0.017538251 = weight(abstract_txt:access in 2221) [ClassicSimilarity], result of:
            0.017538251 = score(doc=2221,freq=1.0), product of:
              0.12297151 = queryWeight, product of:
                1.5743556 = boost
                3.6510832 = idf(docFreq=3134, maxDocs=44421)
                0.021393407 = queryNorm
              0.14262044 = fieldWeight in 2221, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6510832 = idf(docFreq=3134, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2221)
          0.03030949 = weight(abstract_txt:research in 2221) [ClassicSimilarity], result of:
            0.03030949 = score(doc=2221,freq=4.0), product of:
              0.12278887 = queryWeight, product of:
                1.8165587 = boost
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.021393407 = queryNorm
              0.24684234 = fieldWeight in 2221, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2221)
        0.36 = coord(9/25)
    
  2. McKrell, L.; Green, A.; Harris, K.: Libraries and community development national survey (1997) 0.16
    0.16152184 = sum of:
      0.16152184 = product of:
        0.5768637 = sum of:
          0.024270102 = weight(abstract_txt:results in 3984) [ClassicSimilarity], result of:
            0.024270102 = score(doc=3984,freq=1.0), product of:
              0.07442008 = queryWeight, product of:
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.021393407 = queryNorm
              0.32612303 = fieldWeight in 3984, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.09375 = fieldNorm(doc=3984)
          0.025116634 = weight(abstract_txt:other in 3984) [ClassicSimilarity], result of:
            0.025116634 = score(doc=3984,freq=1.0), product of:
              0.07614067 = queryWeight, product of:
                1.0114939 = boost
                3.5186288 = idf(docFreq=3578, maxDocs=44421)
                0.021393407 = queryNorm
              0.32987145 = fieldWeight in 3984, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5186288 = idf(docFreq=3578, maxDocs=44421)
                0.09375 = fieldNorm(doc=3984)
          0.15116233 = weight(abstract_txt:award in 3984) [ClassicSimilarity], result of:
            0.15116233 = score(doc=3984,freq=1.0), product of:
              0.19995308 = queryWeight, product of:
                1.1590549 = boost
                8.063882 = idf(docFreq=37, maxDocs=44421)
                0.021393407 = queryNorm
              0.75598896 = fieldWeight in 3984, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.063882 = idf(docFreq=37, maxDocs=44421)
                0.09375 = fieldNorm(doc=3984)
          0.07952526 = weight(abstract_txt:public in 3984) [ClassicSimilarity], result of:
            0.07952526 = score(doc=3984,freq=2.0), product of:
              0.13030742 = queryWeight, product of:
                1.3232427 = boost
                4.603092 = idf(docFreq=1209, maxDocs=44421)
                0.021393407 = queryNorm
              0.6102896 = fieldWeight in 3984, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.603092 = idf(docFreq=1209, maxDocs=44421)
                0.09375 = fieldNorm(doc=3984)
          0.08557313 = weight(abstract_txt:projects in 3984) [ClassicSimilarity], result of:
            0.08557313 = score(doc=3984,freq=1.0), product of:
              0.17239872 = queryWeight, product of:
                1.5220256 = boost
                5.2945876 = idf(docFreq=605, maxDocs=44421)
                0.021393407 = queryNorm
              0.49636757 = fieldWeight in 3984, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2945876 = idf(docFreq=605, maxDocs=44421)
                0.09375 = fieldNorm(doc=3984)
          0.051436912 = weight(abstract_txt:research in 3984) [ClassicSimilarity], result of:
            0.051436912 = score(doc=3984,freq=2.0), product of:
              0.12278887 = queryWeight, product of:
                1.8165587 = boost
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.021393407 = queryNorm
              0.41890535 = fieldWeight in 3984, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.09375 = fieldNorm(doc=3984)
          0.15977935 = weight(abstract_txt:funded in 3984) [ClassicSimilarity], result of:
            0.15977935 = score(doc=3984,freq=1.0), product of:
              0.2614104 = queryWeight, product of:
                1.8742018 = boost
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.021393407 = queryNorm
              0.61122036 = fieldWeight in 3984, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.09375 = fieldNorm(doc=3984)
        0.28 = coord(7/25)
    
  3. Miao, Q.; Li, Q.; Zeng, D.: Fine-grained opinion mining by integrating multiple review sources (2010) 0.13
    0.12854348 = sum of:
      0.12854348 = product of:
        0.53559786 = sum of:
          0.024270102 = weight(abstract_txt:results in 104) [ClassicSimilarity], result of:
            0.024270102 = score(doc=104,freq=1.0), product of:
              0.07442008 = queryWeight, product of:
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.021393407 = queryNorm
              0.32612303 = fieldWeight in 104, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.09375 = fieldNorm(doc=104)
          0.036367755 = weight(abstract_txt:knowledge in 104) [ClassicSimilarity], result of:
            0.036367755 = score(doc=104,freq=2.0), product of:
              0.07734699 = queryWeight, product of:
                1.0194751 = boost
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.021393407 = queryNorm
              0.4701897 = fieldWeight in 104, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.09375 = fieldNorm(doc=104)
          0.10620936 = weight(abstract_txt:fine in 104) [ClassicSimilarity], result of:
            0.10620936 = score(doc=104,freq=1.0), product of:
              0.15803051 = queryWeight, product of:
                1.0304108 = boost
                7.168868 = idf(docFreq=92, maxDocs=44421)
                0.021393407 = queryNorm
              0.67208135 = fieldWeight in 104, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.168868 = idf(docFreq=92, maxDocs=44421)
                0.09375 = fieldNorm(doc=104)
          0.1406705 = weight(abstract_txt:grained in 104) [ClassicSimilarity], result of:
            0.1406705 = score(doc=104,freq=1.0), product of:
              0.19059043 = queryWeight, product of:
                1.1315936 = boost
                7.872826 = idf(docFreq=45, maxDocs=44421)
                0.021393407 = queryNorm
              0.73807746 = fieldWeight in 104, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.872826 = idf(docFreq=45, maxDocs=44421)
                0.09375 = fieldNorm(doc=104)
          0.19170874 = weight(abstract_txt:mining in 104) [ClassicSimilarity], result of:
            0.19170874 = score(doc=104,freq=2.0), product of:
              0.23427558 = queryWeight, product of:
                1.7742648 = boost
                6.1720386 = idf(docFreq=251, maxDocs=44421)
                0.021393407 = queryNorm
              0.8183044 = fieldWeight in 104, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1720386 = idf(docFreq=251, maxDocs=44421)
                0.09375 = fieldNorm(doc=104)
          0.036371388 = weight(abstract_txt:research in 104) [ClassicSimilarity], result of:
            0.036371388 = score(doc=104,freq=1.0), product of:
              0.12278887 = queryWeight, product of:
                1.8165587 = boost
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.021393407 = queryNorm
              0.2962108 = fieldWeight in 104, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.09375 = fieldNorm(doc=104)
        0.24 = coord(6/25)
    
  4. Rusch-Feja, D.; Becker, H.J.: Global Info : the German digital libraries project (1999) 0.13
    0.1279697 = sum of:
      0.1279697 = product of:
        0.45703465 = sum of:
          0.012135051 = weight(abstract_txt:results in 2242) [ClassicSimilarity], result of:
            0.012135051 = score(doc=2242,freq=1.0), product of:
              0.07442008 = queryWeight, product of:
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.021393407 = queryNorm
              0.16306151 = fieldWeight in 2242, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.046875 = fieldNorm(doc=2242)
          0.01776014 = weight(abstract_txt:other in 2242) [ClassicSimilarity], result of:
            0.01776014 = score(doc=2242,freq=2.0), product of:
              0.07614067 = queryWeight, product of:
                1.0114939 = boost
                3.5186288 = idf(docFreq=3578, maxDocs=44421)
                0.021393407 = queryNorm
              0.23325433 = fieldWeight in 2242, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.5186288 = idf(docFreq=3578, maxDocs=44421)
                0.046875 = fieldNorm(doc=2242)
          0.016458536 = weight(abstract_txt:science in 2242) [ClassicSimilarity], result of:
            0.016458536 = score(doc=2242,freq=1.0), product of:
              0.09118496 = queryWeight, product of:
                1.1069207 = boost
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.021393407 = queryNorm
              0.18049617 = fieldWeight in 2242, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.046875 = fieldNorm(doc=2242)
          0.060509343 = weight(abstract_txt:projects in 2242) [ClassicSimilarity], result of:
            0.060509343 = score(doc=2242,freq=2.0), product of:
              0.17239872 = queryWeight, product of:
                1.5220256 = boost
                5.2945876 = idf(docFreq=605, maxDocs=44421)
                0.021393407 = queryNorm
              0.35098487 = fieldWeight in 2242, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2945876 = idf(docFreq=605, maxDocs=44421)
                0.046875 = fieldNorm(doc=2242)
          0.029763397 = weight(abstract_txt:access in 2242) [ClassicSimilarity], result of:
            0.029763397 = score(doc=2242,freq=2.0), product of:
              0.12297151 = queryWeight, product of:
                1.5743556 = boost
                3.6510832 = idf(docFreq=3134, maxDocs=44421)
                0.021393407 = queryNorm
              0.2420349 = fieldWeight in 2242, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6510832 = idf(docFreq=3134, maxDocs=44421)
                0.046875 = fieldNorm(doc=2242)
          0.04066445 = weight(abstract_txt:research in 2242) [ClassicSimilarity], result of:
            0.04066445 = score(doc=2242,freq=5.0), product of:
              0.12278887 = queryWeight, product of:
                1.8165587 = boost
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.021393407 = queryNorm
              0.33117375 = fieldWeight in 2242, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.046875 = fieldNorm(doc=2242)
          0.27974373 = weight(abstract_txt:federal in 2242) [ClassicSimilarity], result of:
            0.27974373 = score(doc=2242,freq=3.0), product of:
              0.47843838 = queryWeight, product of:
                3.1053715 = boost
                7.201658 = idf(docFreq=89, maxDocs=44421)
                0.021393407 = queryNorm
              0.5847017 = fieldWeight in 2242, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.201658 = idf(docFreq=89, maxDocs=44421)
                0.046875 = fieldNorm(doc=2242)
        0.28 = coord(7/25)
    
  5. Jayroe, T.J.: ¬A humble servant : the work of Helen L. Brownson and the early years of information science research (2012) 0.12
    0.11862405 = sum of:
      0.11862405 = product of:
        0.5931202 = sum of:
          0.054861788 = weight(abstract_txt:science in 1458) [ClassicSimilarity], result of:
            0.054861788 = score(doc=1458,freq=4.0), product of:
              0.09118496 = queryWeight, product of:
                1.1069207 = boost
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.021393407 = queryNorm
              0.60165393 = fieldWeight in 1458, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.850585 = idf(docFreq=2567, maxDocs=44421)
                0.078125 = fieldNorm(doc=1458)
          0.100848906 = weight(abstract_txt:projects in 1458) [ClassicSimilarity], result of:
            0.100848906 = score(doc=1458,freq=2.0), product of:
              0.17239872 = queryWeight, product of:
                1.5220256 = boost
                5.2945876 = idf(docFreq=605, maxDocs=44421)
                0.021393407 = queryNorm
              0.5849748 = fieldWeight in 1458, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2945876 = idf(docFreq=605, maxDocs=44421)
                0.078125 = fieldNorm(doc=1458)
          0.035076503 = weight(abstract_txt:access in 1458) [ClassicSimilarity], result of:
            0.035076503 = score(doc=1458,freq=1.0), product of:
              0.12297151 = queryWeight, product of:
                1.5743556 = boost
                3.6510832 = idf(docFreq=3134, maxDocs=44421)
                0.021393407 = queryNorm
              0.2852409 = fieldWeight in 1458, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6510832 = idf(docFreq=3134, maxDocs=44421)
                0.078125 = fieldNorm(doc=1458)
          0.13314946 = weight(abstract_txt:funded in 1458) [ClassicSimilarity], result of:
            0.13314946 = score(doc=1458,freq=1.0), product of:
              0.2614104 = queryWeight, product of:
                1.8742018 = boost
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.021393407 = queryNorm
              0.5093503 = fieldWeight in 1458, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.078125 = fieldNorm(doc=1458)
          0.26918355 = weight(abstract_txt:federal in 1458) [ClassicSimilarity], result of:
            0.26918355 = score(doc=1458,freq=1.0), product of:
              0.47843838 = queryWeight, product of:
                3.1053715 = boost
                7.201658 = idf(docFreq=89, maxDocs=44421)
                0.021393407 = queryNorm
              0.5626295 = fieldWeight in 1458, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.201658 = idf(docFreq=89, maxDocs=44421)
                0.078125 = fieldNorm(doc=1458)
        0.2 = coord(5/25)