Document (#40325)

Author
Hook, P.A.
Title
Using course-subject Co-occurrence (CSCO) to reveal the structure of an academic discipline : a framework to evaluate different inputs of a domain map
Source
Journal of the Association for Information Science and Technology. 68(2017) no.1, S.182-196
Year
2017
Abstract
This article proposes, exemplifies, and validates the use of course-subject co-occurrence (CSCO) data to generate topic maps of an academic discipline. A CSCO event is when 2 course-subjects are taught in the same academic year by the same teacher. A total of 61,856 CSCO events were extracted from the 2010-11 directory of the American Association of Law Schools and used to visualize the structure of law school education in the United States. Different normalization, ordination (layout), and clustering algorithms were compared and the best performing algorithm of each type was used to generate the final map. Validation studies demonstrate that CSCO produces topic maps that are consistent with expert opinion and 4 other indicators of the topical similarity of law school course-subjects. This research is the first to use CSCO to produce a visualization of a domain. It is also the first to use an expanded, multi-part gold standard to evaluate the validity of domain maps and the intermediate steps in their creation. It is suggested that the framework used herein may be adopted for other studies that compare different inputs of a domain map in order to empirically derive the best maps as measured against extrinsic sources of topical similarity (gold standards).
Content
Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23630/full.

Similar documents (content)

  1. Ku, C.-H.; Leroy, G.: ¬A crime reports analysis system to identify related crimes (2011) 0.24
    0.24025607 = sum of:
      0.24025607 = product of:
        0.66737795 = sum of:
          0.011462997 = weight(abstract_txt:that in 629) [ClassicSimilarity], result of:
            0.011462997 = score(doc=629,freq=2.0), product of:
              0.05483829 = queryWeight, product of:
                1.0594544 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.021886805 = queryNorm
              0.20903271 = fieldWeight in 629, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=629)
          0.03267376 = weight(abstract_txt:same in 629) [ClassicSimilarity], result of:
            0.03267376 = score(doc=629,freq=1.0), product of:
              0.110243045 = queryWeight, product of:
                1.062187 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.021886805 = queryNorm
              0.29637933 = fieldWeight in 629, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.0625 = fieldNorm(doc=629)
          0.038689245 = weight(abstract_txt:best in 629) [ClassicSimilarity], result of:
            0.038689245 = score(doc=629,freq=1.0), product of:
              0.123389624 = queryWeight, product of:
                1.123737 = boost
                5.0168557 = idf(docFreq=799, maxDocs=44421)
                0.021886805 = queryNorm
              0.31355348 = fieldWeight in 629, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0168557 = idf(docFreq=799, maxDocs=44421)
                0.0625 = fieldNorm(doc=629)
          0.017390752 = weight(abstract_txt:used in 629) [ClassicSimilarity], result of:
            0.017390752 = score(doc=629,freq=1.0), product of:
              0.08288217 = queryWeight, product of:
                1.1279804 = boost
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.021886805 = queryNorm
              0.20982501 = fieldWeight in 629, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.0625 = fieldNorm(doc=629)
          0.079740316 = weight(abstract_txt:evaluate in 629) [ClassicSimilarity], result of:
            0.079740316 = score(doc=629,freq=3.0), product of:
              0.1385575 = queryWeight, product of:
                1.1908042 = boost
                5.316273 = idf(docFreq=592, maxDocs=44421)
                0.021886805 = queryNorm
              0.57550347 = fieldWeight in 629, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.316273 = idf(docFreq=592, maxDocs=44421)
                0.0625 = fieldNorm(doc=629)
          0.031860296 = weight(abstract_txt:different in 629) [ClassicSimilarity], result of:
            0.031860296 = score(doc=629,freq=2.0), product of:
              0.09849301 = queryWeight, product of:
                1.2296278 = boost
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.021886805 = queryNorm
              0.32347775 = fieldWeight in 629, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.0625 = fieldNorm(doc=629)
          0.18103826 = weight(abstract_txt:similarity in 629) [ClassicSimilarity], result of:
            0.18103826 = score(doc=629,freq=9.0), product of:
              0.16595279 = queryWeight, product of:
                1.3032197 = boost
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.021886805 = queryNorm
              1.0909022 = fieldWeight in 629, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.0625 = fieldNorm(doc=629)
          0.20971932 = weight(abstract_txt:gold in 629) [ClassicSimilarity], result of:
            0.20971932 = score(doc=629,freq=2.0), product of:
              0.30220437 = queryWeight, product of:
                1.758635 = boost
                7.85132 = idf(docFreq=46, maxDocs=44421)
                0.021886805 = queryNorm
              0.6939652 = fieldWeight in 629, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.85132 = idf(docFreq=46, maxDocs=44421)
                0.0625 = fieldNorm(doc=629)
          0.06480301 = weight(abstract_txt:domain in 629) [ClassicSimilarity], result of:
            0.06480301 = score(doc=629,freq=1.0), product of:
              0.21925959 = queryWeight, product of:
                2.1184568 = boost
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.021886805 = queryNorm
              0.29555383 = fieldWeight in 629, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.0625 = fieldNorm(doc=629)
        0.36 = coord(9/25)
    
  2. Klavans, R.; Boyack, K.W.: Identifying a better measure of relatedness for mapping science (2006) 0.18
    0.1758737 = sum of:
      0.1758737 = product of:
        0.5496053 = sum of:
          0.02875789 = weight(abstract_txt:framework in 252) [ClassicSimilarity], result of:
            0.02875789 = score(doc=252,freq=1.0), product of:
              0.101248786 = queryWeight, product of:
                1.0179355 = boost
                4.5445113 = idf(docFreq=1282, maxDocs=44421)
                0.021886805 = queryNorm
              0.28403196 = fieldWeight in 252, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5445113 = idf(docFreq=1282, maxDocs=44421)
                0.0625 = fieldNorm(doc=252)
          0.0140392445 = weight(abstract_txt:that in 252) [ClassicSimilarity], result of:
            0.0140392445 = score(doc=252,freq=3.0), product of:
              0.05483829 = queryWeight, product of:
                1.0594544 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.021886805 = queryNorm
              0.25601172 = fieldWeight in 252, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=252)
          0.067011744 = weight(abstract_txt:best in 252) [ClassicSimilarity], result of:
            0.067011744 = score(doc=252,freq=3.0), product of:
              0.123389624 = queryWeight, product of:
                1.123737 = boost
                5.0168557 = idf(docFreq=799, maxDocs=44421)
                0.021886805 = queryNorm
              0.5430906 = fieldWeight in 252, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.0168557 = idf(docFreq=799, maxDocs=44421)
                0.0625 = fieldNorm(doc=252)
          0.024594238 = weight(abstract_txt:used in 252) [ClassicSimilarity], result of:
            0.024594238 = score(doc=252,freq=2.0), product of:
              0.08288217 = queryWeight, product of:
                1.1279804 = boost
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.021886805 = queryNorm
              0.29673737 = fieldWeight in 252, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.0625 = fieldNorm(doc=252)
          0.046038095 = weight(abstract_txt:evaluate in 252) [ClassicSimilarity], result of:
            0.046038095 = score(doc=252,freq=1.0), product of:
              0.1385575 = queryWeight, product of:
                1.1908042 = boost
                5.316273 = idf(docFreq=592, maxDocs=44421)
                0.021886805 = queryNorm
              0.33226708 = fieldWeight in 252, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.316273 = idf(docFreq=592, maxDocs=44421)
                0.0625 = fieldNorm(doc=252)
          0.022528632 = weight(abstract_txt:different in 252) [ClassicSimilarity], result of:
            0.022528632 = score(doc=252,freq=1.0), product of:
              0.09849301 = queryWeight, product of:
                1.2296278 = boost
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.021886805 = queryNorm
              0.2287333 = fieldWeight in 252, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.0625 = fieldNorm(doc=252)
          0.1692485 = weight(abstract_txt:inputs in 252) [ClassicSimilarity], result of:
            0.1692485 = score(doc=252,freq=1.0), product of:
              0.33004135 = queryWeight, product of:
                1.8378477 = boost
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.021886805 = queryNorm
              0.51281 = fieldWeight in 252, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.0625 = fieldNorm(doc=252)
          0.17738695 = weight(abstract_txt:maps in 252) [ClassicSimilarity], result of:
            0.17738695 = score(doc=252,freq=2.0), product of:
              0.34053853 = queryWeight, product of:
                2.6401188 = boost
                5.8933253 = idf(docFreq=332, maxDocs=44421)
                0.021886805 = queryNorm
              0.52090126 = fieldWeight in 252, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8933253 = idf(docFreq=332, maxDocs=44421)
                0.0625 = fieldNorm(doc=252)
        0.32 = coord(8/25)
    
  3. Wang, P.: ¬An empirical study of knowledge structures of research topics (1999) 0.16
    0.15751834 = sum of:
      0.15751834 = product of:
        0.5625655 = sum of:
          0.008105562 = weight(abstract_txt:that in 667) [ClassicSimilarity], result of:
            0.008105562 = score(doc=667,freq=1.0), product of:
              0.05483829 = queryWeight, product of:
                1.0594544 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.021886805 = queryNorm
              0.14780845 = fieldWeight in 667, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=667)
          0.03267376 = weight(abstract_txt:same in 667) [ClassicSimilarity], result of:
            0.03267376 = score(doc=667,freq=1.0), product of:
              0.110243045 = queryWeight, product of:
                1.062187 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.021886805 = queryNorm
              0.29637933 = fieldWeight in 667, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.0625 = fieldNorm(doc=667)
          0.068502255 = weight(abstract_txt:topic in 667) [ClassicSimilarity], result of:
            0.068502255 = score(doc=667,freq=3.0), product of:
              0.12521258 = queryWeight, product of:
                1.1320076 = boost
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.021886805 = queryNorm
              0.5470876 = fieldWeight in 667, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.0625 = fieldNorm(doc=667)
          0.05883656 = weight(abstract_txt:discipline in 667) [ClassicSimilarity], result of:
            0.05883656 = score(doc=667,freq=1.0), product of:
              0.16317363 = queryWeight, product of:
                1.2922612 = boost
                5.7692223 = idf(docFreq=376, maxDocs=44421)
                0.021886805 = queryNorm
              0.3605764 = fieldWeight in 667, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7692223 = idf(docFreq=376, maxDocs=44421)
                0.0625 = fieldNorm(doc=667)
          0.067119144 = weight(abstract_txt:generate in 667) [ClassicSimilarity], result of:
            0.067119144 = score(doc=667,freq=1.0), product of:
              0.17814873 = queryWeight, product of:
                1.3502579 = boost
                6.0281444 = idf(docFreq=290, maxDocs=44421)
                0.021886805 = queryNorm
              0.37675902 = fieldWeight in 667, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0281444 = idf(docFreq=290, maxDocs=44421)
                0.0625 = fieldNorm(doc=667)
          0.046854816 = weight(abstract_txt:academic in 667) [ClassicSimilarity], result of:
            0.046854816 = score(doc=667,freq=1.0), product of:
              0.16047907 = queryWeight, product of:
                1.5695682 = boost
                4.6714945 = idf(docFreq=1129, maxDocs=44421)
                0.021886805 = queryNorm
              0.2919684 = fieldWeight in 667, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6714945 = idf(docFreq=1129, maxDocs=44421)
                0.0625 = fieldNorm(doc=667)
          0.2804734 = weight(abstract_txt:maps in 667) [ClassicSimilarity], result of:
            0.2804734 = score(doc=667,freq=5.0), product of:
              0.34053853 = queryWeight, product of:
                2.6401188 = boost
                5.8933253 = idf(docFreq=332, maxDocs=44421)
                0.021886805 = queryNorm
              0.8236173 = fieldWeight in 667, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.8933253 = idf(docFreq=332, maxDocs=44421)
                0.0625 = fieldNorm(doc=667)
        0.28 = coord(7/25)
    
  4. Buchel, O.; Coleman, A.: How can classificatory structures be used to improve science education? (2003) 0.15
    0.1537505 = sum of:
      0.1537505 = product of:
        0.6406271 = sum of:
          0.014184734 = weight(abstract_txt:that in 280) [ClassicSimilarity], result of:
            0.014184734 = score(doc=280,freq=1.0), product of:
              0.05483829 = queryWeight, product of:
                1.0594544 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.021886805 = queryNorm
              0.2586648 = fieldWeight in 280, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.109375 = fieldNorm(doc=280)
          0.030433817 = weight(abstract_txt:used in 280) [ClassicSimilarity], result of:
            0.030433817 = score(doc=280,freq=1.0), product of:
              0.08288217 = queryWeight, product of:
                1.1279804 = boost
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.021886805 = queryNorm
              0.36719376 = fieldWeight in 280, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.109375 = fieldNorm(doc=280)
          0.06921214 = weight(abstract_txt:topic in 280) [ClassicSimilarity], result of:
            0.06921214 = score(doc=280,freq=1.0), product of:
              0.12521258 = queryWeight, product of:
                1.1320076 = boost
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.021886805 = queryNorm
              0.5527571 = fieldWeight in 280, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.109375 = fieldNorm(doc=280)
          0.10296398 = weight(abstract_txt:discipline in 280) [ClassicSimilarity], result of:
            0.10296398 = score(doc=280,freq=1.0), product of:
              0.16317363 = queryWeight, product of:
                1.2922612 = boost
                5.7692223 = idf(docFreq=376, maxDocs=44421)
                0.021886805 = queryNorm
              0.6310087 = fieldWeight in 280, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7692223 = idf(docFreq=376, maxDocs=44421)
                0.109375 = fieldNorm(doc=280)
          0.113405265 = weight(abstract_txt:domain in 280) [ClassicSimilarity], result of:
            0.113405265 = score(doc=280,freq=1.0), product of:
              0.21925959 = queryWeight, product of:
                2.1184568 = boost
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.021886805 = queryNorm
              0.5172192 = fieldWeight in 280, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.109375 = fieldNorm(doc=280)
          0.31042716 = weight(abstract_txt:maps in 280) [ClassicSimilarity], result of:
            0.31042716 = score(doc=280,freq=2.0), product of:
              0.34053853 = queryWeight, product of:
                2.6401188 = boost
                5.8933253 = idf(docFreq=332, maxDocs=44421)
                0.021886805 = queryNorm
              0.9115772 = fieldWeight in 280, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8933253 = idf(docFreq=332, maxDocs=44421)
                0.109375 = fieldNorm(doc=280)
        0.24 = coord(6/25)
    
  5. Kim, J.-M.; Shin, H.; Kim, H.-J.: Schema and constraints-based matching and merging of Topic Maps (2007) 0.13
    0.1335699 = sum of:
      0.1335699 = product of:
        0.55654126 = sum of:
          0.008105562 = weight(abstract_txt:that in 1922) [ClassicSimilarity], result of:
            0.008105562 = score(doc=1922,freq=1.0), product of:
              0.05483829 = queryWeight, product of:
                1.0594544 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.021886805 = queryNorm
              0.14780845 = fieldWeight in 1922, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=1922)
          0.017390752 = weight(abstract_txt:used in 1922) [ClassicSimilarity], result of:
            0.017390752 = score(doc=1922,freq=1.0), product of:
              0.08288217 = queryWeight, product of:
                1.1279804 = boost
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.021886805 = queryNorm
              0.20982501 = fieldWeight in 1922, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.0625 = fieldNorm(doc=1922)
          0.11864938 = weight(abstract_txt:topic in 1922) [ClassicSimilarity], result of:
            0.11864938 = score(doc=1922,freq=9.0), product of:
              0.12521258 = queryWeight, product of:
                1.1320076 = boost
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.021886805 = queryNorm
              0.94758356 = fieldWeight in 1922, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.0625 = fieldNorm(doc=1922)
          0.067119144 = weight(abstract_txt:generate in 1922) [ClassicSimilarity], result of:
            0.067119144 = score(doc=1922,freq=1.0), product of:
              0.17814873 = queryWeight, product of:
                1.3502579 = boost
                6.0281444 = idf(docFreq=290, maxDocs=44421)
                0.021886805 = queryNorm
              0.37675902 = fieldWeight in 1922, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0281444 = idf(docFreq=290, maxDocs=44421)
                0.0625 = fieldNorm(doc=1922)
          0.06480301 = weight(abstract_txt:domain in 1922) [ClassicSimilarity], result of:
            0.06480301 = score(doc=1922,freq=1.0), product of:
              0.21925959 = queryWeight, product of:
                2.1184568 = boost
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.021886805 = queryNorm
              0.29555383 = fieldWeight in 1922, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.0625 = fieldNorm(doc=1922)
          0.2804734 = weight(abstract_txt:maps in 1922) [ClassicSimilarity], result of:
            0.2804734 = score(doc=1922,freq=5.0), product of:
              0.34053853 = queryWeight, product of:
                2.6401188 = boost
                5.8933253 = idf(docFreq=332, maxDocs=44421)
                0.021886805 = queryNorm
              0.8236173 = fieldWeight in 1922, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.8933253 = idf(docFreq=332, maxDocs=44421)
                0.0625 = fieldNorm(doc=1922)
        0.24 = coord(6/25)