Document (#40539)

Author
Leydesdorff, L.
Nerghes, A.
Title
Co-word maps and topic modeling : a comparison using small and medium-sized corpora (N?<?1.000)
Source
Journal of the Association for Information Science and Technology. 68(2017) no.4, S.1024-1035
Year
2017
Abstract
Induced by "big data," "topic modeling" has become an attractive alternative to mapping co-words in terms of co-occurrences and co-absences using network techniques. Does topic modeling provide an alternative for co-word mapping in research practices using moderately sized document collections? We return to the word/document matrix using first a single text with a strong argument ("The Leiden Manifesto") and then upscale to a sample of moderate size (n?=?687) to study the pros and cons of the two approaches in terms of the resulting possibilities for making semantic maps that can serve an argument. The results from co-word mapping (using two different routines) versus topic modeling are significantly uncorrelated. Whereas components in the co-word maps can easily be designated, the topic models provide sets of words that are very differently organized. In these samples, the topic models seem to reveal similarities other than semantic ones (e.g., linguistic ones). In other words, topic modeling does not replace co-word mapping in small and medium-sized sets; but the paper leaves open the possibility that topic modeling would work well for the semantic mapping of large sets.
Content
Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23740/full.
Theme
Informetrie

Similar documents (author)

  1. Leydesdorff, L.: ¬The generation of aggregated journal-journal citation maps on the basis of the CD-ROM version of the Science Citation Index (1994) 4.52
    4.5150814 = sum of:
      4.5150814 = weight(author_txt:leydesdorff in 8280) [ClassicSimilarity], result of:
        4.5150814 = fieldWeight in 8280, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.2241306 = idf(docFreq=87, maxDocs=44421)
          0.625 = fieldNorm(doc=8280)
    
  2. Leydesdorff, L.: Why words and co-word cannot map the development of the science (1997) 4.52
    4.5150814 = sum of:
      4.5150814 = weight(author_txt:leydesdorff in 1147) [ClassicSimilarity], result of:
        4.5150814 = fieldWeight in 1147, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.2241306 = idf(docFreq=87, maxDocs=44421)
          0.625 = fieldNorm(doc=1147)
    
  3. Leydesdorff, L.: Theories of citation? (1999) 4.52
    4.5150814 = sum of:
      4.5150814 = weight(author_txt:leydesdorff in 6130) [ClassicSimilarity], result of:
        4.5150814 = fieldWeight in 6130, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.2241306 = idf(docFreq=87, maxDocs=44421)
          0.625 = fieldNorm(doc=6130)
    
  4. Leydesdorff, L.: ¬A sociological theory of communication : the self-organization of the knowledge-based society (2001) 4.52
    4.5150814 = sum of:
      4.5150814 = weight(author_txt:leydesdorff in 1184) [ClassicSimilarity], result of:
        4.5150814 = fieldWeight in 1184, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.2241306 = idf(docFreq=87, maxDocs=44421)
          0.625 = fieldNorm(doc=1184)
    
  5. Leydesdorff, L.: Dynamic and evolutionary updates of classificatory schemes in scientific journal structures (2002) 4.52
    4.5150814 = sum of:
      4.5150814 = weight(author_txt:leydesdorff in 2249) [ClassicSimilarity], result of:
        4.5150814 = fieldWeight in 2249, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.2241306 = idf(docFreq=87, maxDocs=44421)
          0.625 = fieldNorm(doc=2249)
    

Similar documents (content)

  1. Li, X.; Zhang, A.; Li, C.; Ouyang, J.; Cai, Y.: Exploring coherent topics by topic modeling with term weighting (2018) 0.29
    0.2870162 = sum of:
      0.2870162 = product of:
        1.0250579 = sum of:
          0.018715478 = weight(abstract_txt:document in 45) [ClassicSimilarity], result of:
            0.018715478 = score(doc=45,freq=1.0), product of:
              0.069733866 = queryWeight, product of:
                1.0505428 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.01545798 = queryNorm
              0.26838437 = fieldWeight in 45, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.0625 = fieldNorm(doc=45)
          0.023354817 = weight(abstract_txt:models in 45) [ClassicSimilarity], result of:
            0.023354817 = score(doc=45,freq=1.0), product of:
              0.080827795 = queryWeight, product of:
                1.1310252 = boost
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.01545798 = queryNorm
              0.28894538 = fieldWeight in 45, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.0625 = fieldNorm(doc=45)
          0.049343877 = weight(abstract_txt:sets in 45) [ClassicSimilarity], result of:
            0.049343877 = score(doc=45,freq=1.0), product of:
              0.15234528 = queryWeight, product of:
                1.901744 = boost
                5.18232 = idf(docFreq=677, maxDocs=44421)
                0.01545798 = queryNorm
              0.323895 = fieldWeight in 45, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.18232 = idf(docFreq=677, maxDocs=44421)
                0.0625 = fieldNorm(doc=45)
          0.17224284 = weight(abstract_txt:words in 45) [ClassicSimilarity], result of:
            0.17224284 = score(doc=45,freq=10.0), product of:
              0.1627175 = queryWeight, product of:
                1.9654169 = boost
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.01545798 = queryNorm
              1.058539 = fieldWeight in 45, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                5.355831 = idf(docFreq=569, maxDocs=44421)
                0.0625 = fieldNorm(doc=45)
          0.19750834 = weight(abstract_txt:word in 45) [ClassicSimilarity], result of:
            0.19750834 = score(doc=45,freq=3.0), product of:
              0.3355058 = queryWeight, product of:
                3.9911916 = boost
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.01545798 = queryNorm
              0.58868825 = fieldWeight in 45, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.0625 = fieldNorm(doc=45)
          0.26497334 = weight(abstract_txt:modeling in 45) [ClassicSimilarity], result of:
            0.26497334 = score(doc=45,freq=3.0), product of:
              0.40811062 = queryWeight, product of:
                4.401913 = boost
                5.997685 = idf(docFreq=299, maxDocs=44421)
                0.01545798 = queryNorm
              0.64926845 = fieldWeight in 45, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.997685 = idf(docFreq=299, maxDocs=44421)
                0.0625 = fieldNorm(doc=45)
          0.29891917 = weight(abstract_txt:topic in 45) [ClassicSimilarity], result of:
            0.29891917 = score(doc=45,freq=6.0), product of:
              0.3863508 = queryWeight, product of:
                4.9455295 = boost
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.01545798 = queryNorm
              0.7736988 = fieldWeight in 45, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.0625 = fieldNorm(doc=45)
        0.28 = coord(7/25)
    
  2. Lu, K.; Wolfram, D.: Measuring author research relatedness : a comparison of word-based, topic-based, and author cocitation approaches (2012) 0.17
    0.16572708 = sum of:
      0.16572708 = product of:
        0.8286354 = sum of:
          0.042278484 = weight(abstract_txt:using in 1453) [ClassicSimilarity], result of:
            0.042278484 = score(doc=1453,freq=3.0), product of:
              0.11297845 = queryWeight, product of:
                2.1142664 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.01545798 = queryNorm
              0.37421724 = fieldWeight in 1453, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.0625 = fieldNorm(doc=1453)
          0.16113102 = weight(abstract_txt:mapping in 1453) [ClassicSimilarity], result of:
            0.16113102 = score(doc=1453,freq=2.0), product of:
              0.31554833 = queryWeight, product of:
                3.5334165 = boost
                5.7772117 = idf(docFreq=373, maxDocs=44421)
                0.01545798 = queryNorm
              0.5106382 = fieldWeight in 1453, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7772117 = idf(docFreq=373, maxDocs=44421)
                0.0625 = fieldNorm(doc=1453)
          0.19750834 = weight(abstract_txt:word in 1453) [ClassicSimilarity], result of:
            0.19750834 = score(doc=1453,freq=3.0), product of:
              0.3355058 = queryWeight, product of:
                3.9911916 = boost
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.01545798 = queryNorm
              0.58868825 = fieldWeight in 1453, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.4380693 = idf(docFreq=524, maxDocs=44421)
                0.0625 = fieldNorm(doc=1453)
          0.21634983 = weight(abstract_txt:modeling in 1453) [ClassicSimilarity], result of:
            0.21634983 = score(doc=1453,freq=2.0), product of:
              0.40811062 = queryWeight, product of:
                4.401913 = boost
                5.997685 = idf(docFreq=299, maxDocs=44421)
                0.01545798 = queryNorm
              0.53012544 = fieldWeight in 1453, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.997685 = idf(docFreq=299, maxDocs=44421)
                0.0625 = fieldNorm(doc=1453)
          0.21136774 = weight(abstract_txt:topic in 1453) [ClassicSimilarity], result of:
            0.21136774 = score(doc=1453,freq=3.0), product of:
              0.3863508 = queryWeight, product of:
                4.9455295 = boost
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.01545798 = queryNorm
              0.5470876 = fieldWeight in 1453, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.0625 = fieldNorm(doc=1453)
        0.2 = coord(5/25)
    
  3. Siebers, Q.H.J.F.: Implementing inference rules in the Topic maps model (2006) 0.16
    0.15709676 = sum of:
      0.15709676 = product of:
        0.7854838 = sum of:
          0.030511867 = weight(abstract_txt:using in 730) [ClassicSimilarity], result of:
            0.030511867 = score(doc=730,freq=1.0), product of:
              0.11297845 = queryWeight, product of:
                2.1142664 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.01545798 = queryNorm
              0.27006802 = fieldWeight in 730, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.078125 = fieldNorm(doc=730)
          0.15711312 = weight(abstract_txt:maps in 730) [ClassicSimilarity], result of:
            0.15711312 = score(doc=730,freq=3.0), product of:
              0.19701596 = queryWeight, product of:
                2.16266 = boost
                5.8933253 = idf(docFreq=332, maxDocs=44421)
                0.01545798 = queryNorm
              0.79746395 = fieldWeight in 730, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.8933253 = idf(docFreq=332, maxDocs=44421)
                0.078125 = fieldNorm(doc=730)
          0.14242105 = weight(abstract_txt:mapping in 730) [ClassicSimilarity], result of:
            0.14242105 = score(doc=730,freq=1.0), product of:
              0.31554833 = queryWeight, product of:
                3.5334165 = boost
                5.7772117 = idf(docFreq=373, maxDocs=44421)
                0.01545798 = queryNorm
              0.45134467 = fieldWeight in 730, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7772117 = idf(docFreq=373, maxDocs=44421)
                0.078125 = fieldNorm(doc=730)
          0.19122803 = weight(abstract_txt:modeling in 730) [ClassicSimilarity], result of:
            0.19122803 = score(doc=730,freq=1.0), product of:
              0.40811062 = queryWeight, product of:
                4.401913 = boost
                5.997685 = idf(docFreq=299, maxDocs=44421)
                0.01545798 = queryNorm
              0.46856913 = fieldWeight in 730, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.997685 = idf(docFreq=299, maxDocs=44421)
                0.078125 = fieldNorm(doc=730)
          0.2642097 = weight(abstract_txt:topic in 730) [ClassicSimilarity], result of:
            0.2642097 = score(doc=730,freq=3.0), product of:
              0.3863508 = queryWeight, product of:
                4.9455295 = boost
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.01545798 = queryNorm
              0.6838595 = fieldWeight in 730, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.078125 = fieldNorm(doc=730)
        0.2 = coord(5/25)
    
  4. Liu, Y.; Xu, S.; Blanchard, E.: ¬A local context-aware LDA model for topic modeling in a document network (2017) 0.15
    0.14942518 = sum of:
      0.14942518 = product of:
        0.6226049 = sum of:
          0.05293537 = weight(abstract_txt:document in 4642) [ClassicSimilarity], result of:
            0.05293537 = score(doc=4642,freq=8.0), product of:
              0.069733866 = queryWeight, product of:
                1.0505428 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.01545798 = queryNorm
              0.7591056 = fieldWeight in 4642, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.0625 = fieldNorm(doc=4642)
          0.0330287 = weight(abstract_txt:models in 4642) [ClassicSimilarity], result of:
            0.0330287 = score(doc=4642,freq=2.0), product of:
              0.080827795 = queryWeight, product of:
                1.1310252 = boost
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.01545798 = queryNorm
              0.40863046 = fieldWeight in 4642, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.0625 = fieldNorm(doc=4642)
          0.05957943 = weight(abstract_txt:ones in 4642) [ClassicSimilarity], result of:
            0.05957943 = score(doc=4642,freq=1.0), product of:
              0.15090628 = queryWeight, product of:
                1.5454165 = boost
                6.3169727 = idf(docFreq=217, maxDocs=44421)
                0.01545798 = queryNorm
              0.3948108 = fieldWeight in 4642, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3169727 = idf(docFreq=217, maxDocs=44421)
                0.0625 = fieldNorm(doc=4642)
          0.049343877 = weight(abstract_txt:sets in 4642) [ClassicSimilarity], result of:
            0.049343877 = score(doc=4642,freq=1.0), product of:
              0.15234528 = queryWeight, product of:
                1.901744 = boost
                5.18232 = idf(docFreq=677, maxDocs=44421)
                0.01545798 = queryNorm
              0.323895 = fieldWeight in 4642, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.18232 = idf(docFreq=677, maxDocs=44421)
                0.0625 = fieldNorm(doc=4642)
          0.21634983 = weight(abstract_txt:modeling in 4642) [ClassicSimilarity], result of:
            0.21634983 = score(doc=4642,freq=2.0), product of:
              0.40811062 = queryWeight, product of:
                4.401913 = boost
                5.997685 = idf(docFreq=299, maxDocs=44421)
                0.01545798 = queryNorm
              0.53012544 = fieldWeight in 4642, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.997685 = idf(docFreq=299, maxDocs=44421)
                0.0625 = fieldNorm(doc=4642)
          0.21136774 = weight(abstract_txt:topic in 4642) [ClassicSimilarity], result of:
            0.21136774 = score(doc=4642,freq=3.0), product of:
              0.3863508 = queryWeight, product of:
                4.9455295 = boost
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.01545798 = queryNorm
              0.5470876 = fieldWeight in 4642, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.0625 = fieldNorm(doc=4642)
        0.24 = coord(6/25)
    
  5. Potha, N.; Stamatatos, E.: Improving author verification based on topic modeling (2019) 0.14
    0.14388406 = sum of:
      0.14388406 = product of:
        0.5995169 = sum of:
          0.018715478 = weight(abstract_txt:document in 385) [ClassicSimilarity], result of:
            0.018715478 = score(doc=385,freq=1.0), product of:
              0.069733866 = queryWeight, product of:
                1.0505428 = boost
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.01545798 = queryNorm
              0.26838437 = fieldWeight in 385, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.29415 = idf(docFreq=1647, maxDocs=44421)
                0.0625 = fieldNorm(doc=385)
          0.023354817 = weight(abstract_txt:models in 385) [ClassicSimilarity], result of:
            0.023354817 = score(doc=385,freq=1.0), product of:
              0.080827795 = queryWeight, product of:
                1.1310252 = boost
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.01545798 = queryNorm
              0.28894538 = fieldWeight in 385, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.0625 = fieldNorm(doc=385)
          0.03176167 = weight(abstract_txt:semantic in 385) [ClassicSimilarity], result of:
            0.03176167 = score(doc=385,freq=1.0), product of:
              0.113573164 = queryWeight, product of:
                1.6420085 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.01545798 = queryNorm
              0.27965823 = fieldWeight in 385, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.0625 = fieldNorm(doc=385)
          0.049343877 = weight(abstract_txt:sets in 385) [ClassicSimilarity], result of:
            0.049343877 = score(doc=385,freq=1.0), product of:
              0.15234528 = queryWeight, product of:
                1.901744 = boost
                5.18232 = idf(docFreq=677, maxDocs=44421)
                0.01545798 = queryNorm
              0.323895 = fieldWeight in 385, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.18232 = idf(docFreq=677, maxDocs=44421)
                0.0625 = fieldNorm(doc=385)
          0.26497334 = weight(abstract_txt:modeling in 385) [ClassicSimilarity], result of:
            0.26497334 = score(doc=385,freq=3.0), product of:
              0.40811062 = queryWeight, product of:
                4.401913 = boost
                5.997685 = idf(docFreq=299, maxDocs=44421)
                0.01545798 = queryNorm
              0.64926845 = fieldWeight in 385, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.997685 = idf(docFreq=299, maxDocs=44421)
                0.0625 = fieldNorm(doc=385)
          0.21136774 = weight(abstract_txt:topic in 385) [ClassicSimilarity], result of:
            0.21136774 = score(doc=385,freq=3.0), product of:
              0.3863508 = queryWeight, product of:
                4.9455295 = boost
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.01545798 = queryNorm
              0.5470876 = fieldWeight in 385, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.0625 = fieldNorm(doc=385)
        0.24 = coord(6/25)