Document (#38138)

Author
Darányi, S.
Wittek, P.
Title
Demonstrating conceptual dynamics in an evolving text collection
Source
Journal of the American Society for Information Science and Technology. 64(2013) no.12, S.2564-2572
Year
2013
Abstract
Based on real-world user demands, we demonstrate how animated visualization of evolving text corpora displays the underlying dynamics of semantic content. To interpret the results, one needs a dynamic theory of word meaning. We suggest that conceptual dynamics as the interaction between kinds of intellectual and emotional content and language is key for such a theory. We demonstrate our method by two-way seriation, which is a popular technique to analyze groups of similar instances and their features as well as the connections between the groups themselves. The two-way seriated data may be visualized as a two-dimensional heat map or as a three-dimensional landscape in which color codes or height correspond to the values in the matrix. In this article, we focus on two-way seriation of sparse data in the Reuters-21568 test collection. To achieve a meaningful visualization, we introduce a compactly supported convolution kernel similar to filter kernels used in image reconstruction and geostatistics. This filter populates the high-dimensional sparse space with values that interpolate nearby elements and provides insight into the clustering structure. We also extend two-way seriation to deal with online updates of both the row and column spaces and, combined with the convolution kernel, demonstrate a three-dimensional visualization of dynamics.
Theme
Visualisierung
Semantisches Umfeld in Indexierung u. Retrieval

Similar documents (content)

  1. Li, J.; Zhang, Z.; Li, X.; Chen, H.: Kernel-based learning for biomedical relation extraction (2008) 0.10
    0.102858596 = sum of:
      0.102858596 = product of:
        0.85715497 = sum of:
          0.030615667 = weight(abstract_txt:text in 2611) [ClassicSimilarity], result of:
            0.030615667 = score(doc=2611,freq=2.0), product of:
              0.06857448 = queryWeight, product of:
                1.0022175 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.016932627 = queryNorm
              0.4464586 = fieldWeight in 2611, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.078125 = fieldNorm(doc=2611)
          0.2257708 = weight(abstract_txt:kernels in 2611) [ClassicSimilarity], result of:
            0.2257708 = score(doc=2611,freq=2.0), product of:
              0.20620628 = queryWeight, product of:
                1.2289004 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.016932627 = queryNorm
              1.0948784 = fieldWeight in 2611, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.078125 = fieldNorm(doc=2611)
          0.6007685 = weight(abstract_txt:kernel in 2611) [ClassicSimilarity], result of:
            0.6007685 = score(doc=2611,freq=9.0), product of:
              0.3021811 = queryWeight, product of:
                2.1038482 = boost
                8.482592 = idf(docFreq=24, maxDocs=44421)
                0.016932627 = queryNorm
              1.9881074 = fieldWeight in 2611, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                8.482592 = idf(docFreq=24, maxDocs=44421)
                0.078125 = fieldNorm(doc=2611)
        0.12 = coord(3/25)
    
  2. Zhang, M.; Zhou, G.D.; Aw, A.: Exploring syntactic structured features over parse trees for relation extraction using kernel methods (2008) 0.08
    0.08299873 = sum of:
      0.08299873 = product of:
        0.6916561 = sum of:
          0.017318837 = weight(abstract_txt:text in 3055) [ClassicSimilarity], result of:
            0.017318837 = score(doc=3055,freq=1.0), product of:
              0.06857448 = queryWeight, product of:
                1.0022175 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.016932627 = queryNorm
              0.25255513 = fieldWeight in 3055, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=3055)
          0.2212093 = weight(abstract_txt:kernels in 3055) [ClassicSimilarity], result of:
            0.2212093 = score(doc=3055,freq=3.0), product of:
              0.20620628 = queryWeight, product of:
                1.2289004 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.016932627 = queryNorm
              1.0727574 = fieldWeight in 3055, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.0625 = fieldNorm(doc=3055)
          0.45312795 = weight(abstract_txt:kernel in 3055) [ClassicSimilarity], result of:
            0.45312795 = score(doc=3055,freq=8.0), product of:
              0.3021811 = queryWeight, product of:
                2.1038482 = boost
                8.482592 = idf(docFreq=24, maxDocs=44421)
                0.016932627 = queryNorm
              1.4995245 = fieldWeight in 3055, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                8.482592 = idf(docFreq=24, maxDocs=44421)
                0.0625 = fieldNorm(doc=3055)
        0.12 = coord(3/25)
    
  3. Oh, K.E.; Halpern, D.; Tremaine, M.; Chiang, J.; Silver, D.; Bemis, K.: Blocked: when the information is hidden by the visualization (2016) 0.08
    0.07689725 = sum of:
      0.07689725 = product of:
        0.4806078 = sum of:
          0.0316853 = weight(abstract_txt:three in 3888) [ClassicSimilarity], result of:
            0.0316853 = score(doc=3888,freq=2.0), product of:
              0.081416406 = queryWeight, product of:
                1.0920354 = boost
                4.4030223 = idf(docFreq=1477, maxDocs=44421)
                0.016932627 = queryNorm
              0.38917586 = fieldWeight in 3888, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4030223 = idf(docFreq=1477, maxDocs=44421)
                0.0625 = fieldNorm(doc=3888)
          0.024396665 = weight(abstract_txt:theory in 3888) [ClassicSimilarity], result of:
            0.024396665 = score(doc=3888,freq=1.0), product of:
              0.086172834 = queryWeight, product of:
                1.1234815 = boost
                4.529811 = idf(docFreq=1301, maxDocs=44421)
                0.016932627 = queryNorm
              0.28311318 = fieldWeight in 3888, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.529811 = idf(docFreq=1301, maxDocs=44421)
                0.0625 = fieldNorm(doc=3888)
          0.1899461 = weight(abstract_txt:visualization in 3888) [ClassicSimilarity], result of:
            0.1899461 = score(doc=3888,freq=4.0), product of:
              0.24410728 = queryWeight, product of:
                2.3158836 = boost
                6.225004 = idf(docFreq=238, maxDocs=44421)
                0.016932627 = queryNorm
              0.7781255 = fieldWeight in 3888, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.225004 = idf(docFreq=238, maxDocs=44421)
                0.0625 = fieldNorm(doc=3888)
          0.23457976 = weight(abstract_txt:dimensional in 3888) [ClassicSimilarity], result of:
            0.23457976 = score(doc=3888,freq=2.0), product of:
              0.38965216 = queryWeight, product of:
                3.3785806 = boost
                6.8111186 = idf(docFreq=132, maxDocs=44421)
                0.016932627 = queryNorm
              0.6020235 = fieldWeight in 3888, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8111186 = idf(docFreq=132, maxDocs=44421)
                0.0625 = fieldNorm(doc=3888)
        0.16 = coord(4/25)
    
  4. Lin, N.; Li, D.; Ding, Y.; He, B.; Qin, Z.; Tang, J.; Li, J.; Dong, T.: ¬The dynamic features of Delicious, Flickr, and YouTube (2012) 0.07
    0.06876488 = sum of:
      0.06876488 = product of:
        0.3438244 = sum of:
          0.03880641 = weight(abstract_txt:three in 970) [ClassicSimilarity], result of:
            0.03880641 = score(doc=970,freq=3.0), product of:
              0.081416406 = queryWeight, product of:
                1.0920354 = boost
                4.4030223 = idf(docFreq=1477, maxDocs=44421)
                0.016932627 = queryNorm
              0.47664115 = fieldWeight in 970, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.4030223 = idf(docFreq=1477, maxDocs=44421)
                0.0625 = fieldNorm(doc=970)
          0.03479481 = weight(abstract_txt:groups in 970) [ClassicSimilarity], result of:
            0.03479481 = score(doc=970,freq=1.0), product of:
              0.109184176 = queryWeight, product of:
                1.2646216 = boost
                5.09888 = idf(docFreq=736, maxDocs=44421)
                0.016932627 = queryNorm
              0.31868 = fieldWeight in 970, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.09888 = idf(docFreq=736, maxDocs=44421)
                0.0625 = fieldNorm(doc=970)
          0.037038486 = weight(abstract_txt:similar in 970) [ClassicSimilarity], result of:
            0.037038486 = score(doc=970,freq=1.0), product of:
              0.113828816 = queryWeight, product of:
                1.2912396 = boost
                5.206202 = idf(docFreq=661, maxDocs=44421)
                0.016932627 = queryNorm
              0.32538763 = fieldWeight in 970, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.206202 = idf(docFreq=661, maxDocs=44421)
                0.0625 = fieldNorm(doc=970)
          0.072552085 = weight(abstract_txt:evolving in 970) [ClassicSimilarity], result of:
            0.072552085 = score(doc=970,freq=1.0), product of:
              0.17820367 = queryWeight, product of:
                1.6156194 = boost
                6.514082 = idf(docFreq=178, maxDocs=44421)
                0.016932627 = queryNorm
              0.40713012 = fieldWeight in 970, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.514082 = idf(docFreq=178, maxDocs=44421)
                0.0625 = fieldNorm(doc=970)
          0.1606326 = weight(abstract_txt:dynamics in 970) [ClassicSimilarity], result of:
            0.1606326 = score(doc=970,freq=1.0), product of:
              0.3814016 = queryWeight, product of:
                3.34262 = boost
                6.738623 = idf(docFreq=142, maxDocs=44421)
                0.016932627 = queryNorm
              0.42116395 = fieldWeight in 970, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.738623 = idf(docFreq=142, maxDocs=44421)
                0.0625 = fieldNorm(doc=970)
        0.2 = coord(5/25)
    
  5. Chen, C.; Kuljis, J.: ¬The rising landscape : a visual exploration of superstring revolutions in physics (2003) 0.07
    0.06860712 = sum of:
      0.06860712 = product of:
        0.42879453 = sum of:
          0.10322478 = weight(abstract_txt:visualized in 2469) [ClassicSimilarity], result of:
            0.10322478 = score(doc=2469,freq=1.0), product of:
              0.13654271 = queryWeight, product of:
                8.063882 = idf(docFreq=37, maxDocs=44421)
                0.016932627 = queryNorm
              0.75598896 = fieldWeight in 2469, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.063882 = idf(docFreq=37, maxDocs=44421)
                0.09375 = fieldNorm(doc=2469)
          0.036594998 = weight(abstract_txt:theory in 2469) [ClassicSimilarity], result of:
            0.036594998 = score(doc=2469,freq=1.0), product of:
              0.086172834 = queryWeight, product of:
                1.1234815 = boost
                4.529811 = idf(docFreq=1301, maxDocs=44421)
                0.016932627 = queryNorm
              0.42466977 = fieldWeight in 2469, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.529811 = idf(docFreq=1301, maxDocs=44421)
                0.09375 = fieldNorm(doc=2469)
          0.14651519 = weight(abstract_txt:animated in 2469) [ClassicSimilarity], result of:
            0.14651519 = score(doc=2469,freq=1.0), product of:
              0.17245176 = queryWeight, product of:
                1.1238272 = boost
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.016932627 = queryNorm
              0.849601 = fieldWeight in 2469, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.09375 = fieldNorm(doc=2469)
          0.14245957 = weight(abstract_txt:visualization in 2469) [ClassicSimilarity], result of:
            0.14245957 = score(doc=2469,freq=1.0), product of:
              0.24410728 = queryWeight, product of:
                2.3158836 = boost
                6.225004 = idf(docFreq=238, maxDocs=44421)
                0.016932627 = queryNorm
              0.58359414 = fieldWeight in 2469, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.225004 = idf(docFreq=238, maxDocs=44421)
                0.09375 = fieldNorm(doc=2469)
        0.16 = coord(4/25)