Document (#37509)

Author
Dunne, C.
Shneiderman, B.
Gove, R.
Klavans, J.
Dorr, B.
Title
Rapid understanding of scientific paper collections : integrating statistics, text analytics, and visualization
Source
Journal of the American Society for Information Science and Technology. 63(2012) no.12, S.2351-2369
Year
2012
Abstract
Keeping up with rapidly growing research fields, especially when there are multiple interdisciplinary sources, requires substantial effort for researchers, program managers, or venture capital investors. Current theories and tools are directed at finding a paper or website, not gaining an understanding of the key papers, authors, controversies, and hypotheses. This report presents an effort to integrate statistics, text analytics, and visualization in a multiple coordinated window environment that supports exploration. Our prototype system, Action Science Explorer (ASE), provides an environment for demonstrating principles of coordination and conducting iterative usability tests of them with interested and knowledgeable users. We developed an understanding of the value of reference management, statistics, citation text extraction, natural language summarization for single and multiple documents, filters to interactively select key papers, and network visualization to see citation patterns and identify clusters. A three-phase usability study guided our revisions to ASE and led us to improve the testing methods.

Similar documents (author)

  1. Dorr, B.J.: Large-scale dictionary construction for foreign language tutoring and interlingual machine translation (1997) 1.19
    1.1867265 = sum of:
      1.1867265 = product of:
        3.5601792 = sum of:
          3.5601792 = weight(author_txt:dorr in 4244) [ClassicSimilarity], result of:
            3.5601792 = score(doc=4244,freq=1.0), product of:
              0.61226875 = queryWeight, product of:
                1.0915313 = boost
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.060291506 = queryNorm
              5.814733 = fieldWeight in 4244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.625 = fieldNorm(doc=4244)
        0.33333334 = coord(1/3)
    
  2. Dorr, B.J.; Olsen, M.B.: Multilingual generation : the role of telicity in lexical choice and syntactic realization (1996) 0.95
    0.9493811 = sum of:
      0.9493811 = product of:
        2.8481433 = sum of:
          2.8481433 = weight(author_txt:dorr in 536) [ClassicSimilarity], result of:
            2.8481433 = score(doc=536,freq=1.0), product of:
              0.61226875 = queryWeight, product of:
                1.0915313 = boost
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.060291506 = queryNorm
              4.6517863 = fieldWeight in 536, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.5 = fieldNorm(doc=536)
        0.33333334 = coord(1/3)
    
  3. Oard, D.W.; Dorr, B.J.: Evaluating cross-laguage text filtering effectiveness (1998) 0.95
    0.9493811 = sum of:
      0.9493811 = product of:
        2.8481433 = sum of:
          2.8481433 = weight(author_txt:dorr in 214) [ClassicSimilarity], result of:
            2.8481433 = score(doc=214,freq=1.0), product of:
              0.61226875 = queryWeight, product of:
                1.0915313 = boost
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.060291506 = queryNorm
              4.6517863 = fieldWeight in 214, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.5 = fieldNorm(doc=214)
        0.33333334 = coord(1/3)
    
  4. Dorr, B.J.; Gaasterland, T.: Exploiting aspectual features and connecting words for summarization-inspired temporal-relation extraction (2007) 0.95
    0.9493811 = sum of:
      0.9493811 = product of:
        2.8481433 = sum of:
          2.8481433 = weight(author_txt:dorr in 1950) [ClassicSimilarity], result of:
            2.8481433 = score(doc=1950,freq=1.0), product of:
              0.61226875 = queryWeight, product of:
                1.0915313 = boost
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.060291506 = queryNorm
              4.6517863 = fieldWeight in 1950, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.5 = fieldNorm(doc=1950)
        0.33333334 = coord(1/3)
    
  5. Klavans, R.; Boyack, K.W.: Identifying a better measure of relatedness for mapping science (2006) 0.92
    0.92299235 = sum of:
      0.92299235 = product of:
        2.768977 = sum of:
          2.768977 = weight(author_txt:klavans in 252) [ClassicSimilarity], result of:
            2.768977 = score(doc=252,freq=1.0), product of:
              0.60086983 = queryWeight, product of:
                1.0813228 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.060291506 = queryNorm
              4.6082807 = fieldWeight in 252, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.5 = fieldNorm(doc=252)
        0.33333334 = coord(1/3)
    

Similar documents (content)

  1. Aris, A.; Shneiderman, B.; Qazvinian, V.; Radev, D.: Visual overviews for discovering key papers and influences across research fronts (2009) 0.11
    0.10693251 = sum of:
      0.10693251 = product of:
        0.53466254 = sum of:
          0.09686807 = weight(abstract_txt:gaining in 143) [ClassicSimilarity], result of:
            0.09686807 = score(doc=143,freq=1.0), product of:
              0.16152847 = queryWeight, product of:
                1.0299402 = boost
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.020431276 = queryNorm
              0.5996966 = fieldWeight in 143, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.078125 = fieldNorm(doc=143)
          0.07084138 = weight(abstract_txt:citation in 143) [ClassicSimilarity], result of:
            0.07084138 = score(doc=143,freq=2.0), product of:
              0.13111529 = queryWeight, product of:
                1.3122879 = boost
                4.890223 = idf(docFreq=907, maxDocs=44421)
                0.020431276 = queryNorm
              0.5402984 = fieldWeight in 143, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.890223 = idf(docFreq=907, maxDocs=44421)
                0.078125 = fieldNorm(doc=143)
          0.12505172 = weight(abstract_txt:papers in 143) [ClassicSimilarity], result of:
            0.12505172 = score(doc=143,freq=4.0), product of:
              0.15200053 = queryWeight, product of:
                1.4129443 = boost
                5.2653174 = idf(docFreq=623, maxDocs=44421)
                0.020431276 = queryNorm
              0.82270586 = fieldWeight in 143, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.2653174 = idf(docFreq=623, maxDocs=44421)
                0.078125 = fieldNorm(doc=143)
          0.08691404 = weight(abstract_txt:multiple in 143) [ClassicSimilarity], result of:
            0.08691404 = score(doc=143,freq=1.0), product of:
              0.21671836 = queryWeight, product of:
                2.0663123 = boost
                5.1333895 = idf(docFreq=711, maxDocs=44421)
                0.020431276 = queryNorm
              0.40104604 = fieldWeight in 143, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1333895 = idf(docFreq=711, maxDocs=44421)
                0.078125 = fieldNorm(doc=143)
          0.15498735 = weight(abstract_txt:visualization in 143) [ClassicSimilarity], result of:
            0.15498735 = score(doc=143,freq=1.0), product of:
              0.31868863 = queryWeight, product of:
                2.5057135 = boost
                6.225004 = idf(docFreq=238, maxDocs=44421)
                0.020431276 = queryNorm
              0.48632845 = fieldWeight in 143, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.225004 = idf(docFreq=238, maxDocs=44421)
                0.078125 = fieldNorm(doc=143)
        0.2 = coord(5/25)
    
  2. Adler, R.; Ewing, J.; Taylor, P.: Citation statistics : A report from the International Mathematical Union (IMU) in cooperation with the International Council of Industrial and Applied Mathematics (ICIAM) and the Institute of Mathematical Statistics (IMS) (2008) 0.09
    0.09433218 = sum of:
      0.09433218 = product of:
        0.47166088 = sum of:
          0.038747225 = weight(abstract_txt:gaining in 3417) [ClassicSimilarity], result of:
            0.038747225 = score(doc=3417,freq=1.0), product of:
              0.16152847 = queryWeight, product of:
                1.0299402 = boost
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.020431276 = queryNorm
              0.23987862 = fieldWeight in 3417, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.03125 = fieldNorm(doc=3417)
          0.087339126 = weight(abstract_txt:citation in 3417) [ClassicSimilarity], result of:
            0.087339126 = score(doc=3417,freq=19.0), product of:
              0.13111529 = queryWeight, product of:
                1.3122879 = boost
                4.890223 = idf(docFreq=907, maxDocs=44421)
                0.020431276 = queryNorm
              0.66612464 = fieldWeight in 3417, product of:
                4.358899 = tf(freq=19.0), with freq of:
                  19.0 = termFreq=19.0
                4.890223 = idf(docFreq=907, maxDocs=44421)
                0.03125 = fieldNorm(doc=3417)
          0.05002069 = weight(abstract_txt:papers in 3417) [ClassicSimilarity], result of:
            0.05002069 = score(doc=3417,freq=4.0), product of:
              0.15200053 = queryWeight, product of:
                1.4129443 = boost
                5.2653174 = idf(docFreq=623, maxDocs=44421)
                0.020431276 = queryNorm
              0.32908234 = fieldWeight in 3417, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.2653174 = idf(docFreq=623, maxDocs=44421)
                0.03125 = fieldNorm(doc=3417)
          0.03354382 = weight(abstract_txt:understanding in 3417) [ClassicSimilarity], result of:
            0.03354382 = score(doc=3417,freq=2.0), product of:
              0.16795544 = queryWeight, product of:
                1.8190522 = boost
                4.5191154 = idf(docFreq=1315, maxDocs=44421)
                0.020431276 = queryNorm
              0.19971856 = fieldWeight in 3417, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5191154 = idf(docFreq=1315, maxDocs=44421)
                0.03125 = fieldNorm(doc=3417)
          0.26201 = weight(abstract_txt:statistics in 3417) [ClassicSimilarity], result of:
            0.26201 = score(doc=3417,freq=17.0), product of:
              0.32398483 = queryWeight, product of:
                2.5264485 = boost
                6.2765174 = idf(docFreq=226, maxDocs=44421)
                0.020431276 = queryNorm
              0.80871075 = fieldWeight in 3417, product of:
                4.1231055 = tf(freq=17.0), with freq of:
                  17.0 = termFreq=17.0
                6.2765174 = idf(docFreq=226, maxDocs=44421)
                0.03125 = fieldNorm(doc=3417)
        0.2 = coord(5/25)
    
  3. Zhu, B.; Chen, H.: Information visualization (2004) 0.09
    0.090689935 = sum of:
      0.090689935 = product of:
        0.45344967 = sum of:
          0.021302275 = weight(abstract_txt:environment in 5276) [ClassicSimilarity], result of:
            0.021302275 = score(doc=5276,freq=1.0), product of:
              0.11769987 = queryWeight, product of:
                1.2433417 = boost
                4.6332955 = idf(docFreq=1173, maxDocs=44421)
                0.020431276 = queryNorm
              0.1809881 = fieldWeight in 5276, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6332955 = idf(docFreq=1173, maxDocs=44421)
                0.0390625 = fieldNorm(doc=5276)
          0.04024271 = weight(abstract_txt:effort in 5276) [ClassicSimilarity], result of:
            0.04024271 = score(doc=5276,freq=1.0), product of:
              0.17986643 = queryWeight, product of:
                1.537013 = boost
                5.727658 = idf(docFreq=392, maxDocs=44421)
                0.020431276 = queryNorm
              0.22373663 = fieldWeight in 5276, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.727658 = idf(docFreq=392, maxDocs=44421)
                0.0390625 = fieldNorm(doc=5276)
          0.02119707 = weight(abstract_txt:text in 5276) [ClassicSimilarity], result of:
            0.02119707 = score(doc=5276,freq=1.0), product of:
              0.13428874 = queryWeight, product of:
                1.6265519 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.020431276 = queryNorm
              0.15784696 = fieldWeight in 5276, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0390625 = fieldNorm(doc=5276)
          0.041929778 = weight(abstract_txt:understanding in 5276) [ClassicSimilarity], result of:
            0.041929778 = score(doc=5276,freq=2.0), product of:
              0.16795544 = queryWeight, product of:
                1.8190522 = boost
                4.5191154 = idf(docFreq=1315, maxDocs=44421)
                0.020431276 = queryNorm
              0.24964821 = fieldWeight in 5276, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5191154 = idf(docFreq=1315, maxDocs=44421)
                0.0390625 = fieldNorm(doc=5276)
          0.32877782 = weight(abstract_txt:visualization in 5276) [ClassicSimilarity], result of:
            0.32877782 = score(doc=5276,freq=18.0), product of:
              0.31868863 = queryWeight, product of:
                2.5057135 = boost
                6.225004 = idf(docFreq=238, maxDocs=44421)
                0.020431276 = queryNorm
              1.0316584 = fieldWeight in 5276, product of:
                4.2426405 = tf(freq=18.0), with freq of:
                  18.0 = termFreq=18.0
                6.225004 = idf(docFreq=238, maxDocs=44421)
                0.0390625 = fieldNorm(doc=5276)
        0.2 = coord(5/25)
    
  4. Information visualization : human-centered issues and perspectives (2008) 0.08
    0.0830385 = sum of:
      0.0830385 = product of:
        0.6919875 = sum of:
          0.06252586 = weight(abstract_txt:papers in 272) [ClassicSimilarity], result of:
            0.06252586 = score(doc=272,freq=1.0), product of:
              0.15200053 = queryWeight, product of:
                1.4129443 = boost
                5.2653174 = idf(docFreq=623, maxDocs=44421)
                0.020431276 = queryNorm
              0.41135293 = fieldWeight in 272, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2653174 = idf(docFreq=623, maxDocs=44421)
                0.078125 = fieldNorm(doc=272)
          0.19109127 = weight(abstract_txt:analytics in 272) [ClassicSimilarity], result of:
            0.19109127 = score(doc=272,freq=1.0), product of:
              0.32010996 = queryWeight, product of:
                2.0504637 = boost
                7.6410246 = idf(docFreq=57, maxDocs=44421)
                0.020431276 = queryNorm
              0.59695506 = fieldWeight in 272, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.6410246 = idf(docFreq=57, maxDocs=44421)
                0.078125 = fieldNorm(doc=272)
          0.4383704 = weight(abstract_txt:visualization in 272) [ClassicSimilarity], result of:
            0.4383704 = score(doc=272,freq=8.0), product of:
              0.31868863 = queryWeight, product of:
                2.5057135 = boost
                6.225004 = idf(docFreq=238, maxDocs=44421)
                0.020431276 = queryNorm
              1.3755445 = fieldWeight in 272, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                6.225004 = idf(docFreq=238, maxDocs=44421)
                0.078125 = fieldNorm(doc=272)
        0.12 = coord(3/25)
    
  5. Parsons, P.; Sedig, K.: Adjustable properties of visual representations : improving the quality of human-information interaction (2014) 0.08
    0.07769091 = sum of:
      0.07769091 = product of:
        0.4855682 = sum of:
          0.07093075 = weight(abstract_txt:coordination in 2214) [ClassicSimilarity], result of:
            0.07093075 = score(doc=2214,freq=1.0), product of:
              0.15227374 = queryWeight, product of:
                7.4529724 = idf(docFreq=69, maxDocs=44421)
                0.020431276 = queryNorm
              0.46581078 = fieldWeight in 2214, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4529724 = idf(docFreq=69, maxDocs=44421)
                0.0625 = fieldNorm(doc=2214)
          0.07445248 = weight(abstract_txt:coordinated in 2214) [ClassicSimilarity], result of:
            0.07445248 = score(doc=2214,freq=1.0), product of:
              0.15727322 = queryWeight, product of:
                1.0162835 = boost
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.020431276 = queryNorm
              0.47339582 = fieldWeight in 2214, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.0625 = fieldNorm(doc=2214)
          0.21619508 = weight(abstract_txt:analytics in 2214) [ClassicSimilarity], result of:
            0.21619508 = score(doc=2214,freq=2.0), product of:
              0.32010996 = queryWeight, product of:
                2.0504637 = boost
                7.6410246 = idf(docFreq=57, maxDocs=44421)
                0.020431276 = queryNorm
              0.67537755 = fieldWeight in 2214, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.6410246 = idf(docFreq=57, maxDocs=44421)
                0.0625 = fieldNorm(doc=2214)
          0.12398988 = weight(abstract_txt:visualization in 2214) [ClassicSimilarity], result of:
            0.12398988 = score(doc=2214,freq=1.0), product of:
              0.31868863 = queryWeight, product of:
                2.5057135 = boost
                6.225004 = idf(docFreq=238, maxDocs=44421)
                0.020431276 = queryNorm
              0.38906276 = fieldWeight in 2214, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.225004 = idf(docFreq=238, maxDocs=44421)
                0.0625 = fieldNorm(doc=2214)
        0.16 = coord(4/25)