Document (#36227)

Author
Berendt, B.
Krause, B.
Kolbe-Nusser, S.
Title
Intelligent scientific authoring tools : interactive data mining for constructive uses of citation networks
Source
Information processing and management. 46(2010) no.1, S.1-10
Year
2010
Abstract
Many powerful methods and tools exist for extracting meaning from scientific publications, their texts, and their citation links. However, existing proposals often neglect a fundamental aspect of learning: that understanding and learning require an active and constructive exploration of a domain. In this paper, we describe a new method and a tool that use data mining and interactivity to turn the typical search and retrieve dialogue, in which the user asks questions and a system gives answers, into a dialogue that also involves sense-making, in which the user has to become active by constructing a bibliography and a domain model of the search term(s). This model starts from an automatically generated and annotated clustering solution that is iteratively modified by users. The tool is part of an integrated authoring system covering all phases from search through reading and sense-making to writing. Two evaluation studies demonstrate the usability of this interactive and constructive approach, and they show that clusters and groups represent identifiable sub-topics.
Theme
Data Mining

Similar documents (author)

  1. Krause, J.: Praxisorientierte natürlichsprachliche Frage-Antwort-Systeme : zur Entwicklung vor allem in der Bundesrepublik Deutschland (1983) 4.95
    4.9482985 = sum of:
      4.9482985 = weight(author_txt:krause in 5187) [ClassicSimilarity], result of:
        4.9482985 = fieldWeight in 5187, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.917278 = idf(docFreq=43, maxDocs=44421)
          0.625 = fieldNorm(doc=5187)
    
  2. Krause, J.: Mensch-Maschine-Interaktion in natürlicherSprache : zur Bewertung eines natürlichsprachigen Frage-Antwort-Systems (1980) 4.95
    4.9482985 = sum of:
      4.9482985 = weight(author_txt:krause in 5497) [ClassicSimilarity], result of:
        4.9482985 = fieldWeight in 5497, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.917278 = idf(docFreq=43, maxDocs=44421)
          0.625 = fieldNorm(doc=5497)
    
  3. Krause, M.G.: Intellectual problems of indexing picture collections (1988) 4.95
    4.9482985 = sum of:
      4.9482985 = weight(author_txt:krause in 5637) [ClassicSimilarity], result of:
        4.9482985 = fieldWeight in 5637, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.917278 = idf(docFreq=43, maxDocs=44421)
          0.625 = fieldNorm(doc=5637)
    
  4. Krause, J.: Was leisten informationslinguistische Komponenten von Referenz-Retrievalsystemen für Massendaten? : Von der 'Pragmatik im Computer' zur Pragmatikanalyse als Designgrundlage (1986) 4.95
    4.9482985 = sum of:
      4.9482985 = weight(author_txt:krause in 7394) [ClassicSimilarity], result of:
        4.9482985 = fieldWeight in 7394, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.917278 = idf(docFreq=43, maxDocs=44421)
          0.625 = fieldNorm(doc=7394)
    
  5. Krause, J.: Mensch-Maschine-Interaktion in natürlicher Sprache : Evaluierungsstudien zu praxisorientierten Frage-Antwort-Systemen und ihre Methodik (1982) 4.95
    4.9482985 = sum of:
      4.9482985 = weight(author_txt:krause in 578) [ClassicSimilarity], result of:
        4.9482985 = fieldWeight in 578, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.917278 = idf(docFreq=43, maxDocs=44421)
          0.625 = fieldNorm(doc=578)
    

Similar documents (content)

  1. Cooper, L.; Kuhlthau, C.C.: Imagery for constructing meaning in the information search process : a study of middle school students (1999) 0.15
    0.15301213 = sum of:
      0.15301213 = product of:
        0.63755053 = sum of:
          0.01478598 = weight(abstract_txt:user in 1280) [ClassicSimilarity], result of:
            0.01478598 = score(doc=1280,freq=1.0), product of:
              0.073453374 = queryWeight, product of:
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.019955447 = queryNorm
              0.20129749 = fieldWeight in 1280, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1280)
          0.013214445 = weight(abstract_txt:from in 1280) [ClassicSimilarity], result of:
            0.013214445 = score(doc=1280,freq=2.0), product of:
              0.06192006 = queryWeight, product of:
                1.1244895 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.019955447 = queryNorm
              0.21341136 = fieldWeight in 1280, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1280)
          0.03161571 = weight(abstract_txt:learning in 1280) [ClassicSimilarity], result of:
            0.03161571 = score(doc=1280,freq=1.0), product of:
              0.12191215 = queryWeight, product of:
                1.2883018 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.019955447 = queryNorm
              0.2593319 = fieldWeight in 1280, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1280)
          0.03691339 = weight(abstract_txt:making in 1280) [ClassicSimilarity], result of:
            0.03691339 = score(doc=1280,freq=1.0), product of:
              0.13517644 = queryWeight, product of:
                1.3565775 = boost
                4.9933834 = idf(docFreq=818, maxDocs=44421)
                0.019955447 = queryNorm
              0.27307564 = fieldWeight in 1280, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9933834 = idf(docFreq=818, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1280)
          0.019607715 = weight(abstract_txt:that in 1280) [ClassicSimilarity], result of:
            0.019607715 = score(doc=1280,freq=4.0), product of:
              0.07580357 = queryWeight, product of:
                1.6062346 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.019955447 = queryNorm
              0.2586648 = fieldWeight in 1280, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1280)
          0.52141327 = weight(abstract_txt:constructive in 1280) [ClassicSimilarity], result of:
            0.52141327 = score(doc=1280,freq=4.0), product of:
              0.569609 = queryWeight, product of:
                3.4105794 = boost
                8.369263 = idf(docFreq=27, maxDocs=44421)
                0.019955447 = queryNorm
              0.9153881 = fieldWeight in 1280, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.369263 = idf(docFreq=27, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1280)
        0.24 = coord(6/25)
    
  2. Lee, S.-C.: ¬The utilization and selection of authoring software (1993) 0.14
    0.14265466 = sum of:
      0.14265466 = product of:
        0.89159167 = sum of:
          0.059665333 = weight(abstract_txt:tools in 4566) [ClassicSimilarity], result of:
            0.059665333 = score(doc=4566,freq=1.0), product of:
              0.10729474 = queryWeight, product of:
                1.2086021 = boost
                4.448705 = idf(docFreq=1411, maxDocs=44421)
                0.019955447 = queryNorm
              0.55608815 = fieldWeight in 4566, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.448705 = idf(docFreq=1411, maxDocs=44421)
                0.125 = fieldNorm(doc=4566)
          0.08215338 = weight(abstract_txt:tool in 4566) [ClassicSimilarity], result of:
            0.08215338 = score(doc=4566,freq=1.0), product of:
              0.13279468 = queryWeight, product of:
                1.3445733 = boost
                4.9491973 = idf(docFreq=855, maxDocs=44421)
                0.019955447 = queryNorm
              0.61864966 = fieldWeight in 4566, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9491973 = idf(docFreq=855, maxDocs=44421)
                0.125 = fieldNorm(doc=4566)
          0.16448392 = weight(abstract_txt:interactive in 4566) [ClassicSimilarity], result of:
            0.16448392 = score(doc=4566,freq=2.0), product of:
              0.16743106 = queryWeight, product of:
                1.5097747 = boost
                5.557282 = idf(docFreq=465, maxDocs=44421)
                0.019955447 = queryNorm
              0.9823979 = fieldWeight in 4566, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.557282 = idf(docFreq=465, maxDocs=44421)
                0.125 = fieldNorm(doc=4566)
          0.585289 = weight(abstract_txt:authoring in 4566) [ClassicSimilarity], result of:
            0.585289 = score(doc=4566,freq=5.0), product of:
              0.28753275 = queryWeight, product of:
                1.9785079 = boost
                7.282627 = idf(docFreq=82, maxDocs=44421)
                0.019955447 = queryNorm
              2.035556 = fieldWeight in 4566, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.282627 = idf(docFreq=82, maxDocs=44421)
                0.125 = fieldNorm(doc=4566)
        0.16 = coord(4/25)
    
  3. Kuo, J.-S.; Li, H.; Yang, Y.-K.: Active learning for constructing transliteration lexicons from the Web (2008) 0.14
    0.1418197 = sum of:
      0.1418197 = product of:
        0.59091544 = sum of:
          0.119194284 = weight(abstract_txt:starts in 2345) [ClassicSimilarity], result of:
            0.119194284 = score(doc=2345,freq=1.0), product of:
              0.16363762 = queryWeight, product of:
                1.0554088 = boost
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.019955447 = queryNorm
              0.7284039 = fieldWeight in 2345, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.09375 = fieldNorm(doc=2345)
          0.016018325 = weight(abstract_txt:from in 2345) [ClassicSimilarity], result of:
            0.016018325 = score(doc=2345,freq=1.0), product of:
              0.06192006 = queryWeight, product of:
                1.1244895 = boost
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.019955447 = queryNorm
              0.25869364 = fieldWeight in 2345, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.759399 = idf(docFreq=7646, maxDocs=44421)
                0.09375 = fieldNorm(doc=2345)
          0.16487183 = weight(abstract_txt:iteratively in 2345) [ClassicSimilarity], result of:
            0.16487183 = score(doc=2345,freq=1.0), product of:
              0.20314704 = queryWeight, product of:
                1.1759379 = boost
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.019955447 = queryNorm
              0.81158864 = fieldWeight in 2345, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.09375 = fieldNorm(doc=2345)
          0.13275833 = weight(abstract_txt:learning in 2345) [ClassicSimilarity], result of:
            0.13275833 = score(doc=2345,freq=6.0), product of:
              0.12191215 = queryWeight, product of:
                1.2883018 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.019955447 = queryNorm
              1.0889672 = fieldWeight in 2345, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.09375 = fieldNorm(doc=2345)
          0.029109905 = weight(abstract_txt:that in 2345) [ClassicSimilarity], result of:
            0.029109905 = score(doc=2345,freq=3.0), product of:
              0.07580357 = queryWeight, product of:
                1.6062346 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.019955447 = queryNorm
              0.3840176 = fieldWeight in 2345, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.09375 = fieldNorm(doc=2345)
          0.12896276 = weight(abstract_txt:active in 2345) [ClassicSimilarity], result of:
            0.12896276 = score(doc=2345,freq=1.0), product of:
              0.21728633 = queryWeight, product of:
                1.7199283 = boost
                6.3308296 = idf(docFreq=214, maxDocs=44421)
                0.019955447 = queryNorm
              0.5935153 = fieldWeight in 2345, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3308296 = idf(docFreq=214, maxDocs=44421)
                0.09375 = fieldNorm(doc=2345)
        0.24 = coord(6/25)
    
  4. Patton, M.; Reynolds, D.; Choudhury, G.S.; DiLauro, T.: Toward a metadata generation framework : a case study at Johns Hopkins University (2004) 0.13
    0.13343957 = sum of:
      0.13343957 = product of:
        0.4765699 = sum of:
          0.031642318 = weight(abstract_txt:tools in 2192) [ClassicSimilarity], result of:
            0.031642318 = score(doc=2192,freq=2.0), product of:
              0.10729474 = queryWeight, product of:
                1.2086021 = boost
                4.448705 = idf(docFreq=1411, maxDocs=44421)
                0.019955447 = queryNorm
              0.29491025 = fieldWeight in 2192, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.448705 = idf(docFreq=1411, maxDocs=44421)
                0.046875 = fieldNorm(doc=2192)
          0.025262894 = weight(abstract_txt:scientific in 2192) [ClassicSimilarity], result of:
            0.025262894 = score(doc=2192,freq=1.0), product of:
              0.11634068 = queryWeight, product of:
                1.2585194 = boost
                4.6324444 = idf(docFreq=1174, maxDocs=44421)
                0.019955447 = queryNorm
              0.21714583 = fieldWeight in 2192, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6324444 = idf(docFreq=1174, maxDocs=44421)
                0.046875 = fieldNorm(doc=2192)
          0.04356841 = weight(abstract_txt:tool in 2192) [ClassicSimilarity], result of:
            0.04356841 = score(doc=2192,freq=2.0), product of:
              0.13279468 = queryWeight, product of:
                1.3445733 = boost
                4.9491973 = idf(docFreq=855, maxDocs=44421)
                0.019955447 = queryNorm
              0.32808852 = fieldWeight in 2192, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.9491973 = idf(docFreq=855, maxDocs=44421)
                0.046875 = fieldNorm(doc=2192)
          0.031640053 = weight(abstract_txt:making in 2192) [ClassicSimilarity], result of:
            0.031640053 = score(doc=2192,freq=1.0), product of:
              0.13517644 = queryWeight, product of:
                1.3565775 = boost
                4.9933834 = idf(docFreq=818, maxDocs=44421)
                0.019955447 = queryNorm
              0.23406485 = fieldWeight in 2192, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9933834 = idf(docFreq=818, maxDocs=44421)
                0.046875 = fieldNorm(doc=2192)
          0.014554952 = weight(abstract_txt:that in 2192) [ClassicSimilarity], result of:
            0.014554952 = score(doc=2192,freq=3.0), product of:
              0.07580357 = queryWeight, product of:
                1.6062346 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.019955447 = queryNorm
              0.1920088 = fieldWeight in 2192, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.046875 = fieldNorm(doc=2192)
          0.10643846 = weight(abstract_txt:dialogue in 2192) [ClassicSimilarity], result of:
            0.10643846 = score(doc=2192,freq=1.0), product of:
              0.30348828 = queryWeight, product of:
                2.0326617 = boost
                7.48196 = idf(docFreq=67, maxDocs=44421)
                0.019955447 = queryNorm
              0.35071686 = fieldWeight in 2192, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.48196 = idf(docFreq=67, maxDocs=44421)
                0.046875 = fieldNorm(doc=2192)
          0.22346283 = weight(abstract_txt:constructive in 2192) [ClassicSimilarity], result of:
            0.22346283 = score(doc=2192,freq=1.0), product of:
              0.569609 = queryWeight, product of:
                3.4105794 = boost
                8.369263 = idf(docFreq=27, maxDocs=44421)
                0.019955447 = queryNorm
              0.3923092 = fieldWeight in 2192, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.369263 = idf(docFreq=27, maxDocs=44421)
                0.046875 = fieldNorm(doc=2192)
        0.28 = coord(7/25)
    
  5. Kuhlthau, C.C.; Tama, S.L.: Information search process of lawyers : a call for 'just for me' information services (2001) 0.12
    0.11728367 = sum of:
      0.11728367 = product of:
        0.48868197 = sum of:
          0.021406755 = weight(abstract_txt:model in 5492) [ClassicSimilarity], result of:
            0.021406755 = score(doc=5492,freq=1.0), product of:
              0.085997194 = queryWeight, product of:
                1.0820224 = boost
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.019955447 = queryNorm
              0.24892388 = fieldWeight in 5492, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.0625 = fieldNorm(doc=5492)
          0.036132243 = weight(abstract_txt:learning in 5492) [ClassicSimilarity], result of:
            0.036132243 = score(doc=5492,freq=1.0), product of:
              0.12191215 = queryWeight, product of:
                1.2883018 = boost
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.019955447 = queryNorm
              0.29637933 = fieldWeight in 5492, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7420692 = idf(docFreq=1052, maxDocs=44421)
                0.0625 = fieldNorm(doc=5492)
          0.024808543 = weight(abstract_txt:search in 5492) [ClassicSimilarity], result of:
            0.024808543 = score(doc=5492,freq=1.0), product of:
              0.108612955 = queryWeight, product of:
                1.4892944 = boost
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.019955447 = queryNorm
              0.22841237 = fieldWeight in 5492, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.654598 = idf(docFreq=3123, maxDocs=44421)
                0.0625 = fieldNorm(doc=5492)
          0.022408817 = weight(abstract_txt:that in 5492) [ClassicSimilarity], result of:
            0.022408817 = score(doc=5492,freq=4.0), product of:
              0.07580357 = queryWeight, product of:
                1.6062346 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.019955447 = queryNorm
              0.2956169 = fieldWeight in 5492, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=5492)
          0.08597517 = weight(abstract_txt:active in 5492) [ClassicSimilarity], result of:
            0.08597517 = score(doc=5492,freq=1.0), product of:
              0.21728633 = queryWeight, product of:
                1.7199283 = boost
                6.3308296 = idf(docFreq=214, maxDocs=44421)
                0.019955447 = queryNorm
              0.39567685 = fieldWeight in 5492, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3308296 = idf(docFreq=214, maxDocs=44421)
                0.0625 = fieldNorm(doc=5492)
          0.29795045 = weight(abstract_txt:constructive in 5492) [ClassicSimilarity], result of:
            0.29795045 = score(doc=5492,freq=1.0), product of:
              0.569609 = queryWeight, product of:
                3.4105794 = boost
                8.369263 = idf(docFreq=27, maxDocs=44421)
                0.019955447 = queryNorm
              0.5230789 = fieldWeight in 5492, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.369263 = idf(docFreq=27, maxDocs=44421)
                0.0625 = fieldNorm(doc=5492)
        0.24 = coord(6/25)