Document (#35830)

Author
Dang, X.H.
Ong. K.-L.
Title
Knowledge discovery in data streams
Source
Encyclopedia of library and information sciences. 3rd ed. Ed.: M.J. Bates
Imprint
London : Taylor & Francis
Year
2009
Pages
S.xx-xx
Abstract
Knowing what to do with the massive amount of data collected has always been an ongoing issue for many organizations. While data mining has been touted to be the solution, it has failed to deliver the impact despite its successes in many areas. One reason is that data mining algorithms were not designed for the real world, i.e., they usually assume a static view of the data and a stable execution environment where resourcesare abundant. The reality however is that data are constantly changing and the execution environment is dynamic. Hence, it becomes difficult for data mining to truly deliver timely and relevant results. Recently, the processing of stream data has received many attention. What is interesting is that the methodology to design stream-based algorithms may well be the solution to the above problem. In this entry, we discuss this issue and present an overview of recent works.
Footnote
Vgl.: http://www.tandfonline.com/doi/book/10.1081/E-ELIS3.
Theme
Data Mining

Similar documents (author)

  1. Over, P.; Dang, H.; Harman, D.: DUC in context (2007) 3.56
    3.5640912 = sum of:
      3.5640912 = weight(author_txt:dang in 1934) [ClassicSimilarity], result of:
        3.5640912 = fieldWeight in 1934, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.375 = fieldNorm(doc=1934)
    
  2. Dang, E.K.F.; Luk, R.W.P.; Allan, J.: Beyond bag-of-words : bigram-enhanced context-dependent term weights (2014) 3.56
    3.5640912 = sum of:
      3.5640912 = weight(author_txt:dang in 2283) [ClassicSimilarity], result of:
        3.5640912 = fieldWeight in 2283, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.375 = fieldNorm(doc=2283)
    
  3. Dang, E.K.F.; Luk, R.W.P.; Allan, J.: ¬A context-dependent relevance model (2016) 3.56
    3.5640912 = sum of:
      3.5640912 = weight(author_txt:dang in 3778) [ClassicSimilarity], result of:
        3.5640912 = fieldWeight in 3778, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.375 = fieldNorm(doc=3778)
    
  4. Dang, E.K.F.; Luk, R.W.P.; Allan, J.: ¬A retrieval model family based on the probability ranking principle for ad hoc retrieval (2022) 3.56
    3.5640912 = sum of:
      3.5640912 = weight(author_txt:dang in 1639) [ClassicSimilarity], result of:
        3.5640912 = fieldWeight in 1639, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.375 = fieldNorm(doc=1639)
    
  5. Dang, E.K.F.; Luk, R.W.P.; Ho, K.S.; Chan, S.C.F.; Lee, D.L.: ¬A new measure of clustering effectiveness : algorithms and experimental studies (2008) 2.97
    2.9700758 = sum of:
      2.9700758 = weight(author_txt:dang in 2367) [ClassicSimilarity], result of:
        2.9700758 = fieldWeight in 2367, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.3125 = fieldNorm(doc=2367)
    

Similar documents (content)

  1. Calvanese, D.; Kalayci, T.E.; Montali, M.; Santoso, A.: OBDA for log extraction in process mining (2017) 0.22
    0.22232859 = sum of:
      0.22232859 = product of:
        0.79403067 = sum of:
          0.016044155 = weight(abstract_txt:been in 4931) [ClassicSimilarity], result of:
            0.016044155 = score(doc=4931,freq=1.0), product of:
              0.071022436 = queryWeight, product of:
                1.0157921 = boost
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.01934414 = queryNorm
              0.22590263 = fieldWeight in 4931, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.0625 = fieldNorm(doc=4931)
          0.04657593 = weight(abstract_txt:issue in 4931) [ClassicSimilarity], result of:
            0.04657593 = score(doc=4931,freq=1.0), product of:
              0.14453022 = queryWeight, product of:
                1.4490601 = boost
                5.156118 = idf(docFreq=695, maxDocs=44421)
                0.01934414 = queryNorm
              0.32225737 = fieldWeight in 4931, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.156118 = idf(docFreq=695, maxDocs=44421)
                0.0625 = fieldNorm(doc=4931)
          0.07164701 = weight(abstract_txt:solution in 4931) [ClassicSimilarity], result of:
            0.07164701 = score(doc=4931,freq=1.0), product of:
              0.19259708 = queryWeight, product of:
                1.6727533 = boost
                5.9520745 = idf(docFreq=313, maxDocs=44421)
                0.01934414 = queryNorm
              0.37200466 = fieldWeight in 4931, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9520745 = idf(docFreq=313, maxDocs=44421)
                0.0625 = fieldNorm(doc=4931)
          0.034609493 = weight(abstract_txt:many in 4931) [ClassicSimilarity], result of:
            0.034609493 = score(doc=4931,freq=1.0), product of:
              0.13573074 = queryWeight, product of:
                1.719855 = boost
                4.0797825 = idf(docFreq=2041, maxDocs=44421)
                0.01934414 = queryNorm
              0.2549864 = fieldWeight in 4931, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0797825 = idf(docFreq=2041, maxDocs=44421)
                0.0625 = fieldNorm(doc=4931)
          0.26253566 = weight(abstract_txt:execution in 4931) [ClassicSimilarity], result of:
            0.26253566 = score(doc=4931,freq=2.0), product of:
              0.3633288 = queryWeight, product of:
                2.2975078 = boost
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.01934414 = queryNorm
              0.7225842 = fieldWeight in 4931, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.0625 = fieldNorm(doc=4931)
          0.23966251 = weight(abstract_txt:mining in 4931) [ClassicSimilarity], result of:
            0.23966251 = score(doc=4931,freq=4.0), product of:
              0.31064293 = queryWeight, product of:
                2.601857 = boost
                6.1720386 = idf(docFreq=251, maxDocs=44421)
                0.01934414 = queryNorm
              0.7715048 = fieldWeight in 4931, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.1720386 = idf(docFreq=251, maxDocs=44421)
                0.0625 = fieldNorm(doc=4931)
          0.12295594 = weight(abstract_txt:data in 4931) [ClassicSimilarity], result of:
            0.12295594 = score(doc=4931,freq=6.0), product of:
              0.2411683 = queryWeight, product of:
                3.7436666 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.01934414 = queryNorm
              0.5098346 = fieldWeight in 4931, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=4931)
        0.28 = coord(7/25)
    
  2. Liu, B.: Web data mining : exploring hyperlinks, contents, and usage data (2011) 0.11
    0.11293622 = sum of:
      0.11293622 = product of:
        0.7058514 = sum of:
          0.10899089 = weight(abstract_txt:algorithms in 1354) [ClassicSimilarity], result of:
            0.10899089 = score(doc=1354,freq=3.0), product of:
              0.17663254 = queryWeight, product of:
                1.6019258 = boost
                5.7000527 = idf(docFreq=403, maxDocs=44421)
                0.01934414 = queryNorm
              0.6170488 = fieldWeight in 1354, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.7000527 = idf(docFreq=403, maxDocs=44421)
                0.0625 = fieldNorm(doc=1354)
          0.04894522 = weight(abstract_txt:many in 1354) [ClassicSimilarity], result of:
            0.04894522 = score(doc=1354,freq=2.0), product of:
              0.13573074 = queryWeight, product of:
                1.719855 = boost
                4.0797825 = idf(docFreq=2041, maxDocs=44421)
                0.01934414 = queryNorm
              0.36060524 = fieldWeight in 1354, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0797825 = idf(docFreq=2041, maxDocs=44421)
                0.0625 = fieldNorm(doc=1354)
          0.41510764 = weight(abstract_txt:mining in 1354) [ClassicSimilarity], result of:
            0.41510764 = score(doc=1354,freq=12.0), product of:
              0.31064293 = queryWeight, product of:
                2.601857 = boost
                6.1720386 = idf(docFreq=251, maxDocs=44421)
                0.01934414 = queryNorm
              1.3362855 = fieldWeight in 1354, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                6.1720386 = idf(docFreq=251, maxDocs=44421)
                0.0625 = fieldNorm(doc=1354)
          0.1328076 = weight(abstract_txt:data in 1354) [ClassicSimilarity], result of:
            0.1328076 = score(doc=1354,freq=7.0), product of:
              0.2411683 = queryWeight, product of:
                3.7436666 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.01934414 = queryNorm
              0.5506843 = fieldWeight in 1354, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=1354)
        0.16 = coord(4/25)
    
  3. Haslhofer, B.: ¬A Web-based mapping technique for establishing metadata interoperability (2008) 0.10
    0.09937838 = sum of:
      0.09937838 = product of:
        0.41407657 = sum of:
          0.03658505 = weight(abstract_txt:environment in 160) [ClassicSimilarity], result of:
            0.03658505 = score(doc=160,freq=3.0), product of:
              0.11670595 = queryWeight, product of:
                1.3021276 = boost
                4.6332955 = idf(docFreq=1173, maxDocs=44421)
                0.01934414 = queryNorm
              0.31348062 = fieldWeight in 160, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6332955 = idf(docFreq=1173, maxDocs=44421)
                0.0390625 = fieldNorm(doc=160)
          0.0393287 = weight(abstract_txt:algorithms in 160) [ClassicSimilarity], result of:
            0.0393287 = score(doc=160,freq=1.0), product of:
              0.17663254 = queryWeight, product of:
                1.6019258 = boost
                5.7000527 = idf(docFreq=403, maxDocs=44421)
                0.01934414 = queryNorm
              0.2226583 = fieldWeight in 160, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7000527 = idf(docFreq=403, maxDocs=44421)
                0.0390625 = fieldNorm(doc=160)
          0.08955876 = weight(abstract_txt:solution in 160) [ClassicSimilarity], result of:
            0.08955876 = score(doc=160,freq=4.0), product of:
              0.19259708 = queryWeight, product of:
                1.6727533 = boost
                5.9520745 = idf(docFreq=313, maxDocs=44421)
                0.01934414 = queryNorm
              0.46500582 = fieldWeight in 160, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.9520745 = idf(docFreq=313, maxDocs=44421)
                0.0390625 = fieldNorm(doc=160)
          0.07823927 = weight(abstract_txt:deliver in 160) [ClassicSimilarity], result of:
            0.07823927 = score(doc=160,freq=1.0), product of:
              0.27939212 = queryWeight, product of:
                2.0147173 = boost
                7.168868 = idf(docFreq=92, maxDocs=44421)
                0.01934414 = queryNorm
              0.28003392 = fieldWeight in 160, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.168868 = idf(docFreq=92, maxDocs=44421)
                0.0390625 = fieldNorm(doc=160)
          0.11602546 = weight(abstract_txt:execution in 160) [ClassicSimilarity], result of:
            0.11602546 = score(doc=160,freq=1.0), product of:
              0.3633288 = queryWeight, product of:
                2.2975078 = boost
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.01934414 = queryNorm
              0.3193401 = fieldWeight in 160, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.0390625 = fieldNorm(doc=160)
          0.054339364 = weight(abstract_txt:data in 160) [ClassicSimilarity], result of:
            0.054339364 = score(doc=160,freq=3.0), product of:
              0.2411683 = queryWeight, product of:
                3.7436666 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.01934414 = queryNorm
              0.22531718 = fieldWeight in 160, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0390625 = fieldNorm(doc=160)
        0.24 = coord(6/25)
    
  4. Maron, M.E.: Theory and foundation of information retrieval : some introductory remarks (1978) 0.09
    0.09497065 = sum of:
      0.09497065 = product of:
        0.39571106 = sum of:
          0.024066234 = weight(abstract_txt:been in 7406) [ClassicSimilarity], result of:
            0.024066234 = score(doc=7406,freq=1.0), product of:
              0.071022436 = queryWeight, product of:
                1.0157921 = boost
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.01934414 = queryNorm
              0.33885396 = fieldWeight in 7406, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.09375 = fieldNorm(doc=7406)
          0.10489646 = weight(abstract_txt:truly in 7406) [ClassicSimilarity], result of:
            0.10489646 = score(doc=7406,freq=1.0), product of:
              0.15041369 = queryWeight, product of:
                1.0452875 = boost
                7.438788 = idf(docFreq=70, maxDocs=44421)
                0.01934414 = queryNorm
              0.6973864 = fieldWeight in 7406, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.438788 = idf(docFreq=70, maxDocs=44421)
                0.09375 = fieldNorm(doc=7406)
          0.040736824 = weight(abstract_txt:what in 7406) [ClassicSimilarity], result of:
            0.040736824 = score(doc=7406,freq=1.0), product of:
              0.10087454 = queryWeight, product of:
                1.2105922 = boost
                4.3075895 = idf(docFreq=1625, maxDocs=44421)
                0.01934414 = queryNorm
              0.40383652 = fieldWeight in 7406, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3075895 = idf(docFreq=1625, maxDocs=44421)
                0.09375 = fieldNorm(doc=7406)
          0.09880247 = weight(abstract_txt:issue in 7406) [ClassicSimilarity], result of:
            0.09880247 = score(doc=7406,freq=2.0), product of:
              0.14453022 = queryWeight, product of:
                1.4490601 = boost
                5.156118 = idf(docFreq=695, maxDocs=44421)
                0.01934414 = queryNorm
              0.68361115 = fieldWeight in 7406, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.156118 = idf(docFreq=695, maxDocs=44421)
                0.09375 = fieldNorm(doc=7406)
          0.05191424 = weight(abstract_txt:many in 7406) [ClassicSimilarity], result of:
            0.05191424 = score(doc=7406,freq=1.0), product of:
              0.13573074 = queryWeight, product of:
                1.719855 = boost
                4.0797825 = idf(docFreq=2041, maxDocs=44421)
                0.01934414 = queryNorm
              0.3824796 = fieldWeight in 7406, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.0797825 = idf(docFreq=2041, maxDocs=44421)
                0.09375 = fieldNorm(doc=7406)
          0.07529483 = weight(abstract_txt:data in 7406) [ClassicSimilarity], result of:
            0.07529483 = score(doc=7406,freq=1.0), product of:
              0.2411683 = queryWeight, product of:
                3.7436666 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.01934414 = queryNorm
              0.31220865 = fieldWeight in 7406, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.09375 = fieldNorm(doc=7406)
        0.24 = coord(6/25)
    
  5. Mining text data (2012) 0.09
    0.089827165 = sum of:
      0.089827165 = product of:
        0.5614198 = sum of:
          0.016044155 = weight(abstract_txt:been in 1362) [ClassicSimilarity], result of:
            0.016044155 = score(doc=1362,freq=1.0), product of:
              0.071022436 = queryWeight, product of:
                1.0157921 = boost
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.01934414 = queryNorm
              0.22590263 = fieldWeight in 1362, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.614442 = idf(docFreq=3251, maxDocs=44421)
                0.0625 = fieldNorm(doc=1362)
          0.06292593 = weight(abstract_txt:algorithms in 1362) [ClassicSimilarity], result of:
            0.06292593 = score(doc=1362,freq=1.0), product of:
              0.17663254 = queryWeight, product of:
                1.6019258 = boost
                5.7000527 = idf(docFreq=403, maxDocs=44421)
                0.01934414 = queryNorm
              0.3562533 = fieldWeight in 1362, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7000527 = idf(docFreq=403, maxDocs=44421)
                0.0625 = fieldNorm(doc=1362)
          0.35949376 = weight(abstract_txt:mining in 1362) [ClassicSimilarity], result of:
            0.35949376 = score(doc=1362,freq=9.0), product of:
              0.31064293 = queryWeight, product of:
                2.601857 = boost
                6.1720386 = idf(docFreq=251, maxDocs=44421)
                0.01934414 = queryNorm
              1.1572572 = fieldWeight in 1362, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                6.1720386 = idf(docFreq=251, maxDocs=44421)
                0.0625 = fieldNorm(doc=1362)
          0.12295594 = weight(abstract_txt:data in 1362) [ClassicSimilarity], result of:
            0.12295594 = score(doc=1362,freq=6.0), product of:
              0.2411683 = queryWeight, product of:
                3.7436666 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.01934414 = queryNorm
              0.5098346 = fieldWeight in 1362, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=1362)
        0.16 = coord(4/25)