Document (#20911)

Author
Lingras, P.J.
Yao, Y.Y.
Title
Data mining using extensions of the rough set model
Source
Journal of the American Society for Information Science. 49(1998) no.5, S.415-422
Year
1998
Abstract
Examines basic issues of data mining using the theory of rough sets, which is a recent proposal for generalizing classical set theory. The Pawlak rough set model is based on the concept of an equivalence relation. A generalized rough set model need not be based on equivalence relation axioms. The Pawlak rough set model has been used for deriving deterministic as well as probabilistic rules froma complete database. Demonstrates that a generalised rough set model can be used for generating rules from incomplete databases. These rules are based on plausability functions proposed by Shafer. Discusses the importance of rule extraction from incomplete databases in data mining
Footnote
Contribution to a special issue devoted to knowledge discovery and data mining
Theme
Data Mining

Similar documents (content)

  1. Bell, D.A.; Guan, J.W.: Computational methods for rough classification and discovery (1998) 0.41
    0.41472152 = sum of:
      0.41472152 = product of:
        1.4811482 = sum of:
          0.013380828 = weight(abstract_txt:used in 3909) [ClassicSimilarity], result of:
            0.013380828 = score(doc=3909,freq=1.0), product of:
              0.04251425 = queryWeight, product of:
                1.031634 = boost
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.012275287 = queryNorm
              0.3147375 = fieldWeight in 3909, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.09375 = fieldNorm(doc=3909)
          0.020659305 = weight(abstract_txt:using in 3909) [ClassicSimilarity], result of:
            0.020659305 = score(doc=3909,freq=2.0), product of:
              0.0450761 = queryWeight, product of:
                1.0622617 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.012275287 = queryNorm
              0.45832062 = fieldWeight in 3909, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.09375 = fieldNorm(doc=3909)
          0.030410312 = weight(abstract_txt:databases in 3909) [ClassicSimilarity], result of:
            0.030410312 = score(doc=3909,freq=1.0), product of:
              0.07348969 = queryWeight, product of:
                1.3563493 = boost
                4.413907 = idf(docFreq=1461, maxDocs=44421)
                0.012275287 = queryNorm
              0.4138038 = fieldWeight in 3909, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.413907 = idf(docFreq=1461, maxDocs=44421)
                0.09375 = fieldNorm(doc=3909)
          0.056931432 = weight(abstract_txt:theory in 3909) [ClassicSimilarity], result of:
            0.056931432 = score(doc=3909,freq=3.0), product of:
              0.07739986 = queryWeight, product of:
                1.3919654 = boost
                4.529811 = idf(docFreq=1301, maxDocs=44421)
                0.012275287 = queryNorm
              0.73554957 = fieldWeight in 3909, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.529811 = idf(docFreq=1301, maxDocs=44421)
                0.09375 = fieldNorm(doc=3909)
          0.055255678 = weight(abstract_txt:relation in 3909) [ClassicSimilarity], result of:
            0.055255678 = score(doc=3909,freq=1.0), product of:
              0.109428495 = queryWeight, product of:
                1.6550975 = boost
                5.38611 = idf(docFreq=552, maxDocs=44421)
                0.012275287 = queryNorm
              0.5049478 = fieldWeight in 3909, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.38611 = idf(docFreq=552, maxDocs=44421)
                0.09375 = fieldNorm(doc=3909)
          0.07625144 = weight(abstract_txt:rules in 3909) [ClassicSimilarity], result of:
            0.07625144 = score(doc=3909,freq=1.0), product of:
              0.1552655 = queryWeight, product of:
                2.4145794 = boost
                5.238438 = idf(docFreq=640, maxDocs=44421)
                0.012275287 = queryNorm
              0.4911036 = fieldWeight in 3909, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.238438 = idf(docFreq=640, maxDocs=44421)
                0.09375 = fieldNorm(doc=3909)
          1.2282592 = weight(abstract_txt:rough in 3909) [ClassicSimilarity], result of:
            1.2282592 = score(doc=3909,freq=4.0), product of:
              0.78600675 = queryWeight, product of:
                7.683023 = boost
                8.334172 = idf(docFreq=28, maxDocs=44421)
                0.012275287 = queryNorm
              1.5626574 = fieldWeight in 3909, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.334172 = idf(docFreq=28, maxDocs=44421)
                0.09375 = fieldNorm(doc=3909)
        0.28 = coord(7/25)
    
  2. Hassanien, A.-E.: Rough set approach for attribute reduction and rule generation : a case of patients with suspected breast cancer (2004) 0.34
    0.33766413 = sum of:
      0.33766413 = product of:
        1.2059433 = sum of:
          0.012615567 = weight(abstract_txt:used in 3883) [ClassicSimilarity], result of:
            0.012615567 = score(doc=3883,freq=2.0), product of:
              0.04251425 = queryWeight, product of:
                1.031634 = boost
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.012275287 = queryNorm
              0.29673737 = fieldWeight in 3883, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.0625 = fieldNorm(doc=3883)
          0.02027354 = weight(abstract_txt:databases in 3883) [ClassicSimilarity], result of:
            0.02027354 = score(doc=3883,freq=1.0), product of:
              0.07348969 = queryWeight, product of:
                1.3563493 = boost
                4.413907 = idf(docFreq=1461, maxDocs=44421)
                0.012275287 = queryNorm
              0.2758692 = fieldWeight in 3883, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.413907 = idf(docFreq=1461, maxDocs=44421)
                0.0625 = fieldNorm(doc=3883)
          0.030989548 = weight(abstract_txt:theory in 3883) [ClassicSimilarity], result of:
            0.030989548 = score(doc=3883,freq=2.0), product of:
              0.07739986 = queryWeight, product of:
                1.3919654 = boost
                4.529811 = idf(docFreq=1301, maxDocs=44421)
                0.012275287 = queryNorm
              0.4003825 = fieldWeight in 3883, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.529811 = idf(docFreq=1301, maxDocs=44421)
                0.0625 = fieldNorm(doc=3883)
          0.011404913 = weight(abstract_txt:based in 3883) [ClassicSimilarity], result of:
            0.011404913 = score(doc=3883,freq=1.0), product of:
              0.057327773 = queryWeight, product of:
                1.4671906 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.012275287 = queryNorm
              0.1989422 = fieldWeight in 3883, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0625 = fieldNorm(doc=3883)
          0.026121747 = weight(abstract_txt:data in 3883) [ClassicSimilarity], result of:
            0.026121747 = score(doc=3883,freq=4.0), product of:
              0.0627507 = queryWeight, product of:
                1.5350174 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.012275287 = queryNorm
              0.41627818 = fieldWeight in 3883, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=3883)
          0.10166859 = weight(abstract_txt:rules in 3883) [ClassicSimilarity], result of:
            0.10166859 = score(doc=3883,freq=4.0), product of:
              0.1552655 = queryWeight, product of:
                2.4145794 = boost
                5.238438 = idf(docFreq=640, maxDocs=44421)
                0.012275287 = queryNorm
              0.65480477 = fieldWeight in 3883, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.238438 = idf(docFreq=640, maxDocs=44421)
                0.0625 = fieldNorm(doc=3883)
          1.0028695 = weight(abstract_txt:rough in 3883) [ClassicSimilarity], result of:
            1.0028695 = score(doc=3883,freq=6.0), product of:
              0.78600675 = queryWeight, product of:
                7.683023 = boost
                8.334172 = idf(docFreq=28, maxDocs=44421)
                0.012275287 = queryNorm
              1.2759044 = fieldWeight in 3883, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                8.334172 = idf(docFreq=28, maxDocs=44421)
                0.0625 = fieldNorm(doc=3883)
        0.28 = coord(7/25)
    
  3. Yang, H.; King, I.; Lyu, M.R.: ¬The generalized dependency degree between attributes (2007) 0.29
    0.29486814 = sum of:
      0.29486814 = product of:
        0.92146295 = sum of:
          0.01377287 = weight(abstract_txt:using in 2322) [ClassicSimilarity], result of:
            0.01377287 = score(doc=2322,freq=2.0), product of:
              0.0450761 = queryWeight, product of:
                1.0622617 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.012275287 = queryNorm
              0.3055471 = fieldWeight in 2322, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.0625 = fieldNorm(doc=2322)
          0.058830447 = weight(abstract_txt:generalized in 2322) [ClassicSimilarity], result of:
            0.058830447 = score(doc=2322,freq=2.0), product of:
              0.09418638 = queryWeight, product of:
                1.0857689 = boost
                7.0667386 = idf(docFreq=102, maxDocs=44421)
                0.012275287 = queryNorm
              0.62461734 = fieldWeight in 2322, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.0667386 = idf(docFreq=102, maxDocs=44421)
                0.0625 = fieldNorm(doc=2322)
          0.02191292 = weight(abstract_txt:theory in 2322) [ClassicSimilarity], result of:
            0.02191292 = score(doc=2322,freq=1.0), product of:
              0.07739986 = queryWeight, product of:
                1.3919654 = boost
                4.529811 = idf(docFreq=1301, maxDocs=44421)
                0.012275287 = queryNorm
              0.28311318 = fieldWeight in 2322, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.529811 = idf(docFreq=1301, maxDocs=44421)
                0.0625 = fieldNorm(doc=2322)
          0.12714148 = weight(abstract_txt:deterministic in 2322) [ClassicSimilarity], result of:
            0.12714148 = score(doc=2322,freq=2.0), product of:
              0.1574387 = queryWeight, product of:
                1.4037802 = boost
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.012275287 = queryNorm
              0.80756176 = fieldWeight in 2322, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.1365185 = idf(docFreq=12, maxDocs=44421)
                0.0625 = fieldNorm(doc=2322)
          0.100540206 = weight(abstract_txt:equivalence in 2322) [ClassicSimilarity], result of:
            0.100540206 = score(doc=2322,freq=1.0), product of:
              0.21371411 = queryWeight, product of:
                2.3129964 = boost
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.012275287 = queryNorm
              0.47044253 = fieldWeight in 2322, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.0625 = fieldNorm(doc=2322)
          0.10179773 = weight(abstract_txt:incomplete in 2322) [ClassicSimilarity], result of:
            0.10179773 = score(doc=2322,freq=1.0), product of:
              0.21549246 = queryWeight, product of:
                2.3226 = boost
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.012275287 = queryNorm
              0.4723958 = fieldWeight in 2322, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.0625 = fieldNorm(doc=2322)
          0.08804758 = weight(abstract_txt:rules in 2322) [ClassicSimilarity], result of:
            0.08804758 = score(doc=2322,freq=3.0), product of:
              0.1552655 = queryWeight, product of:
                2.4145794 = boost
                5.238438 = idf(docFreq=640, maxDocs=44421)
                0.012275287 = queryNorm
              0.5670776 = fieldWeight in 2322, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.238438 = idf(docFreq=640, maxDocs=44421)
                0.0625 = fieldNorm(doc=2322)
          0.40941972 = weight(abstract_txt:rough in 2322) [ClassicSimilarity], result of:
            0.40941972 = score(doc=2322,freq=1.0), product of:
              0.78600675 = queryWeight, product of:
                7.683023 = boost
                8.334172 = idf(docFreq=28, maxDocs=44421)
                0.012275287 = queryNorm
              0.52088577 = fieldWeight in 2322, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.334172 = idf(docFreq=28, maxDocs=44421)
                0.0625 = fieldNorm(doc=2322)
        0.32 = coord(8/25)
    
  4. Miyamoto, S.: Application of rough sets to information retrieval (1998) 0.26
    0.25801894 = sum of:
      0.25801894 = product of:
        1.6126184 = sum of:
          0.03286938 = weight(abstract_txt:theory in 1559) [ClassicSimilarity], result of:
            0.03286938 = score(doc=1559,freq=1.0), product of:
              0.07739986 = queryWeight, product of:
                1.3919654 = boost
                4.529811 = idf(docFreq=1301, maxDocs=44421)
                0.012275287 = queryNorm
              0.42466977 = fieldWeight in 1559, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.529811 = idf(docFreq=1301, maxDocs=44421)
                0.09375 = fieldNorm(doc=1559)
          0.019591311 = weight(abstract_txt:data in 1559) [ClassicSimilarity], result of:
            0.019591311 = score(doc=1559,freq=1.0), product of:
              0.0627507 = queryWeight, product of:
                1.5350174 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.012275287 = queryNorm
              0.31220865 = fieldWeight in 1559, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.09375 = fieldNorm(doc=1559)
          0.05585357 = weight(abstract_txt:model in 1559) [ClassicSimilarity], result of:
            0.05585357 = score(doc=1559,freq=1.0), product of:
              0.14958675 = queryWeight, product of:
                3.0596724 = boost
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.012275287 = queryNorm
              0.37338582 = fieldWeight in 1559, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.09375 = fieldNorm(doc=1559)
          1.5043042 = weight(abstract_txt:rough in 1559) [ClassicSimilarity], result of:
            1.5043042 = score(doc=1559,freq=6.0), product of:
              0.78600675 = queryWeight, product of:
                7.683023 = boost
                8.334172 = idf(docFreq=28, maxDocs=44421)
                0.012275287 = queryNorm
              1.9138566 = fieldWeight in 1559, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                8.334172 = idf(docFreq=28, maxDocs=44421)
                0.09375 = fieldNorm(doc=1559)
        0.16 = coord(4/25)
    
  5. Methodologies for knowledge discovery and data mining : Third Pacific-Asia Conference, PAKDD'99, Beijing, China, April 26-28, 1999, Proceedings (1999) 0.21
    0.21435364 = sum of:
      0.21435364 = product of:
        1.0717682 = sum of:
          0.028225718 = weight(abstract_txt:based in 4821) [ClassicSimilarity], result of:
            0.028225718 = score(doc=4821,freq=2.0), product of:
              0.057327773 = queryWeight, product of:
                1.4671906 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.012275287 = queryNorm
              0.49235678 = fieldWeight in 4821, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.109375 = fieldNorm(doc=4821)
          0.032324012 = weight(abstract_txt:data in 4821) [ClassicSimilarity], result of:
            0.032324012 = score(doc=4821,freq=2.0), product of:
              0.0627507 = queryWeight, product of:
                1.5350174 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.012275287 = queryNorm
              0.515118 = fieldWeight in 4821, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.109375 = fieldNorm(doc=4821)
          0.088960014 = weight(abstract_txt:rules in 4821) [ClassicSimilarity], result of:
            0.088960014 = score(doc=4821,freq=1.0), product of:
              0.1552655 = queryWeight, product of:
                2.4145794 = boost
                5.238438 = idf(docFreq=640, maxDocs=44421)
                0.012275287 = queryNorm
              0.5729542 = fieldWeight in 4821, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.238438 = idf(docFreq=640, maxDocs=44421)
                0.109375 = fieldNorm(doc=4821)
          0.20577388 = weight(abstract_txt:mining in 4821) [ClassicSimilarity], result of:
            0.20577388 = score(doc=4821,freq=2.0), product of:
              0.21554033 = queryWeight, product of:
                2.8449082 = boost
                6.1720386 = idf(docFreq=251, maxDocs=44421)
                0.012275287 = queryNorm
              0.9546885 = fieldWeight in 4821, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1720386 = idf(docFreq=251, maxDocs=44421)
                0.109375 = fieldNorm(doc=4821)
          0.71648455 = weight(abstract_txt:rough in 4821) [ClassicSimilarity], result of:
            0.71648455 = score(doc=4821,freq=1.0), product of:
              0.78600675 = queryWeight, product of:
                7.683023 = boost
                8.334172 = idf(docFreq=28, maxDocs=44421)
                0.012275287 = queryNorm
              0.9115501 = fieldWeight in 4821, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.334172 = idf(docFreq=28, maxDocs=44421)
                0.109375 = fieldNorm(doc=4821)
        0.2 = coord(5/25)