Document (#33271)

Author
Amir, A.
Feldman, R.
Kashi, R.
Title
¬A new and versatile method for association generation
Source
Information systems. 22(1997) nos.5/6, S.333-347
Year
1997
Abstract
Current algorithms for finding associations among the attributes describing data in a database have a number of shortcomings. Presents a novel method for association generation, that answers all desiderata. The method is different from all existing algorithms and especially suitable to textual databases with binary attributes. Uses subword trees for quick indexing into the required database statistics. Tests the algorithm on the Reuters-22173 database with satisfactory results
Theme
Data Mining

Similar documents (author)

  1. Feldman, T.: Multimedia (1994) 5.58
    5.5805492 = sum of:
      5.5805492 = weight(author_txt:feldman in 7235) [ClassicSimilarity], result of:
        5.5805492 = fieldWeight in 7235, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.928879 = idf(docFreq=15, maxDocs=44421)
          0.625 = fieldNorm(doc=7235)
    
  2. Feldman, S.E.: Searching natural language systems : searchers know the engine (1994) 5.58
    5.5805492 = sum of:
      5.5805492 = weight(author_txt:feldman in 1828) [ClassicSimilarity], result of:
        5.5805492 = fieldWeight in 1828, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.928879 = idf(docFreq=15, maxDocs=44421)
          0.625 = fieldNorm(doc=1828)
    
  3. Feldman, T.: Multimedia : eine Einführung (1995) 5.58
    5.5805492 = sum of:
      5.5805492 = weight(author_txt:feldman in 2888) [ClassicSimilarity], result of:
        5.5805492 = fieldWeight in 2888, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.928879 = idf(docFreq=15, maxDocs=44421)
          0.625 = fieldNorm(doc=2888)
    
  4. Feldman, T.: ¬The emergence of the electronic book (1990) 5.58
    5.5805492 = sum of:
      5.5805492 = weight(author_txt:feldman in 2942) [ClassicSimilarity], result of:
        5.5805492 = fieldWeight in 2942, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.928879 = idf(docFreq=15, maxDocs=44421)
          0.625 = fieldNorm(doc=2942)
    
  5. Feldman, S.: Testing natural language : comparing DIALOG, TARGET, and DR-LINK (1996) 5.58
    5.5805492 = sum of:
      5.5805492 = weight(author_txt:feldman in 532) [ClassicSimilarity], result of:
        5.5805492 = fieldWeight in 532, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.928879 = idf(docFreq=15, maxDocs=44421)
          0.625 = fieldNorm(doc=532)
    

Similar documents (content)

  1. Rodríguez, A.; Carazo, J.M.; Trelles-Salazar, O.: Mining association rules from biological databases (2005) 0.11
    0.11165814 = sum of:
      0.11165814 = product of:
        0.46524227 = sum of:
          0.03977454 = weight(abstract_txt:novel in 261) [ClassicSimilarity], result of:
            0.03977454 = score(doc=261,freq=1.0), product of:
              0.11525823 = queryWeight, product of:
                1.0124741 = boost
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.020617444 = queryNorm
              0.3450907 = fieldWeight in 261, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.0625 = fieldNorm(doc=261)
          0.07599354 = weight(abstract_txt:algorithm in 261) [ClassicSimilarity], result of:
            0.07599354 = score(doc=261,freq=3.0), product of:
              0.12304931 = queryWeight, product of:
                1.0461345 = boost
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.020617444 = queryNorm
              0.6175861 = fieldWeight in 261, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.0625 = fieldNorm(doc=261)
          0.11872853 = weight(abstract_txt:association in 261) [ClassicSimilarity], result of:
            0.11872853 = score(doc=261,freq=2.0), product of:
              0.23894902 = queryWeight, product of:
                2.061653 = boost
                5.6215343 = idf(docFreq=436, maxDocs=44421)
                0.020617444 = queryNorm
              0.49687812 = fieldWeight in 261, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6215343 = idf(docFreq=436, maxDocs=44421)
                0.0625 = fieldNorm(doc=261)
          0.08752097 = weight(abstract_txt:algorithms in 261) [ClassicSimilarity], result of:
            0.08752097 = score(doc=261,freq=1.0), product of:
              0.24567063 = queryWeight, product of:
                2.0904489 = boost
                5.7000527 = idf(docFreq=403, maxDocs=44421)
                0.020617444 = queryNorm
              0.3562533 = fieldWeight in 261, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7000527 = idf(docFreq=403, maxDocs=44421)
                0.0625 = fieldNorm(doc=261)
          0.078680255 = weight(abstract_txt:database in 261) [ClassicSimilarity], result of:
            0.078680255 = score(doc=261,freq=2.0), product of:
              0.20791033 = queryWeight, product of:
                2.3553019 = boost
                4.2814875 = idf(docFreq=1668, maxDocs=44421)
                0.020617444 = queryNorm
              0.3784336 = fieldWeight in 261, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2814875 = idf(docFreq=1668, maxDocs=44421)
                0.0625 = fieldNorm(doc=261)
          0.064544424 = weight(abstract_txt:method in 261) [ClassicSimilarity], result of:
            0.064544424 = score(doc=261,freq=1.0), product of:
              0.22955216 = queryWeight, product of:
                2.474852 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.020617444 = queryNorm
              0.2811754 = fieldWeight in 261, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0625 = fieldNorm(doc=261)
        0.24 = coord(6/25)
    
  2. Ciganik, M.: Inteligencne indexovanie a inteligencne klasifikacie (1994) 0.10
    0.09558708 = sum of:
      0.09558708 = product of:
        0.59741926 = sum of:
          0.10478003 = weight(abstract_txt:suitable in 1119) [ClassicSimilarity], result of:
            0.10478003 = score(doc=1119,freq=1.0), product of:
              0.13849503 = queryWeight, product of:
                1.1098518 = boost
                6.0524936 = idf(docFreq=283, maxDocs=44421)
                0.020617444 = queryNorm
              0.7565617 = fieldWeight in 1119, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0524936 = idf(docFreq=283, maxDocs=44421)
                0.125 = fieldNorm(doc=1119)
          0.19564286 = weight(abstract_txt:shortcomings in 1119) [ClassicSimilarity], result of:
            0.19564286 = score(doc=1119,freq=1.0), product of:
              0.21000251 = queryWeight, product of:
                1.366659 = boost
                7.4529724 = idf(docFreq=69, maxDocs=44421)
                0.020617444 = queryNorm
              0.93162155 = fieldWeight in 1119, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4529724 = idf(docFreq=69, maxDocs=44421)
                0.125 = fieldNorm(doc=1119)
          0.1679075 = weight(abstract_txt:association in 1119) [ClassicSimilarity], result of:
            0.1679075 = score(doc=1119,freq=1.0), product of:
              0.23894902 = queryWeight, product of:
                2.061653 = boost
                5.6215343 = idf(docFreq=436, maxDocs=44421)
                0.020617444 = queryNorm
              0.7026918 = fieldWeight in 1119, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6215343 = idf(docFreq=436, maxDocs=44421)
                0.125 = fieldNorm(doc=1119)
          0.12908885 = weight(abstract_txt:method in 1119) [ClassicSimilarity], result of:
            0.12908885 = score(doc=1119,freq=1.0), product of:
              0.22955216 = queryWeight, product of:
                2.474852 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.020617444 = queryNorm
              0.5623508 = fieldWeight in 1119, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.125 = fieldNorm(doc=1119)
        0.16 = coord(4/25)
    
  3. Gagliardi, I.; Schettini, R.: ¬A method for the automatic indexing of colour images for effective image retrieval (1997) 0.09
    0.086960144 = sum of:
      0.086960144 = product of:
        0.5435009 = sum of:
          0.08555654 = weight(abstract_txt:describing in 3886) [ClassicSimilarity], result of:
            0.08555654 = score(doc=3886,freq=1.0), product of:
              0.13225494 = queryWeight, product of:
                1.0845608 = boost
                5.9145703 = idf(docFreq=325, maxDocs=44421)
                0.020617444 = queryNorm
              0.64690614 = fieldWeight in 3886, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9145703 = idf(docFreq=325, maxDocs=44421)
                0.109375 = fieldNorm(doc=3886)
          0.1605146 = weight(abstract_txt:satisfactory in 3886) [ClassicSimilarity], result of:
            0.1605146 = score(doc=3886,freq=1.0), product of:
              0.2011806 = queryWeight, product of:
                1.3376453 = boost
                7.2947483 = idf(docFreq=81, maxDocs=44421)
                0.020617444 = queryNorm
              0.7978631 = fieldWeight in 3886, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2947483 = idf(docFreq=81, maxDocs=44421)
                0.109375 = fieldNorm(doc=3886)
          0.13769044 = weight(abstract_txt:database in 3886) [ClassicSimilarity], result of:
            0.13769044 = score(doc=3886,freq=2.0), product of:
              0.20791033 = queryWeight, product of:
                2.3553019 = boost
                4.2814875 = idf(docFreq=1668, maxDocs=44421)
                0.020617444 = queryNorm
              0.66225874 = fieldWeight in 3886, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2814875 = idf(docFreq=1668, maxDocs=44421)
                0.109375 = fieldNorm(doc=3886)
          0.1597393 = weight(abstract_txt:method in 3886) [ClassicSimilarity], result of:
            0.1597393 = score(doc=3886,freq=2.0), product of:
              0.22955216 = queryWeight, product of:
                2.474852 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.020617444 = queryNorm
              0.6958736 = fieldWeight in 3886, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.109375 = fieldNorm(doc=3886)
        0.16 = coord(4/25)
    
  4. Wong, S.K.M.; Butz, C.J.; Xiang, X.: Automated database schema design using mined data dependencies (1998) 0.09
    0.08608274 = sum of:
      0.08608274 = product of:
        0.53801715 = sum of:
          0.065812334 = weight(abstract_txt:algorithm in 3897) [ClassicSimilarity], result of:
            0.065812334 = score(doc=3897,freq=1.0), product of:
              0.12304931 = queryWeight, product of:
                1.0461345 = boost
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.020617444 = queryNorm
              0.53484523 = fieldWeight in 3897, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.09375 = fieldNorm(doc=3897)
          0.23084329 = weight(abstract_txt:attributes in 3897) [ClassicSimilarity], result of:
            0.23084329 = score(doc=3897,freq=2.0), product of:
              0.28406593 = queryWeight, product of:
                2.2478766 = boost
                6.1293135 = idf(docFreq=262, maxDocs=44421)
                0.020617444 = queryNorm
              0.81263983 = fieldWeight in 3897, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1293135 = idf(docFreq=262, maxDocs=44421)
                0.09375 = fieldNorm(doc=3897)
          0.14454485 = weight(abstract_txt:database in 3897) [ClassicSimilarity], result of:
            0.14454485 = score(doc=3897,freq=3.0), product of:
              0.20791033 = queryWeight, product of:
                2.3553019 = boost
                4.2814875 = idf(docFreq=1668, maxDocs=44421)
                0.020617444 = queryNorm
              0.6952269 = fieldWeight in 3897, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2814875 = idf(docFreq=1668, maxDocs=44421)
                0.09375 = fieldNorm(doc=3897)
          0.09681664 = weight(abstract_txt:method in 3897) [ClassicSimilarity], result of:
            0.09681664 = score(doc=3897,freq=1.0), product of:
              0.22955216 = queryWeight, product of:
                2.474852 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.020617444 = queryNorm
              0.42176312 = fieldWeight in 3897, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.09375 = fieldNorm(doc=3897)
        0.16 = coord(4/25)
    
  5. Li, Q.; Wu, Y.-f.B.: People search : searching people sharing similar interests from the Web (2008) 0.08
    0.0824332 = sum of:
      0.0824332 = product of:
        0.412166 = sum of:
          0.048419636 = weight(abstract_txt:finding in 2344) [ClassicSimilarity], result of:
            0.048419636 = score(doc=2344,freq=1.0), product of:
              0.11324251 = queryWeight, product of:
                1.0035815 = boost
                5.4729567 = idf(docFreq=506, maxDocs=44421)
                0.020617444 = queryNorm
              0.42757475 = fieldWeight in 2344, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4729567 = idf(docFreq=506, maxDocs=44421)
                0.078125 = fieldNorm(doc=2344)
          0.07756058 = weight(abstract_txt:algorithm in 2344) [ClassicSimilarity], result of:
            0.07756058 = score(doc=2344,freq=2.0), product of:
              0.12304931 = queryWeight, product of:
                1.0461345 = boost
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.020617444 = queryNorm
              0.63032115 = fieldWeight in 2344, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7050157 = idf(docFreq=401, maxDocs=44421)
                0.078125 = fieldNorm(doc=2344)
          0.06268506 = weight(abstract_txt:textual in 2344) [ClassicSimilarity], result of:
            0.06268506 = score(doc=2344,freq=1.0), product of:
              0.13451514 = queryWeight, product of:
                1.0937889 = boost
                5.9648952 = idf(docFreq=309, maxDocs=44421)
                0.020617444 = queryNorm
              0.46600744 = fieldWeight in 2344, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9648952 = idf(docFreq=309, maxDocs=44421)
                0.078125 = fieldNorm(doc=2344)
          0.10940121 = weight(abstract_txt:algorithms in 2344) [ClassicSimilarity], result of:
            0.10940121 = score(doc=2344,freq=1.0), product of:
              0.24567063 = queryWeight, product of:
                2.0904489 = boost
                5.7000527 = idf(docFreq=403, maxDocs=44421)
                0.020617444 = queryNorm
              0.4453166 = fieldWeight in 2344, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7000527 = idf(docFreq=403, maxDocs=44421)
                0.078125 = fieldNorm(doc=2344)
          0.1140995 = weight(abstract_txt:method in 2344) [ClassicSimilarity], result of:
            0.1140995 = score(doc=2344,freq=2.0), product of:
              0.22955216 = queryWeight, product of:
                2.474852 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.020617444 = queryNorm
              0.4970526 = fieldWeight in 2344, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.078125 = fieldNorm(doc=2344)
        0.2 = coord(5/25)