Document (#19518)

Author
Mock, K.J.
Vemuri, V.R.
Title
Information filtering via hill climbing, WordNet, and index patterns
Source
Information processing and management. 33(1997) no.5, S.633-644
Year
1997
Abstract
The INFOS (Intelligent News Filtering Organizational System) project is designed to reduce the user's search burden by automatically categorising data as relevant or irrelevant based upon user interests. These predictions are learned automatically based upon features taken from input articles and collaborative features derived from other users. The filtering is performed by a hybrid technique that combines elements of a keyword-based hill climbing method, knowledge-based conceptual representation via WordNet, and partial parsing via index patterns. The hybrid systems integrating all these approaches combines the benefits of each while maintaing robustness and acalability
Footnote
Contribution to a special issue devoted to electronic newspapers
Theme
Computerlinguistik
Object
WordNet

Similar documents (content)

  1. Chandrasekar, R.; Srinivas, B.: Automatic induction of rules for text simplification (1997) 0.13
    0.13171257 = sum of:
      0.13171257 = product of:
        0.5488024 = sum of:
          0.016900282 = weight(abstract_txt:these in 3873) [ClassicSimilarity], result of:
            0.016900282 = score(doc=3873,freq=1.0), product of:
              0.05663737 = queryWeight, product of:
                1.0177499 = boost
                3.1828754 = idf(docFreq=5006, maxDocs=44421)
                0.017484063 = queryNorm
              0.29839456 = fieldWeight in 3873, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1828754 = idf(docFreq=5006, maxDocs=44421)
                0.09375 = fieldNorm(doc=3873)
          0.0856894 = weight(abstract_txt:partial in 3873) [ClassicSimilarity], result of:
            0.0856894 = score(doc=3873,freq=1.0), product of:
              0.13267277 = queryWeight, product of:
                1.1014516 = boost
                6.889283 = idf(docFreq=122, maxDocs=44421)
                0.017484063 = queryNorm
              0.6458703 = fieldWeight in 3873, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.889283 = idf(docFreq=122, maxDocs=44421)
                0.09375 = fieldNorm(doc=3873)
          0.12199663 = weight(abstract_txt:parsing in 3873) [ClassicSimilarity], result of:
            0.12199663 = score(doc=3873,freq=1.0), product of:
              0.1679045 = queryWeight, product of:
                1.2390981 = boost
                7.750224 = idf(docFreq=51, maxDocs=44421)
                0.017484063 = queryNorm
              0.7265835 = fieldWeight in 3873, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.750224 = idf(docFreq=51, maxDocs=44421)
                0.09375 = fieldNorm(doc=3873)
          0.12476987 = weight(abstract_txt:automatically in 3873) [ClassicSimilarity], result of:
            0.12476987 = score(doc=3873,freq=2.0), product of:
              0.17043948 = queryWeight, product of:
                1.7655281 = boost
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.017484063 = queryNorm
              0.7320479 = fieldWeight in 3873, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.09375 = fieldNorm(doc=3873)
          0.04781022 = weight(abstract_txt:based in 3873) [ClassicSimilarity], result of:
            0.04781022 = score(doc=3873,freq=2.0), product of:
              0.11328896 = queryWeight, product of:
                2.0356276 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.017484063 = queryNorm
              0.4220201 = fieldWeight in 3873, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.09375 = fieldNorm(doc=3873)
          0.15163597 = weight(abstract_txt:combines in 3873) [ClassicSimilarity], result of:
            0.15163597 = score(doc=3873,freq=1.0), product of:
              0.24455425 = queryWeight, product of:
                2.1148381 = boost
                6.613871 = idf(docFreq=161, maxDocs=44421)
                0.017484063 = queryNorm
              0.62005043 = fieldWeight in 3873, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.613871 = idf(docFreq=161, maxDocs=44421)
                0.09375 = fieldNorm(doc=3873)
        0.24 = coord(6/25)
    
  2. Cullen, C.: Verity agent technology : automatic filtering, matching and dissemination of information (1996) 0.12
    0.11703287 = sum of:
      0.11703287 = product of:
        0.73145545 = sum of:
          0.07481309 = weight(abstract_txt:intelligent in 3415) [ClassicSimilarity], result of:
            0.07481309 = score(doc=3415,freq=1.0), product of:
              0.1093581 = queryWeight, product of:
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.017484063 = queryNorm
              0.6841111 = fieldWeight in 3415, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.109375 = fieldNorm(doc=3415)
          0.2975325 = weight(abstract_txt:categorising in 3415) [ClassicSimilarity], result of:
            0.2975325 = score(doc=3415,freq=1.0), product of:
              0.2745083 = queryWeight, product of:
                1.5843542 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.017484063 = queryNorm
              1.0838743 = fieldWeight in 3415, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.109375 = fieldNorm(doc=3415)
          0.10292989 = weight(abstract_txt:automatically in 3415) [ClassicSimilarity], result of:
            0.10292989 = score(doc=3415,freq=1.0), product of:
              0.17043948 = queryWeight, product of:
                1.7655281 = boost
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.017484063 = queryNorm
              0.6039087 = fieldWeight in 3415, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.109375 = fieldNorm(doc=3415)
          0.25617993 = weight(abstract_txt:filtering in 3415) [ClassicSimilarity], result of:
            0.25617993 = score(doc=3415,freq=1.0), product of:
              0.3583189 = queryWeight, product of:
                3.1352344 = boost
                6.5366817 = idf(docFreq=174, maxDocs=44421)
                0.017484063 = queryNorm
              0.71494955 = fieldWeight in 3415, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5366817 = idf(docFreq=174, maxDocs=44421)
                0.109375 = fieldNorm(doc=3415)
        0.16 = coord(4/25)
    
  3. Ma, W.-Y.; Manjunath, B.S.: ¬A texture thesaurus for browsing large aerial photographs (1998) 0.11
    0.11193583 = sum of:
      0.11193583 = product of:
        0.4663993 = sum of:
          0.0925226 = weight(abstract_txt:robustness in 1874) [ClassicSimilarity], result of:
            0.0925226 = score(doc=1874,freq=1.0), product of:
              0.18297417 = queryWeight, product of:
                1.2935089 = boost
                8.090549 = idf(docFreq=36, maxDocs=44421)
                0.017484063 = queryNorm
              0.50565934 = fieldWeight in 1874, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.090549 = idf(docFreq=36, maxDocs=44421)
                0.0625 = fieldNorm(doc=1874)
          0.04625991 = weight(abstract_txt:features in 1874) [ClassicSimilarity], result of:
            0.04625991 = score(doc=1874,freq=2.0), product of:
              0.1152642 = queryWeight, product of:
                1.4519001 = boost
                4.5406218 = idf(docFreq=1287, maxDocs=44421)
                0.017484063 = queryNorm
              0.40133804 = fieldWeight in 1874, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5406218 = idf(docFreq=1287, maxDocs=44421)
                0.0625 = fieldNorm(doc=1874)
          0.051098898 = weight(abstract_txt:patterns in 1874) [ClassicSimilarity], result of:
            0.051098898 = score(doc=1874,freq=1.0), product of:
              0.15518233 = queryWeight, product of:
                1.6846538 = boost
                5.2685275 = idf(docFreq=621, maxDocs=44421)
                0.017484063 = queryNorm
              0.32928297 = fieldWeight in 1874, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2685275 = idf(docFreq=621, maxDocs=44421)
                0.0625 = fieldNorm(doc=1874)
          0.022537956 = weight(abstract_txt:based in 1874) [ClassicSimilarity], result of:
            0.022537956 = score(doc=1874,freq=1.0), product of:
              0.11328896 = queryWeight, product of:
                2.0356276 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.017484063 = queryNorm
              0.1989422 = fieldWeight in 1874, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0625 = fieldNorm(doc=1874)
          0.10759141 = weight(abstract_txt:hybrid in 1874) [ClassicSimilarity], result of:
            0.10759141 = score(doc=1874,freq=1.0), product of:
              0.25492924 = queryWeight, product of:
                2.1592321 = boost
                6.7527075 = idf(docFreq=140, maxDocs=44421)
                0.017484063 = queryNorm
              0.42204422 = fieldWeight in 1874, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7527075 = idf(docFreq=140, maxDocs=44421)
                0.0625 = fieldNorm(doc=1874)
          0.14638853 = weight(abstract_txt:filtering in 1874) [ClassicSimilarity], result of:
            0.14638853 = score(doc=1874,freq=1.0), product of:
              0.3583189 = queryWeight, product of:
                3.1352344 = boost
                6.5366817 = idf(docFreq=174, maxDocs=44421)
                0.017484063 = queryNorm
              0.4085426 = fieldWeight in 1874, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5366817 = idf(docFreq=174, maxDocs=44421)
                0.0625 = fieldNorm(doc=1874)
        0.24 = coord(6/25)
    
  4. Liu, D.-R.; Shih, M.-J.: Hybrid-patent classification based on patent-network analysis (2011) 0.09
    0.09123867 = sum of:
      0.09123867 = product of:
        0.45619336 = sum of:
          0.07640839 = weight(abstract_txt:predictions in 189) [ClassicSimilarity], result of:
            0.07640839 = score(doc=189,freq=1.0), product of:
              0.16105911 = queryWeight, product of:
                1.2135766 = boost
                7.590594 = idf(docFreq=60, maxDocs=44421)
                0.017484063 = queryNorm
              0.4744121 = fieldWeight in 189, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.590594 = idf(docFreq=60, maxDocs=44421)
                0.0625 = fieldNorm(doc=189)
          0.032710698 = weight(abstract_txt:features in 189) [ClassicSimilarity], result of:
            0.032710698 = score(doc=189,freq=1.0), product of:
              0.1152642 = queryWeight, product of:
                1.4519001 = boost
                4.5406218 = idf(docFreq=1287, maxDocs=44421)
                0.017484063 = queryNorm
              0.28378886 = fieldWeight in 189, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5406218 = idf(docFreq=1287, maxDocs=44421)
                0.0625 = fieldNorm(doc=189)
          0.059629824 = weight(abstract_txt:based in 189) [ClassicSimilarity], result of:
            0.059629824 = score(doc=189,freq=7.0), product of:
              0.11328896 = queryWeight, product of:
                2.0356276 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.017484063 = queryNorm
              0.5263516 = fieldWeight in 189, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0625 = fieldNorm(doc=189)
          0.10109064 = weight(abstract_txt:combines in 189) [ClassicSimilarity], result of:
            0.10109064 = score(doc=189,freq=1.0), product of:
              0.24455425 = queryWeight, product of:
                2.1148381 = boost
                6.613871 = idf(docFreq=161, maxDocs=44421)
                0.017484063 = queryNorm
              0.41336694 = fieldWeight in 189, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.613871 = idf(docFreq=161, maxDocs=44421)
                0.0625 = fieldNorm(doc=189)
          0.1863538 = weight(abstract_txt:hybrid in 189) [ClassicSimilarity], result of:
            0.1863538 = score(doc=189,freq=3.0), product of:
              0.25492924 = queryWeight, product of:
                2.1592321 = boost
                6.7527075 = idf(docFreq=140, maxDocs=44421)
                0.017484063 = queryNorm
              0.73100203 = fieldWeight in 189, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.7527075 = idf(docFreq=140, maxDocs=44421)
                0.0625 = fieldNorm(doc=189)
        0.2 = coord(5/25)
    
  5. Mostafa, J.; Quiroga, L.M.; Palakal, M.: Filtering medical documents using automated and human classification methods (1998) 0.09
    0.090239815 = sum of:
      0.090239815 = product of:
        0.5639989 = sum of:
          0.01408357 = weight(abstract_txt:these in 3326) [ClassicSimilarity], result of:
            0.01408357 = score(doc=3326,freq=1.0), product of:
              0.05663737 = queryWeight, product of:
                1.0177499 = boost
                3.1828754 = idf(docFreq=5006, maxDocs=44421)
                0.017484063 = queryNorm
              0.24866214 = fieldWeight in 3326, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1828754 = idf(docFreq=5006, maxDocs=44421)
                0.078125 = fieldNorm(doc=3326)
          0.073521346 = weight(abstract_txt:automatically in 3326) [ClassicSimilarity], result of:
            0.073521346 = score(doc=3326,freq=1.0), product of:
              0.17043948 = queryWeight, product of:
                1.7655281 = boost
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.017484063 = queryNorm
              0.43136334 = fieldWeight in 3326, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.078125 = fieldNorm(doc=3326)
          0.028172443 = weight(abstract_txt:based in 3326) [ClassicSimilarity], result of:
            0.028172443 = score(doc=3326,freq=1.0), product of:
              0.11328896 = queryWeight, product of:
                2.0356276 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.017484063 = queryNorm
              0.24867775 = fieldWeight in 3326, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.078125 = fieldNorm(doc=3326)
          0.44822153 = weight(abstract_txt:filtering in 3326) [ClassicSimilarity], result of:
            0.44822153 = score(doc=3326,freq=6.0), product of:
              0.3583189 = queryWeight, product of:
                3.1352344 = boost
                6.5366817 = idf(docFreq=174, maxDocs=44421)
                0.017484063 = queryNorm
              1.2509012 = fieldWeight in 3326, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.5366817 = idf(docFreq=174, maxDocs=44421)
                0.078125 = fieldNorm(doc=3326)
        0.16 = coord(4/25)