Document (#23442)

Author
Boley, D.
Gini, M.
Hastings, K.
Mobasher, B.
Moore, J.
Title
¬A client side Web agent for document categorization
Source
Internet research. Electronic networking applications and policy. 8(1998) no.5, S.387-399
Year
1998
Abstract
Proposes a client-side agent for exploring and categorizing documents on the World Wide Web. As the user browses the Web using a usual Web browser, this agent is designed to aid the user by classifying the documents the user finds most interesting into clusters. The agent carries out the task completely automatically and autonomously, with as little user intervention as the user desires. The principal novel components in this agent that make it possible are a scalable hierarchical clustering algorithm and a taxonomic label generator. Describes the overall architecture of this agent and discusses the details of the algorithms within its key components
Theme
Internet

Similar documents (author)

  1. Hastings, S.K.: ¬An exploratory study of intellectual access to digitized art images : the information industry and the role of the Internet (1995) 2.49
    2.485333 = sum of:
      2.485333 = product of:
        4.970666 = sum of:
          4.970666 = weight(author_txt:hastings in 3254) [ClassicSimilarity], result of:
            4.970666 = score(doc=3254,freq=1.0), product of:
              0.8265479 = queryWeight, product of:
                1.2118013 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.07088757 = queryNorm
              6.0137663 = fieldWeight in 3254, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.625 = fieldNorm(doc=3254)
        0.5 = coord(1/2)
    
  2. Hastings, S.K.: Evaluation of image retrieval systems : role of user feedback (1999) 2.49
    2.485333 = sum of:
      2.485333 = product of:
        4.970666 = sum of:
          4.970666 = weight(author_txt:hastings in 970) [ClassicSimilarity], result of:
            4.970666 = score(doc=970,freq=1.0), product of:
              0.8265479 = queryWeight, product of:
                1.2118013 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.07088757 = queryNorm
              6.0137663 = fieldWeight in 970, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.625 = fieldNorm(doc=970)
        0.5 = coord(1/2)
    
  3. Christian, E.J.; Hastings, M.: ¬The virtual library : a selective bibliography for exploration (1994) 1.99
    1.9882665 = sum of:
      1.9882665 = product of:
        3.976533 = sum of:
          3.976533 = weight(author_txt:hastings in 1470) [ClassicSimilarity], result of:
            3.976533 = score(doc=1470,freq=1.0), product of:
              0.8265479 = queryWeight, product of:
                1.2118013 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.07088757 = queryNorm
              4.811013 = fieldWeight in 1470, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.5 = fieldNorm(doc=1470)
        0.5 = coord(1/2)
    
  4. Lunin, L.F.; Martin, K.; Hastings, S.K.: Design: information technologies and creative practices (2009) 1.49
    1.4911999 = sum of:
      1.4911999 = product of:
        2.9823997 = sum of:
          2.9823997 = weight(author_txt:hastings in 5889) [ClassicSimilarity], result of:
            2.9823997 = score(doc=5889,freq=1.0), product of:
              0.8265479 = queryWeight, product of:
                1.2118013 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.07088757 = queryNorm
              3.60826 = fieldWeight in 5889, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.375 = fieldNorm(doc=5889)
        0.5 = coord(1/2)
    
  5. Chung, E.-K.; Miksa, S.; Hastings, S.K.: ¬A framework of automatic subject term assignment for text categorization : an indexing conception-based approach (2010) 1.49
    1.4911999 = sum of:
      1.4911999 = product of:
        2.9823997 = sum of:
          2.9823997 = weight(author_txt:hastings in 421) [ClassicSimilarity], result of:
            2.9823997 = score(doc=421,freq=1.0), product of:
              0.8265479 = queryWeight, product of:
                1.2118013 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.07088757 = queryNorm
              3.60826 = fieldWeight in 421, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.375 = fieldNorm(doc=421)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Park, J.S.; O'Brien, J.C.; Cai, C.J.; Ringel Morris, M.; Liang, P.; Bernstein, M.S.: Generative agents : interactive simulacra of human behavior (2023) 0.13
    0.13058005 = sum of:
      0.13058005 = product of:
        0.6529002 = sum of:
          0.0074446807 = weight(abstract_txt:this in 1974) [ClassicSimilarity], result of:
            0.0074446807 = score(doc=1974,freq=2.0), product of:
              0.040004145 = queryWeight, product of:
                1.107217 = boost
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.015015309 = queryNorm
              0.18609773 = fieldWeight in 1974, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1974)
          0.108130544 = weight(abstract_txt:autonomously in 1974) [ClassicSimilarity], result of:
            0.108130544 = score(doc=1974,freq=1.0), product of:
              0.20803806 = queryWeight, product of:
                1.4577767 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.015015309 = queryNorm
              0.5197633 = fieldWeight in 1974, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1974)
          0.04400087 = weight(abstract_txt:components in 1974) [ClassicSimilarity], result of:
            0.04400087 = score(doc=1974,freq=1.0), product of:
              0.14393334 = queryWeight, product of:
                1.7148072 = boost
                5.59 = idf(docFreq=450, maxDocs=44421)
                0.015015309 = queryNorm
              0.30570313 = fieldWeight in 1974, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.59 = idf(docFreq=450, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1974)
          0.031406295 = weight(abstract_txt:user in 1974) [ClassicSimilarity], result of:
            0.031406295 = score(doc=1974,freq=1.0), product of:
              0.1560193 = queryWeight, product of:
                2.8228889 = boost
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.015015309 = queryNorm
              0.20129749 = fieldWeight in 1974, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1974)
          0.46191785 = weight(abstract_txt:agent in 1974) [ClassicSimilarity], result of:
            0.46191785 = score(doc=1974,freq=3.0), product of:
              0.6900762 = queryWeight, product of:
                6.503449 = boost
                7.0667386 = idf(docFreq=102, maxDocs=44421)
                0.015015309 = queryNorm
              0.66937226 = fieldWeight in 1974, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.0667386 = idf(docFreq=102, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1974)
        0.2 = coord(5/25)
    
  2. Barrueco, J.M.; Inglada, V.J.: Reference linking in economics : the Citec project (2003) 0.12
    0.11636186 = sum of:
      0.11636186 = product of:
        0.96968216 = sum of:
          0.010528368 = weight(abstract_txt:this in 3718) [ClassicSimilarity], result of:
            0.010528368 = score(doc=3718,freq=1.0), product of:
              0.040004145 = queryWeight, product of:
                1.107217 = boost
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.015015309 = queryNorm
              0.26318192 = fieldWeight in 3718, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.109375 = fieldNorm(doc=3718)
          0.035318118 = weight(abstract_txt:documents in 3718) [ClassicSimilarity], result of:
            0.035318118 = score(doc=3718,freq=1.0), product of:
              0.0783127 = queryWeight, product of:
                1.264884 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.015015309 = queryNorm
              0.45098835 = fieldWeight in 3718, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.109375 = fieldNorm(doc=3718)
          0.9238357 = weight(abstract_txt:agent in 3718) [ClassicSimilarity], result of:
            0.9238357 = score(doc=3718,freq=3.0), product of:
              0.6900762 = queryWeight, product of:
                6.503449 = boost
                7.0667386 = idf(docFreq=102, maxDocs=44421)
                0.015015309 = queryNorm
              1.3387445 = fieldWeight in 3718, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.0667386 = idf(docFreq=102, maxDocs=44421)
                0.109375 = fieldNorm(doc=3718)
        0.12 = coord(3/25)
    
  3. Cheung, D.W.; Kao, B.; Lee, J.: Discovering user access patterns on the World Wide Web (1998) 0.12
    0.11512148 = sum of:
      0.11512148 = product of:
        0.71950924 = sum of:
          0.035318118 = weight(abstract_txt:documents in 1332) [ClassicSimilarity], result of:
            0.035318118 = score(doc=1332,freq=1.0), product of:
              0.0783127 = queryWeight, product of:
                1.264884 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.015015309 = queryNorm
              0.45098835 = fieldWeight in 1332, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.109375 = fieldNorm(doc=1332)
          0.08800174 = weight(abstract_txt:components in 1332) [ClassicSimilarity], result of:
            0.08800174 = score(doc=1332,freq=1.0), product of:
              0.14393334 = queryWeight, product of:
                1.7148072 = boost
                5.59 = idf(docFreq=450, maxDocs=44421)
                0.015015309 = queryNorm
              0.61140627 = fieldWeight in 1332, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.59 = idf(docFreq=450, maxDocs=44421)
                0.109375 = fieldNorm(doc=1332)
          0.06281259 = weight(abstract_txt:user in 1332) [ClassicSimilarity], result of:
            0.06281259 = score(doc=1332,freq=1.0), product of:
              0.1560193 = queryWeight, product of:
                2.8228889 = boost
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.015015309 = queryNorm
              0.40259498 = fieldWeight in 1332, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.109375 = fieldNorm(doc=1332)
          0.5333768 = weight(abstract_txt:agent in 1332) [ClassicSimilarity], result of:
            0.5333768 = score(doc=1332,freq=1.0), product of:
              0.6900762 = queryWeight, product of:
                6.503449 = boost
                7.0667386 = idf(docFreq=102, maxDocs=44421)
                0.015015309 = queryNorm
              0.77292454 = fieldWeight in 1332, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0667386 = idf(docFreq=102, maxDocs=44421)
                0.109375 = fieldNorm(doc=1332)
        0.16 = coord(4/25)
    
  4. Shafique, M.; Chaudhry, A.S.: Intelligent agent-based online information retrieval (1995) 0.11
    0.11285591 = sum of:
      0.11285591 = product of:
        0.9404659 = sum of:
          0.043694835 = weight(abstract_txt:documents in 3919) [ClassicSimilarity], result of:
            0.043694835 = score(doc=3919,freq=3.0), product of:
              0.0783127 = queryWeight, product of:
                1.264884 = boost
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.015015309 = queryNorm
              0.55795336 = fieldWeight in 3919, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.123322 = idf(docFreq=1954, maxDocs=44421)
                0.078125 = fieldNorm(doc=3919)
          0.044866133 = weight(abstract_txt:user in 3919) [ClassicSimilarity], result of:
            0.044866133 = score(doc=3919,freq=1.0), product of:
              0.1560193 = queryWeight, product of:
                2.8228889 = boost
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.015015309 = queryNorm
              0.28756785 = fieldWeight in 3919, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.078125 = fieldNorm(doc=3919)
          0.8519049 = weight(abstract_txt:agent in 3919) [ClassicSimilarity], result of:
            0.8519049 = score(doc=3919,freq=5.0), product of:
              0.6900762 = queryWeight, product of:
                6.503449 = boost
                7.0667386 = idf(docFreq=102, maxDocs=44421)
                0.015015309 = queryNorm
              1.2345085 = fieldWeight in 3919, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.0667386 = idf(docFreq=102, maxDocs=44421)
                0.078125 = fieldNorm(doc=3919)
        0.12 = coord(3/25)
    
  5. Fenstermacher, K.D.; Ginsburg, M.: Client-side monitoring for Web mining (2003) 0.11
    0.11021437 = sum of:
      0.11021437 = product of:
        0.5510718 = sum of:
          0.079357415 = weight(abstract_txt:browser in 2611) [ClassicSimilarity], result of:
            0.079357415 = score(doc=2611,freq=2.0), product of:
              0.105915025 = queryWeight, product of:
                1.0401558 = boost
                6.7814865 = idf(docFreq=136, maxDocs=44421)
                0.015015309 = queryNorm
              0.7492555 = fieldWeight in 2611, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.7814865 = idf(docFreq=136, maxDocs=44421)
                0.078125 = fieldNorm(doc=2611)
          0.0075202626 = weight(abstract_txt:this in 2611) [ClassicSimilarity], result of:
            0.0075202626 = score(doc=2611,freq=1.0), product of:
              0.040004145 = queryWeight, product of:
                1.107217 = boost
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.015015309 = queryNorm
              0.18798709 = fieldWeight in 2611, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4062347 = idf(docFreq=10885, maxDocs=44421)
                0.078125 = fieldNorm(doc=2611)
          0.1684221 = weight(abstract_txt:client in 2611) [ClassicSimilarity], result of:
            0.1684221 = score(doc=2611,freq=3.0), product of:
              0.19252104 = queryWeight, product of:
                1.9832329 = boost
                6.4650254 = idf(docFreq=187, maxDocs=44421)
                0.015015309 = queryNorm
              0.8748244 = fieldWeight in 2611, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.4650254 = idf(docFreq=187, maxDocs=44421)
                0.078125 = fieldNorm(doc=2611)
          0.25090587 = weight(abstract_txt:side in 2611) [ClassicSimilarity], result of:
            0.25090587 = score(doc=2611,freq=4.0), product of:
              0.22816014 = queryWeight, product of:
                2.1590092 = boost
                7.0380287 = idf(docFreq=105, maxDocs=44421)
                0.015015309 = queryNorm
              1.099692 = fieldWeight in 2611, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.0380287 = idf(docFreq=105, maxDocs=44421)
                0.078125 = fieldNorm(doc=2611)
          0.044866133 = weight(abstract_txt:user in 2611) [ClassicSimilarity], result of:
            0.044866133 = score(doc=2611,freq=1.0), product of:
              0.1560193 = queryWeight, product of:
                2.8228889 = boost
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.015015309 = queryNorm
              0.28756785 = fieldWeight in 2611, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.078125 = fieldNorm(doc=2611)
        0.2 = coord(5/25)