Document (#29133)

Author
Shen, D.
Chen, Z.
Yang, Q.
Zeng, H.J.
Zhang, B.
Lu, Y.
Ma, W.Y.
Title
Web page classification through summarization
Source
SIGIR'04: Proceedings of the 27th Annual International ACM-SIGIR Conference an Research and Development in Information Retrieval. Ed.: K. Järvelin, u.a
Imprint
New York, NY : ACM Press
Year
2004
Pages
S.242-249
Theme
Automatisches Klassifizieren

Similar documents (author)

  1. Shen, D.; Yang, Q.; Chen, Z.: Noise reduction through summarization for Web-page classification (2007) 2.30
    2.3013923 = sum of:
      2.3013923 = product of:
        3.8356535 = sum of:
          0.72188 = weight(author_txt:chen in 1953) [ClassicSimilarity], result of:
            0.72188 = score(doc=1953,freq=1.0), product of:
              0.31367606 = queryWeight, product of:
                6.136947 = idf(docFreq=260, maxDocs=44421)
                0.05111272 = queryNorm
              2.3013551 = fieldWeight in 1953, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.136947 = idf(docFreq=260, maxDocs=44421)
                0.375 = fieldNorm(doc=1953)
          1.1506933 = weight(author_txt:yang in 1953) [ClassicSimilarity], result of:
            1.1506933 = score(doc=1953,freq=1.0), product of:
              0.42803347 = queryWeight, product of:
                1.1681489 = boost
                7.168868 = idf(docFreq=92, maxDocs=44421)
                0.05111272 = queryNorm
              2.6883254 = fieldWeight in 1953, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.168868 = idf(docFreq=92, maxDocs=44421)
                0.375 = fieldNorm(doc=1953)
          1.9630802 = weight(author_txt:shen in 1953) [ClassicSimilarity], result of:
            1.9630802 = score(doc=1953,freq=1.0), product of:
              0.611125 = queryWeight, product of:
                1.3958037 = boost
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.05111272 = queryNorm
              3.21224 = fieldWeight in 1953, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.375 = fieldNorm(doc=1953)
        0.6 = coord(3/5)
    
  2. Zhang, J.; Zeng, M.L.: ¬A new similarity measure for subject hierarchical structures (2014) 1.16
    1.1611435 = sum of:
      1.1611435 = product of:
        2.9028587 = sum of:
          1.0932704 = weight(author_txt:zhang in 2778) [ClassicSimilarity], result of:
            1.0932704 = score(doc=2778,freq=1.0), product of:
              0.34147894 = queryWeight, product of:
                1.043377 = boost
                6.40315 = idf(docFreq=199, maxDocs=44421)
                0.05111272 = queryNorm
              3.201575 = fieldWeight in 2778, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.40315 = idf(docFreq=199, maxDocs=44421)
                0.5 = fieldNorm(doc=2778)
          1.8095884 = weight(author_txt:zeng in 2778) [ClassicSimilarity], result of:
            1.8095884 = score(doc=2778,freq=1.0), product of:
              0.47782117 = queryWeight, product of:
                1.2342184 = boost
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.05111272 = queryNorm
              3.7871666 = fieldWeight in 2778, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.5 = fieldNorm(doc=2778)
        0.4 = coord(2/5)
    
  3. Shen, X.-L.; Zhang, K.Z.K.; Zhao, S.J.: Herd behavior in consumers' adoption of online reviews (2016) 1.11
    1.1132132 = sum of:
      1.1132132 = product of:
        2.783033 = sum of:
          0.81995285 = weight(author_txt:zhang in 4157) [ClassicSimilarity], result of:
            0.81995285 = score(doc=4157,freq=1.0), product of:
              0.34147894 = queryWeight, product of:
                1.043377 = boost
                6.40315 = idf(docFreq=199, maxDocs=44421)
                0.05111272 = queryNorm
              2.4011812 = fieldWeight in 4157, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.40315 = idf(docFreq=199, maxDocs=44421)
                0.375 = fieldNorm(doc=4157)
          1.9630802 = weight(author_txt:shen in 4157) [ClassicSimilarity], result of:
            1.9630802 = score(doc=4157,freq=1.0), product of:
              0.611125 = queryWeight, product of:
                1.3958037 = boost
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.05111272 = queryNorm
              3.21224 = fieldWeight in 4157, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.375 = fieldNorm(doc=4157)
        0.4 = coord(2/5)
    
  4. Zeng, M.L.; Chen, Y.: Features of an integrated thesaurus management and search system for the networked environment (2003) 1.11
    1.1088381 = sum of:
      1.1088381 = product of:
        2.7720952 = sum of:
          0.9625067 = weight(author_txt:chen in 4817) [ClassicSimilarity], result of:
            0.9625067 = score(doc=4817,freq=1.0), product of:
              0.31367606 = queryWeight, product of:
                6.136947 = idf(docFreq=260, maxDocs=44421)
                0.05111272 = queryNorm
              3.0684736 = fieldWeight in 4817, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.136947 = idf(docFreq=260, maxDocs=44421)
                0.5 = fieldNorm(doc=4817)
          1.8095884 = weight(author_txt:zeng in 4817) [ClassicSimilarity], result of:
            1.8095884 = score(doc=4817,freq=1.0), product of:
              0.47782117 = queryWeight, product of:
                1.2342184 = boost
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.05111272 = queryNorm
              3.7871666 = fieldWeight in 4817, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.5 = fieldNorm(doc=4817)
        0.4 = coord(2/5)
    
  5. Zhang, M.; Yang, C.C.: Using content and network analysis to understand the social support exchange patterns and user behaviors of an online smoking cessation intervention program (2015) 1.05
    1.0510113 = sum of:
      1.0510113 = product of:
        2.6275282 = sum of:
          1.0932704 = weight(author_txt:zhang in 2668) [ClassicSimilarity], result of:
            1.0932704 = score(doc=2668,freq=1.0), product of:
              0.34147894 = queryWeight, product of:
                1.043377 = boost
                6.40315 = idf(docFreq=199, maxDocs=44421)
                0.05111272 = queryNorm
              3.201575 = fieldWeight in 2668, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.40315 = idf(docFreq=199, maxDocs=44421)
                0.5 = fieldNorm(doc=2668)
          1.5342578 = weight(author_txt:yang in 2668) [ClassicSimilarity], result of:
            1.5342578 = score(doc=2668,freq=1.0), product of:
              0.42803347 = queryWeight, product of:
                1.1681489 = boost
                7.168868 = idf(docFreq=92, maxDocs=44421)
                0.05111272 = queryNorm
              3.584434 = fieldWeight in 2668, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.168868 = idf(docFreq=92, maxDocs=44421)
                0.5 = fieldNorm(doc=2668)
        0.4 = coord(2/5)
    

Similar documents (content)

  1. Shen, D.; Yang, Q.; Chen, Z.: Noise reduction through summarization for Web-page classification (2007) 1.93
    1.9280016 = sum of:
      1.9280016 = sum of:
        0.18407568 = weight(abstract_txt:classification in 1953) [ClassicSimilarity], result of:
          0.18407568 = score(doc=1953,freq=6.0), product of:
            0.24094774 = queryWeight, product of:
              3.9921594 = idf(docFreq=2228, maxDocs=44421)
              0.06035524 = queryNorm
            0.7639652 = fieldWeight in 1953, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              3.9921594 = idf(docFreq=2228, maxDocs=44421)
              0.078125 = fieldNorm(doc=1953)
        0.07558175 = weight(abstract_txt:through in 1953) [ClassicSimilarity], result of:
          0.07558175 = score(doc=1953,freq=1.0), product of:
            0.24187277 = queryWeight, product of:
              1.0019177 = boost
              3.9998152 = idf(docFreq=2211, maxDocs=44421)
              0.06035524 = queryNorm
            0.31248558 = fieldWeight in 1953, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              3.9998152 = idf(docFreq=2211, maxDocs=44421)
              0.078125 = fieldNorm(doc=1953)
        0.62109846 = weight(abstract_txt:page in 1953) [ClassicSimilarity], result of:
          0.62109846 = score(doc=1953,freq=6.0), product of:
            0.54204106 = queryWeight, product of:
              1.4998736 = boost
              5.987735 = idf(docFreq=302, maxDocs=44421)
              0.06035524 = queryNorm
            1.1458513 = fieldWeight in 1953, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              5.987735 = idf(docFreq=302, maxDocs=44421)
              0.078125 = fieldNorm(doc=1953)
        1.0472457 = weight(abstract_txt:summarization in 1953) [ClassicSimilarity], result of:
          1.0472457 = score(doc=1953,freq=6.0), product of:
            0.7678758 = queryWeight, product of:
              1.7851884 = boost
              7.1267567 = idf(docFreq=96, maxDocs=44421)
              0.06035524 = queryNorm
            1.3638217 = fieldWeight in 1953, product of:
              2.4494898 = tf(freq=6.0), with freq of:
                6.0 = termFreq=6.0
              7.1267567 = idf(docFreq=96, maxDocs=44421)
              0.078125 = fieldNorm(doc=1953)
    
  2. Huo, W.: Automatic multi-word term extraction and its application to Web-page summarization (2012) 0.88
    0.8806907 = sum of:
      0.8806907 = product of:
        1.1742543 = sum of:
          0.075148575 = weight(abstract_txt:classification in 1563) [ClassicSimilarity], result of:
            0.075148575 = score(doc=1563,freq=1.0), product of:
              0.24094774 = queryWeight, product of:
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.06035524 = queryNorm
              0.31188744 = fieldWeight in 1563, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.078125 = fieldNorm(doc=1563)
          0.3585913 = weight(abstract_txt:page in 1563) [ClassicSimilarity], result of:
            0.3585913 = score(doc=1563,freq=2.0), product of:
              0.54204106 = queryWeight, product of:
                1.4998736 = boost
                5.987735 = idf(docFreq=302, maxDocs=44421)
                0.06035524 = queryNorm
              0.66155744 = fieldWeight in 1563, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.987735 = idf(docFreq=302, maxDocs=44421)
                0.078125 = fieldNorm(doc=1563)
          0.74051446 = weight(abstract_txt:summarization in 1563) [ClassicSimilarity], result of:
            0.74051446 = score(doc=1563,freq=3.0), product of:
              0.7678758 = queryWeight, product of:
                1.7851884 = boost
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.06035524 = queryNorm
              0.9643675 = fieldWeight in 1563, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.078125 = fieldNorm(doc=1563)
        0.75 = coord(3/4)
    
  3. Balas, J.: Dewey and the net (1996) 0.43
    0.43417305 = sum of:
      0.43417305 = product of:
        0.8683461 = sum of:
          0.1511635 = weight(abstract_txt:through in 4772) [ClassicSimilarity], result of:
            0.1511635 = score(doc=4772,freq=1.0), product of:
              0.24187277 = queryWeight, product of:
                1.0019177 = boost
                3.9998152 = idf(docFreq=2211, maxDocs=44421)
                0.06035524 = queryNorm
              0.62497115 = fieldWeight in 4772, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9998152 = idf(docFreq=2211, maxDocs=44421)
                0.15625 = fieldNorm(doc=4772)
          0.7171826 = weight(abstract_txt:page in 4772) [ClassicSimilarity], result of:
            0.7171826 = score(doc=4772,freq=2.0), product of:
              0.54204106 = queryWeight, product of:
                1.4998736 = boost
                5.987735 = idf(docFreq=302, maxDocs=44421)
                0.06035524 = queryNorm
              1.3231149 = fieldWeight in 4772, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.987735 = idf(docFreq=302, maxDocs=44421)
                0.15625 = fieldNorm(doc=4772)
        0.5 = coord(2/4)
    
  4. Over, P.; Dang, H.; Harman, D.: DUC in context (2007) 0.35
    0.3521826 = sum of:
      0.3521826 = product of:
        0.7043652 = sum of:
          0.10581445 = weight(abstract_txt:through in 1934) [ClassicSimilarity], result of:
            0.10581445 = score(doc=1934,freq=1.0), product of:
              0.24187277 = queryWeight, product of:
                1.0019177 = boost
                3.9998152 = idf(docFreq=2211, maxDocs=44421)
                0.06035524 = queryNorm
              0.4374798 = fieldWeight in 1934, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9998152 = idf(docFreq=2211, maxDocs=44421)
                0.109375 = fieldNorm(doc=1934)
          0.59855074 = weight(abstract_txt:summarization in 1934) [ClassicSimilarity], result of:
            0.59855074 = score(doc=1934,freq=1.0), product of:
              0.7678758 = queryWeight, product of:
                1.7851884 = boost
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.06035524 = queryNorm
              0.77948904 = fieldWeight in 1934, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.109375 = fieldNorm(doc=1934)
        0.5 = coord(2/4)
    
  5. Harrow, J.; Wickersham, L.; Rotherham, S.; Ella Farnsworth, E.; McElhenny, G.: Contextual depth projection in Large Language Models through semantic lattice frameworks (2024) 0.35
    0.34695995 = sum of:
      0.34695995 = product of:
        0.46261328 = sum of:
          0.06011886 = weight(abstract_txt:classification in 2403) [ClassicSimilarity], result of:
            0.06011886 = score(doc=2403,freq=1.0), product of:
              0.24094774 = queryWeight, product of:
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.06035524 = queryNorm
              0.24950996 = fieldWeight in 2403, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.0625 = fieldNorm(doc=2403)
          0.0604654 = weight(abstract_txt:through in 2403) [ClassicSimilarity], result of:
            0.0604654 = score(doc=2403,freq=1.0), product of:
              0.24187277 = queryWeight, product of:
                1.0019177 = boost
                3.9998152 = idf(docFreq=2211, maxDocs=44421)
                0.06035524 = queryNorm
              0.24998845 = fieldWeight in 2403, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9998152 = idf(docFreq=2211, maxDocs=44421)
                0.0625 = fieldNorm(doc=2403)
          0.342029 = weight(abstract_txt:summarization in 2403) [ClassicSimilarity], result of:
            0.342029 = score(doc=2403,freq=1.0), product of:
              0.7678758 = queryWeight, product of:
                1.7851884 = boost
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.06035524 = queryNorm
              0.4454223 = fieldWeight in 2403, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.1267567 = idf(docFreq=96, maxDocs=44421)
                0.0625 = fieldNorm(doc=2403)
        0.75 = coord(3/4)