Document (#22388)

Author
Haas, S.W.
Grams, E.S.
Title
Readers, authors, and page structure : a discussion of four questions arising from a content analysis of Web pages
Source
Journal of the American Society for Information Science. 51(2000) no.2, S.181-192
Year
2000
Abstract
Previous research describing Web page and link classification systems resulting from a content analysis of over 75 Web pages left us with four unanswered questions: (1) What is the most useful apllication of page types: as descriptions of entire pages or as components that are combined to create pages? (2) Is there a kind of analysis that we can perform on isolated anchors, which can be text, icons, or both together, that is equivalent to the syntactic analysis for embedded and labeld anchors? (3) How explicitly are readers informed about what can be found by traversing a link, especially for the relatively broad categories of expansion and resource links? (4) Is there a relationship between the type of link and whther its target is a whole page or a fragment, or of its target is in the same site or a different site than its source? This article examines these questions
Theme
Internet

Similar documents (author)

  1. Haas, S.W.: ¬A feasibility study of the case hierarchy model for the construction and porting of natural language interfaces (1990) 5.51
    5.506935 = sum of:
      5.506935 = weight(author_txt:haas in 8070) [ClassicSimilarity], result of:
        5.506935 = fieldWeight in 8070, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.811096 = idf(docFreq=17, maxDocs=44421)
          0.625 = fieldNorm(doc=8070)
    
  2. Haas, S.W.: Disciplinary variation in automatic sublanguage term identification (1997) 5.51
    5.506935 = sum of:
      5.506935 = weight(author_txt:haas in 6568) [ClassicSimilarity], result of:
        5.506935 = fieldWeight in 6568, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.811096 = idf(docFreq=17, maxDocs=44421)
          0.625 = fieldNorm(doc=6568)
    
  3. Haas, S.W.: ¬A text filter for the automatic identification of empirical articles (1996) 5.51
    5.506935 = sum of:
      5.506935 = weight(author_txt:haas in 6866) [ClassicSimilarity], result of:
        5.506935 = fieldWeight in 6866, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.811096 = idf(docFreq=17, maxDocs=44421)
          0.625 = fieldNorm(doc=6866)
    
  4. Haas, S.W.: Natural language processing : toward large-scale, robust systems (1996) 5.51
    5.506935 = sum of:
      5.506935 = weight(author_txt:haas in 484) [ClassicSimilarity], result of:
        5.506935 = fieldWeight in 484, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.811096 = idf(docFreq=17, maxDocs=44421)
          0.625 = fieldNorm(doc=484)
    
  5. Haas, S.: Metadata mania : an overview (1998) 5.51
    5.506935 = sum of:
      5.506935 = weight(author_txt:haas in 3222) [ClassicSimilarity], result of:
        5.506935 = fieldWeight in 3222, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.811096 = idf(docFreq=17, maxDocs=44421)
          0.625 = fieldNorm(doc=3222)
    

Similar documents (content)

  1. Menczer, F.: Lexical and semantic clustering by Web links (2004) 0.22
    0.22031753 = sum of:
      0.22031753 = product of:
        0.7868483 = sum of:
          0.015835945 = weight(abstract_txt:that in 4090) [ClassicSimilarity], result of:
            0.015835945 = score(doc=4090,freq=4.0), product of:
              0.04285532 = queryWeight, product of:
                1.0686762 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.016956622 = queryNorm
              0.3695211 = fieldWeight in 4090, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.078125 = fieldNorm(doc=4090)
          0.027386466 = weight(abstract_txt:there in 4090) [ClassicSimilarity], result of:
            0.027386466 = score(doc=4090,freq=1.0), product of:
              0.08562271 = queryWeight, product of:
                1.2333679 = boost
                4.094086 = idf(docFreq=2012, maxDocs=44421)
                0.016956622 = queryNorm
              0.31985047 = fieldWeight in 4090, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.094086 = idf(docFreq=2012, maxDocs=44421)
                0.078125 = fieldNorm(doc=4090)
          0.041208494 = weight(abstract_txt:content in 4090) [ClassicSimilarity], result of:
            0.041208494 = score(doc=4090,freq=2.0), product of:
              0.08923724 = queryWeight, product of:
                1.2591319 = boost
                4.1796083 = idf(docFreq=1847, maxDocs=44421)
                0.016956622 = queryNorm
              0.46178582 = fieldWeight in 4090, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1796083 = idf(docFreq=1847, maxDocs=44421)
                0.078125 = fieldNorm(doc=4090)
          0.03867488 = weight(abstract_txt:analysis in 4090) [ClassicSimilarity], result of:
            0.03867488 = score(doc=4090,freq=1.0), product of:
              0.13578786 = queryWeight, product of:
                2.1965628 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.016956622 = queryNorm
              0.28481838 = fieldWeight in 4090, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.078125 = fieldNorm(doc=4090)
          0.24887453 = weight(abstract_txt:link in 4090) [ClassicSimilarity], result of:
            0.24887453 = score(doc=4090,freq=5.0), product of:
              0.24960831 = queryWeight, product of:
                2.5791304 = boost
                5.707506 = idf(docFreq=400, maxDocs=44421)
                0.016956622 = queryNorm
              0.9970603 = fieldWeight in 4090, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.707506 = idf(docFreq=400, maxDocs=44421)
                0.078125 = fieldNorm(doc=4090)
          0.24351853 = weight(abstract_txt:pages in 4090) [ClassicSimilarity], result of:
            0.24351853 = score(doc=4090,freq=3.0), product of:
              0.3210376 = queryWeight, product of:
                3.377467 = boost
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.016956622 = queryNorm
              0.75853586 = fieldWeight in 4090, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.078125 = fieldNorm(doc=4090)
          0.17134944 = weight(abstract_txt:page in 4090) [ClassicSimilarity], result of:
            0.17134944 = score(doc=4090,freq=1.0), product of:
              0.36629423 = queryWeight, product of:
                3.6076818 = boost
                5.987735 = idf(docFreq=302, maxDocs=44421)
                0.016956622 = queryNorm
              0.4677918 = fieldWeight in 4090, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.987735 = idf(docFreq=302, maxDocs=44421)
                0.078125 = fieldNorm(doc=4090)
        0.28 = coord(7/25)
    
  2. Liu, Y.; Zhang, M.; Cen, R.; Ru, L.; Ma, S.: Data cleansing for Web information retrieval using query independent features (2007) 0.21
    0.20540792 = sum of:
      0.20540792 = product of:
        0.7335997 = sum of:
          0.010971464 = weight(abstract_txt:that in 1607) [ClassicSimilarity], result of:
            0.010971464 = score(doc=1607,freq=3.0), product of:
              0.04285532 = queryWeight, product of:
                1.0686762 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.016956622 = queryNorm
              0.25601172 = fieldWeight in 1607, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=1607)
          0.021909174 = weight(abstract_txt:there in 1607) [ClassicSimilarity], result of:
            0.021909174 = score(doc=1607,freq=1.0), product of:
              0.08562271 = queryWeight, product of:
                1.2333679 = boost
                4.094086 = idf(docFreq=2012, maxDocs=44421)
                0.016956622 = queryNorm
              0.2558804 = fieldWeight in 1607, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.094086 = idf(docFreq=2012, maxDocs=44421)
                0.0625 = fieldNorm(doc=1607)
          0.025518665 = weight(abstract_txt:what in 1607) [ClassicSimilarity], result of:
            0.025518665 = score(doc=1607,freq=1.0), product of:
              0.09478588 = queryWeight, product of:
                1.297687 = boost
                4.3075895 = idf(docFreq=1625, maxDocs=44421)
                0.016956622 = queryNorm
              0.26922435 = fieldWeight in 1607, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3075895 = idf(docFreq=1625, maxDocs=44421)
                0.0625 = fieldNorm(doc=1607)
          0.1569361 = weight(abstract_txt:target in 1607) [ClassicSimilarity], result of:
            0.1569361 = score(doc=1607,freq=3.0), product of:
              0.22060387 = queryWeight, product of:
                1.9797244 = boost
                6.571569 = idf(docFreq=168, maxDocs=44421)
                0.016956622 = queryNorm
              0.7113932 = fieldWeight in 1607, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.571569 = idf(docFreq=168, maxDocs=44421)
                0.0625 = fieldNorm(doc=1607)
          0.04375563 = weight(abstract_txt:analysis in 1607) [ClassicSimilarity], result of:
            0.04375563 = score(doc=1607,freq=2.0), product of:
              0.13578786 = queryWeight, product of:
                2.1965628 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.016956622 = queryNorm
              0.3222352 = fieldWeight in 1607, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.0625 = fieldNorm(doc=1607)
          0.33742914 = weight(abstract_txt:pages in 1607) [ClassicSimilarity], result of:
            0.33742914 = score(doc=1607,freq=9.0), product of:
              0.3210376 = queryWeight, product of:
                3.377467 = boost
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.016956622 = queryNorm
              1.051058 = fieldWeight in 1607, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.0625 = fieldNorm(doc=1607)
          0.13707955 = weight(abstract_txt:page in 1607) [ClassicSimilarity], result of:
            0.13707955 = score(doc=1607,freq=1.0), product of:
              0.36629423 = queryWeight, product of:
                3.6076818 = boost
                5.987735 = idf(docFreq=302, maxDocs=44421)
                0.016956622 = queryNorm
              0.37423342 = fieldWeight in 1607, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.987735 = idf(docFreq=302, maxDocs=44421)
                0.0625 = fieldNorm(doc=1607)
        0.28 = coord(7/25)
    
  3. Bar-Ilan, J.: Web links and search engine ranking : the case of Google and the query "Jew" (2006) 0.20
    0.20218267 = sum of:
      0.20218267 = product of:
        0.72208095 = sum of:
          0.010971464 = weight(abstract_txt:that in 104) [ClassicSimilarity], result of:
            0.010971464 = score(doc=104,freq=3.0), product of:
              0.04285532 = queryWeight, product of:
                1.0686762 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.016956622 = queryNorm
              0.25601172 = fieldWeight in 104, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=104)
          0.032966796 = weight(abstract_txt:content in 104) [ClassicSimilarity], result of:
            0.032966796 = score(doc=104,freq=2.0), product of:
              0.08923724 = queryWeight, product of:
                1.2591319 = boost
                4.1796083 = idf(docFreq=1847, maxDocs=44421)
                0.016956622 = queryNorm
              0.36942866 = fieldWeight in 104, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1796083 = idf(docFreq=1847, maxDocs=44421)
                0.0625 = fieldNorm(doc=104)
          0.059050877 = weight(abstract_txt:site in 104) [ClassicSimilarity], result of:
            0.059050877 = score(doc=104,freq=1.0), product of:
              0.16582724 = queryWeight, product of:
                1.7164301 = boost
                5.6975803 = idf(docFreq=404, maxDocs=44421)
                0.016956622 = queryNorm
              0.35609877 = fieldWeight in 104, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6975803 = idf(docFreq=404, maxDocs=44421)
                0.0625 = fieldNorm(doc=104)
          0.030939901 = weight(abstract_txt:analysis in 104) [ClassicSimilarity], result of:
            0.030939901 = score(doc=104,freq=1.0), product of:
              0.13578786 = queryWeight, product of:
                2.1965628 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.016956622 = queryNorm
              0.2278547 = fieldWeight in 104, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.0625 = fieldNorm(doc=104)
          0.08904006 = weight(abstract_txt:link in 104) [ClassicSimilarity], result of:
            0.08904006 = score(doc=104,freq=1.0), product of:
              0.24960831 = queryWeight, product of:
                2.5791304 = boost
                5.707506 = idf(docFreq=400, maxDocs=44421)
                0.016956622 = queryNorm
              0.35671914 = fieldWeight in 104, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.707506 = idf(docFreq=400, maxDocs=44421)
                0.0625 = fieldNorm(doc=104)
          0.22495277 = weight(abstract_txt:pages in 104) [ClassicSimilarity], result of:
            0.22495277 = score(doc=104,freq=4.0), product of:
              0.3210376 = queryWeight, product of:
                3.377467 = boost
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.016956622 = queryNorm
              0.7007054 = fieldWeight in 104, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.0625 = fieldNorm(doc=104)
          0.2741591 = weight(abstract_txt:page in 104) [ClassicSimilarity], result of:
            0.2741591 = score(doc=104,freq=4.0), product of:
              0.36629423 = queryWeight, product of:
                3.6076818 = boost
                5.987735 = idf(docFreq=302, maxDocs=44421)
                0.016956622 = queryNorm
              0.74846685 = fieldWeight in 104, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.987735 = idf(docFreq=302, maxDocs=44421)
                0.0625 = fieldNorm(doc=104)
        0.28 = coord(7/25)
    
  4. Breslin, J.G.: Social semantic information spaces (2009) 0.17
    0.17441657 = sum of:
      0.17441657 = product of:
        0.62291634 = sum of:
          0.074885815 = weight(abstract_txt:explicitly in 364) [ClassicSimilarity], result of:
            0.074885815 = score(doc=364,freq=2.0), product of:
              0.12239154 = queryWeight, product of:
                1.0426987 = boost
                6.922344 = idf(docFreq=118, maxDocs=44421)
                0.016956622 = queryNorm
              0.61185455 = fieldWeight in 364, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.922344 = idf(docFreq=118, maxDocs=44421)
                0.0625 = fieldNorm(doc=364)
          0.015515995 = weight(abstract_txt:that in 364) [ClassicSimilarity], result of:
            0.015515995 = score(doc=364,freq=6.0), product of:
              0.04285532 = queryWeight, product of:
                1.0686762 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.016956622 = queryNorm
              0.3620553 = fieldWeight in 364, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=364)
          0.0379478 = weight(abstract_txt:there in 364) [ClassicSimilarity], result of:
            0.0379478 = score(doc=364,freq=3.0), product of:
              0.08562271 = queryWeight, product of:
                1.2333679 = boost
                4.094086 = idf(docFreq=2012, maxDocs=44421)
                0.016956622 = queryNorm
              0.44319782 = fieldWeight in 364, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.094086 = idf(docFreq=2012, maxDocs=44421)
                0.0625 = fieldNorm(doc=364)
          0.044199623 = weight(abstract_txt:what in 364) [ClassicSimilarity], result of:
            0.044199623 = score(doc=364,freq=3.0), product of:
              0.09478588 = queryWeight, product of:
                1.297687 = boost
                4.3075895 = idf(docFreq=1625, maxDocs=44421)
                0.016956622 = queryNorm
              0.46631023 = fieldWeight in 364, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.3075895 = idf(docFreq=1625, maxDocs=44421)
                0.0625 = fieldNorm(doc=364)
          0.1542219 = weight(abstract_txt:link in 364) [ClassicSimilarity], result of:
            0.1542219 = score(doc=364,freq=3.0), product of:
              0.24960831 = queryWeight, product of:
                2.5791304 = boost
                5.707506 = idf(docFreq=400, maxDocs=44421)
                0.016956622 = queryNorm
              0.61785567 = fieldWeight in 364, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.707506 = idf(docFreq=400, maxDocs=44421)
                0.0625 = fieldNorm(doc=364)
          0.15906563 = weight(abstract_txt:pages in 364) [ClassicSimilarity], result of:
            0.15906563 = score(doc=364,freq=2.0), product of:
              0.3210376 = queryWeight, product of:
                3.377467 = boost
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.016956622 = queryNorm
              0.49547353 = fieldWeight in 364, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.0625 = fieldNorm(doc=364)
          0.13707955 = weight(abstract_txt:page in 364) [ClassicSimilarity], result of:
            0.13707955 = score(doc=364,freq=1.0), product of:
              0.36629423 = queryWeight, product of:
                3.6076818 = boost
                5.987735 = idf(docFreq=302, maxDocs=44421)
                0.016956622 = queryNorm
              0.37423342 = fieldWeight in 364, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.987735 = idf(docFreq=302, maxDocs=44421)
                0.0625 = fieldNorm(doc=364)
        0.28 = coord(7/25)
    
  5. Thelwall, M.; Vaughan, L.: New versions of PageRank employing alternative Web document models (2004) 0.16
    0.16202286 = sum of:
      0.16202286 = product of:
        0.67509526 = sum of:
          0.012668756 = weight(abstract_txt:that in 799) [ClassicSimilarity], result of:
            0.012668756 = score(doc=799,freq=4.0), product of:
              0.04285532 = queryWeight, product of:
                1.0686762 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.016956622 = queryNorm
              0.2956169 = fieldWeight in 799, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=799)
          0.0445113 = weight(abstract_txt:four in 799) [ClassicSimilarity], result of:
            0.0445113 = score(doc=799,freq=1.0), product of:
              0.13734679 = queryWeight, product of:
                1.5620949 = boost
                5.1852746 = idf(docFreq=675, maxDocs=44421)
                0.016956622 = queryNorm
              0.32407966 = fieldWeight in 799, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1852746 = idf(docFreq=675, maxDocs=44421)
                0.0625 = fieldNorm(doc=799)
          0.08351055 = weight(abstract_txt:site in 799) [ClassicSimilarity], result of:
            0.08351055 = score(doc=799,freq=2.0), product of:
              0.16582724 = queryWeight, product of:
                1.7164301 = boost
                5.6975803 = idf(docFreq=404, maxDocs=44421)
                0.016956622 = queryNorm
              0.5035997 = fieldWeight in 799, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6975803 = idf(docFreq=404, maxDocs=44421)
                0.0625 = fieldNorm(doc=799)
          0.08904006 = weight(abstract_txt:link in 799) [ClassicSimilarity], result of:
            0.08904006 = score(doc=799,freq=1.0), product of:
              0.24960831 = queryWeight, product of:
                2.5791304 = boost
                5.707506 = idf(docFreq=400, maxDocs=44421)
                0.016956622 = queryNorm
              0.35671914 = fieldWeight in 799, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.707506 = idf(docFreq=400, maxDocs=44421)
                0.0625 = fieldNorm(doc=799)
          0.25150484 = weight(abstract_txt:pages in 799) [ClassicSimilarity], result of:
            0.25150484 = score(doc=799,freq=5.0), product of:
              0.3210376 = queryWeight, product of:
                3.377467 = boost
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.016956622 = queryNorm
              0.78341246 = fieldWeight in 799, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.0625 = fieldNorm(doc=799)
          0.19385976 = weight(abstract_txt:page in 799) [ClassicSimilarity], result of:
            0.19385976 = score(doc=799,freq=2.0), product of:
              0.36629423 = queryWeight, product of:
                3.6076818 = boost
                5.987735 = idf(docFreq=302, maxDocs=44421)
                0.016956622 = queryNorm
              0.529246 = fieldWeight in 799, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.987735 = idf(docFreq=302, maxDocs=44421)
                0.0625 = fieldNorm(doc=799)
        0.24 = coord(6/25)