Document (#24286)

Author
Chakrabati, S.
Van den Berg, M.
Dom, B.
Title
Focused crawling : a new approach in topic-specific Web resource discovery
Source
Computer networks. 31(1999) no.11-16, S.1623-1640
Year
1999
Theme
Internet

Similar documents (author)

  1. Berg, O.: Current problems with MARC/ISBD formats in relation to online public access of bibliographic information (1991) 5.41
    5.4105906 = sum of:
      5.4105906 = weight(author_txt:berg in 468) [ClassicSimilarity], result of:
        5.4105906 = fieldWeight in 468, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.656945 = idf(docFreq=20, maxDocs=44421)
          0.625 = fieldNorm(doc=468)
    
  2. Berg, S.: Auf dem Weg : Fallbeispiel: Vorbereitungen für einen elektronischen Katalog (1995) 5.41
    5.4105906 = sum of:
      5.4105906 = weight(author_txt:berg in 716) [ClassicSimilarity], result of:
        5.4105906 = fieldWeight in 716, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.656945 = idf(docFreq=20, maxDocs=44421)
          0.625 = fieldNorm(doc=716)
    
  3. Berg, L.: Wie das Internet die Gesellschaft verändert : Google gründet ein Forschungsinstitut in Berlin (2011) 5.41
    5.4105906 = sum of:
      5.4105906 = weight(author_txt:berg in 552) [ClassicSimilarity], result of:
        5.4105906 = fieldWeight in 552, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.656945 = idf(docFreq=20, maxDocs=44421)
          0.625 = fieldNorm(doc=552)
    
  4. Berg, L.: Pablo will es wissen : Lernen mit Salman Khan (2012) 5.41
    5.4105906 = sum of:
      5.4105906 = weight(author_txt:berg in 1228) [ClassicSimilarity], result of:
        5.4105906 = fieldWeight in 1228, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.656945 = idf(docFreq=20, maxDocs=44421)
          0.625 = fieldNorm(doc=1228)
    
  5. Berg, J. van den: ¬The ICONCLASS browser user's guide (1992) 4.33
    4.3284726 = sum of:
      4.3284726 = weight(author_txt:berg in 3269) [ClassicSimilarity], result of:
        4.3284726 = fieldWeight in 3269, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.656945 = idf(docFreq=20, maxDocs=44421)
          0.5 = fieldNorm(doc=3269)
    

Similar documents (content)

  1. Kwiatkowski, M.; Höhfeld, S.: Thematisches Aufspüren von Web-Dokumenten : eine kritische Betrachtung von Focused Crawling-Strategien (2007) 0.46
    0.4580477 = sum of:
      0.4580477 = product of:
        1.0687779 = sum of:
          0.2589926 = weight(abstract_txt:focused in 1153) [ClassicSimilarity], result of:
            0.2589926 = score(doc=1153,freq=5.0), product of:
              0.33165202 = queryWeight, product of:
                1.4936033 = boost
                5.5877852 = idf(docFreq=451, maxDocs=44421)
                0.039738152 = queryNorm
              0.78091675 = fieldWeight in 1153, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.5877852 = idf(docFreq=451, maxDocs=44421)
                0.0625 = fieldNorm(doc=1153)
          0.11764925 = weight(abstract_txt:discovery in 1153) [ClassicSimilarity], result of:
            0.11764925 = score(doc=1153,freq=1.0), product of:
              0.33512527 = queryWeight, product of:
                1.5014039 = boost
                5.616968 = idf(docFreq=438, maxDocs=44421)
                0.039738152 = queryNorm
              0.3510605 = fieldWeight in 1153, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.616968 = idf(docFreq=438, maxDocs=44421)
                0.0625 = fieldNorm(doc=1153)
          0.69213605 = weight(abstract_txt:crawling in 1153) [ClassicSimilarity], result of:
            0.69213605 = score(doc=1153,freq=3.0), product of:
              0.7572425 = queryWeight, product of:
                2.2568955 = boost
                8.443371 = idf(docFreq=25, maxDocs=44421)
                0.039738152 = queryNorm
              0.9140217 = fieldWeight in 1153, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.443371 = idf(docFreq=25, maxDocs=44421)
                0.0625 = fieldNorm(doc=1153)
        0.42857143 = coord(3/7)
    
  2. Alqaraleh, S.; Ramadan, O.; Salamah, M.: Efficient watcher based web crawler design (2015) 0.38
    0.38012806 = sum of:
      0.38012806 = product of:
        0.88696545 = sum of:
          0.03476134 = weight(abstract_txt:approach in 2627) [ClassicSimilarity], result of:
            0.03476134 = score(doc=2627,freq=1.0), product of:
              0.14866614 = queryWeight, product of:
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.039738152 = queryNorm
              0.2338215 = fieldWeight in 2627, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.0625 = fieldNorm(doc=2627)
          0.052994218 = weight(abstract_txt:specific in 2627) [ClassicSimilarity], result of:
            0.052994218 = score(doc=2627,freq=1.0), product of:
              0.19692464 = queryWeight, product of:
                1.1509169 = boost
                4.305746 = idf(docFreq=1628, maxDocs=44421)
                0.039738152 = queryNorm
              0.26910913 = fieldWeight in 2627, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.305746 = idf(docFreq=1628, maxDocs=44421)
                0.0625 = fieldNorm(doc=2627)
          0.7992099 = weight(abstract_txt:crawling in 2627) [ClassicSimilarity], result of:
            0.7992099 = score(doc=2627,freq=4.0), product of:
              0.7572425 = queryWeight, product of:
                2.2568955 = boost
                8.443371 = idf(docFreq=25, maxDocs=44421)
                0.039738152 = queryNorm
              1.0554214 = fieldWeight in 2627, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.443371 = idf(docFreq=25, maxDocs=44421)
                0.0625 = fieldNorm(doc=2627)
        0.42857143 = coord(3/7)
    
  3. Simeoni, F.; Yakici, M.; Neely, S.; Crestani, F.: Metadata harvesting for content-based distributed information retrieval (2008) 0.31
    0.30728668 = sum of:
      0.30728668 = product of:
        0.7170023 = sum of:
          0.06952268 = weight(abstract_txt:approach in 2336) [ClassicSimilarity], result of:
            0.06952268 = score(doc=2336,freq=4.0), product of:
              0.14866614 = queryWeight, product of:
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.039738152 = queryNorm
              0.467643 = fieldWeight in 2336, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.0625 = fieldNorm(doc=2336)
          0.082352914 = weight(abstract_txt:resource in 2336) [ClassicSimilarity], result of:
            0.082352914 = score(doc=2336,freq=1.0), product of:
              0.26420054 = queryWeight, product of:
                1.3330941 = boost
                4.987297 = idf(docFreq=823, maxDocs=44421)
                0.039738152 = queryNorm
              0.31170607 = fieldWeight in 2336, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.987297 = idf(docFreq=823, maxDocs=44421)
                0.0625 = fieldNorm(doc=2336)
          0.5651267 = weight(abstract_txt:crawling in 2336) [ClassicSimilarity], result of:
            0.5651267 = score(doc=2336,freq=2.0), product of:
              0.7572425 = queryWeight, product of:
                2.2568955 = boost
                8.443371 = idf(docFreq=25, maxDocs=44421)
                0.039738152 = queryNorm
              0.7462956 = fieldWeight in 2336, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.443371 = idf(docFreq=25, maxDocs=44421)
                0.0625 = fieldNorm(doc=2336)
        0.42857143 = coord(3/7)
    
  4. Slavic, A.: General library classification in learning material metadata : the application in IMS/LOM and CDMES metadata schemas (2003) 0.27
    0.27499998 = sum of:
      0.27499998 = product of:
        0.48124993 = sum of:
          0.061449945 = weight(abstract_txt:approach in 4961) [ClassicSimilarity], result of:
            0.061449945 = score(doc=4961,freq=2.0), product of:
              0.14866614 = queryWeight, product of:
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.039738152 = queryNorm
              0.41334188 = fieldWeight in 4961, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.078125 = fieldNorm(doc=4961)
          0.06624278 = weight(abstract_txt:specific in 4961) [ClassicSimilarity], result of:
            0.06624278 = score(doc=4961,freq=1.0), product of:
              0.19692464 = queryWeight, product of:
                1.1509169 = boost
                4.305746 = idf(docFreq=1628, maxDocs=44421)
                0.039738152 = queryNorm
              0.3363864 = fieldWeight in 4961, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.305746 = idf(docFreq=1628, maxDocs=44421)
                0.078125 = fieldNorm(doc=4961)
          0.14558075 = weight(abstract_txt:resource in 4961) [ClassicSimilarity], result of:
            0.14558075 = score(doc=4961,freq=2.0), product of:
              0.26420054 = queryWeight, product of:
                1.3330941 = boost
                4.987297 = idf(docFreq=823, maxDocs=44421)
                0.039738152 = queryNorm
              0.55102366 = fieldWeight in 4961, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.987297 = idf(docFreq=823, maxDocs=44421)
                0.078125 = fieldNorm(doc=4961)
          0.20797646 = weight(abstract_txt:discovery in 4961) [ClassicSimilarity], result of:
            0.20797646 = score(doc=4961,freq=2.0), product of:
              0.33512527 = queryWeight, product of:
                1.5014039 = boost
                5.616968 = idf(docFreq=438, maxDocs=44421)
                0.039738152 = queryNorm
              0.6205932 = fieldWeight in 4961, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.616968 = idf(docFreq=438, maxDocs=44421)
                0.078125 = fieldNorm(doc=4961)
        0.5714286 = coord(4/7)
    
  5. Otterbacher, J.; Radev, D.: Exploring fact-focused relevance and novelty detection (2008) 0.27
    0.26945817 = sum of:
      0.26945817 = product of:
        0.47155178 = sum of:
          0.06952268 = weight(abstract_txt:approach in 3210) [ClassicSimilarity], result of:
            0.06952268 = score(doc=3210,freq=4.0), product of:
              0.14866614 = queryWeight, product of:
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.039738152 = queryNorm
              0.467643 = fieldWeight in 3210, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.0625 = fieldNorm(doc=3210)
          0.052994218 = weight(abstract_txt:specific in 3210) [ClassicSimilarity], result of:
            0.052994218 = score(doc=3210,freq=1.0), product of:
              0.19692464 = queryWeight, product of:
                1.1509169 = boost
                4.305746 = idf(docFreq=1628, maxDocs=44421)
                0.039738152 = queryNorm
              0.26910913 = fieldWeight in 3210, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.305746 = idf(docFreq=1628, maxDocs=44421)
                0.0625 = fieldNorm(doc=3210)
          0.14842007 = weight(abstract_txt:topic in 3210) [ClassicSimilarity], result of:
            0.14842007 = score(doc=3210,freq=3.0), product of:
              0.27129123 = queryWeight, product of:
                1.3508646 = boost
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.039738152 = queryNorm
              0.5470876 = fieldWeight in 3210, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.0625 = fieldNorm(doc=3210)
          0.20061481 = weight(abstract_txt:focused in 3210) [ClassicSimilarity], result of:
            0.20061481 = score(doc=3210,freq=3.0), product of:
              0.33165202 = queryWeight, product of:
                1.4936033 = boost
                5.5877852 = idf(docFreq=451, maxDocs=44421)
                0.039738152 = queryNorm
              0.6048955 = fieldWeight in 3210, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.5877852 = idf(docFreq=451, maxDocs=44421)
                0.0625 = fieldNorm(doc=3210)
        0.5714286 = coord(4/7)