Document (#21065)

Author
Wätjen, H.-J.
Title
GERHARD : Automatisches Sammeln, Klassifizieren und Indexieren von wissenschaftlich relevanten Informationsressourcen im deutschen World Wide Web
Source
B.I.T.online. 1(1998) H.4, S.279-290
Year
1998
Abstract
Die intellektuelle Erschließung des Internet befindet sich in einer Krise. Yahoo und andere Dienste können mit dem Wachstum des Web nicht mithalten. GERHARD ist derzeit weltweit der einzige Such- und Navigationsdienst, der die mit einem Roboter gesammelten Internetressourcen mit computerlinguistischen und statistischen Verfahren auch automatisch vollständig klassifiziert. Weit über eine Million HTML-Dokumente von wissenschaftlich relevanten Servern in Deutschland können wie bei anderen Suchmaschinen in der Datenbank gesucht, aber auch über die Navigation in der dreisprachigen Universalen Dezimalklassifikation (ETH-Bibliothek Zürich) recherchiert werden
Footnote
Vgl. auch: http://www.gerhard.de/info/Dokumente/Bericht/bericht.pdf
Theme
Automatisches Klassifizieren
Internet
Klassifikationssysteme im Online-Retrieval
Object
GERHARD
DK
Harvest
UDC

Similar documents (author)

  1. Wätjen, H.-J.: ORBIS, der Oldenburger Online-Benutzerkatalog (1991) 4.61
    4.6059904 = sum of:
      4.6059904 = weight(author_txt:wätjen in 2068) [ClassicSimilarity], result of:
        4.6059904 = fieldWeight in 2068, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.211981 = idf(docFreq=11, maxDocs=44218)
          0.5 = fieldNorm(doc=2068)
    
  2. Wätjen, H.-J.: Mensch oder Maschine? : Auswahl und Erschließung vonm Informationsressourcen im Internet (1996) 4.61
    4.6059904 = sum of:
      4.6059904 = weight(author_txt:wätjen in 3161) [ClassicSimilarity], result of:
        4.6059904 = fieldWeight in 3161, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.211981 = idf(docFreq=11, maxDocs=44218)
          0.5 = fieldNorm(doc=3161)
    
  3. Wätjen, H.-J.: Hypertextbasierte OPACs im World-wide Web (1996) 4.61
    4.6059904 = sum of:
      4.6059904 = weight(author_txt:wätjen in 5456) [ClassicSimilarity], result of:
        4.6059904 = fieldWeight in 5456, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.211981 = idf(docFreq=11, maxDocs=44218)
          0.5 = fieldNorm(doc=5456)
    
  4. Wätjen, H.-J.: Automatisches Sammeln, Klassifizieren und Indexieren von wissenschaftlich relevanten Informationsressourcen im deutschen World Wide Web : das DFG-Projekt GERHARD (1998) 4.61
    4.6059904 = sum of:
      4.6059904 = weight(author_txt:wätjen in 3066) [ClassicSimilarity], result of:
        4.6059904 = fieldWeight in 3066, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.211981 = idf(docFreq=11, maxDocs=44218)
          0.5 = fieldNorm(doc=3066)
    
  5. Wätjen, H.-J.: Zur Realität virtueller Bibliotheken : Möglichkeiten, Aufgaben, Probleme (1999) 4.61
    4.6059904 = sum of:
      4.6059904 = weight(author_txt:wätjen in 4125) [ClassicSimilarity], result of:
        4.6059904 = fieldWeight in 4125, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.211981 = idf(docFreq=11, maxDocs=44218)
          0.5 = fieldNorm(doc=4125)
    

Similar documents (content)

  1. Krüger, C.: Evaluation des WWW-Suchdienstes GERHARD unter besonderer Beachtung automatischer Indexierung (1999) 0.35
    0.3456611 = sum of:
      0.3456611 = product of:
        1.234504 = sum of:
          0.07759602 = weight(abstract_txt:intellektuelle in 1777) [ClassicSimilarity], result of:
            0.07759602 = score(doc=1777,freq=1.0), product of:
              0.159068 = queryWeight, product of:
                1.0126057 = boost
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.020126387 = queryNorm
              0.4878167 = fieldWeight in 1777, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.0625 = fieldNorm(doc=1777)
          0.078845575 = weight(abstract_txt:einzige in 1777) [ClassicSimilarity], result of:
            0.078845575 = score(doc=1777,freq=1.0), product of:
              0.16077113 = queryWeight, product of:
                1.0180122 = boost
                7.84674 = idf(docFreq=46, maxDocs=44218)
                0.020126387 = queryNorm
              0.49042124 = fieldWeight in 1777, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.84674 = idf(docFreq=46, maxDocs=44218)
                0.0625 = fieldNorm(doc=1777)
          0.10241011 = weight(abstract_txt:servern in 1777) [ClassicSimilarity], result of:
            0.10241011 = score(doc=1777,freq=1.0), product of:
              0.19138962 = queryWeight, product of:
                1.1107291 = boost
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.020126387 = queryNorm
              0.53508705 = fieldWeight in 1777, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.0625 = fieldNorm(doc=1777)
          0.1520391 = weight(abstract_txt:internetressourcen in 1777) [ClassicSimilarity], result of:
            0.1520391 = score(doc=1777,freq=2.0), product of:
              0.1976894 = queryWeight, product of:
                1.1288614 = boost
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.020126387 = queryNorm
              0.7690807 = fieldWeight in 1777, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.701155 = idf(docFreq=19, maxDocs=44218)
                0.0625 = fieldNorm(doc=1777)
          0.10942038 = weight(abstract_txt:klassifiziert in 1777) [ClassicSimilarity], result of:
            0.10942038 = score(doc=1777,freq=1.0), product of:
              0.20002702 = queryWeight, product of:
                1.135516 = boost
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.020126387 = queryNorm
              0.547028 = fieldWeight in 1777, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.752448 = idf(docFreq=18, maxDocs=44218)
                0.0625 = fieldNorm(doc=1777)
          0.13487367 = weight(abstract_txt:wissenschaftlich in 1777) [ClassicSimilarity], result of:
            0.13487367 = score(doc=1777,freq=1.0), product of:
              0.2897241 = queryWeight, product of:
                1.9326636 = boost
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.020126387 = queryNorm
              0.4655245 = fieldWeight in 1777, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.448392 = idf(docFreq=69, maxDocs=44218)
                0.0625 = fieldNorm(doc=1777)
          0.57931906 = weight(abstract_txt:gerhard in 1777) [ClassicSimilarity], result of:
            0.57931906 = score(doc=1777,freq=8.0), product of:
              0.38277924 = queryWeight, product of:
                2.2214582 = boost
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.020126387 = queryNorm
              1.5134547 = fieldWeight in 1777, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                8.561393 = idf(docFreq=22, maxDocs=44218)
                0.0625 = fieldNorm(doc=1777)
        0.28 = coord(7/25)
    
  2. Lepsky, K.: Automatisches Indexieren (2023) 0.22
    0.21598837 = sum of:
      0.21598837 = product of:
        1.3499273 = sum of:
          0.11639404 = weight(abstract_txt:intellektuelle in 781) [ClassicSimilarity], result of:
            0.11639404 = score(doc=781,freq=1.0), product of:
              0.159068 = queryWeight, product of:
                1.0126057 = boost
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.020126387 = queryNorm
              0.73172504 = fieldWeight in 781, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.805067 = idf(docFreq=48, maxDocs=44218)
                0.09375 = fieldNorm(doc=781)
          0.03050718 = weight(abstract_txt:über in 781) [ClassicSimilarity], result of:
            0.03050718 = score(doc=781,freq=1.0), product of:
              0.08208057 = queryWeight, product of:
                1.0286891 = boost
                3.964518 = idf(docFreq=2280, maxDocs=44218)
                0.020126387 = queryNorm
              0.37167358 = fieldWeight in 781, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.964518 = idf(docFreq=2280, maxDocs=44218)
                0.09375 = fieldNorm(doc=781)
          0.04309605 = weight(abstract_txt:können in 781) [ClassicSimilarity], result of:
            0.04309605 = score(doc=781,freq=1.0), product of:
              0.103338934 = queryWeight, product of:
                1.1542395 = boost
                4.4483833 = idf(docFreq=1405, maxDocs=44218)
                0.020126387 = queryNorm
              0.41703594 = fieldWeight in 781, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4483833 = idf(docFreq=1405, maxDocs=44218)
                0.09375 = fieldNorm(doc=781)
          1.15993 = weight(title_txt:automatisches in 781) [ClassicSimilarity], result of:
            1.15993 = score(doc=781,freq=1.0), product of:
              0.207959 = queryWeight, product of:
                1.1578114 = boost
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.020126387 = queryNorm
              5.5776863 = fieldWeight in 781, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.625 = fieldNorm(doc=781)
        0.16 = coord(4/25)
    
  3. Oberhauser, O.: Automatisches Klassifizieren : Entwicklungsstand - Methodik - Anwendungsbereiche (2005) 0.13
    0.13072713 = sum of:
      0.13072713 = product of:
        1.0893928 = sum of:
          0.81195104 = weight(title_txt:automatisches in 38) [ClassicSimilarity], result of:
            0.81195104 = score(doc=38,freq=1.0), product of:
              0.207959 = queryWeight, product of:
                1.1578114 = boost
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.020126387 = queryNorm
              3.9043806 = fieldWeight in 38, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.4375 = fieldNorm(doc=38)
          0.17963411 = weight(abstract_txt:klassifizieren in 38) [ClassicSimilarity], result of:
            0.17963411 = score(doc=38,freq=3.0), product of:
              0.21097772 = queryWeight, product of:
                1.1661844 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.020126387 = queryNorm
              0.8514364 = fieldWeight in 38, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.0546875 = fieldNorm(doc=38)
          0.097807646 = weight(abstract_txt:relevanten in 38) [ClassicSimilarity], result of:
            0.097807646 = score(doc=38,freq=1.0), product of:
              0.25562873 = queryWeight, product of:
                1.8153852 = boost
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.020126387 = queryNorm
              0.382616 = fieldWeight in 38, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.0546875 = fieldNorm(doc=38)
        0.12 = coord(3/25)
    
  4. Gödert, W.; Lepsky, K.; Nagelschmidt, M.: Informationserschließung und Automatisches Indexieren : ein Lehr- und Arbeitsbuch (2011) 0.12
    0.120750025 = sum of:
      0.120750025 = product of:
        0.75468767 = sum of:
          0.035953052 = weight(abstract_txt:über in 2550) [ClassicSimilarity], result of:
            0.035953052 = score(doc=2550,freq=2.0), product of:
              0.08208057 = queryWeight, product of:
                1.0286891 = boost
                3.964518 = idf(docFreq=2280, maxDocs=44218)
                0.020126387 = queryNorm
              0.43802148 = fieldWeight in 2550, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.964518 = idf(docFreq=2280, maxDocs=44218)
                0.078125 = fieldNorm(doc=2550)
          0.102856256 = weight(abstract_txt:indexieren in 2550) [ClassicSimilarity], result of:
            0.102856256 = score(doc=2550,freq=1.0), product of:
              0.16541325 = queryWeight, product of:
                1.0326047 = boost
                7.9592175 = idf(docFreq=41, maxDocs=44218)
                0.020126387 = queryNorm
              0.6218139 = fieldWeight in 2550, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.9592175 = idf(docFreq=41, maxDocs=44218)
                0.078125 = fieldNorm(doc=2550)
          0.035913374 = weight(abstract_txt:können in 2550) [ClassicSimilarity], result of:
            0.035913374 = score(doc=2550,freq=1.0), product of:
              0.103338934 = queryWeight, product of:
                1.1542395 = boost
                4.4483833 = idf(docFreq=1405, maxDocs=44218)
                0.020126387 = queryNorm
              0.34752995 = fieldWeight in 2550, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4483833 = idf(docFreq=1405, maxDocs=44218)
                0.078125 = fieldNorm(doc=2550)
          0.579965 = weight(title_txt:automatisches in 2550) [ClassicSimilarity], result of:
            0.579965 = score(doc=2550,freq=1.0), product of:
              0.207959 = queryWeight, product of:
                1.1578114 = boost
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.020126387 = queryNorm
              2.7888432 = fieldWeight in 2550, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.3125 = fieldNorm(doc=2550)
        0.16 = coord(4/25)
    
  5. Oberhauser, O.: Automatisches Klassifizieren : Verfahren zur Erschließung elektronischer Dokumente (2004) 0.12
    0.116807975 = sum of:
      0.116807975 = product of:
        0.9733998 = sum of:
          0.6959581 = weight(title_txt:automatisches in 2487) [ClassicSimilarity], result of:
            0.6959581 = score(doc=2487,freq=1.0), product of:
              0.207959 = queryWeight, product of:
                1.1578114 = boost
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.020126387 = queryNorm
              3.346612 = fieldWeight in 2487, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.924298 = idf(docFreq=15, maxDocs=44218)
                0.375 = fieldNorm(doc=2487)
          0.17963411 = weight(abstract_txt:klassifizieren in 2487) [ClassicSimilarity], result of:
            0.17963411 = score(doc=2487,freq=3.0), product of:
              0.21097772 = queryWeight, product of:
                1.1661844 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.020126387 = queryNorm
              0.8514364 = fieldWeight in 2487, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2487)
          0.097807646 = weight(abstract_txt:relevanten in 2487) [ClassicSimilarity], result of:
            0.097807646 = score(doc=2487,freq=1.0), product of:
              0.25562873 = queryWeight, product of:
                1.8153852 = boost
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.020126387 = queryNorm
              0.382616 = fieldWeight in 2487, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.996407 = idf(docFreq=109, maxDocs=44218)
                0.0546875 = fieldNorm(doc=2487)
        0.12 = coord(3/25)