Document (#21065)

Author
Wätjen, H.-J.
Title
GERHARD : Automatisches Sammeln, Klassifizieren und Indexieren von wissenschaftlich relevanten Informationsressourcen im deutschen World Wide Web
Source
B.I.T.online. 1(1998) H.4, S.279-290
Year
1998
Abstract
Die intellektuelle Erschließung des Internet befindet sich in einer Krise. Yahoo und andere Dienste können mit dem Wachstum des Web nicht mithalten. GERHARD ist derzeit weltweit der einzige Such- und Navigationsdienst, der die mit einem Roboter gesammelten Internetressourcen mit computerlinguistischen und statistischen Verfahren auch automatisch vollständig klassifiziert. Weit über eine Million HTML-Dokumente von wissenschaftlich relevanten Servern in Deutschland können wie bei anderen Suchmaschinen in der Datenbank gesucht, aber auch über die Navigation in der dreisprachigen Universalen Dezimalklassifikation (ETH-Bibliothek Zürich) recherchiert werden
Footnote
Vgl. auch: http://www.gerhard.de/info/Dokumente/Bericht/bericht.pdf
Theme
Automatisches Klassifizieren
Internet
Klassifikationssysteme im Online-Retrieval
Object
GERHARD
DK
Harvest
UDC

Similar documents (author)

  1. Wätjen, H.-J.: ORBIS, der Oldenburger Online-Benutzerkatalog (1991) 4.61
    4.6082807 = sum of:
      4.6082807 = weight(author_txt:wätjen in 2067) [ClassicSimilarity], result of:
        4.6082807 = fieldWeight in 2067, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.216561 = idf(docFreq=11, maxDocs=44421)
          0.5 = fieldNorm(doc=2067)
    
  2. Wätjen, H.-J.: Mensch oder Maschine? : Auswahl und Erschließung vonm Informationsressourcen im Internet (1996) 4.61
    4.6082807 = sum of:
      4.6082807 = weight(author_txt:wätjen in 3229) [ClassicSimilarity], result of:
        4.6082807 = fieldWeight in 3229, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.216561 = idf(docFreq=11, maxDocs=44421)
          0.5 = fieldNorm(doc=3229)
    
  3. Wätjen, H.-J.: Hypertextbasierte OPACs im World-wide Web (1996) 4.61
    4.6082807 = sum of:
      4.6082807 = weight(author_txt:wätjen in 5524) [ClassicSimilarity], result of:
        4.6082807 = fieldWeight in 5524, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.216561 = idf(docFreq=11, maxDocs=44421)
          0.5 = fieldNorm(doc=5524)
    
  4. Wätjen, H.-J.: Automatisches Sammeln, Klassifizieren und Indexieren von wissenschaftlich relevanten Informationsressourcen im deutschen World Wide Web : das DFG-Projekt GERHARD (1998) 4.61
    4.6082807 = sum of:
      4.6082807 = weight(author_txt:wätjen in 4066) [ClassicSimilarity], result of:
        4.6082807 = fieldWeight in 4066, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.216561 = idf(docFreq=11, maxDocs=44421)
          0.5 = fieldNorm(doc=4066)
    
  5. Wätjen, H.-J.: Zur Realität virtueller Bibliotheken : Möglichkeiten, Aufgaben, Probleme (1999) 4.61
    4.6082807 = sum of:
      4.6082807 = weight(author_txt:wätjen in 5125) [ClassicSimilarity], result of:
        4.6082807 = fieldWeight in 5125, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.216561 = idf(docFreq=11, maxDocs=44421)
          0.5 = fieldNorm(doc=5125)
    

Similar documents (content)

  1. Krüger, C.: Evaluation des WWW-Suchdienstes GERHARD unter besonderer Beachtung automatischer Indexierung (1999) 0.35
    0.34574223 = sum of:
      0.34574223 = product of:
        1.2347937 = sum of:
          0.07710796 = weight(abstract_txt:intellektuelle in 2777) [ClassicSimilarity], result of:
            0.07710796 = score(doc=2777,freq=1.0), product of:
              0.1583845 = queryWeight, product of:
                1.0123874 = boost
                7.7894444 = idf(docFreq=49, maxDocs=44421)
                0.020084428 = queryNorm
              0.48684028 = fieldWeight in 2777, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7894444 = idf(docFreq=49, maxDocs=44421)
                0.0625 = fieldNorm(doc=2777)
          0.078960106 = weight(abstract_txt:einzige in 2777) [ClassicSimilarity], result of:
            0.078960106 = score(doc=2777,freq=1.0), product of:
              0.16091074 = queryWeight, product of:
                1.0204293 = boost
                7.85132 = idf(docFreq=46, maxDocs=44421)
                0.020084428 = queryNorm
              0.4907075 = fieldWeight in 2777, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.85132 = idf(docFreq=46, maxDocs=44421)
                0.0625 = fieldNorm(doc=2777)
          0.1025439 = weight(abstract_txt:servern in 2777) [ClassicSimilarity], result of:
            0.1025439 = score(doc=2777,freq=1.0), product of:
              0.19153719 = queryWeight, product of:
                1.1133121 = boost
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.020084428 = queryNorm
              0.53537333 = fieldWeight in 2777, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.0625 = fieldNorm(doc=2777)
          0.15223378 = weight(abstract_txt:internetressourcen in 2777) [ClassicSimilarity], result of:
            0.15223378 = score(doc=2777,freq=2.0), product of:
              0.1978384 = queryWeight, product of:
                1.1314769 = boost
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.020084428 = queryNorm
              0.76948553 = fieldWeight in 2777, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.0625 = fieldNorm(doc=2777)
          0.10955949 = weight(abstract_txt:klassifiziert in 2777) [ClassicSimilarity], result of:
            0.10955949 = score(doc=2777,freq=1.0), product of:
              0.20017657 = queryWeight, product of:
                1.1381434 = boost
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.020084428 = queryNorm
              0.5473143 = fieldWeight in 2777, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.757029 = idf(docFreq=18, maxDocs=44421)
                0.0625 = fieldNorm(doc=2777)
          0.13431245 = weight(abstract_txt:wissenschaftlich in 2777) [ClassicSimilarity], result of:
            0.13431245 = score(doc=2777,freq=1.0), product of:
              0.28889105 = queryWeight, product of:
                1.9336257 = boost
                7.438788 = idf(docFreq=70, maxDocs=44421)
                0.020084428 = queryNorm
              0.46492425 = fieldWeight in 2777, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.438788 = idf(docFreq=70, maxDocs=44421)
                0.0625 = fieldNorm(doc=2777)
          0.5800759 = weight(abstract_txt:gerhard in 2777) [ClassicSimilarity], result of:
            0.5800759 = score(doc=2777,freq=8.0), product of:
              0.38307437 = queryWeight, product of:
                2.2266243 = boost
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.020084428 = queryNorm
              1.5142645 = fieldWeight in 2777, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.0625 = fieldNorm(doc=2777)
        0.28 = coord(7/25)
    
  2. Lepsky, K.: Automatisches Indexieren (2023) 0.22
    0.21609548 = sum of:
      0.21609548 = product of:
        1.3505968 = sum of:
          0.115661934 = weight(abstract_txt:intellektuelle in 1782) [ClassicSimilarity], result of:
            0.115661934 = score(doc=1782,freq=1.0), product of:
              0.1583845 = queryWeight, product of:
                1.0123874 = boost
                7.7894444 = idf(docFreq=49, maxDocs=44421)
                0.020084428 = queryNorm
              0.73026043 = fieldWeight in 1782, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7894444 = idf(docFreq=49, maxDocs=44421)
                0.09375 = fieldNorm(doc=1782)
          0.030472616 = weight(abstract_txt:über in 1782) [ClassicSimilarity], result of:
            0.030472616 = score(doc=1782,freq=1.0), product of:
              0.08201039 = queryWeight, product of:
                1.0302434 = boost
                3.9634154 = idf(docFreq=2293, maxDocs=44421)
                0.020084428 = queryNorm
              0.3715702 = fieldWeight in 1782, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9634154 = idf(docFreq=2293, maxDocs=44421)
                0.09375 = fieldNorm(doc=1782)
          0.04309251 = weight(abstract_txt:können in 1782) [ClassicSimilarity], result of:
            0.04309251 = score(doc=1782,freq=1.0), product of:
              0.10332298 = queryWeight, product of:
                1.1563888 = boost
                4.448705 = idf(docFreq=1411, maxDocs=44421)
                0.020084428 = queryNorm
              0.4170661 = fieldWeight in 1782, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.448705 = idf(docFreq=1411, maxDocs=44421)
                0.09375 = fieldNorm(doc=1782)
          1.1613697 = weight(title_txt:automatisches in 1782) [ClassicSimilarity], result of:
            1.1613697 = score(doc=1782,freq=1.0), product of:
              0.20811029 = queryWeight, product of:
                1.1604787 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.020084428 = queryNorm
              5.5805492 = fieldWeight in 1782, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.625 = fieldNorm(doc=1782)
        0.16 = coord(4/25)
    
  3. Oberhauser, O.: Automatisches Klassifizieren : Entwicklungsstand - Methodik - Anwendungsbereiche (2005) 0.13
    0.13089412 = sum of:
      0.13089412 = product of:
        1.0907844 = sum of:
          0.8129588 = weight(title_txt:automatisches in 163) [ClassicSimilarity], result of:
            0.8129588 = score(doc=163,freq=1.0), product of:
              0.20811029 = queryWeight, product of:
                1.1604787 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.020084428 = queryNorm
              3.9063845 = fieldWeight in 163, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.4375 = fieldNorm(doc=163)
          0.17985506 = weight(abstract_txt:klassifizieren in 163) [ClassicSimilarity], result of:
            0.17985506 = score(doc=163,freq=3.0), product of:
              0.21112965 = queryWeight, product of:
                1.1688668 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.020084428 = queryNorm
              0.85187024 = fieldWeight in 163, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.0546875 = fieldNorm(doc=163)
          0.09797055 = weight(abstract_txt:relevanten in 163) [ClassicSimilarity], result of:
            0.09797055 = score(doc=163,freq=1.0), product of:
              0.255887 = queryWeight, product of:
                1.8198245 = boost
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.020084428 = queryNorm
              0.38286647 = fieldWeight in 163, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.0546875 = fieldNorm(doc=163)
        0.12 = coord(3/25)
    
  4. Gödert, W.; Lepsky, K.; Nagelschmidt, M.: Informationserschließung und Automatisches Indexieren : ein Lehr- und Arbeitsbuch (2011) 0.12
    0.120881714 = sum of:
      0.120881714 = product of:
        0.75551075 = sum of:
          0.035912324 = weight(abstract_txt:über in 3550) [ClassicSimilarity], result of:
            0.035912324 = score(doc=3550,freq=2.0), product of:
              0.08201039 = queryWeight, product of:
                1.0302434 = boost
                3.9634154 = idf(docFreq=2293, maxDocs=44421)
                0.020084428 = queryNorm
              0.43789968 = fieldWeight in 3550, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9634154 = idf(docFreq=2293, maxDocs=44421)
                0.078125 = fieldNorm(doc=3550)
          0.10300314 = weight(abstract_txt:indexieren in 3550) [ClassicSimilarity], result of:
            0.10300314 = score(doc=3550,freq=1.0), product of:
              0.1655542 = queryWeight, product of:
                1.035048 = boost
                7.963798 = idf(docFreq=41, maxDocs=44421)
                0.020084428 = queryNorm
              0.6221717 = fieldWeight in 3550, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.963798 = idf(docFreq=41, maxDocs=44421)
                0.078125 = fieldNorm(doc=3550)
          0.03591043 = weight(abstract_txt:können in 3550) [ClassicSimilarity], result of:
            0.03591043 = score(doc=3550,freq=1.0), product of:
              0.10332298 = queryWeight, product of:
                1.1563888 = boost
                4.448705 = idf(docFreq=1411, maxDocs=44421)
                0.020084428 = queryNorm
              0.3475551 = fieldWeight in 3550, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.448705 = idf(docFreq=1411, maxDocs=44421)
                0.078125 = fieldNorm(doc=3550)
          0.58068484 = weight(title_txt:automatisches in 3550) [ClassicSimilarity], result of:
            0.58068484 = score(doc=3550,freq=1.0), product of:
              0.20811029 = queryWeight, product of:
                1.1604787 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.020084428 = queryNorm
              2.7902746 = fieldWeight in 3550, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.3125 = fieldNorm(doc=3550)
        0.16 = coord(4/25)
    
  5. Oberhauser, O.: Automatisches Klassifizieren : Verfahren zur Erschließung elektronischer Dokumente (2004) 0.12
    0.11695769 = sum of:
      0.11695769 = product of:
        0.9746474 = sum of:
          0.6968218 = weight(title_txt:automatisches in 3487) [ClassicSimilarity], result of:
            0.6968218 = score(doc=3487,freq=1.0), product of:
              0.20811029 = queryWeight, product of:
                1.1604787 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.020084428 = queryNorm
              3.3483295 = fieldWeight in 3487, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.375 = fieldNorm(doc=3487)
          0.17985506 = weight(abstract_txt:klassifizieren in 3487) [ClassicSimilarity], result of:
            0.17985506 = score(doc=3487,freq=3.0), product of:
              0.21112965 = queryWeight, product of:
                1.1688668 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.020084428 = queryNorm
              0.85187024 = fieldWeight in 3487, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3487)
          0.09797055 = weight(abstract_txt:relevanten in 3487) [ClassicSimilarity], result of:
            0.09797055 = score(doc=3487,freq=1.0), product of:
              0.255887 = queryWeight, product of:
                1.8198245 = boost
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.020084428 = queryNorm
              0.38286647 = fieldWeight in 3487, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3487)
        0.12 = coord(3/25)