Document (#2323)

Author
Panyr, J.
Title
Vektorraum-Modell und Clusteranalyse in Information-Retrieval-Systemen
Source
Nachrichten für Dokumentation. 38(1987) H.1, S.13-20
Year
1987
Abstract
Ausgehend von theoretischen Indexierungsansätzen wird das klassische Vektorraum-Modell für automatische Indexierung (mit dem Trennschärfen-Modell) erläutert. Das Clustering in Information-Retrieval-Systemem wird als eine natürliche logische Folge aus diesem Modell aufgefaßt und in allen seinen Ausprägungen (d.h. als Dokumenten-, Term- oder Dokumenten- und Termklassifikation) behandelt. Anschließend werden die Suchstrategien in vorklassifizierten Dokumentenbeständen (Clustersuche) detailliert beschrieben. Zum Schluß wird noch die sinnvolle Anwendung der Clusteranalyse in Information-Retrieval-Systemen kurz diskutiert
Theme
Automatisches Indexieren
Automatisches Klassifizieren

Similar documents (author)

  1. Panyr, J.: Thesaurus und wissensbasierte Systeme - Thesauri und Wissensbasen (1988) 5.47
    5.47028 = sum of:
      5.47028 = weight(author_txt:panyr in 22) [ClassicSimilarity], result of:
        5.47028 = fieldWeight in 22, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.752448 = idf(docFreq=18, maxDocs=44218)
          0.625 = fieldNorm(doc=22)
    
  2. Panyr, J.: Information-Retrieval-Methoden in regelbasierten Expertensystemen (1990) 5.47
    5.47028 = sum of:
      5.47028 = weight(author_txt:panyr in 260) [ClassicSimilarity], result of:
        5.47028 = fieldWeight in 260, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.752448 = idf(docFreq=18, maxDocs=44218)
          0.625 = fieldNorm(doc=260)
    
  3. Panyr, J.: Vom Wissen zur Information : Notwendigkeit der Kooperation der Fachleute aus dem Bereich der Informations-Retrieval-Systeme und der Systeme mit formaler Intelligenz (1988) 5.47
    5.47028 = sum of:
      5.47028 = weight(author_txt:panyr in 768) [ClassicSimilarity], result of:
        5.47028 = fieldWeight in 768, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.752448 = idf(docFreq=18, maxDocs=44218)
          0.625 = fieldNorm(doc=768)
    
  4. Panyr, J.: ¬Die Theorie der Fuzzy-Mengen und Information-Retrieval-Systeme (1986) 5.47
    5.47028 = sum of:
      5.47028 = weight(author_txt:panyr in 788) [ClassicSimilarity], result of:
        5.47028 = fieldWeight in 788, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.752448 = idf(docFreq=18, maxDocs=44218)
          0.625 = fieldNorm(doc=788)
    
  5. Panyr, J.: Probabilistische Modelle in Information-Retrieval-Systemen (1986) 5.47
    5.47028 = sum of:
      5.47028 = weight(author_txt:panyr in 1460) [ClassicSimilarity], result of:
        5.47028 = fieldWeight in 1460, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.752448 = idf(docFreq=18, maxDocs=44218)
          0.625 = fieldNorm(doc=1460)
    

Similar documents (content)

  1. Kaiser, A.: Computer-unterstütztes Indexieren in Intelligenten Information Retrieval Systemen : Ein Relevanz-Feedback orientierter Ansatz zur Informationserschließung in unformatierten Datenbanken (1993) 0.19
    0.18797262 = sum of:
      0.18797262 = product of:
        0.52214617 = sum of:
          0.022122268 = weight(abstract_txt:behandelt in 4284) [ClassicSimilarity], result of:
            0.022122268 = score(doc=4284,freq=1.0), product of:
              0.11403323 = queryWeight, product of:
                6.2079496 = idf(docFreq=241, maxDocs=44218)
                0.018368904 = queryNorm
              0.19399843 = fieldWeight in 4284, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2079496 = idf(docFreq=241, maxDocs=44218)
                0.03125 = fieldNorm(doc=4284)
          0.069820665 = weight(abstract_txt:indexierung in 4284) [ClassicSimilarity], result of:
            0.069820665 = score(doc=4284,freq=6.0), product of:
              0.13502595 = queryWeight, product of:
                1.0881604 = boost
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.018368904 = queryNorm
              0.51709074 = fieldWeight in 4284, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.03125 = fieldNorm(doc=4284)
          0.030611333 = weight(abstract_txt:automatische in 4284) [ClassicSimilarity], result of:
            0.030611333 = score(doc=4284,freq=1.0), product of:
              0.14160106 = queryWeight, product of:
                1.1143396 = boost
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.018368904 = queryNorm
              0.21618012 = fieldWeight in 4284, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9177637 = idf(docFreq=118, maxDocs=44218)
                0.03125 = fieldNorm(doc=4284)
          0.014191671 = weight(abstract_txt:information in 4284) [ClassicSimilarity], result of:
            0.014191671 = score(doc=4284,freq=13.0), product of:
              0.052026745 = queryWeight, product of:
                1.1699256 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.018368904 = queryNorm
              0.27277645 = fieldWeight in 4284, product of:
                3.6055512 = tf(freq=13.0), with freq of:
                  13.0 = termFreq=13.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.03125 = fieldNorm(doc=4284)
          0.0329282 = weight(abstract_txt:retrieval in 4284) [ClassicSimilarity], result of:
            0.0329282 = score(doc=4284,freq=8.0), product of:
              0.10720147 = queryWeight, product of:
                1.6793658 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.018368904 = queryNorm
              0.3071618 = fieldWeight in 4284, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.03125 = fieldNorm(doc=4284)
          0.033320602 = weight(abstract_txt:wird in 4284) [ClassicSimilarity], result of:
            0.033320602 = score(doc=4284,freq=5.0), product of:
              0.12637775 = queryWeight, product of:
                1.8233927 = boost
                3.773177 = idf(docFreq=2761, maxDocs=44218)
                0.018368904 = queryNorm
              0.26365876 = fieldWeight in 4284, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.773177 = idf(docFreq=2761, maxDocs=44218)
                0.03125 = fieldNorm(doc=4284)
          0.11839232 = weight(abstract_txt:systemen in 4284) [ClassicSimilarity], result of:
            0.11839232 = score(doc=4284,freq=6.0), product of:
              0.24190988 = queryWeight, product of:
                2.059805 = boost
                6.3935823 = idf(docFreq=200, maxDocs=44218)
                0.018368904 = queryNorm
              0.4894067 = fieldWeight in 4284, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.3935823 = idf(docFreq=200, maxDocs=44218)
                0.03125 = fieldNorm(doc=4284)
          0.09900013 = weight(abstract_txt:dokumenten in 4284) [ClassicSimilarity], result of:
            0.09900013 = score(doc=4284,freq=4.0), product of:
              0.24578696 = queryWeight, product of:
                2.0762455 = boost
                6.444614 = idf(docFreq=190, maxDocs=44218)
                0.018368904 = queryNorm
              0.40278837 = fieldWeight in 4284, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.444614 = idf(docFreq=190, maxDocs=44218)
                0.03125 = fieldNorm(doc=4284)
          0.10175898 = weight(abstract_txt:modell in 4284) [ClassicSimilarity], result of:
            0.10175898 = score(doc=4284,freq=1.0), product of:
              0.50066453 = queryWeight, product of:
                4.190711 = boost
                6.5039306 = idf(docFreq=179, maxDocs=44218)
                0.018368904 = queryNorm
              0.20324783 = fieldWeight in 4284, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5039306 = idf(docFreq=179, maxDocs=44218)
                0.03125 = fieldNorm(doc=4284)
        0.36 = coord(9/25)
    
  2. Panyr, J.: Probabilistische Modelle in Information-Retrieval-Systemen (1986) 0.19
    0.1860524 = sum of:
      0.1860524 = product of:
        0.7752183 = sum of:
          0.09165409 = weight(abstract_txt:kurz in 1460) [ClassicSimilarity], result of:
            0.09165409 = score(doc=1460,freq=1.0), product of:
              0.12760496 = queryWeight, product of:
                1.0578353 = boost
                6.5669885 = idf(docFreq=168, maxDocs=44218)
                0.018368904 = queryNorm
              0.71826434 = fieldWeight in 1460, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5669885 = idf(docFreq=168, maxDocs=44218)
                0.109375 = fieldNorm(doc=1460)
          0.09976458 = weight(abstract_txt:indexierung in 1460) [ClassicSimilarity], result of:
            0.09976458 = score(doc=1460,freq=1.0), product of:
              0.13502595 = queryWeight, product of:
                1.0881604 = boost
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.018368904 = queryNorm
              0.7388549 = fieldWeight in 1460, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.109375 = fieldNorm(doc=1460)
          0.10205446 = weight(abstract_txt:ausgehend in 1460) [ClassicSimilarity], result of:
            0.10205446 = score(doc=1460,freq=1.0), product of:
              0.13708428 = queryWeight, product of:
                1.0964229 = boost
                6.806538 = idf(docFreq=132, maxDocs=44218)
                0.018368904 = queryNorm
              0.7444651 = fieldWeight in 1460, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.806538 = idf(docFreq=132, maxDocs=44218)
                0.109375 = fieldNorm(doc=1460)
          0.15635827 = weight(abstract_txt:detailliert in 1460) [ClassicSimilarity], result of:
            0.15635827 = score(doc=1460,freq=1.0), product of:
              0.1821854 = queryWeight, product of:
                1.2639825 = boost
                7.84674 = idf(docFreq=46, maxDocs=44218)
                0.018368904 = queryNorm
              0.85823715 = fieldWeight in 1460, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.84674 = idf(docFreq=46, maxDocs=44218)
                0.109375 = fieldNorm(doc=1460)
          0.23505183 = weight(abstract_txt:schluß in 1460) [ClassicSimilarity], result of:
            0.23505183 = score(doc=1460,freq=1.0), product of:
              0.23907934 = queryWeight, product of:
                1.4479558 = boost
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.018368904 = queryNorm
              0.98315406 = fieldWeight in 1460, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.988837 = idf(docFreq=14, maxDocs=44218)
                0.109375 = fieldNorm(doc=1460)
          0.090335086 = weight(abstract_txt:wird in 1460) [ClassicSimilarity], result of:
            0.090335086 = score(doc=1460,freq=3.0), product of:
              0.12637775 = queryWeight, product of:
                1.8233927 = boost
                3.773177 = idf(docFreq=2761, maxDocs=44218)
                0.018368904 = queryNorm
              0.71480215 = fieldWeight in 1460, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.773177 = idf(docFreq=2761, maxDocs=44218)
                0.109375 = fieldNorm(doc=1460)
        0.24 = coord(6/25)
    
  3. Fuhr, N.: Theorie des Information Retrieval I : Modelle (2004) 0.15
    0.14609274 = sum of:
      0.14609274 = product of:
        0.60871977 = sum of:
          0.057008334 = weight(abstract_txt:indexierung in 2912) [ClassicSimilarity], result of:
            0.057008334 = score(doc=2912,freq=1.0), product of:
              0.13502595 = queryWeight, product of:
                1.0881604 = boost
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.018368904 = queryNorm
              0.4222028 = fieldWeight in 2912, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.0625 = fieldNorm(doc=2912)
          0.007872122 = weight(abstract_txt:information in 2912) [ClassicSimilarity], result of:
            0.007872122 = score(doc=2912,freq=1.0), product of:
              0.052026745 = queryWeight, product of:
                1.1699256 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.018368904 = queryNorm
              0.15130915 = fieldWeight in 2912, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.0625 = fieldNorm(doc=2912)
          0.0329282 = weight(abstract_txt:retrieval in 2912) [ClassicSimilarity], result of:
            0.0329282 = score(doc=2912,freq=2.0), product of:
              0.10720147 = queryWeight, product of:
                1.6793658 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.018368904 = queryNorm
              0.3071618 = fieldWeight in 2912, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.0625 = fieldNorm(doc=2912)
          0.05162005 = weight(abstract_txt:wird in 2912) [ClassicSimilarity], result of:
            0.05162005 = score(doc=2912,freq=3.0), product of:
              0.12637775 = queryWeight, product of:
                1.8233927 = boost
                3.773177 = idf(docFreq=2761, maxDocs=44218)
                0.018368904 = queryNorm
              0.40845838 = fieldWeight in 2912, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.773177 = idf(docFreq=2761, maxDocs=44218)
                0.0625 = fieldNorm(doc=2912)
          0.17147325 = weight(abstract_txt:dokumenten in 2912) [ClassicSimilarity], result of:
            0.17147325 = score(doc=2912,freq=3.0), product of:
              0.24578696 = queryWeight, product of:
                2.0762455 = boost
                6.444614 = idf(docFreq=190, maxDocs=44218)
                0.018368904 = queryNorm
              0.6976499 = fieldWeight in 2912, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.444614 = idf(docFreq=190, maxDocs=44218)
                0.0625 = fieldNorm(doc=2912)
          0.28781784 = weight(abstract_txt:modell in 2912) [ClassicSimilarity], result of:
            0.28781784 = score(doc=2912,freq=2.0), product of:
              0.50066453 = queryWeight, product of:
                4.190711 = boost
                6.5039306 = idf(docFreq=179, maxDocs=44218)
                0.018368904 = queryNorm
              0.57487166 = fieldWeight in 2912, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.5039306 = idf(docFreq=179, maxDocs=44218)
                0.0625 = fieldNorm(doc=2912)
        0.24 = coord(6/25)
    
  4. Siebenlist, T.: MEMOSE. Spezialsuchmaschine für emotional geladene Dokumente (2012) 0.13
    0.12652507 = sum of:
      0.12652507 = product of:
        0.5271878 = sum of:
          0.07856066 = weight(abstract_txt:kurz in 175) [ClassicSimilarity], result of:
            0.07856066 = score(doc=175,freq=1.0), product of:
              0.12760496 = queryWeight, product of:
                1.0578353 = boost
                6.5669885 = idf(docFreq=168, maxDocs=44218)
                0.018368904 = queryNorm
              0.6156552 = fieldWeight in 175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5669885 = idf(docFreq=168, maxDocs=44218)
                0.09375 = fieldNorm(doc=175)
          0.0855125 = weight(abstract_txt:indexierung in 175) [ClassicSimilarity], result of:
            0.0855125 = score(doc=175,freq=1.0), product of:
              0.13502595 = queryWeight, product of:
                1.0881604 = boost
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.018368904 = queryNorm
              0.6333042 = fieldWeight in 175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.09375 = fieldNorm(doc=175)
          0.011808184 = weight(abstract_txt:information in 175) [ClassicSimilarity], result of:
            0.011808184 = score(doc=175,freq=1.0), product of:
              0.052026745 = queryWeight, product of:
                1.1699256 = boost
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.018368904 = queryNorm
              0.22696373 = fieldWeight in 175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.4209464 = idf(docFreq=10677, maxDocs=44218)
                0.09375 = fieldNorm(doc=175)
          0.049392298 = weight(abstract_txt:retrieval in 175) [ClassicSimilarity], result of:
            0.049392298 = score(doc=175,freq=2.0), product of:
              0.10720147 = queryWeight, product of:
                1.6793658 = boost
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.018368904 = queryNorm
              0.4607427 = fieldWeight in 175, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4751394 = idf(docFreq=3720, maxDocs=44218)
                0.09375 = fieldNorm(doc=175)
          0.044704273 = weight(abstract_txt:wird in 175) [ClassicSimilarity], result of:
            0.044704273 = score(doc=175,freq=1.0), product of:
              0.12637775 = queryWeight, product of:
                1.8233927 = boost
                3.773177 = idf(docFreq=2761, maxDocs=44218)
                0.018368904 = queryNorm
              0.35373533 = fieldWeight in 175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.773177 = idf(docFreq=2761, maxDocs=44218)
                0.09375 = fieldNorm(doc=175)
          0.25720987 = weight(abstract_txt:dokumenten in 175) [ClassicSimilarity], result of:
            0.25720987 = score(doc=175,freq=3.0), product of:
              0.24578696 = queryWeight, product of:
                2.0762455 = boost
                6.444614 = idf(docFreq=190, maxDocs=44218)
                0.018368904 = queryNorm
              1.0464748 = fieldWeight in 175, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.444614 = idf(docFreq=190, maxDocs=44218)
                0.09375 = fieldNorm(doc=175)
        0.24 = coord(6/25)
    
  5. Markscheffel, B.: ¬Eine Entwurfsmethodik für Hypermedia-Systeme auf Basis des Spatial-Satellite-Modells S**2M (1993) 0.12
    0.12218515 = sum of:
      0.12218515 = product of:
        0.7636572 = sum of:
          0.12370869 = weight(abstract_txt:ausgehend in 2244) [ClassicSimilarity], result of:
            0.12370869 = score(doc=2244,freq=2.0), product of:
              0.13708428 = queryWeight, product of:
                1.0964229 = boost
                6.806538 = idf(docFreq=132, maxDocs=44218)
                0.018368904 = queryNorm
              0.902428 = fieldWeight in 2244, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.806538 = idf(docFreq=132, maxDocs=44218)
                0.09375 = fieldNorm(doc=2244)
          0.06322139 = weight(abstract_txt:wird in 2244) [ClassicSimilarity], result of:
            0.06322139 = score(doc=2244,freq=2.0), product of:
              0.12637775 = queryWeight, product of:
                1.8233927 = boost
                3.773177 = idf(docFreq=2761, maxDocs=44218)
                0.018368904 = queryNorm
              0.50025725 = fieldWeight in 2244, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.773177 = idf(docFreq=2761, maxDocs=44218)
                0.09375 = fieldNorm(doc=2244)
          0.14500038 = weight(abstract_txt:systemen in 2244) [ClassicSimilarity], result of:
            0.14500038 = score(doc=2244,freq=1.0), product of:
              0.24190988 = queryWeight, product of:
                2.059805 = boost
                6.3935823 = idf(docFreq=200, maxDocs=44218)
                0.018368904 = queryNorm
              0.5993984 = fieldWeight in 2244, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3935823 = idf(docFreq=200, maxDocs=44218)
                0.09375 = fieldNorm(doc=2244)
          0.43172678 = weight(abstract_txt:modell in 2244) [ClassicSimilarity], result of:
            0.43172678 = score(doc=2244,freq=2.0), product of:
              0.50066453 = queryWeight, product of:
                4.190711 = boost
                6.5039306 = idf(docFreq=179, maxDocs=44218)
                0.018368904 = queryNorm
              0.8623075 = fieldWeight in 2244, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.5039306 = idf(docFreq=179, maxDocs=44218)
                0.09375 = fieldNorm(doc=2244)
        0.16 = coord(4/25)