Document (#34818)

Author
Puzicha, J.
Title
Informationen finden! : Intelligente Suchmaschinentechnologie & automatische Kategorisierung
Imprint
Rheinbach : recommind
Year
2007
Pages
15 S
Abstract
Wie in diesem Text erläutert wurde, ist die Effektivität von Such- und Klassifizierungssystemen durch folgendes bestimmt: 1) den Arbeitsauftrag, 2) die Genauigkeit des Systems, 3) den zu erreichenden Automatisierungsgrad, 4) die Einfachheit der Integration in bereits vorhandene Systeme. Diese Kriterien gehen davon aus, dass jedes System, unabhängig von der Technologie, in der Lage ist, Grundvoraussetzungen des Produkts in Bezug auf Funktionalität, Skalierbarkeit und Input-Methode zu erfüllen. Diese Produkteigenschaften sind in der Recommind Produktliteratur genauer erläutert. Von diesen Fähigkeiten ausgehend sollte die vorhergehende Diskussion jedoch einige klare Trends aufgezeigt haben. Es ist nicht überraschend, dass jüngere Entwicklungen im Maschine Learning und anderen Bereichen der Informatik einen theoretischen Ausgangspunkt für die Entwicklung von Suchmaschinen- und Klassifizierungstechnologie haben. Besonders jüngste Fortschritte bei den statistischen Methoden (PLSA) und anderen mathematischen Werkzeugen (SVMs) haben eine Ergebnisqualität auf Durchbruchsniveau erreicht. Dazu kommt noch die Flexibilität in der Anwendung durch Selbsttraining und Kategorienerkennen von PLSA-Systemen, wie auch eine neue Generation von vorher unerreichten Produktivitätsverbesserungen.
Content
Technical Whitepaper - Grundlagen der Informationsgewinnung
Footnote
Vgl. auch: http://www.recommind.de/?id=mindserver_categorization.
Theme
Automatisches Klassifizieren
Object
Latent Semantic Indexing

Similar documents (content)

  1. Dueck, G.: Wild duck : Empirische Philosophie der Mensch-Computer-Vernetzung (2004) 0.10
    0.1009424 = sum of:
      0.1009424 = product of:
        0.3605086 = sum of:
          0.061546203 = weight(abstract_txt:mathematischen in 1653) [ClassicSimilarity], result of:
            0.061546203 = score(doc=1653,freq=1.0), product of:
              0.1685596 = queryWeight, product of:
                7.7894444 = idf(docFreq=49, maxDocs=44421)
                0.021639489 = queryNorm
              0.36513022 = fieldWeight in 1653, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7894444 = idf(docFreq=49, maxDocs=44421)
                0.046875 = fieldNorm(doc=1653)
          0.019939013 = weight(abstract_txt:durch in 1653) [ClassicSimilarity], result of:
            0.019939013 = score(doc=1653,freq=1.0), product of:
              0.10017632 = queryWeight, product of:
                1.0902367 = boost
                4.246169 = idf(docFreq=1728, maxDocs=44421)
                0.021639489 = queryNorm
              0.19903918 = fieldWeight in 1653, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.246169 = idf(docFreq=1728, maxDocs=44421)
                0.046875 = fieldNorm(doc=1653)
          0.03673301 = weight(abstract_txt:diese in 1653) [ClassicSimilarity], result of:
            0.03673301 = score(doc=1653,freq=3.0), product of:
              0.10438223 = queryWeight, product of:
                1.1128882 = boost
                4.3343906 = idf(docFreq=1582, maxDocs=44421)
                0.021639489 = queryNorm
              0.35190865 = fieldWeight in 1653, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.3343906 = idf(docFreq=1582, maxDocs=44421)
                0.046875 = fieldNorm(doc=1653)
          0.04163252 = weight(abstract_txt:dass in 1653) [ClassicSimilarity], result of:
            0.04163252 = score(doc=1653,freq=3.0), product of:
              0.113469034 = queryWeight, product of:
                1.1603178 = boost
                4.5191154 = idf(docFreq=1315, maxDocs=44421)
                0.021639489 = queryNorm
              0.36690643 = fieldWeight in 1653, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.5191154 = idf(docFreq=1315, maxDocs=44421)
                0.046875 = fieldNorm(doc=1653)
          0.0969199 = weight(abstract_txt:produkts in 1653) [ClassicSimilarity], result of:
            0.0969199 = score(doc=1653,freq=1.0), product of:
              0.22815393 = queryWeight, product of:
                1.1634219 = boost
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.021639489 = queryNorm
              0.4248005 = fieldWeight in 1653, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.046875 = fieldNorm(doc=1653)
          0.06415886 = weight(abstract_txt:anderen in 1653) [ClassicSimilarity], result of:
            0.06415886 = score(doc=1653,freq=3.0), product of:
              0.15138865 = queryWeight, product of:
                1.3402472 = boost
                5.2198906 = idf(docFreq=652, maxDocs=44421)
                0.021639489 = queryNorm
              0.42380232 = fieldWeight in 1653, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.2198906 = idf(docFreq=652, maxDocs=44421)
                0.046875 = fieldNorm(doc=1653)
          0.0395791 = weight(abstract_txt:haben in 1653) [ClassicSimilarity], result of:
            0.0395791 = score(doc=1653,freq=1.0), product of:
              0.18112165 = queryWeight, product of:
                1.7954324 = boost
                4.661807 = idf(docFreq=1140, maxDocs=44421)
                0.021639489 = queryNorm
              0.2185222 = fieldWeight in 1653, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.661807 = idf(docFreq=1140, maxDocs=44421)
                0.046875 = fieldNorm(doc=1653)
        0.28 = coord(7/25)
    
  2. Burblies, C.; Wolff, J.E.: Vascoda - Effiziente Vermittlung wissenschaftlicher information (2009) 0.09
    0.089356385 = sum of:
      0.089356385 = product of:
        0.44678193 = sum of:
          0.03289769 = weight(abstract_txt:durch in 3783) [ClassicSimilarity], result of:
            0.03289769 = score(doc=3783,freq=2.0), product of:
              0.10017632 = queryWeight, product of:
                1.0902367 = boost
                4.246169 = idf(docFreq=1728, maxDocs=44421)
                0.021639489 = queryNorm
              0.32839787 = fieldWeight in 3783, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.246169 = idf(docFreq=1728, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3783)
          0.1371568 = weight(abstract_txt:suchmaschinentechnologie in 3783) [ClassicSimilarity], result of:
            0.1371568 = score(doc=3783,freq=2.0), product of:
              0.20596322 = queryWeight, product of:
                1.1053965 = boost
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.021639489 = queryNorm
              0.6659286 = fieldWeight in 3783, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3783)
          0.16820942 = weight(abstract_txt:einfachheit in 3783) [ClassicSimilarity], result of:
            0.16820942 = score(doc=3783,freq=2.0), product of:
              0.23598172 = queryWeight, product of:
                1.1832117 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.021639489 = queryNorm
              0.712807 = fieldWeight in 3783, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3783)
          0.043215822 = weight(abstract_txt:anderen in 3783) [ClassicSimilarity], result of:
            0.043215822 = score(doc=3783,freq=1.0), product of:
              0.15138865 = queryWeight, product of:
                1.3402472 = boost
                5.2198906 = idf(docFreq=652, maxDocs=44421)
                0.021639489 = queryNorm
              0.28546277 = fieldWeight in 3783, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2198906 = idf(docFreq=652, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3783)
          0.065302186 = weight(abstract_txt:haben in 3783) [ClassicSimilarity], result of:
            0.065302186 = score(doc=3783,freq=2.0), product of:
              0.18112165 = queryWeight, product of:
                1.7954324 = boost
                4.661807 = idf(docFreq=1140, maxDocs=44421)
                0.021639489 = queryNorm
              0.36054325 = fieldWeight in 3783, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.661807 = idf(docFreq=1140, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3783)
        0.2 = coord(5/25)
    
  3. Raicher, E.: Möglichkeiten und Grenzen von Primo bei der Einführung in deutschsprachigen Bibliotheken und Bibliotheksverbünden (2010) 0.08
    0.08476459 = sum of:
      0.08476459 = product of:
        0.30273068 = sum of:
          0.03759736 = weight(abstract_txt:durch in 311) [ClassicSimilarity], result of:
            0.03759736 = score(doc=311,freq=8.0), product of:
              0.10017632 = queryWeight, product of:
                1.0902367 = boost
                4.246169 = idf(docFreq=1728, maxDocs=44421)
                0.021639489 = queryNorm
              0.37531185 = fieldWeight in 311, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.246169 = idf(docFreq=1728, maxDocs=44421)
                0.03125 = fieldNorm(doc=311)
          0.055419717 = weight(abstract_txt:suchmaschinentechnologie in 311) [ClassicSimilarity], result of:
            0.055419717 = score(doc=311,freq=1.0), product of:
              0.20596322 = queryWeight, product of:
                1.1053965 = boost
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.021639489 = queryNorm
              0.26907578 = fieldWeight in 311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.03125 = fieldNorm(doc=311)
          0.056322843 = weight(abstract_txt:überraschend in 311) [ClassicSimilarity], result of:
            0.056322843 = score(doc=311,freq=1.0), product of:
              0.20819479 = queryWeight, product of:
                1.1113688 = boost
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.021639489 = queryNorm
              0.27052954 = fieldWeight in 311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.03125 = fieldNorm(doc=311)
          0.019994918 = weight(abstract_txt:diese in 311) [ClassicSimilarity], result of:
            0.019994918 = score(doc=311,freq=2.0), product of:
              0.10438223 = queryWeight, product of:
                1.1128882 = boost
                4.3343906 = idf(docFreq=1582, maxDocs=44421)
                0.021639489 = queryNorm
              0.19155481 = fieldWeight in 311, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3343906 = idf(docFreq=1582, maxDocs=44421)
                0.03125 = fieldNorm(doc=311)
          0.042396482 = weight(abstract_txt:dass in 311) [ClassicSimilarity], result of:
            0.042396482 = score(doc=311,freq=7.0), product of:
              0.113469034 = queryWeight, product of:
                1.1603178 = boost
                4.5191154 = idf(docFreq=1315, maxDocs=44421)
                0.021639489 = queryNorm
              0.37363923 = fieldWeight in 311, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.5191154 = idf(docFreq=1315, maxDocs=44421)
                0.03125 = fieldNorm(doc=311)
          0.06461327 = weight(abstract_txt:produkts in 311) [ClassicSimilarity], result of:
            0.06461327 = score(doc=311,freq=1.0), product of:
              0.22815393 = queryWeight, product of:
                1.1634219 = boost
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.021639489 = queryNorm
              0.28320032 = fieldWeight in 311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.06241 = idf(docFreq=13, maxDocs=44421)
                0.03125 = fieldNorm(doc=311)
          0.026386067 = weight(abstract_txt:haben in 311) [ClassicSimilarity], result of:
            0.026386067 = score(doc=311,freq=1.0), product of:
              0.18112165 = queryWeight, product of:
                1.7954324 = boost
                4.661807 = idf(docFreq=1140, maxDocs=44421)
                0.021639489 = queryNorm
              0.14568147 = fieldWeight in 311, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.661807 = idf(docFreq=1140, maxDocs=44421)
                0.03125 = fieldNorm(doc=311)
        0.28 = coord(7/25)
    
  4. Weishaupt, K.: Alephino : ein neues Bibliothekssystem für kleine und mittlere Bibliotheken (2004) 0.08
    0.084469736 = sum of:
      0.084469736 = product of:
        0.35195723 = sum of:
          0.07413453 = weight(abstract_txt:funktionalität in 3286) [ClassicSimilarity], result of:
            0.07413453 = score(doc=3286,freq=1.0), product of:
              0.17218757 = queryWeight, product of:
                1.0107044 = boost
                7.872826 = idf(docFreq=45, maxDocs=44421)
                0.021639489 = queryNorm
              0.43054518 = fieldWeight in 3286, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.872826 = idf(docFreq=45, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3286)
          0.09022763 = weight(abstract_txt:vorher in 3286) [ClassicSimilarity], result of:
            0.09022763 = score(doc=3286,freq=1.0), product of:
              0.19628231 = queryWeight, product of:
                1.0791054 = boost
                8.405631 = idf(docFreq=26, maxDocs=44421)
                0.021639489 = queryNorm
              0.45968294 = fieldWeight in 3286, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.405631 = idf(docFreq=26, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3286)
          0.02474245 = weight(abstract_txt:diese in 3286) [ClassicSimilarity], result of:
            0.02474245 = score(doc=3286,freq=1.0), product of:
              0.10438223 = queryWeight, product of:
                1.1128882 = boost
                4.3343906 = idf(docFreq=1582, maxDocs=44421)
                0.021639489 = queryNorm
              0.23703699 = fieldWeight in 3286, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3343906 = idf(docFreq=1582, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3286)
          0.03965828 = weight(abstract_txt:dass in 3286) [ClassicSimilarity], result of:
            0.03965828 = score(doc=3286,freq=2.0), product of:
              0.113469034 = queryWeight, product of:
                1.1603178 = boost
                4.5191154 = idf(docFreq=1315, maxDocs=44421)
                0.021639489 = queryNorm
              0.34950748 = fieldWeight in 3286, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5191154 = idf(docFreq=1315, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3286)
          0.043215822 = weight(abstract_txt:anderen in 3286) [ClassicSimilarity], result of:
            0.043215822 = score(doc=3286,freq=1.0), product of:
              0.15138865 = queryWeight, product of:
                1.3402472 = boost
                5.2198906 = idf(docFreq=652, maxDocs=44421)
                0.021639489 = queryNorm
              0.28546277 = fieldWeight in 3286, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2198906 = idf(docFreq=652, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3286)
          0.07997852 = weight(abstract_txt:haben in 3286) [ClassicSimilarity], result of:
            0.07997852 = score(doc=3286,freq=3.0), product of:
              0.18112165 = queryWeight, product of:
                1.7954324 = boost
                4.661807 = idf(docFreq=1140, maxDocs=44421)
                0.021639489 = queryNorm
              0.4415735 = fieldWeight in 3286, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.661807 = idf(docFreq=1140, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3286)
        0.24 = coord(6/25)
    
  5. Donsbach, W.: Wahrheit in den Medien : über den Sinn eines methodischen Objektivitätsbegriffes (2001) 0.08
    0.07542853 = sum of:
      0.07542853 = product of:
        0.31428555 = sum of:
          0.10451247 = weight(abstract_txt:klare in 895) [ClassicSimilarity], result of:
            0.10451247 = score(doc=895,freq=1.0), product of:
              0.1980488 = queryWeight, product of:
                1.0839503 = boost
                8.443371 = idf(docFreq=25, maxDocs=44421)
                0.021639489 = queryNorm
              0.5277107 = fieldWeight in 895, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.443371 = idf(docFreq=25, maxDocs=44421)
                0.0625 = fieldNorm(doc=895)
          0.02658535 = weight(abstract_txt:durch in 895) [ClassicSimilarity], result of:
            0.02658535 = score(doc=895,freq=1.0), product of:
              0.10017632 = queryWeight, product of:
                1.0902367 = boost
                4.246169 = idf(docFreq=1728, maxDocs=44421)
                0.021639489 = queryNorm
              0.26538557 = fieldWeight in 895, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.246169 = idf(docFreq=1728, maxDocs=44421)
                0.0625 = fieldNorm(doc=895)
          0.04897735 = weight(abstract_txt:diese in 895) [ClassicSimilarity], result of:
            0.04897735 = score(doc=895,freq=3.0), product of:
              0.10438223 = queryWeight, product of:
                1.1128882 = boost
                4.3343906 = idf(docFreq=1582, maxDocs=44421)
                0.021639489 = queryNorm
              0.46921155 = fieldWeight in 895, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.3343906 = idf(docFreq=1582, maxDocs=44421)
                0.0625 = fieldNorm(doc=895)
          0.03204873 = weight(abstract_txt:dass in 895) [ClassicSimilarity], result of:
            0.03204873 = score(doc=895,freq=1.0), product of:
              0.113469034 = queryWeight, product of:
                1.1603178 = boost
                4.5191154 = idf(docFreq=1315, maxDocs=44421)
                0.021639489 = queryNorm
              0.28244472 = fieldWeight in 895, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5191154 = idf(docFreq=1315, maxDocs=44421)
                0.0625 = fieldNorm(doc=895)
          0.04938951 = weight(abstract_txt:anderen in 895) [ClassicSimilarity], result of:
            0.04938951 = score(doc=895,freq=1.0), product of:
              0.15138865 = queryWeight, product of:
                1.3402472 = boost
                5.2198906 = idf(docFreq=652, maxDocs=44421)
                0.021639489 = queryNorm
              0.32624316 = fieldWeight in 895, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2198906 = idf(docFreq=652, maxDocs=44421)
                0.0625 = fieldNorm(doc=895)
          0.052772135 = weight(abstract_txt:haben in 895) [ClassicSimilarity], result of:
            0.052772135 = score(doc=895,freq=1.0), product of:
              0.18112165 = queryWeight, product of:
                1.7954324 = boost
                4.661807 = idf(docFreq=1140, maxDocs=44421)
                0.021639489 = queryNorm
              0.29136294 = fieldWeight in 895, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.661807 = idf(docFreq=1140, maxDocs=44421)
                0.0625 = fieldNorm(doc=895)
        0.24 = coord(6/25)