Document (#30981)

Author
Frobese, D.T.
Title
Klassifikationsaufgaben mit der SENTRAX : Konkreter Fall: Automatische Detektion von SPAM
Source
http://web1.bib.uni-hildesheim.de/edocs/2006/51992004X/doc/51992004X.pdf
Year
2006
Abstract
Die Suchfunktionen des SENTRAX-Verfahrens werden für die Klassifizierung von Mails und im Besonderen für die Detektion von SPAM eingesetzt. Die Eigenschaften einer kontextähnlichen Suche und die Fehlertoleranz sollen genutzt werden, um SPAM Nachrichten treffsicher aufzuspüren.
Footnote
Beitrag der Proceedings des Fünften Hildesheimer Evaluierungs- und Retrievalworkshop (HIER 2006), Hildesheim, xx.x.2006.
Theme
Computerlinguistik
Automatisches Klassifizieren
Object
SENTRAX

Similar documents (content)

  1. Goodman, J.; Heckerman, D.; Rounthwaite, R.: Schutzwälle gegen Spam (2005) 0.33
    0.3301693 = sum of:
      0.3301693 = product of:
        1.1886094 = sum of:
          0.010215961 = weight(abstract_txt:einer in 4696) [ClassicSimilarity], result of:
            0.010215961 = score(doc=4696,freq=1.0), product of:
              0.048113238 = queryWeight, product of:
                3.882635 = idf(docFreq=2486, maxDocs=44421)
                0.012391903 = queryNorm
              0.21233161 = fieldWeight in 4696, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.882635 = idf(docFreq=2486, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4696)
          0.032623094 = weight(abstract_txt:sollen in 4696) [ClassicSimilarity], result of:
            0.032623094 = score(doc=4696,freq=1.0), product of:
              0.10433464 = queryWeight, product of:
                1.4725904 = boost
                5.717531 = idf(docFreq=396, maxDocs=44421)
                0.012391903 = queryNorm
              0.3126775 = fieldWeight in 4696, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.717531 = idf(docFreq=396, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4696)
          0.033691138 = weight(abstract_txt:werden in 4696) [ClassicSimilarity], result of:
            0.033691138 = score(doc=4696,freq=5.0), product of:
              0.07854325 = queryWeight, product of:
                1.8069125 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.012391903 = queryNorm
              0.42895013 = fieldWeight in 4696, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4696)
          0.19500068 = weight(abstract_txt:mails in 4696) [ClassicSimilarity], result of:
            0.19500068 = score(doc=4696,freq=4.0), product of:
              0.21647905 = queryWeight, product of:
                2.1211708 = boost
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.012391903 = queryNorm
              0.9007832 = fieldWeight in 4696, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4696)
          0.91707844 = weight(abstract_txt:spam in 4696) [ClassicSimilarity], result of:
            0.91707844 = score(doc=4696,freq=8.0), product of:
              0.6956004 = queryWeight, product of:
                6.5857954 = boost
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.012391903 = queryNorm
              1.3183984 = fieldWeight in 4696, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4696)
        0.2777778 = coord(5/18)
    
  2. Krüger, A.: Angriffe aus dem Netz : die neue Szene des digitalen Verbrechens (2006) 0.16
    0.15559804 = sum of:
      0.15559804 = product of:
        0.93358827 = sum of:
          0.037281487 = weight(abstract_txt:werden in 266) [ClassicSimilarity], result of:
            0.037281487 = score(doc=266,freq=3.0), product of:
              0.07854325 = queryWeight, product of:
                1.8069125 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.012391903 = queryNorm
              0.4746619 = fieldWeight in 266, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.078125 = fieldNorm(doc=266)
          0.24125078 = weight(abstract_txt:mails in 266) [ClassicSimilarity], result of:
            0.24125078 = score(doc=266,freq=3.0), product of:
              0.21647905 = queryWeight, product of:
                2.1211708 = boost
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.012391903 = queryNorm
              1.1144302 = fieldWeight in 266, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.078125 = fieldNorm(doc=266)
          0.655056 = weight(abstract_txt:spam in 266) [ClassicSimilarity], result of:
            0.655056 = score(doc=266,freq=2.0), product of:
              0.6956004 = queryWeight, product of:
                6.5857954 = boost
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.012391903 = queryNorm
              0.9417131 = fieldWeight in 266, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.078125 = fieldNorm(doc=266)
        0.16666667 = coord(3/18)
    
  3. Krüger, K.: Suchmaschinen-Spamming : Vergleichend-kritische Analysen zur Wirkung kommerzieller Strategien der Website-Optimierung auf das Ranking in www-Suchmaschinen (2004) 0.15
    0.15034735 = sum of:
      0.15034735 = product of:
        0.9020841 = sum of:
          0.079488665 = weight(abstract_txt:eingesetzt in 4700) [ClassicSimilarity], result of:
            0.079488665 = score(doc=4700,freq=1.0), product of:
              0.13189442 = queryWeight, product of:
                1.6556972 = boost
                6.428468 = idf(docFreq=194, maxDocs=44421)
                0.012391903 = queryNorm
              0.6026689 = fieldWeight in 4700, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.428468 = idf(docFreq=194, maxDocs=44421)
                0.09375 = fieldNorm(doc=4700)
          0.03652825 = weight(abstract_txt:werden in 4700) [ClassicSimilarity], result of:
            0.03652825 = score(doc=4700,freq=2.0), product of:
              0.07854325 = queryWeight, product of:
                1.8069125 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.012391903 = queryNorm
              0.46507174 = fieldWeight in 4700, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.09375 = fieldNorm(doc=4700)
          0.7860672 = weight(abstract_txt:spam in 4700) [ClassicSimilarity], result of:
            0.7860672 = score(doc=4700,freq=2.0), product of:
              0.6956004 = queryWeight, product of:
                6.5857954 = boost
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.012391903 = queryNorm
              1.1300557 = fieldWeight in 4700, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.09375 = fieldNorm(doc=4700)
        0.16666667 = coord(3/18)
    
  4. Terliesner, J.: Information Retrieval in Wikis : wie können klassische Methoden des Information Retrievals die Suchfunktion eines Wikis bereichern? (2010) 0.15
    0.14510047 = sum of:
      0.14510047 = product of:
        0.43530142 = sum of:
          0.01459423 = weight(abstract_txt:einer in 491) [ClassicSimilarity], result of:
            0.01459423 = score(doc=491,freq=1.0), product of:
              0.048113238 = queryWeight, product of:
                3.882635 = idf(docFreq=2486, maxDocs=44421)
                0.012391903 = queryNorm
              0.30333087 = fieldWeight in 491, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.882635 = idf(docFreq=2486, maxDocs=44421)
                0.078125 = fieldNorm(doc=491)
          0.046604417 = weight(abstract_txt:sollen in 491) [ClassicSimilarity], result of:
            0.046604417 = score(doc=491,freq=1.0), product of:
              0.10433464 = queryWeight, product of:
                1.4725904 = boost
                5.717531 = idf(docFreq=396, maxDocs=44421)
                0.012391903 = queryNorm
              0.44668213 = fieldWeight in 491, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.717531 = idf(docFreq=396, maxDocs=44421)
                0.078125 = fieldNorm(doc=491)
          0.06515628 = weight(abstract_txt:genutzt in 491) [ClassicSimilarity], result of:
            0.06515628 = score(doc=491,freq=1.0), product of:
              0.13045117 = queryWeight, product of:
                1.6466136 = boost
                6.3932 = idf(docFreq=201, maxDocs=44421)
                0.012391903 = queryNorm
              0.49946874 = fieldWeight in 491, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3932 = idf(docFreq=201, maxDocs=44421)
                0.078125 = fieldNorm(doc=491)
          0.052723993 = weight(abstract_txt:werden in 491) [ClassicSimilarity], result of:
            0.052723993 = score(doc=491,freq=6.0), product of:
              0.07854325 = queryWeight, product of:
                1.8069125 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.012391903 = queryNorm
              0.67127335 = fieldWeight in 491, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.078125 = fieldNorm(doc=491)
          0.09445345 = weight(abstract_txt:besonderen in 491) [ClassicSimilarity], result of:
            0.09445345 = score(doc=491,freq=1.0), product of:
              0.16709201 = queryWeight, product of:
                1.8635693 = boost
                7.2355595 = idf(docFreq=86, maxDocs=44421)
                0.012391903 = queryNorm
              0.56527805 = fieldWeight in 491, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2355595 = idf(docFreq=86, maxDocs=44421)
                0.078125 = fieldNorm(doc=491)
          0.16176906 = weight(abstract_txt:suchfunktionen in 491) [ClassicSimilarity], result of:
            0.16176906 = score(doc=491,freq=1.0), product of:
              0.23918876 = queryWeight, product of:
                2.2296572 = boost
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.012391903 = queryNorm
              0.67632383 = fieldWeight in 491, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.078125 = fieldNorm(doc=491)
        0.33333334 = coord(6/18)
    
  5. Sixtus, M.: Schlüssel gegen Spam : Yahoo macht seine Technik öffentlich, die gefälschte Mails erkennt - in Hoffnung dass sie zum Standard wird (2004) 0.13
    0.12563233 = sum of:
      0.12563233 = product of:
        0.75379395 = sum of:
          0.15131326 = weight(abstract_txt:nachrichten in 3217) [ClassicSimilarity], result of:
            0.15131326 = score(doc=3217,freq=2.0), product of:
              0.18157321 = queryWeight, product of:
                1.9426457 = boost
                7.5425844 = idf(docFreq=63, maxDocs=44421)
                0.012391903 = queryNorm
              0.8333457 = fieldWeight in 3217, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.5425844 = idf(docFreq=63, maxDocs=44421)
                0.078125 = fieldNorm(doc=3217)
          0.1392862 = weight(abstract_txt:mails in 3217) [ClassicSimilarity], result of:
            0.1392862 = score(doc=3217,freq=1.0), product of:
              0.21647905 = queryWeight, product of:
                2.1211708 = boost
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.012391903 = queryNorm
              0.6434166 = fieldWeight in 3217, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.078125 = fieldNorm(doc=3217)
          0.46319452 = weight(abstract_txt:spam in 3217) [ClassicSimilarity], result of:
            0.46319452 = score(doc=3217,freq=1.0), product of:
              0.6956004 = queryWeight, product of:
                6.5857954 = boost
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.012391903 = queryNorm
              0.6658917 = fieldWeight in 3217, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.523414 = idf(docFreq=23, maxDocs=44421)
                0.078125 = fieldNorm(doc=3217)
        0.16666667 = coord(3/18)