Document (#26508)

Author
Peters, G.
Gaese, V.
Title
¬Das DocCat-System in der Textdokumentation von G+J
Source
Medien-Informationsmanagement: Archivarische, dokumentarische, betriebswirtschaftliche, rechtliche und Berufsbild-Aspekte. Hrsg.: Marianne Englert u.a
Imprint
Münster : LIT Verlag
Year
2003
Pages
S.123-133
Series
Beiträge zur Mediendokumentation; Bd.6
Abstract
Wir werden einmal die Grundlagen des Text-Mining-Systems bei IBM darstellen, dann werden wir das Projekt etwas umfangreicher und deutlicher darstellen, da kennen wir uns aus. Von daher haben wir zwei Teile, einmal Heidelberg, einmal Hamburg. Noch einmal zur Technologie. Text-Mining ist eine von IBM entwickelte Technologie, die in einer besonderen Ausformung und Programmierung für uns zusammengestellt wurde. Das Projekt hieß bei uns lange Zeit DocText Miner und heißt seit einiger Zeit auf Vorschlag von IBM DocCat, das soll eine Abkürzung für Document-Categoriser sein, sie ist ja auch nett und anschaulich. Wir fangen an mit Text-Mining, das bei IBM in Heidelberg entwickelt wurde. Die verstehen darunter das automatische Indexieren als eine Instanz, also einen Teil von Text-Mining. Probleme werden dabei gezeigt, und das Text-Mining ist eben eine Methode zur Strukturierung von und der Suche in großen Dokumentenmengen, die Extraktion von Informationen und, das ist der hohe Anspruch, von impliziten Zusammenhängen. Das letztere sei dahingestellt. IBM macht das quantitativ, empirisch, approximativ und schnell. das muss man wirklich sagen. Das Ziel, und das ist ganz wichtig für unser Projekt gewesen, ist nicht, den Text zu verstehen, sondern das Ergebnis dieser Verfahren ist, was sie auf Neudeutsch a bundle of words, a bag of words nennen, also eine Menge von bedeutungstragenden Begriffen aus einem Text zu extrahieren, aufgrund von Algorithmen, also im Wesentlichen aufgrund von Rechenoperationen. Es gibt eine ganze Menge von linguistischen Vorstudien, ein wenig Linguistik ist auch dabei, aber nicht die Grundlage der ganzen Geschichte. Was sie für uns gemacht haben, ist also die Annotierung von Pressetexten für unsere Pressedatenbank. Für diejenigen, die es noch nicht kennen: Gruner + Jahr führt eine Textdokumentation, die eine Datenbank führt, seit Anfang der 70er Jahre, da sind z.Z. etwa 6,5 Millionen Dokumente darin, davon etwas über 1 Million Volltexte ab 1993. Das Prinzip war lange Zeit, dass wir die Dokumente, die in der Datenbank gespeichert waren und sind, verschlagworten und dieses Prinzip haben wir auch dann, als der Volltext eingeführt wurde, in abgespeckter Form weitergeführt. Zu diesen 6,5 Millionen Dokumenten gehören dann eben auch ungefähr 10 Millionen Faksimileseiten, weil wir die Faksimiles auch noch standardmäßig aufheben.
Theme
Data Mining
Dokumentenmanagement
Object
DocCat

Similar documents (author)

  1. Peters, C.M.: CD-ROM: its potential in libraries (1986) 4.76
    4.7649565 = sum of:
      4.7649565 = weight(author_txt:peters in 534) [ClassicSimilarity], result of:
        4.7649565 = fieldWeight in 534, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.62393 = idf(docFreq=58, maxDocs=44421)
          0.625 = fieldNorm(doc=534)
    
  2. Peters, T.A.: When smart people fail : an analysis of the transaction log of an online public access catalog (1989) 4.76
    4.7649565 = sum of:
      4.7649565 = weight(author_txt:peters in 2282) [ClassicSimilarity], result of:
        4.7649565 = fieldWeight in 2282, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.62393 = idf(docFreq=58, maxDocs=44421)
          0.625 = fieldNorm(doc=2282)
    
  3. Peters, C.M.: CD-ROM and optical technology : the user interface (1988) 4.76
    4.7649565 = sum of:
      4.7649565 = weight(author_txt:peters in 4012) [ClassicSimilarity], result of:
        4.7649565 = fieldWeight in 4012, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.62393 = idf(docFreq=58, maxDocs=44421)
          0.625 = fieldNorm(doc=4012)
    
  4. Peters, B.F.: Online searching using speech as a man / machine interface (1989) 4.76
    4.7649565 = sum of:
      4.7649565 = weight(author_txt:peters in 4636) [ClassicSimilarity], result of:
        4.7649565 = fieldWeight in 4636, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.62393 = idf(docFreq=58, maxDocs=44421)
          0.625 = fieldNorm(doc=4636)
    
  5. Peters, R.: Katalogisierung mit MIDAS (1991) 4.76
    4.7649565 = sum of:
      4.7649565 = weight(author_txt:peters in 4739) [ClassicSimilarity], result of:
        4.7649565 = fieldWeight in 4739, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          7.62393 = idf(docFreq=58, maxDocs=44421)
          0.625 = fieldNorm(doc=4739)
    

Similar documents (content)

  1. Jörn, F.: Wie Google für uns nach der ominösen Gluonenkraft stöbert : Software-Krabbler machen sich vor der Anfrage auf die Suche - Das Netz ist etwa fünfhundertmal größer als alles Durchforschte (2001) 0.26
    0.26230228 = sum of:
      0.26230228 = product of:
        0.4683969 = sum of:
          0.017761394 = weight(abstract_txt:verstehen in 671) [ClassicSimilarity], result of:
            0.017761394 = score(doc=671,freq=1.0), product of:
              0.14395873 = queryWeight, product of:
                6.3169727 = idf(docFreq=217, maxDocs=44421)
                0.022789197 = queryNorm
              0.12337837 = fieldWeight in 671, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3169727 = idf(docFreq=217, maxDocs=44421)
                0.01953125 = fieldNorm(doc=671)
          0.019182209 = weight(abstract_txt:etwas in 671) [ClassicSimilarity], result of:
            0.019182209 = score(doc=671,freq=1.0), product of:
              0.15153714 = queryWeight, product of:
                1.0259838 = boost
                6.481112 = idf(docFreq=184, maxDocs=44421)
                0.022789197 = queryNorm
              0.12658422 = fieldWeight in 671, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.481112 = idf(docFreq=184, maxDocs=44421)
                0.01953125 = fieldNorm(doc=671)
          0.021628164 = weight(abstract_txt:lange in 671) [ClassicSimilarity], result of:
            0.021628164 = score(doc=671,freq=1.0), product of:
              0.16415966 = queryWeight, product of:
                1.0678598 = boost
                6.7456408 = idf(docFreq=141, maxDocs=44421)
                0.022789197 = queryNorm
              0.13175079 = fieldWeight in 671, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7456408 = idf(docFreq=141, maxDocs=44421)
                0.01953125 = fieldNorm(doc=671)
          0.005514392 = weight(abstract_txt:also in 671) [ClassicSimilarity], result of:
            0.005514392 = score(doc=671,freq=1.0), product of:
              0.083162665 = queryWeight, product of:
                1.0748805 = boost
                3.3949955 = idf(docFreq=4049, maxDocs=44421)
                0.022789197 = queryNorm
              0.066308506 = fieldWeight in 671, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3949955 = idf(docFreq=4049, maxDocs=44421)
                0.01953125 = fieldNorm(doc=671)
          0.021415794 = weight(abstract_txt:haben in 671) [ClassicSimilarity], result of:
            0.021415794 = score(doc=671,freq=4.0), product of:
              0.11760339 = queryWeight, product of:
                1.1069717 = boost
                4.661807 = idf(docFreq=1140, maxDocs=44421)
                0.022789197 = queryNorm
              0.18210185 = fieldWeight in 671, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.661807 = idf(docFreq=1140, maxDocs=44421)
                0.01953125 = fieldNorm(doc=671)
          0.031668797 = weight(abstract_txt:noch in 671) [ClassicSimilarity], result of:
            0.031668797 = score(doc=671,freq=8.0), product of:
              0.12115503 = queryWeight, product of:
                1.1235628 = boost
                4.731677 = idf(docFreq=1063, maxDocs=44421)
                0.022789197 = queryNorm
              0.2613907 = fieldWeight in 671, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.731677 = idf(docFreq=1063, maxDocs=44421)
                0.01953125 = fieldNorm(doc=671)
          0.011484951 = weight(abstract_txt:wurde in 671) [ClassicSimilarity], result of:
            0.011484951 = score(doc=671,freq=1.0), product of:
              0.12322623 = queryWeight, product of:
                1.133126 = boost
                4.7719507 = idf(docFreq=1021, maxDocs=44421)
                0.022789197 = queryNorm
              0.09320216 = fieldWeight in 671, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7719507 = idf(docFreq=1021, maxDocs=44421)
                0.01953125 = fieldNorm(doc=671)
          0.030010134 = weight(abstract_txt:zeit in 671) [ClassicSimilarity], result of:
            0.030010134 = score(doc=671,freq=3.0), product of:
              0.16208965 = queryWeight, product of:
                1.2995837 = boost
                5.4729567 = idf(docFreq=506, maxDocs=44421)
                0.022789197 = queryNorm
              0.18514529 = fieldWeight in 671, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.4729567 = idf(docFreq=506, maxDocs=44421)
                0.01953125 = fieldNorm(doc=671)
          0.052531235 = weight(abstract_txt:dann in 671) [ClassicSimilarity], result of:
            0.052531235 = score(doc=671,freq=8.0), product of:
              0.16977178 = queryWeight, product of:
                1.3300236 = boost
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.022789197 = queryNorm
              0.30942267 = fieldWeight in 671, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.01953125 = fieldNorm(doc=671)
          0.024417179 = weight(abstract_txt:auch in 671) [ClassicSimilarity], result of:
            0.024417179 = score(doc=671,freq=7.0), product of:
              0.12627895 = queryWeight, product of:
                1.4808685 = boost
                3.7418423 = idf(docFreq=2862, maxDocs=44421)
                0.022789197 = queryNorm
              0.19335905 = fieldWeight in 671, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.7418423 = idf(docFreq=2862, maxDocs=44421)
                0.01953125 = fieldNorm(doc=671)
          0.09214805 = weight(abstract_txt:millionen in 671) [ClassicSimilarity], result of:
            0.09214805 = score(doc=671,freq=10.0), product of:
              0.2292316 = queryWeight, product of:
                1.5454817 = boost
                6.5085106 = idf(docFreq=179, maxDocs=44421)
                0.022789197 = queryNorm
              0.4019867 = fieldWeight in 671, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                6.5085106 = idf(docFreq=179, maxDocs=44421)
                0.01953125 = fieldNorm(doc=671)
          0.07977137 = weight(abstract_txt:einmal in 671) [ClassicSimilarity], result of:
            0.07977137 = score(doc=671,freq=4.0), product of:
              0.3110341 = queryWeight, product of:
                2.0787392 = boost
                6.565669 = idf(docFreq=169, maxDocs=44421)
                0.022789197 = queryNorm
              0.25647146 = fieldWeight in 671, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.565669 = idf(docFreq=169, maxDocs=44421)
                0.01953125 = fieldNorm(doc=671)
          0.037850827 = weight(abstract_txt:eine in 671) [ClassicSimilarity], result of:
            0.037850827 = score(doc=671,freq=10.0), product of:
              0.17565353 = queryWeight, product of:
                2.2092223 = boost
                3.4888992 = idf(docFreq=3686, maxDocs=44421)
                0.022789197 = queryNorm
              0.2154857 = fieldWeight in 671, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                3.4888992 = idf(docFreq=3686, maxDocs=44421)
                0.01953125 = fieldNorm(doc=671)
          0.023012385 = weight(abstract_txt:text in 671) [ClassicSimilarity], result of:
            0.023012385 = score(doc=671,freq=2.0), product of:
              0.2061771 = queryWeight, product of:
                2.2389028 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.022789197 = queryNorm
              0.11161465 = fieldWeight in 671, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.01953125 = fieldNorm(doc=671)
        0.56 = coord(14/25)
    
  2. Arns, C.: Fallstricke Online : Über die eigenen Worte gestolpert (2005) 0.20
    0.20150341 = sum of:
      0.20150341 = product of:
        0.62969816 = sum of:
          0.061383072 = weight(abstract_txt:etwas in 4502) [ClassicSimilarity], result of:
            0.061383072 = score(doc=4502,freq=1.0), product of:
              0.15153714 = queryWeight, product of:
                1.0259838 = boost
                6.481112 = idf(docFreq=184, maxDocs=44421)
                0.022789197 = queryNorm
              0.4050695 = fieldWeight in 4502, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.481112 = idf(docFreq=184, maxDocs=44421)
                0.0625 = fieldNorm(doc=4502)
          0.03426527 = weight(abstract_txt:haben in 4502) [ClassicSimilarity], result of:
            0.03426527 = score(doc=4502,freq=1.0), product of:
              0.11760339 = queryWeight, product of:
                1.1069717 = boost
                4.661807 = idf(docFreq=1140, maxDocs=44421)
                0.022789197 = queryNorm
              0.29136294 = fieldWeight in 4502, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.661807 = idf(docFreq=1140, maxDocs=44421)
                0.0625 = fieldNorm(doc=4502)
          0.07798088 = weight(abstract_txt:menge in 4502) [ClassicSimilarity], result of:
            0.07798088 = score(doc=4502,freq=1.0), product of:
              0.17775102 = queryWeight, product of:
                1.1111867 = boost
                7.019336 = idf(docFreq=107, maxDocs=44421)
                0.022789197 = queryNorm
              0.4387085 = fieldWeight in 4502, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.019336 = idf(docFreq=107, maxDocs=44421)
                0.0625 = fieldNorm(doc=4502)
          0.035829157 = weight(abstract_txt:noch in 4502) [ClassicSimilarity], result of:
            0.035829157 = score(doc=4502,freq=1.0), product of:
              0.12115503 = queryWeight, product of:
                1.1235628 = boost
                4.731677 = idf(docFreq=1063, maxDocs=44421)
                0.022789197 = queryNorm
              0.29572982 = fieldWeight in 4502, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.731677 = idf(docFreq=1063, maxDocs=44421)
                0.0625 = fieldNorm(doc=4502)
          0.102939785 = weight(abstract_txt:dann in 4502) [ClassicSimilarity], result of:
            0.102939785 = score(doc=4502,freq=3.0), product of:
              0.16977178 = queryWeight, product of:
                1.3300236 = boost
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.022789197 = queryNorm
              0.60634214 = fieldWeight in 4502, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.0625 = fieldNorm(doc=4502)
          0.05115135 = weight(abstract_txt:auch in 4502) [ClassicSimilarity], result of:
            0.05115135 = score(doc=4502,freq=3.0), product of:
              0.12627895 = queryWeight, product of:
                1.4808685 = boost
                3.7418423 = idf(docFreq=2862, maxDocs=44421)
                0.022789197 = queryNorm
              0.4050663 = fieldWeight in 4502, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.7418423 = idf(docFreq=2862, maxDocs=44421)
                0.0625 = fieldNorm(doc=4502)
          0.18050201 = weight(abstract_txt:einmal in 4502) [ClassicSimilarity], result of:
            0.18050201 = score(doc=4502,freq=2.0), product of:
              0.3110341 = queryWeight, product of:
                2.0787392 = boost
                6.565669 = idf(docFreq=169, maxDocs=44421)
                0.022789197 = queryNorm
              0.58032864 = fieldWeight in 4502, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.565669 = idf(docFreq=169, maxDocs=44421)
                0.0625 = fieldNorm(doc=4502)
          0.085646644 = weight(abstract_txt:eine in 4502) [ClassicSimilarity], result of:
            0.085646644 = score(doc=4502,freq=5.0), product of:
              0.17565353 = queryWeight, product of:
                2.2092223 = boost
                3.4888992 = idf(docFreq=3686, maxDocs=44421)
                0.022789197 = queryNorm
              0.4875885 = fieldWeight in 4502, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.4888992 = idf(docFreq=3686, maxDocs=44421)
                0.0625 = fieldNorm(doc=4502)
        0.32 = coord(8/25)
    
  3. Erben, K.M.: ¬Das Internet wird menschlich : Web-Guides sind die neuen Pfadfinder im Dschungel des Netzes (2001) 0.19
    0.19486861 = sum of:
      0.19486861 = product of:
        0.4871715 = sum of:
          0.043256328 = weight(abstract_txt:lange in 6735) [ClassicSimilarity], result of:
            0.043256328 = score(doc=6735,freq=1.0), product of:
              0.16415966 = queryWeight, product of:
                1.0678598 = boost
                6.7456408 = idf(docFreq=141, maxDocs=44421)
                0.022789197 = queryNorm
              0.26350158 = fieldWeight in 6735, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.7456408 = idf(docFreq=141, maxDocs=44421)
                0.0390625 = fieldNorm(doc=6735)
          0.030286504 = weight(abstract_txt:haben in 6735) [ClassicSimilarity], result of:
            0.030286504 = score(doc=6735,freq=2.0), product of:
              0.11760339 = queryWeight, product of:
                1.1069717 = boost
                4.661807 = idf(docFreq=1140, maxDocs=44421)
                0.022789197 = queryNorm
              0.25753087 = fieldWeight in 6735, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.661807 = idf(docFreq=1140, maxDocs=44421)
                0.0390625 = fieldNorm(doc=6735)
          0.038786195 = weight(abstract_txt:noch in 6735) [ClassicSimilarity], result of:
            0.038786195 = score(doc=6735,freq=3.0), product of:
              0.12115503 = queryWeight, product of:
                1.1235628 = boost
                4.731677 = idf(docFreq=1063, maxDocs=44421)
                0.022789197 = queryNorm
              0.3201369 = fieldWeight in 6735, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.731677 = idf(docFreq=1063, maxDocs=44421)
                0.0390625 = fieldNorm(doc=6735)
          0.055545762 = weight(abstract_txt:eben in 6735) [ClassicSimilarity], result of:
            0.055545762 = score(doc=6735,freq=1.0), product of:
              0.19393994 = queryWeight, product of:
                1.1606857 = boost
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.022789197 = queryNorm
              0.28640702 = fieldWeight in 6735, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.0390625 = fieldNorm(doc=6735)
          0.08499136 = weight(abstract_txt:kennen in 6735) [ClassicSimilarity], result of:
            0.08499136 = score(doc=6735,freq=2.0), product of:
              0.20439637 = queryWeight, product of:
                1.1915646 = boost
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.022789197 = queryNorm
              0.41581637 = fieldWeight in 6735, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.5270805 = idf(docFreq=64, maxDocs=44421)
                0.0390625 = fieldNorm(doc=6735)
          0.03465272 = weight(abstract_txt:zeit in 6735) [ClassicSimilarity], result of:
            0.03465272 = score(doc=6735,freq=1.0), product of:
              0.16208965 = queryWeight, product of:
                1.2995837 = boost
                5.4729567 = idf(docFreq=506, maxDocs=44421)
                0.022789197 = queryNorm
              0.21378738 = fieldWeight in 6735, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4729567 = idf(docFreq=506, maxDocs=44421)
                0.0390625 = fieldNorm(doc=6735)
          0.037145194 = weight(abstract_txt:dann in 6735) [ClassicSimilarity], result of:
            0.037145194 = score(doc=6735,freq=1.0), product of:
              0.16977178 = queryWeight, product of:
                1.3300236 = boost
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.022789197 = queryNorm
              0.21879487 = fieldWeight in 6735, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.0390625 = fieldNorm(doc=6735)
          0.041272566 = weight(abstract_txt:auch in 6735) [ClassicSimilarity], result of:
            0.041272566 = score(doc=6735,freq=5.0), product of:
              0.12627895 = queryWeight, product of:
                1.4808685 = boost
                3.7418423 = idf(docFreq=2862, maxDocs=44421)
                0.022789197 = queryNorm
              0.32683647 = fieldWeight in 6735, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.7418423 = idf(docFreq=2862, maxDocs=44421)
                0.0390625 = fieldNorm(doc=6735)
          0.07977137 = weight(abstract_txt:einmal in 6735) [ClassicSimilarity], result of:
            0.07977137 = score(doc=6735,freq=1.0), product of:
              0.3110341 = queryWeight, product of:
                2.0787392 = boost
                6.565669 = idf(docFreq=169, maxDocs=44421)
                0.022789197 = queryNorm
              0.25647146 = fieldWeight in 6735, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.565669 = idf(docFreq=169, maxDocs=44421)
                0.0390625 = fieldNorm(doc=6735)
          0.041463498 = weight(abstract_txt:eine in 6735) [ClassicSimilarity], result of:
            0.041463498 = score(doc=6735,freq=3.0), product of:
              0.17565353 = queryWeight, product of:
                2.2092223 = boost
                3.4888992 = idf(docFreq=3686, maxDocs=44421)
                0.022789197 = queryNorm
              0.23605275 = fieldWeight in 6735, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4888992 = idf(docFreq=3686, maxDocs=44421)
                0.0390625 = fieldNorm(doc=6735)
        0.4 = coord(10/25)
    
  4. Taglinger, H.: Ausgevogelt, jetzt wird es ernst (2018) 0.18
    0.17798145 = sum of:
      0.17798145 = product of:
        0.4943929 = sum of:
          0.049731907 = weight(abstract_txt:verstehen in 281) [ClassicSimilarity], result of:
            0.049731907 = score(doc=281,freq=1.0), product of:
              0.14395873 = queryWeight, product of:
                6.3169727 = idf(docFreq=217, maxDocs=44421)
                0.022789197 = queryNorm
              0.34545946 = fieldWeight in 281, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3169727 = idf(docFreq=217, maxDocs=44421)
                0.0546875 = fieldNorm(doc=281)
          0.021835878 = weight(abstract_txt:also in 281) [ClassicSimilarity], result of:
            0.021835878 = score(doc=281,freq=2.0), product of:
              0.083162665 = queryWeight, product of:
                1.0748805 = boost
                3.3949955 = idf(docFreq=4049, maxDocs=44421)
                0.022789197 = queryNorm
              0.2625683 = fieldWeight in 281, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3949955 = idf(docFreq=4049, maxDocs=44421)
                0.0546875 = fieldNorm(doc=281)
          0.068233274 = weight(abstract_txt:menge in 281) [ClassicSimilarity], result of:
            0.068233274 = score(doc=281,freq=1.0), product of:
              0.17775102 = queryWeight, product of:
                1.1111867 = boost
                7.019336 = idf(docFreq=107, maxDocs=44421)
                0.022789197 = queryNorm
              0.38386995 = fieldWeight in 281, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.019336 = idf(docFreq=107, maxDocs=44421)
                0.0546875 = fieldNorm(doc=281)
          0.05430068 = weight(abstract_txt:noch in 281) [ClassicSimilarity], result of:
            0.05430068 = score(doc=281,freq=3.0), product of:
              0.12115503 = queryWeight, product of:
                1.1235628 = boost
                4.731677 = idf(docFreq=1063, maxDocs=44421)
                0.022789197 = queryNorm
              0.4481917 = fieldWeight in 281, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.731677 = idf(docFreq=1063, maxDocs=44421)
                0.0546875 = fieldNorm(doc=281)
          0.077764064 = weight(abstract_txt:eben in 281) [ClassicSimilarity], result of:
            0.077764064 = score(doc=281,freq=1.0), product of:
              0.19393994 = queryWeight, product of:
                1.1606857 = boost
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.022789197 = queryNorm
              0.40096983 = fieldWeight in 281, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.0546875 = fieldNorm(doc=281)
          0.048513804 = weight(abstract_txt:zeit in 281) [ClassicSimilarity], result of:
            0.048513804 = score(doc=281,freq=1.0), product of:
              0.16208965 = queryWeight, product of:
                1.2995837 = boost
                5.4729567 = idf(docFreq=506, maxDocs=44421)
                0.022789197 = queryNorm
              0.2993023 = fieldWeight in 281, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4729567 = idf(docFreq=506, maxDocs=44421)
                0.0546875 = fieldNorm(doc=281)
          0.090072304 = weight(abstract_txt:dann in 281) [ClassicSimilarity], result of:
            0.090072304 = score(doc=281,freq=3.0), product of:
              0.16977178 = queryWeight, product of:
                1.3300236 = boost
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.022789197 = queryNorm
              0.53054935 = fieldWeight in 281, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.0546875 = fieldNorm(doc=281)
          0.036544286 = weight(abstract_txt:auch in 281) [ClassicSimilarity], result of:
            0.036544286 = score(doc=281,freq=2.0), product of:
              0.12627895 = queryWeight, product of:
                1.4808685 = boost
                3.7418423 = idf(docFreq=2862, maxDocs=44421)
                0.022789197 = queryNorm
              0.28939334 = fieldWeight in 281, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.7418423 = idf(docFreq=2862, maxDocs=44421)
                0.0546875 = fieldNorm(doc=281)
          0.04739673 = weight(abstract_txt:eine in 281) [ClassicSimilarity], result of:
            0.04739673 = score(doc=281,freq=2.0), product of:
              0.17565353 = queryWeight, product of:
                2.2092223 = boost
                3.4888992 = idf(docFreq=3686, maxDocs=44421)
                0.022789197 = queryNorm
              0.2698308 = fieldWeight in 281, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4888992 = idf(docFreq=3686, maxDocs=44421)
                0.0546875 = fieldNorm(doc=281)
        0.36 = coord(9/25)
    
  5. Heyer, G.; Quasthoff, U.; Wittig, T.: Text Mining : Wissensrohstoff Text. Konzepte, Algorithmen, Ergebnisse (2006) 0.17
    0.16595046 = sum of:
      0.16595046 = product of:
        0.5926802 = sum of:
          0.07422769 = weight(abstract_txt:technologie in 218) [ClassicSimilarity], result of:
            0.07422769 = score(doc=218,freq=3.0), product of:
              0.16314366 = queryWeight, product of:
                1.06455 = boost
                6.724734 = idf(docFreq=144, maxDocs=44421)
                0.022789197 = queryNorm
              0.4549836 = fieldWeight in 218, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.724734 = idf(docFreq=144, maxDocs=44421)
                0.0390625 = fieldNorm(doc=218)
          0.06117369 = weight(abstract_txt:lange in 218) [ClassicSimilarity], result of:
            0.06117369 = score(doc=218,freq=2.0), product of:
              0.16415966 = queryWeight, product of:
                1.0678598 = boost
                6.7456408 = idf(docFreq=141, maxDocs=44421)
                0.022789197 = queryNorm
              0.37264752 = fieldWeight in 218, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.7456408 = idf(docFreq=141, maxDocs=44421)
                0.0390625 = fieldNorm(doc=218)
          0.031668797 = weight(abstract_txt:noch in 218) [ClassicSimilarity], result of:
            0.031668797 = score(doc=218,freq=2.0), product of:
              0.12115503 = queryWeight, product of:
                1.1235628 = boost
                4.731677 = idf(docFreq=1063, maxDocs=44421)
                0.022789197 = queryNorm
              0.2613907 = fieldWeight in 218, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.731677 = idf(docFreq=1063, maxDocs=44421)
                0.0390625 = fieldNorm(doc=218)
          0.02610306 = weight(abstract_txt:auch in 218) [ClassicSimilarity], result of:
            0.02610306 = score(doc=218,freq=2.0), product of:
              0.12627895 = queryWeight, product of:
                1.4808685 = boost
                3.7418423 = idf(docFreq=2862, maxDocs=44421)
                0.022789197 = queryNorm
              0.20670952 = fieldWeight in 218, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.7418423 = idf(docFreq=2862, maxDocs=44421)
                0.0390625 = fieldNorm(doc=218)
          0.047877926 = weight(abstract_txt:eine in 218) [ClassicSimilarity], result of:
            0.047877926 = score(doc=218,freq=4.0), product of:
              0.17565353 = queryWeight, product of:
                2.2092223 = boost
                3.4888992 = idf(docFreq=3686, maxDocs=44421)
                0.022789197 = queryNorm
              0.27257025 = fieldWeight in 218, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4888992 = idf(docFreq=3686, maxDocs=44421)
                0.0390625 = fieldNorm(doc=218)
          0.1173406 = weight(abstract_txt:text in 218) [ClassicSimilarity], result of:
            0.1173406 = score(doc=218,freq=13.0), product of:
              0.2061771 = queryWeight, product of:
                2.2389028 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.022789197 = queryNorm
              0.5691253 = fieldWeight in 218, product of:
                3.6055512 = tf(freq=13.0), with freq of:
                  13.0 = termFreq=13.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0390625 = fieldNorm(doc=218)
          0.23428848 = weight(abstract_txt:mining in 218) [ClassicSimilarity], result of:
            0.23428848 = score(doc=218,freq=8.0), product of:
              0.34357163 = queryWeight, product of:
                2.442641 = boost
                6.1720386 = idf(docFreq=251, maxDocs=44421)
                0.022789197 = queryNorm
              0.68192035 = fieldWeight in 218, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                6.1720386 = idf(docFreq=251, maxDocs=44421)
                0.0390625 = fieldNorm(doc=218)
        0.28 = coord(7/25)