Document (#23563)

Author
Schirmer, K.
Haller, J.
Title
Zugang zu mehrsprachigen Nachrichten im Internet
Source
Sprachtechnologie für eine dynamische Wirtschaft im Medienzeitalter - Language technologies for dynamic business in the age of the media - L'ingénierie linguistique au service de la dynamisation économique à l'ère du multimédia: Tagungsakten der XXVI. Jahrestagung der Internationalen Vereinigung Sprache und Wirtschaft e.V., 23.-25.11.2000, Fachhochschule Köln. Hrsg.: K.-D. Schmitz
Imprint
Wien : Termnet
Year
2000
Pages
S.23-24
Abstract
In einer Kooperation zwischen smart information und dem IAI werden täglich ca. 20.000 aktuelle Nachrichten des Tages (in deutscher Sprache) linguistisch indexiert. Die Nachrichten werden täglich von der Nachrichtensuchmaschine newscan http://www.newscan.de von smart information aus den verschiedensten InternetQuellen gesammelt. Der Benutzer kann mit frei gewählten Begriffen suchen. Das Ergebnis einer solchen Schlüsselwortsuche wird in Tabellenform ausgegeben, nach Häufigkeit geordnet. Bei einer größeren Ergebnismenge (mehr als zehn Dokumente) werden die Nachrichten automatisch gruppiert (Clusteranalyse) und mit einem Label (Thema) versehen. Diese Themen werden in einer Baumstruktur dargestellt. Der Nutzer kann gezielt auf einen Themenbereich zugreifen. Die Clusteranalyse beruht auf der automatischen Gruppierung der Dokumente und ihrer Stichwörter (Deskriptoren), wie sie von dem automatischen Deskribierungsmodul AUDESC des IAI erzeugt werden. Die in einer großen Datei zusammengestellten Nachrichten werden in jeder Nacht an das IAI geschickt. Mit einer speziell an diese Nachrichten angepaßte Version des Indexierungsmoduls AUTINDEX werden jeder einzelnen Nachricht Schlagwörter zugeordnet
Theme
Multilinguale Probleme
Internet

Similar documents (author)

  1. Haller, K.: ¬Das Katalogsystem der Bayerischen Staatsbibliothek (1991) 5.58
    5.5805492 = sum of:
      5.5805492 = weight(author_txt:haller in 525) [ClassicSimilarity], result of:
        5.5805492 = fieldWeight in 525, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.928879 = idf(docFreq=15, maxDocs=44421)
          0.625 = fieldNorm(doc=525)
    
  2. Haller, K.: Regelwerke und Normdateien in Verbundbibliotheken (1988) 5.58
    5.5805492 = sum of:
      5.5805492 = weight(author_txt:haller in 701) [ClassicSimilarity], result of:
        5.5805492 = fieldWeight in 701, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.928879 = idf(docFreq=15, maxDocs=44421)
          0.625 = fieldNorm(doc=701)
    
  3. Haller, K.: Kommunikation, Normung und Kataloge (1990) 5.58
    5.5805492 = sum of:
      5.5805492 = weight(author_txt:haller in 1146) [ClassicSimilarity], result of:
        5.5805492 = fieldWeight in 1146, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.928879 = idf(docFreq=15, maxDocs=44421)
          0.625 = fieldNorm(doc=1146)
    
  4. Haller, K.: ¬Der Image-Katalog 1953-1981 der Bayerischen Staatsbibliothek (1997) 5.58
    5.5805492 = sum of:
      5.5805492 = weight(author_txt:haller in 458) [ClassicSimilarity], result of:
        5.5805492 = fieldWeight in 458, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.928879 = idf(docFreq=15, maxDocs=44421)
          0.625 = fieldNorm(doc=458)
    
  5. Haller, K.: Katalogkunde : Formalkataloge und formale Ordnungsmethodem (1983) 5.58
    5.5805492 = sum of:
      5.5805492 = weight(author_txt:haller in 1704) [ClassicSimilarity], result of:
        5.5805492 = fieldWeight in 1704, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.928879 = idf(docFreq=15, maxDocs=44421)
          0.625 = fieldNorm(doc=1704)
    

Similar documents (content)

  1. Deuble, M.; Niemann, J.: Elektronische Archivierung für die Lokalberichterstattung : Szenarien der Archivreorganisation am Beispiel der Cuxhavener Nachrichten (1997) 0.16
    0.15857697 = sum of:
      0.15857697 = product of:
        0.99110603 = sum of:
          0.15370953 = weight(abstract_txt:20.000 in 2483) [ClassicSimilarity], result of:
            0.15370953 = score(doc=2483,freq=1.0), product of:
              0.14124899 = queryWeight, product of:
                1.026306 = boost
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.01580895 = queryNorm
              1.0882169 = fieldWeight in 2483, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.125 = fieldNorm(doc=2483)
          0.115699224 = weight(abstract_txt:einer in 2483) [ClassicSimilarity], result of:
            0.115699224 = score(doc=2483,freq=2.0), product of:
              0.16856945 = queryWeight, product of:
                2.746308 = boost
                3.882635 = idf(docFreq=2486, maxDocs=44421)
                0.01580895 = queryNorm
              0.6863594 = fieldWeight in 2483, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.882635 = idf(docFreq=2486, maxDocs=44421)
                0.125 = fieldNorm(doc=2483)
          0.12191138 = weight(abstract_txt:werden in 2483) [ClassicSimilarity], result of:
            0.12191138 = score(doc=2483,freq=3.0), product of:
              0.16052397 = queryWeight, product of:
                2.8946974 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.01580895 = queryNorm
              0.759459 = fieldWeight in 2483, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.125 = fieldNorm(doc=2483)
          0.59978586 = weight(abstract_txt:nachrichten in 2483) [ClassicSimilarity], result of:
            0.59978586 = score(doc=2483,freq=1.0), product of:
              0.63615954 = queryWeight, product of:
                5.335104 = boost
                7.5425844 = idf(docFreq=63, maxDocs=44421)
                0.01580895 = queryNorm
              0.94282305 = fieldWeight in 2483, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5425844 = idf(docFreq=63, maxDocs=44421)
                0.125 = fieldNorm(doc=2483)
        0.16 = coord(4/25)
    
  2. Zoller, P.: ¬Die wohldurchdachte Aufbereitung des Informationsangebotes : Teil einer computergestützten Sachbearbeitung (1992) 0.16
    0.15818821 = sum of:
      0.15818821 = product of:
        0.790941 = sum of:
          0.0331976 = weight(abstract_txt:diese in 1401) [ClassicSimilarity], result of:
            0.0331976 = score(doc=1401,freq=1.0), product of:
              0.070026204 = queryWeight, product of:
                1.0219496 = boost
                4.3343906 = idf(docFreq=1582, maxDocs=44421)
                0.01580895 = queryNorm
              0.47407398 = fieldWeight in 1401, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3343906 = idf(docFreq=1582, maxDocs=44421)
                0.109375 = fieldNorm(doc=1401)
          0.099758126 = weight(abstract_txt:dokumente in 1401) [ClassicSimilarity], result of:
            0.099758126 = score(doc=1401,freq=1.0), product of:
              0.14582153 = queryWeight, product of:
                1.4747216 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.01580895 = queryNorm
              0.6841111 = fieldWeight in 1401, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.109375 = fieldNorm(doc=1401)
          0.071585245 = weight(abstract_txt:einer in 1401) [ClassicSimilarity], result of:
            0.071585245 = score(doc=1401,freq=1.0), product of:
              0.16856945 = queryWeight, product of:
                2.746308 = boost
                3.882635 = idf(docFreq=2486, maxDocs=44421)
                0.01580895 = queryNorm
              0.42466322 = fieldWeight in 1401, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.882635 = idf(docFreq=2486, maxDocs=44421)
                0.109375 = fieldNorm(doc=1401)
          0.06158737 = weight(abstract_txt:werden in 1401) [ClassicSimilarity], result of:
            0.06158737 = score(doc=1401,freq=1.0), product of:
              0.16052397 = queryWeight, product of:
                2.8946974 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.01580895 = queryNorm
              0.38366464 = fieldWeight in 1401, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.109375 = fieldNorm(doc=1401)
          0.52481264 = weight(abstract_txt:nachrichten in 1401) [ClassicSimilarity], result of:
            0.52481264 = score(doc=1401,freq=1.0), product of:
              0.63615954 = queryWeight, product of:
                5.335104 = boost
                7.5425844 = idf(docFreq=63, maxDocs=44421)
                0.01580895 = queryNorm
              0.8249702 = fieldWeight in 1401, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5425844 = idf(docFreq=63, maxDocs=44421)
                0.109375 = fieldNorm(doc=1401)
        0.2 = coord(5/25)
    
  3. Start von Wikinews (2005) 0.15
    0.15202652 = sum of:
      0.15202652 = product of:
        1.2668877 = sum of:
          0.053347386 = weight(abstract_txt:kann in 4300) [ClassicSimilarity], result of:
            0.053347386 = score(doc=4300,freq=1.0), product of:
              0.07574086 = queryWeight, product of:
                1.0628313 = boost
                4.507782 = idf(docFreq=1330, maxDocs=44421)
                0.01580895 = queryNorm
              0.70434093 = fieldWeight in 4300, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.507782 = idf(docFreq=1330, maxDocs=44421)
                0.15625 = fieldNorm(doc=4300)
          0.15325868 = weight(abstract_txt:jeder in 4300) [ClassicSimilarity], result of:
            0.15325868 = score(doc=4300,freq=1.0), product of:
              0.15306346 = queryWeight, product of:
                1.5108974 = boost
                6.4081626 = idf(docFreq=198, maxDocs=44421)
                0.01580895 = queryNorm
              1.0012754 = fieldWeight in 4300, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4081626 = idf(docFreq=198, maxDocs=44421)
                0.15625 = fieldNorm(doc=4300)
          1.0602816 = weight(abstract_txt:nachrichten in 4300) [ClassicSimilarity], result of:
            1.0602816 = score(doc=4300,freq=2.0), product of:
              0.63615954 = queryWeight, product of:
                5.335104 = boost
                7.5425844 = idf(docFreq=63, maxDocs=44421)
                0.01580895 = queryNorm
              1.6666914 = fieldWeight in 4300, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.5425844 = idf(docFreq=63, maxDocs=44421)
                0.15625 = fieldNorm(doc=4300)
        0.12 = coord(3/25)
    
  4. Albrecht, C.: ¬Die Entdeckung der Weitschweifigkeit : Über das Glück, mit Markow-Ketten zu rasseln: Die Schriften Claude E. Shannons (2001) 0.15
    0.15024947 = sum of:
      0.15024947 = product of:
        0.53660524 = sum of:
          0.009485029 = weight(abstract_txt:diese in 6643) [ClassicSimilarity], result of:
            0.009485029 = score(doc=6643,freq=1.0), product of:
              0.070026204 = queryWeight, product of:
                1.0219496 = boost
                4.3343906 = idf(docFreq=1582, maxDocs=44421)
                0.01580895 = queryNorm
              0.13544971 = fieldWeight in 6643, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3343906 = idf(docFreq=1582, maxDocs=44421)
                0.03125 = fieldNorm(doc=6643)
          0.076854765 = weight(abstract_txt:nachricht in 6643) [ClassicSimilarity], result of:
            0.076854765 = score(doc=6643,freq=4.0), product of:
              0.14124899 = queryWeight, product of:
                1.026306 = boost
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.01580895 = queryNorm
              0.54410845 = fieldWeight in 6643, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.03125 = fieldNorm(doc=6643)
          0.01508892 = weight(abstract_txt:kann in 6643) [ClassicSimilarity], result of:
            0.01508892 = score(doc=6643,freq=2.0), product of:
              0.07574086 = queryWeight, product of:
                1.0628313 = boost
                4.507782 = idf(docFreq=1330, maxDocs=44421)
                0.01580895 = queryNorm
              0.19921769 = fieldWeight in 6643, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.507782 = idf(docFreq=1330, maxDocs=44421)
                0.03125 = fieldNorm(doc=6643)
          0.028502323 = weight(abstract_txt:dokumente in 6643) [ClassicSimilarity], result of:
            0.028502323 = score(doc=6643,freq=1.0), product of:
              0.14582153 = queryWeight, product of:
                1.4747216 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.01580895 = queryNorm
              0.19546032 = fieldWeight in 6643, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.03125 = fieldNorm(doc=6643)
          0.040905852 = weight(abstract_txt:einer in 6643) [ClassicSimilarity], result of:
            0.040905852 = score(doc=6643,freq=4.0), product of:
              0.16856945 = queryWeight, product of:
                2.746308 = boost
                3.882635 = idf(docFreq=2486, maxDocs=44421)
                0.01580895 = queryNorm
              0.2426647 = fieldWeight in 6643, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.882635 = idf(docFreq=2486, maxDocs=44421)
                0.03125 = fieldNorm(doc=6643)
          0.030477844 = weight(abstract_txt:werden in 6643) [ClassicSimilarity], result of:
            0.030477844 = score(doc=6643,freq=3.0), product of:
              0.16052397 = queryWeight, product of:
                2.8946974 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.01580895 = queryNorm
              0.18986475 = fieldWeight in 6643, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.03125 = fieldNorm(doc=6643)
          0.33529052 = weight(abstract_txt:nachrichten in 6643) [ClassicSimilarity], result of:
            0.33529052 = score(doc=6643,freq=5.0), product of:
              0.63615954 = queryWeight, product of:
                5.335104 = boost
                7.5425844 = idf(docFreq=63, maxDocs=44421)
                0.01580895 = queryNorm
              0.52705413 = fieldWeight in 6643, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                7.5425844 = idf(docFreq=63, maxDocs=44421)
                0.03125 = fieldNorm(doc=6643)
        0.28 = coord(7/25)
    
  5. Stock, M.: Neuigkeiten auf der Spur : Searches, Tracks und News Pages bei Factiva (2002) 0.14
    0.13582978 = sum of:
      0.13582978 = product of:
        0.84893614 = sum of:
          0.12441656 = weight(abstract_txt:geordnet in 1697) [ClassicSimilarity], result of:
            0.12441656 = score(doc=1697,freq=1.0), product of:
              0.13410085 = queryWeight, product of:
                8.482592 = idf(docFreq=24, maxDocs=44421)
                0.01580895 = queryNorm
              0.9277835 = fieldWeight in 1697, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.482592 = idf(docFreq=24, maxDocs=44421)
                0.109375 = fieldNorm(doc=1697)
          0.1281217 = weight(abstract_txt:tages in 1697) [ClassicSimilarity], result of:
            0.1281217 = score(doc=1697,freq=1.0), product of:
              0.13675018 = queryWeight, product of:
                1.0098298 = boost
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.01580895 = queryNorm
              0.93690336 = fieldWeight in 1697, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.565973 = idf(docFreq=22, maxDocs=44421)
                0.109375 = fieldNorm(doc=1697)
          0.071585245 = weight(abstract_txt:einer in 1697) [ClassicSimilarity], result of:
            0.071585245 = score(doc=1697,freq=1.0), product of:
              0.16856945 = queryWeight, product of:
                2.746308 = boost
                3.882635 = idf(docFreq=2486, maxDocs=44421)
                0.01580895 = queryNorm
              0.42466322 = fieldWeight in 1697, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.882635 = idf(docFreq=2486, maxDocs=44421)
                0.109375 = fieldNorm(doc=1697)
          0.52481264 = weight(abstract_txt:nachrichten in 1697) [ClassicSimilarity], result of:
            0.52481264 = score(doc=1697,freq=1.0), product of:
              0.63615954 = queryWeight, product of:
                5.335104 = boost
                7.5425844 = idf(docFreq=63, maxDocs=44421)
                0.01580895 = queryNorm
              0.8249702 = fieldWeight in 1697, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5425844 = idf(docFreq=63, maxDocs=44421)
                0.109375 = fieldNorm(doc=1697)
        0.16 = coord(4/25)