Document (#10430)

Author
Schwarz, C.
Title
THESYS: Thesaurus Syntax System : a fully automatic thesaurus building aid
Source
Wissensorganisation im Wandel: Dezimalklassifikation - Thesaurusfragen - Warenklassifikation. Proc. 11. Jahrestagung der Gesellschaft für Klassifikation, Aachen, 29.6.-1.7.1987. Hrsg.: H.-J. Hermes u. J. Hölzl
Imprint
Frankfurt : Indeks
Year
1988
Pages
S.63-70
Series
Studien zur Klassifikation; Bd.18
Abstract
THESYS is based on the natural language processing of free-text databases. It yields statistically evaluated correlations between words of the database. These correlations correspond to traditional thesaurus relations. The person who has to build a thesaurus is thus assisted by the proposals made by THESYS. THESYS is being tested on commercial databases under real world conditions. It is part of a text processing project at Siemens, called TINA (Text-Inhalts-Analyse). Software from TINA is actually being applied and evaluated by the US Department of Commerce for patent search and indexing (REALIST: REtrieval Aids by Linguistics and STatistics)
Theme
Computerlinguistik
Object
THESYS
TINA

Similar documents (author)

  1. Schwarz, C.: Natural language and information retrieval : Kommentierte Literaturliste zu Systemen, Verfahren und Tools (1986) 5.13
    5.1281 = sum of:
      5.1281 = weight(author_txt:schwarz in 407) [ClassicSimilarity], result of:
        5.1281 = fieldWeight in 407, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.20496 = idf(docFreq=32, maxDocs=44421)
          0.625 = fieldNorm(doc=407)
    
  2. Schwarz, C.: Linguistische Hilfsmittel beim Information Retrieval (1984) 5.13
    5.1281 = sum of:
      5.1281 = weight(author_txt:schwarz in 544) [ClassicSimilarity], result of:
        5.1281 = fieldWeight in 544, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.20496 = idf(docFreq=32, maxDocs=44421)
          0.625 = fieldNorm(doc=544)
    
  3. Schwarz, B.: Book House: ein OPAC für die Erschließung und Recherche Schöner Literatur (1991) 5.13
    5.1281 = sum of:
      5.1281 = weight(author_txt:schwarz in 1021) [ClassicSimilarity], result of:
        5.1281 = fieldWeight in 1021, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.20496 = idf(docFreq=32, maxDocs=44421)
          0.625 = fieldNorm(doc=1021)
    
  4. Schwarz, C.: Freitextrecherche: Grenzen und Möglichkeiten (1982) 5.13
    5.1281 = sum of:
      5.1281 = weight(author_txt:schwarz in 1348) [ClassicSimilarity], result of:
        5.1281 = fieldWeight in 1348, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.20496 = idf(docFreq=32, maxDocs=44421)
          0.625 = fieldNorm(doc=1348)
    
  5. Schwarz, R.: Buch und Bahn : Auskunftsdienst per CD-ROM (1995) 5.13
    5.1281 = sum of:
      5.1281 = weight(author_txt:schwarz in 4141) [ClassicSimilarity], result of:
        5.1281 = fieldWeight in 4141, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.20496 = idf(docFreq=32, maxDocs=44421)
          0.625 = fieldNorm(doc=4141)
    

Similar documents (content)

  1. Ruge, G.; Schwarz, C.: Natural language access to free-text data bases (1989) 0.58
    0.57643133 = sum of:
      0.57643133 = product of:
        1.2008986 = sum of:
          0.06323684 = weight(abstract_txt:department in 3635) [ClassicSimilarity], result of:
            0.06323684 = score(doc=3635,freq=1.0), product of:
              0.13029017 = queryWeight, product of:
                6.2125297 = idf(docFreq=241, maxDocs=44421)
                0.02097216 = queryNorm
              0.4853539 = fieldWeight in 3635, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2125297 = idf(docFreq=241, maxDocs=44421)
                0.078125 = fieldNorm(doc=3635)
          0.07484658 = weight(abstract_txt:syntax in 3635) [ClassicSimilarity], result of:
            0.07484658 = score(doc=3635,freq=1.0), product of:
              0.145785 = queryWeight, product of:
                1.0577928 = boost
                6.571569 = idf(docFreq=168, maxDocs=44421)
                0.02097216 = queryNorm
              0.51340383 = fieldWeight in 3635, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.571569 = idf(docFreq=168, maxDocs=44421)
                0.078125 = fieldNorm(doc=3635)
          0.07566799 = weight(abstract_txt:actually in 3635) [ClassicSimilarity], result of:
            0.07566799 = score(doc=3635,freq=1.0), product of:
              0.14684969 = queryWeight, product of:
                1.0616484 = boost
                6.595522 = idf(docFreq=164, maxDocs=44421)
                0.02097216 = queryNorm
              0.5152751 = fieldWeight in 3635, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.595522 = idf(docFreq=164, maxDocs=44421)
                0.078125 = fieldNorm(doc=3635)
          0.08780345 = weight(abstract_txt:patent in 3635) [ClassicSimilarity], result of:
            0.08780345 = score(doc=3635,freq=1.0), product of:
              0.16215833 = queryWeight, product of:
                1.1156136 = boost
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.02097216 = queryNorm
              0.5414674 = fieldWeight in 3635, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.078125 = fieldNorm(doc=3635)
          0.088785835 = weight(abstract_txt:commerce in 3635) [ClassicSimilarity], result of:
            0.088785835 = score(doc=3635,freq=1.0), product of:
              0.16336562 = queryWeight, product of:
                1.1197588 = boost
                6.9565353 = idf(docFreq=114, maxDocs=44421)
                0.02097216 = queryNorm
              0.5434793 = fieldWeight in 3635, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9565353 = idf(docFreq=114, maxDocs=44421)
                0.078125 = fieldNorm(doc=3635)
          0.11387877 = weight(abstract_txt:yields in 3635) [ClassicSimilarity], result of:
            0.11387877 = score(doc=3635,freq=1.0), product of:
              0.19285315 = queryWeight, product of:
                1.2166272 = boost
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.02097216 = queryNorm
              0.59049475 = fieldWeight in 3635, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.558333 = idf(docFreq=62, maxDocs=44421)
                0.078125 = fieldNorm(doc=3635)
          0.18394127 = weight(abstract_txt:inhalts in 3635) [ClassicSimilarity], result of:
            0.18394127 = score(doc=3635,freq=1.0), product of:
              0.26549172 = queryWeight, product of:
                1.4274788 = boost
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.02097216 = queryNorm
              0.6928324 = fieldWeight in 3635, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.078125 = fieldNorm(doc=3635)
          0.04833933 = weight(abstract_txt:being in 3635) [ClassicSimilarity], result of:
            0.04833933 = score(doc=3635,freq=1.0), product of:
              0.1372383 = queryWeight, product of:
                1.4514325 = boost
                4.5085335 = idf(docFreq=1329, maxDocs=44421)
                0.02097216 = queryNorm
              0.35222918 = fieldWeight in 3635, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5085335 = idf(docFreq=1329, maxDocs=44421)
                0.078125 = fieldNorm(doc=3635)
          0.21238017 = weight(abstract_txt:siemens in 3635) [ClassicSimilarity], result of:
            0.21238017 = score(doc=3635,freq=1.0), product of:
              0.29219595 = queryWeight, product of:
                1.4975498 = boost
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.02097216 = queryNorm
              0.7268416 = fieldWeight in 3635, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.078125 = fieldNorm(doc=3635)
          0.06300906 = weight(abstract_txt:processing in 3635) [ClassicSimilarity], result of:
            0.06300906 = score(doc=3635,freq=1.0), product of:
              0.1637609 = queryWeight, product of:
                1.5854928 = boost
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.02097216 = queryNorm
              0.38476256 = fieldWeight in 3635, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9249606 = idf(docFreq=876, maxDocs=44421)
                0.078125 = fieldNorm(doc=3635)
          0.098587155 = weight(abstract_txt:evaluated in 3635) [ClassicSimilarity], result of:
            0.098587155 = score(doc=3635,freq=1.0), product of:
              0.22070988 = queryWeight, product of:
                1.8406451 = boost
                5.717531 = idf(docFreq=396, maxDocs=44421)
                0.02097216 = queryNorm
              0.44668213 = fieldWeight in 3635, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.717531 = idf(docFreq=396, maxDocs=44421)
                0.078125 = fieldNorm(doc=3635)
          0.09042221 = weight(abstract_txt:text in 3635) [ClassicSimilarity], result of:
            0.09042221 = score(doc=3635,freq=3.0), product of:
              0.1653668 = queryWeight, product of:
                1.9513221 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.02097216 = queryNorm
              0.5467979 = fieldWeight in 3635, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.078125 = fieldNorm(doc=3635)
        0.48 = coord(12/25)
    
  2. Ruge, G.; Schwarz, C.: Linguistically based term associations : a new semantic component for a hyperterm system (1990) 0.20
    0.19631611 = sum of:
      0.19631611 = product of:
        0.8179838 = sum of:
          0.10433762 = weight(abstract_txt:statistics in 5543) [ClassicSimilarity], result of:
            0.10433762 = score(doc=5543,freq=1.0), product of:
              0.13298792 = queryWeight, product of:
                1.0102998 = boost
                6.2765174 = idf(docFreq=226, maxDocs=44421)
                0.02097216 = queryNorm
              0.7845647 = fieldWeight in 5543, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2765174 = idf(docFreq=226, maxDocs=44421)
                0.125 = fieldNorm(doc=5543)
          0.12382636 = weight(abstract_txt:linguistics in 5543) [ClassicSimilarity], result of:
            0.12382636 = score(doc=5543,freq=1.0), product of:
              0.14907116 = queryWeight, product of:
                1.0696483 = boost
                6.6452217 = idf(docFreq=156, maxDocs=44421)
                0.02097216 = queryNorm
              0.8306527 = fieldWeight in 5543, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6452217 = idf(docFreq=156, maxDocs=44421)
                0.125 = fieldNorm(doc=5543)
          0.13333373 = weight(abstract_txt:aids in 5543) [ClassicSimilarity], result of:
            0.13333373 = score(doc=5543,freq=1.0), product of:
              0.15660715 = queryWeight, product of:
                1.0963519 = boost
                6.8111186 = idf(docFreq=132, maxDocs=44421)
                0.02097216 = queryNorm
              0.8513898 = fieldWeight in 5543, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8111186 = idf(docFreq=132, maxDocs=44421)
                0.125 = fieldNorm(doc=5543)
          0.07257453 = weight(abstract_txt:databases in 5543) [ClassicSimilarity], result of:
            0.07257453 = score(doc=5543,freq=1.0), product of:
              0.13153794 = queryWeight, product of:
                1.4209694 = boost
                4.413907 = idf(docFreq=1461, maxDocs=44421)
                0.02097216 = queryNorm
              0.5517384 = fieldWeight in 5543, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.413907 = idf(docFreq=1461, maxDocs=44421)
                0.125 = fieldNorm(doc=5543)
          0.30038312 = weight(abstract_txt:realist in 5543) [ClassicSimilarity], result of:
            0.30038312 = score(doc=5543,freq=1.0), product of:
              0.269134 = queryWeight, product of:
                1.4372373 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.02097216 = queryNorm
              1.1161098 = fieldWeight in 5543, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.125 = fieldNorm(doc=5543)
          0.08352847 = weight(abstract_txt:text in 5543) [ClassicSimilarity], result of:
            0.08352847 = score(doc=5543,freq=1.0), product of:
              0.1653668 = queryWeight, product of:
                1.9513221 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.02097216 = queryNorm
              0.50511026 = fieldWeight in 5543, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.125 = fieldNorm(doc=5543)
        0.24 = coord(6/25)
    
  3. Ruge, G.: Experiments on linguistically-based term associations (1992) 0.09
    0.08952068 = sum of:
      0.08952068 = product of:
        0.5595043 = sum of:
          0.07825322 = weight(abstract_txt:statistics in 1809) [ClassicSimilarity], result of:
            0.07825322 = score(doc=1809,freq=1.0), product of:
              0.13298792 = queryWeight, product of:
                1.0102998 = boost
                6.2765174 = idf(docFreq=226, maxDocs=44421)
                0.02097216 = queryNorm
              0.5884235 = fieldWeight in 1809, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2765174 = idf(docFreq=226, maxDocs=44421)
                0.09375 = fieldNorm(doc=1809)
          0.1000003 = weight(abstract_txt:aids in 1809) [ClassicSimilarity], result of:
            0.1000003 = score(doc=1809,freq=1.0), product of:
              0.15660715 = queryWeight, product of:
                1.0963519 = boost
                6.8111186 = idf(docFreq=132, maxDocs=44421)
                0.02097216 = queryNorm
              0.63854235 = fieldWeight in 1809, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8111186 = idf(docFreq=132, maxDocs=44421)
                0.09375 = fieldNorm(doc=1809)
          0.3186044 = weight(abstract_txt:realist in 1809) [ClassicSimilarity], result of:
            0.3186044 = score(doc=1809,freq=2.0), product of:
              0.269134 = queryWeight, product of:
                1.4372373 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.02097216 = queryNorm
              1.1838132 = fieldWeight in 1809, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.09375 = fieldNorm(doc=1809)
          0.06264635 = weight(abstract_txt:text in 1809) [ClassicSimilarity], result of:
            0.06264635 = score(doc=1809,freq=1.0), product of:
              0.1653668 = queryWeight, product of:
                1.9513221 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.02097216 = queryNorm
              0.3788327 = fieldWeight in 1809, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.09375 = fieldNorm(doc=1809)
        0.16 = coord(4/25)
    
  4. Kousha, K.; Thelwall, M.: Patent citation analysis with Google (2017) 0.08
    0.08370053 = sum of:
      0.08370053 = product of:
        0.52312833 = sum of:
          0.2221271 = weight(abstract_txt:patent in 4317) [ClassicSimilarity], result of:
            0.2221271 = score(doc=4317,freq=10.0), product of:
              0.16215833 = queryWeight, product of:
                1.1156136 = boost
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.02097216 = queryNorm
              1.3698162 = fieldWeight in 4317, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                6.930783 = idf(docFreq=117, maxDocs=44421)
                0.0625 = fieldNorm(doc=4317)
          0.05131794 = weight(abstract_txt:databases in 4317) [ClassicSimilarity], result of:
            0.05131794 = score(doc=4317,freq=2.0), product of:
              0.13153794 = queryWeight, product of:
                1.4209694 = boost
                4.413907 = idf(docFreq=1461, maxDocs=44421)
                0.02097216 = queryNorm
              0.39013794 = fieldWeight in 4317, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.413907 = idf(docFreq=1461, maxDocs=44421)
                0.0625 = fieldNorm(doc=4317)
          0.07886972 = weight(abstract_txt:evaluated in 4317) [ClassicSimilarity], result of:
            0.07886972 = score(doc=4317,freq=1.0), product of:
              0.22070988 = queryWeight, product of:
                1.8406451 = boost
                5.717531 = idf(docFreq=396, maxDocs=44421)
                0.02097216 = queryNorm
              0.3573457 = fieldWeight in 4317, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.717531 = idf(docFreq=396, maxDocs=44421)
                0.0625 = fieldNorm(doc=4317)
          0.17081358 = weight(abstract_txt:correlations in 4317) [ClassicSimilarity], result of:
            0.17081358 = score(doc=4317,freq=1.0), product of:
              0.36945635 = queryWeight, product of:
                2.3814461 = boost
                7.3974023 = idf(docFreq=73, maxDocs=44421)
                0.02097216 = queryNorm
              0.46233764 = fieldWeight in 4317, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3974023 = idf(docFreq=73, maxDocs=44421)
                0.0625 = fieldNorm(doc=4317)
        0.16 = coord(4/25)
    
  5. Larson, R.R.: Cheshire 2 : design and evaluation of a next-generation online catalog system (1995) 0.07
    0.07279759 = sum of:
      0.07279759 = product of:
        0.45498496 = sum of:
          0.10433762 = weight(abstract_txt:statistics in 3888) [ClassicSimilarity], result of:
            0.10433762 = score(doc=3888,freq=1.0), product of:
              0.13298792 = queryWeight, product of:
                1.0102998 = boost
                6.2765174 = idf(docFreq=226, maxDocs=44421)
                0.02097216 = queryNorm
              0.7845647 = fieldWeight in 3888, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2765174 = idf(docFreq=226, maxDocs=44421)
                0.125 = fieldNorm(doc=3888)
          0.10937942 = weight(abstract_txt:being in 3888) [ClassicSimilarity], result of:
            0.10937942 = score(doc=3888,freq=2.0), product of:
              0.1372383 = queryWeight, product of:
                1.4514325 = boost
                4.5085335 = idf(docFreq=1329, maxDocs=44421)
                0.02097216 = queryNorm
              0.7970036 = fieldWeight in 3888, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5085335 = idf(docFreq=1329, maxDocs=44421)
                0.125 = fieldNorm(doc=3888)
          0.15773945 = weight(abstract_txt:evaluated in 3888) [ClassicSimilarity], result of:
            0.15773945 = score(doc=3888,freq=1.0), product of:
              0.22070988 = queryWeight, product of:
                1.8406451 = boost
                5.717531 = idf(docFreq=396, maxDocs=44421)
                0.02097216 = queryNorm
              0.7146914 = fieldWeight in 3888, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.717531 = idf(docFreq=396, maxDocs=44421)
                0.125 = fieldNorm(doc=3888)
          0.08352847 = weight(abstract_txt:text in 3888) [ClassicSimilarity], result of:
            0.08352847 = score(doc=3888,freq=1.0), product of:
              0.1653668 = queryWeight, product of:
                1.9513221 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.02097216 = queryNorm
              0.50511026 = fieldWeight in 3888, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.125 = fieldNorm(doc=3888)
        0.16 = coord(4/25)