Document (#1810)

Author
Ruge, G.
Title
Experiments on linguistically-based term associations
Source
Information processing and management. 28(1992) no.3, S.317-332
Year
1992
Abstract
Describes the hyperterm system REALIST (REtrieval Aids by LInguistic and STatistics) and describes its semantic component. The semantic component of REALIST generates semantic term relations such synonyms. It takes as input a free text data base and generates as output term pairs that are semantically related with respect to their meanings in the data base. In the 1st step an automatic syntactic analysis provides linguistical knowledge about the terms of the data base. In the 2nd step this knowledge is compared by statistical similarity computation. Various experiments with different similarity measures are described
Theme
Computerlinguistik
Object
REALIST

Similar documents (author)

  1. Ruge, G.: ¬A spreading activation network for automatic generation of thesaurus relationships (1991) 5.94
    5.9401517 = sum of:
      5.9401517 = weight(author_txt:ruge in 4505) [ClassicSimilarity], result of:
        5.9401517 = fieldWeight in 4505, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.625 = fieldNorm(doc=4505)
    
  2. Ruge, G.: Sprache und Computer : Wortbedeutung und Termassoziation. Methoden zur automatischen semantischen Klassifikation (1995) 5.94
    5.9401517 = sum of:
      5.9401517 = weight(author_txt:ruge in 2534) [ClassicSimilarity], result of:
        5.9401517 = fieldWeight in 2534, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.625 = fieldNorm(doc=2534)
    
  3. Ruge, G.; Schwarz, C.: Term association and computational linguistics (1991) 4.75
    4.7521214 = sum of:
      4.7521214 = weight(author_txt:ruge in 2309) [ClassicSimilarity], result of:
        4.7521214 = fieldWeight in 2309, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.5 = fieldNorm(doc=2309)
    
  4. Ruge, G.; Schwarz, C.: Linguistically based term associations : a new semantic component for a hyperterm system (1990) 4.75
    4.7521214 = sum of:
      4.7521214 = weight(author_txt:ruge in 5543) [ClassicSimilarity], result of:
        4.7521214 = fieldWeight in 5543, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.5 = fieldNorm(doc=5543)
    
  5. Ruge, G.; Schwarz, C.: Natural language access to free-text data bases (1989) 4.75
    4.7521214 = sum of:
      4.7521214 = weight(author_txt:ruge in 3635) [ClassicSimilarity], result of:
        4.7521214 = fieldWeight in 3635, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.504243 = idf(docFreq=8, maxDocs=44421)
          0.5 = fieldNorm(doc=3635)
    

Similar documents (content)

  1. Giacomini, L.: Ontologies and knowledge representation in terminology : present and future perspectives (2024) 0.15
    0.14561509 = sum of:
      0.14561509 = product of:
        0.6067295 = sum of:
          0.041098796 = weight(abstract_txt:respect in 2286) [ClassicSimilarity], result of:
            0.041098796 = score(doc=2286,freq=1.0), product of:
              0.10951695 = queryWeight, product of:
                6.004374 = idf(docFreq=297, maxDocs=44421)
                0.018239528 = queryNorm
              0.37527338 = fieldWeight in 2286, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.004374 = idf(docFreq=297, maxDocs=44421)
                0.0625 = fieldNorm(doc=2286)
          0.016936217 = weight(abstract_txt:knowledge in 2286) [ClassicSimilarity], result of:
            0.016936217 = score(doc=2286,freq=1.0), product of:
              0.076409884 = queryWeight, product of:
                1.1812698 = boost
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.018239528 = queryNorm
              0.22164954 = fieldWeight in 2286, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.0625 = fieldNorm(doc=2286)
          0.021036237 = weight(abstract_txt:data in 2286) [ClassicSimilarity], result of:
            0.021036237 = score(doc=2286,freq=1.0), product of:
              0.10106817 = queryWeight, product of:
                1.6638998 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.018239528 = queryNorm
              0.20813909 = fieldWeight in 2286, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=2286)
          0.082613826 = weight(abstract_txt:component in 2286) [ClassicSimilarity], result of:
            0.082613826 = score(doc=2286,freq=1.0), product of:
              0.21977271 = queryWeight, product of:
                2.0033703 = boost
                6.014492 = idf(docFreq=294, maxDocs=44421)
                0.018239528 = queryNorm
              0.37590575 = fieldWeight in 2286, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.014492 = idf(docFreq=294, maxDocs=44421)
                0.0625 = fieldNorm(doc=2286)
          0.06278198 = weight(abstract_txt:term in 2286) [ClassicSimilarity], result of:
            0.06278198 = score(doc=2286,freq=1.0), product of:
              0.20950402 = queryWeight, product of:
                2.39561 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.018239528 = queryNorm
              0.29966956 = fieldWeight in 2286, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.0625 = fieldNorm(doc=2286)
          0.38226244 = weight(abstract_txt:realist in 2286) [ClassicSimilarity], result of:
            0.38226244 = score(doc=2286,freq=2.0), product of:
              0.4843616 = queryWeight, product of:
                2.9741247 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.018239528 = queryNorm
              0.7892088 = fieldWeight in 2286, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.0625 = fieldNorm(doc=2286)
        0.24 = coord(6/25)
    
  2. Ru, C.; Tang, J.; Li, S.; Xie, S.; Wang, T.: Using semantic similarity to reduce wrong labels in distant supervision for relation extraction (2018) 0.14
    0.14180204 = sum of:
      0.14180204 = product of:
        0.5064359 = sum of:
          0.07502615 = weight(abstract_txt:input in 55) [ClassicSimilarity], result of:
            0.07502615 = score(doc=55,freq=3.0), product of:
              0.11342182 = queryWeight, product of:
                1.0176716 = boost
                6.110481 = idf(docFreq=267, maxDocs=44421)
                0.018239528 = queryNorm
              0.66147894 = fieldWeight in 55, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.110481 = idf(docFreq=267, maxDocs=44421)
                0.0625 = fieldNorm(doc=55)
          0.016936217 = weight(abstract_txt:knowledge in 55) [ClassicSimilarity], result of:
            0.016936217 = score(doc=55,freq=1.0), product of:
              0.076409884 = queryWeight, product of:
                1.1812698 = boost
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.018239528 = queryNorm
              0.22164954 = fieldWeight in 55, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.0625 = fieldNorm(doc=55)
          0.042072475 = weight(abstract_txt:data in 55) [ClassicSimilarity], result of:
            0.042072475 = score(doc=55,freq=4.0), product of:
              0.10106817 = queryWeight, product of:
                1.6638998 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.018239528 = queryNorm
              0.41627818 = fieldWeight in 55, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=55)
          0.10576075 = weight(abstract_txt:similarity in 55) [ClassicSimilarity], result of:
            0.10576075 = score(doc=55,freq=2.0), product of:
              0.20565769 = queryWeight, product of:
                1.937969 = boost
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.018239528 = queryNorm
              0.51425624 = fieldWeight in 55, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.0625 = fieldNorm(doc=55)
          0.102051616 = weight(abstract_txt:semantic in 55) [ClassicSimilarity], result of:
            0.102051616 = score(doc=55,freq=4.0), product of:
              0.18245775 = queryWeight, product of:
                2.2356362 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.018239528 = queryNorm
              0.55931646 = fieldWeight in 55, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.0625 = fieldNorm(doc=55)
          0.06278198 = weight(abstract_txt:term in 55) [ClassicSimilarity], result of:
            0.06278198 = score(doc=55,freq=1.0), product of:
              0.20950402 = queryWeight, product of:
                2.39561 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.018239528 = queryNorm
              0.29966956 = fieldWeight in 55, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.0625 = fieldNorm(doc=55)
          0.10180664 = weight(abstract_txt:base in 55) [ClassicSimilarity], result of:
            0.10180664 = score(doc=55,freq=1.0), product of:
              0.2891699 = queryWeight, product of:
                2.8144693 = boost
                5.633042 = idf(docFreq=431, maxDocs=44421)
                0.018239528 = queryNorm
              0.35206512 = fieldWeight in 55, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.633042 = idf(docFreq=431, maxDocs=44421)
                0.0625 = fieldNorm(doc=55)
        0.28 = coord(7/25)
    
  3. Tudhope, D.; Taylor, C.: Navigation via similarity (1997) 0.14
    0.14080642 = sum of:
      0.14080642 = product of:
        0.5866934 = sum of:
          0.052618556 = weight(abstract_txt:takes in 1155) [ClassicSimilarity], result of:
            0.052618556 = score(doc=1155,freq=1.0), product of:
              0.11127934 = queryWeight, product of:
                1.0080141 = boost
                6.0524936 = idf(docFreq=283, maxDocs=44421)
                0.018239528 = queryNorm
              0.47285107 = fieldWeight in 1155, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0524936 = idf(docFreq=283, maxDocs=44421)
                0.078125 = fieldNorm(doc=1155)
          0.07732604 = weight(abstract_txt:semantically in 1155) [ClassicSimilarity], result of:
            0.07732604 = score(doc=1155,freq=1.0), product of:
              0.1438376 = queryWeight, product of:
                1.1460289 = boost
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.018239528 = queryNorm
              0.53759265 = fieldWeight in 1155, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.881186 = idf(docFreq=123, maxDocs=44421)
                0.078125 = fieldNorm(doc=1155)
          0.026621798 = weight(abstract_txt:describes in 1155) [ClassicSimilarity], result of:
            0.026621798 = score(doc=1155,freq=1.0), product of:
              0.089020535 = queryWeight, product of:
                1.2750272 = boost
                3.82787 = idf(docFreq=2626, maxDocs=44421)
                0.018239528 = queryNorm
              0.29905233 = fieldWeight in 1155, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.82787 = idf(docFreq=2626, maxDocs=44421)
                0.078125 = fieldNorm(doc=1155)
          0.20902805 = weight(abstract_txt:similarity in 1155) [ClassicSimilarity], result of:
            0.20902805 = score(doc=1155,freq=5.0), product of:
              0.20565769 = queryWeight, product of:
                1.937969 = boost
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.018239528 = queryNorm
              1.0163882 = fieldWeight in 1155, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.078125 = fieldNorm(doc=1155)
          0.14262147 = weight(abstract_txt:semantic in 1155) [ClassicSimilarity], result of:
            0.14262147 = score(doc=1155,freq=5.0), product of:
              0.18245775 = queryWeight, product of:
                2.2356362 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.018239528 = queryNorm
              0.7816685 = fieldWeight in 1155, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.078125 = fieldNorm(doc=1155)
          0.07847747 = weight(abstract_txt:term in 1155) [ClassicSimilarity], result of:
            0.07847747 = score(doc=1155,freq=1.0), product of:
              0.20950402 = queryWeight, product of:
                2.39561 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.018239528 = queryNorm
              0.37458694 = fieldWeight in 1155, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.078125 = fieldNorm(doc=1155)
        0.24 = coord(6/25)
    
  4. Ruge, G.; Schwarz, C.: Linguistically based term associations : a new semantic component for a hyperterm system (1990) 0.14
    0.1408054 = sum of:
      0.1408054 = product of:
        0.88003373 = sum of:
          0.09388843 = weight(abstract_txt:statistics in 5543) [ClassicSimilarity], result of:
            0.09388843 = score(doc=5543,freq=1.0), product of:
              0.11966946 = queryWeight, product of:
                1.0453242 = boost
                6.2765174 = idf(docFreq=226, maxDocs=44421)
                0.018239528 = queryNorm
              0.7845647 = fieldWeight in 5543, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2765174 = idf(docFreq=226, maxDocs=44421)
                0.125 = fieldNorm(doc=5543)
          0.11998064 = weight(abstract_txt:aids in 5543) [ClassicSimilarity], result of:
            0.11998064 = score(doc=5543,freq=1.0), product of:
              0.14092328 = queryWeight, product of:
                1.1343595 = boost
                6.8111186 = idf(docFreq=132, maxDocs=44421)
                0.018239528 = queryNorm
              0.8513898 = fieldWeight in 5543, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8111186 = idf(docFreq=132, maxDocs=44421)
                0.125 = fieldNorm(doc=5543)
          0.12556396 = weight(abstract_txt:term in 5543) [ClassicSimilarity], result of:
            0.12556396 = score(doc=5543,freq=1.0), product of:
              0.20950402 = queryWeight, product of:
                2.39561 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.018239528 = queryNorm
              0.5993391 = fieldWeight in 5543, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.125 = fieldNorm(doc=5543)
          0.5406007 = weight(abstract_txt:realist in 5543) [ClassicSimilarity], result of:
            0.5406007 = score(doc=5543,freq=1.0), product of:
              0.4843616 = queryWeight, product of:
                2.9741247 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.018239528 = queryNorm
              1.1161098 = fieldWeight in 5543, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.125 = fieldNorm(doc=5543)
        0.16 = coord(4/25)
    
  5. Kantardzic, M.: Data mining : concepts, models, methods, and algorithms (2003) 0.14
    0.13703087 = sum of:
      0.13703087 = product of:
        0.42822146 = sum of:
          0.046944216 = weight(abstract_txt:statistics in 3291) [ClassicSimilarity], result of:
            0.046944216 = score(doc=3291,freq=1.0), product of:
              0.11966946 = queryWeight, product of:
                1.0453242 = boost
                6.2765174 = idf(docFreq=226, maxDocs=44421)
                0.018239528 = queryNorm
              0.39228234 = fieldWeight in 3291, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.2765174 = idf(docFreq=226, maxDocs=44421)
                0.0625 = fieldNorm(doc=3291)
          0.05999032 = weight(abstract_txt:aids in 3291) [ClassicSimilarity], result of:
            0.05999032 = score(doc=3291,freq=1.0), product of:
              0.14092328 = queryWeight, product of:
                1.1343595 = boost
                6.8111186 = idf(docFreq=132, maxDocs=44421)
                0.018239528 = queryNorm
              0.4256949 = fieldWeight in 3291, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8111186 = idf(docFreq=132, maxDocs=44421)
                0.0625 = fieldNorm(doc=3291)
          0.016936217 = weight(abstract_txt:knowledge in 3291) [ClassicSimilarity], result of:
            0.016936217 = score(doc=3291,freq=1.0), product of:
              0.076409884 = queryWeight, product of:
                1.1812698 = boost
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.018239528 = queryNorm
              0.22164954 = fieldWeight in 3291, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5463927 = idf(docFreq=3480, maxDocs=44421)
                0.0625 = fieldNorm(doc=3291)
          0.02129744 = weight(abstract_txt:describes in 3291) [ClassicSimilarity], result of:
            0.02129744 = score(doc=3291,freq=1.0), product of:
              0.089020535 = queryWeight, product of:
                1.2750272 = boost
                3.82787 = idf(docFreq=2626, maxDocs=44421)
                0.018239528 = queryNorm
              0.23924187 = fieldWeight in 3291, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.82787 = idf(docFreq=2626, maxDocs=44421)
                0.0625 = fieldNorm(doc=3291)
          0.087098084 = weight(abstract_txt:computation in 3291) [ClassicSimilarity], result of:
            0.087098084 = score(doc=3291,freq=1.0), product of:
              0.1806901 = queryWeight, product of:
                1.2844775 = boost
                7.7124834 = idf(docFreq=53, maxDocs=44421)
                0.018239528 = queryNorm
              0.4820302 = fieldWeight in 3291, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7124834 = idf(docFreq=53, maxDocs=44421)
                0.0625 = fieldNorm(doc=3291)
          0.07584723 = weight(abstract_txt:data in 3291) [ClassicSimilarity], result of:
            0.07584723 = score(doc=3291,freq=13.0), product of:
              0.10106817 = queryWeight, product of:
                1.6638998 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.018239528 = queryNorm
              0.75045615 = fieldWeight in 3291, product of:
                3.6055512 = tf(freq=13.0), with freq of:
                  13.0 = termFreq=13.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=3291)
          0.057325955 = weight(abstract_txt:experiments in 3291) [ClassicSimilarity], result of:
            0.057325955 = score(doc=3291,freq=1.0), product of:
              0.17225538 = queryWeight, product of:
                1.7736206 = boost
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.018239528 = queryNorm
              0.3327963 = fieldWeight in 3291, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.0625 = fieldNorm(doc=3291)
          0.06278198 = weight(abstract_txt:term in 3291) [ClassicSimilarity], result of:
            0.06278198 = score(doc=3291,freq=1.0), product of:
              0.20950402 = queryWeight, product of:
                2.39561 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.018239528 = queryNorm
              0.29966956 = fieldWeight in 3291, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.0625 = fieldNorm(doc=3291)
        0.32 = coord(8/25)