Document (#40382)

Author
Li, N.
Sun, J.
Title
Improving Chinese term association from the linguistic perspective
Source
Knowledge organization. 44(2017) no.1, S.13-23
Year
2017
Abstract
The study aims to solve how to construct the semantic relations of specific domain terms by applying linguistic rules. The semantic structure analysis at the morpheme level was used for semantic measure, and a morpheme-based term association model was proposed by improving and combining the literal-based similarity algorithm and co-occurrence relatedness methods. This study provides a novel insight into the method of semantic analysis and calculation by morpheme parsing, and the proposed solution is feasible for the automatic association of compound terms. The results show that this approach could be used to construct appropriate term association and form a reasonable structural knowledge graph. However, due to linguistic differences, the viability and effectiveness of the use of our method in non-Chinese linguistic environments should be verified.
Theme
Semantisches Umfeld in Indexierung u. Retrieval
Computerlinguistik

Similar documents (content)

  1. Galvez, C.; Moya-Anegón, F. de; Solana, V.H.: Term conflation methods in information retrieval : non-linguistic and linguistic approaches (2005) 0.19
    0.19337204 = sum of:
      0.19337204 = product of:
        0.8057169 = sum of:
          0.017777158 = weight(abstract_txt:used in 5394) [ClassicSimilarity], result of:
            0.017777158 = score(doc=5394,freq=1.0), product of:
              0.06777898 = queryWeight, product of:
                1.0307516 = boost
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.019586815 = queryNorm
              0.26228127 = fieldWeight in 5394, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.078125 = fieldNorm(doc=5394)
          0.031064996 = weight(abstract_txt:terms in 5394) [ClassicSimilarity], result of:
            0.031064996 = score(doc=5394,freq=1.0), product of:
              0.098333396 = queryWeight, product of:
                1.2415293 = boost
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.019586815 = queryNorm
              0.31591502 = fieldWeight in 5394, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.078125 = fieldNorm(doc=5394)
          0.0427782 = weight(abstract_txt:method in 5394) [ClassicSimilarity], result of:
            0.0427782 = score(doc=5394,freq=1.0), product of:
              0.121712506 = queryWeight, product of:
                1.3812557 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.019586815 = queryNorm
              0.35146925 = fieldWeight in 5394, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.078125 = fieldNorm(doc=5394)
          0.13454588 = weight(abstract_txt:term in 5394) [ClassicSimilarity], result of:
            0.13454588 = score(doc=5394,freq=3.0), product of:
              0.20737533 = queryWeight, product of:
                2.2081606 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.019586815 = queryNorm
              0.64880365 = fieldWeight in 5394, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.078125 = fieldNorm(doc=5394)
          0.16692649 = weight(abstract_txt:association in 5394) [ClassicSimilarity], result of:
            0.16692649 = score(doc=5394,freq=1.0), product of:
              0.38008466 = queryWeight, product of:
                3.4519274 = boost
                5.6215343 = idf(docFreq=436, maxDocs=44421)
                0.019586815 = queryNorm
              0.43918237 = fieldWeight in 5394, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6215343 = idf(docFreq=436, maxDocs=44421)
                0.078125 = fieldNorm(doc=5394)
          0.41262415 = weight(abstract_txt:linguistic in 5394) [ClassicSimilarity], result of:
            0.41262415 = score(doc=5394,freq=5.0), product of:
              0.40635902 = queryWeight, product of:
                3.5692456 = boost
                5.8125896 = idf(docFreq=360, maxDocs=44421)
                0.019586815 = queryNorm
              1.0154177 = fieldWeight in 5394, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.8125896 = idf(docFreq=360, maxDocs=44421)
                0.078125 = fieldNorm(doc=5394)
        0.24 = coord(6/25)
    
  2. Guo, L.; Wan, X.: Exploiting syntactic and semantic relationships between terms for opinion retrieval (2012) 0.17
    0.17369576 = sum of:
      0.17369576 = product of:
        0.54279923 = sum of:
          0.025140697 = weight(abstract_txt:used in 1492) [ClassicSimilarity], result of:
            0.025140697 = score(doc=1492,freq=2.0), product of:
              0.06777898 = queryWeight, product of:
                1.0307516 = boost
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.019586815 = queryNorm
              0.37092173 = fieldWeight in 1492, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.078125 = fieldNorm(doc=1492)
          0.018714504 = weight(abstract_txt:study in 1492) [ClassicSimilarity], result of:
            0.018714504 = score(doc=1492,freq=1.0), product of:
              0.07014107 = queryWeight, product of:
                1.0485585 = boost
                3.415198 = idf(docFreq=3968, maxDocs=44421)
                0.019586815 = queryNorm
              0.26681235 = fieldWeight in 1492, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.415198 = idf(docFreq=3968, maxDocs=44421)
                0.078125 = fieldNorm(doc=1492)
          0.05380615 = weight(abstract_txt:terms in 1492) [ClassicSimilarity], result of:
            0.05380615 = score(doc=1492,freq=3.0), product of:
              0.098333396 = queryWeight, product of:
                1.2415293 = boost
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.019586815 = queryNorm
              0.54718083 = fieldWeight in 1492, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.078125 = fieldNorm(doc=1492)
          0.12834558 = weight(abstract_txt:relatedness in 1492) [ClassicSimilarity], result of:
            0.12834558 = score(doc=1492,freq=1.0), product of:
              0.20095438 = queryWeight, product of:
                1.2549899 = boost
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.019586815 = queryNorm
              0.6386802 = fieldWeight in 1492, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.078125 = fieldNorm(doc=1492)
          0.07409403 = weight(abstract_txt:method in 1492) [ClassicSimilarity], result of:
            0.07409403 = score(doc=1492,freq=3.0), product of:
              0.121712506 = queryWeight, product of:
                1.3812557 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.019586815 = queryNorm
              0.6087626 = fieldWeight in 1492, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.078125 = fieldNorm(doc=1492)
          0.045971207 = weight(abstract_txt:proposed in 1492) [ClassicSimilarity], result of:
            0.045971207 = score(doc=1492,freq=1.0), product of:
              0.12769605 = queryWeight, product of:
                1.4148005 = boost
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.019586815 = queryNorm
              0.36000493 = fieldWeight in 1492, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.078125 = fieldNorm(doc=1492)
          0.07768009 = weight(abstract_txt:term in 1492) [ClassicSimilarity], result of:
            0.07768009 = score(doc=1492,freq=1.0), product of:
              0.20737533 = queryWeight, product of:
                2.2081606 = boost
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.019586815 = queryNorm
              0.37458694 = fieldWeight in 1492, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.794713 = idf(docFreq=998, maxDocs=44421)
                0.078125 = fieldNorm(doc=1492)
          0.11904697 = weight(abstract_txt:semantic in 1492) [ClassicSimilarity], result of:
            0.11904697 = score(doc=1492,freq=2.0), product of:
              0.24080513 = queryWeight, product of:
                2.7476053 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.019586815 = queryNorm
              0.49437058 = fieldWeight in 1492, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.078125 = fieldNorm(doc=1492)
        0.32 = coord(8/25)
    
  3. Lee, C.-H.; Khoo, C.; Na, J.-C.: Automatic identification of treatment relations for medical ontology learning : an exploratory study (2004) 0.17
    0.16991031 = sum of:
      0.16991031 = product of:
        0.70795965 = sum of:
          0.020112557 = weight(abstract_txt:used in 3661) [ClassicSimilarity], result of:
            0.020112557 = score(doc=3661,freq=2.0), product of:
              0.06777898 = queryWeight, product of:
                1.0307516 = boost
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.019586815 = queryNorm
              0.29673737 = fieldWeight in 3661, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.0625 = fieldNorm(doc=3661)
          0.029943205 = weight(abstract_txt:study in 3661) [ClassicSimilarity], result of:
            0.029943205 = score(doc=3661,freq=4.0), product of:
              0.07014107 = queryWeight, product of:
                1.0485585 = boost
                3.415198 = idf(docFreq=3968, maxDocs=44421)
                0.019586815 = queryNorm
              0.42689976 = fieldWeight in 3661, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.415198 = idf(docFreq=3968, maxDocs=44421)
                0.0625 = fieldNorm(doc=3661)
          0.048398014 = weight(abstract_txt:method in 3661) [ClassicSimilarity], result of:
            0.048398014 = score(doc=3661,freq=2.0), product of:
              0.121712506 = queryWeight, product of:
                1.3812557 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.019586815 = queryNorm
              0.39764208 = fieldWeight in 3661, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.0625 = fieldNorm(doc=3661)
          0.16495633 = weight(abstract_txt:semantic in 3661) [ClassicSimilarity], result of:
            0.16495633 = score(doc=3661,freq=6.0), product of:
              0.24080513 = queryWeight, product of:
                2.7476053 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.019586815 = queryNorm
              0.68501997 = fieldWeight in 3661, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.0625 = fieldNorm(doc=3661)
          0.18885575 = weight(abstract_txt:association in 3661) [ClassicSimilarity], result of:
            0.18885575 = score(doc=3661,freq=2.0), product of:
              0.38008466 = queryWeight, product of:
                3.4519274 = boost
                5.6215343 = idf(docFreq=436, maxDocs=44421)
                0.019586815 = queryNorm
              0.49687812 = fieldWeight in 3661, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6215343 = idf(docFreq=436, maxDocs=44421)
                0.0625 = fieldNorm(doc=3661)
          0.2556938 = weight(abstract_txt:linguistic in 3661) [ClassicSimilarity], result of:
            0.2556938 = score(doc=3661,freq=3.0), product of:
              0.40635902 = queryWeight, product of:
                3.5692456 = boost
                5.8125896 = idf(docFreq=360, maxDocs=44421)
                0.019586815 = queryNorm
              0.6292313 = fieldWeight in 3661, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.8125896 = idf(docFreq=360, maxDocs=44421)
                0.0625 = fieldNorm(doc=3661)
        0.24 = coord(6/25)
    
  4. Atlam, E.-S.; Morita, K.; Fuketa, M.; Aoe, J.-i.: ¬A new method for selecting English field association terms of compound words and its knowledge representation (2002) 0.16
    0.15960222 = sum of:
      0.15960222 = product of:
        0.66500926 = sum of:
          0.13997133 = weight(abstract_txt:compound in 3590) [ClassicSimilarity], result of:
            0.13997133 = score(doc=3590,freq=2.0), product of:
              0.16898943 = queryWeight, product of:
                1.1508567 = boost
                7.496775 = idf(docFreq=66, maxDocs=44421)
                0.019586815 = queryNorm
              0.8282845 = fieldWeight in 3590, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.496775 = idf(docFreq=66, maxDocs=44421)
                0.078125 = fieldNorm(doc=3590)
          0.062129993 = weight(abstract_txt:terms in 3590) [ClassicSimilarity], result of:
            0.062129993 = score(doc=3590,freq=4.0), product of:
              0.098333396 = queryWeight, product of:
                1.2415293 = boost
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.019586815 = queryNorm
              0.63183004 = fieldWeight in 3590, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.078125 = fieldNorm(doc=3590)
          0.0427782 = weight(abstract_txt:method in 3590) [ClassicSimilarity], result of:
            0.0427782 = score(doc=3590,freq=1.0), product of:
              0.121712506 = queryWeight, product of:
                1.3812557 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.019586815 = queryNorm
              0.35146925 = fieldWeight in 3590, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.078125 = fieldNorm(doc=3590)
          0.06501311 = weight(abstract_txt:proposed in 3590) [ClassicSimilarity], result of:
            0.06501311 = score(doc=3590,freq=2.0), product of:
              0.12769605 = queryWeight, product of:
                1.4148005 = boost
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.019586815 = queryNorm
              0.50912386 = fieldWeight in 3590, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.078125 = fieldNorm(doc=3590)
          0.11904697 = weight(abstract_txt:semantic in 3590) [ClassicSimilarity], result of:
            0.11904697 = score(doc=3590,freq=2.0), product of:
              0.24080513 = queryWeight, product of:
                2.7476053 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.019586815 = queryNorm
              0.49437058 = fieldWeight in 3590, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.078125 = fieldNorm(doc=3590)
          0.23606968 = weight(abstract_txt:association in 3590) [ClassicSimilarity], result of:
            0.23606968 = score(doc=3590,freq=2.0), product of:
              0.38008466 = queryWeight, product of:
                3.4519274 = boost
                5.6215343 = idf(docFreq=436, maxDocs=44421)
                0.019586815 = queryNorm
              0.6210976 = fieldWeight in 3590, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6215343 = idf(docFreq=436, maxDocs=44421)
                0.078125 = fieldNorm(doc=3590)
        0.24 = coord(6/25)
    
  5. Fu, T.; Abbasi, A.; Chen, H.: ¬A hybrid approach to Web forum interactional coherence analysis (2008) 0.16
    0.15663084 = sum of:
      0.15663084 = product of:
        0.48947138 = sum of:
          0.017777158 = weight(abstract_txt:used in 2872) [ClassicSimilarity], result of:
            0.017777158 = score(doc=2872,freq=1.0), product of:
              0.06777898 = queryWeight, product of:
                1.0307516 = boost
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.019586815 = queryNorm
              0.26228127 = fieldWeight in 2872, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3572001 = idf(docFreq=4205, maxDocs=44421)
                0.078125 = fieldNorm(doc=2872)
          0.018714504 = weight(abstract_txt:study in 2872) [ClassicSimilarity], result of:
            0.018714504 = score(doc=2872,freq=1.0), product of:
              0.07014107 = queryWeight, product of:
                1.0485585 = boost
                3.415198 = idf(docFreq=3968, maxDocs=44421)
                0.019586815 = queryNorm
              0.26681235 = fieldWeight in 2872, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.415198 = idf(docFreq=3968, maxDocs=44421)
                0.078125 = fieldNorm(doc=2872)
          0.039429855 = weight(abstract_txt:analysis in 2872) [ClassicSimilarity], result of:
            0.039429855 = score(doc=2872,freq=3.0), product of:
              0.079927556 = queryWeight, product of:
                1.1193212 = boost
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.019586815 = queryNorm
              0.4933199 = fieldWeight in 2872, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.6456752 = idf(docFreq=3151, maxDocs=44421)
                0.078125 = fieldNorm(doc=2872)
          0.031064996 = weight(abstract_txt:terms in 2872) [ClassicSimilarity], result of:
            0.031064996 = score(doc=2872,freq=1.0), product of:
              0.098333396 = queryWeight, product of:
                1.2415293 = boost
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.019586815 = queryNorm
              0.31591502 = fieldWeight in 2872, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.078125 = fieldNorm(doc=2872)
          0.0427782 = weight(abstract_txt:method in 2872) [ClassicSimilarity], result of:
            0.0427782 = score(doc=2872,freq=1.0), product of:
              0.121712506 = queryWeight, product of:
                1.3812557 = boost
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.019586815 = queryNorm
              0.35146925 = fieldWeight in 2872, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4988065 = idf(docFreq=1342, maxDocs=44421)
                0.078125 = fieldNorm(doc=2872)
          0.045971207 = weight(abstract_txt:proposed in 2872) [ClassicSimilarity], result of:
            0.045971207 = score(doc=2872,freq=1.0), product of:
              0.12769605 = queryWeight, product of:
                1.4148005 = boost
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.019586815 = queryNorm
              0.36000493 = fieldWeight in 2872, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.608063 = idf(docFreq=1203, maxDocs=44421)
                0.078125 = fieldNorm(doc=2872)
          0.10920433 = weight(abstract_txt:construct in 2872) [ClassicSimilarity], result of:
            0.10920433 = score(doc=2872,freq=1.0), product of:
              0.2273422 = queryWeight, product of:
                1.8877589 = boost
                6.148508 = idf(docFreq=257, maxDocs=44421)
                0.019586815 = queryNorm
              0.4803522 = fieldWeight in 2872, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.148508 = idf(docFreq=257, maxDocs=44421)
                0.078125 = fieldNorm(doc=2872)
          0.18453111 = weight(abstract_txt:linguistic in 2872) [ClassicSimilarity], result of:
            0.18453111 = score(doc=2872,freq=1.0), product of:
              0.40635902 = queryWeight, product of:
                3.5692456 = boost
                5.8125896 = idf(docFreq=360, maxDocs=44421)
                0.019586815 = queryNorm
              0.45410857 = fieldWeight in 2872, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8125896 = idf(docFreq=360, maxDocs=44421)
                0.078125 = fieldNorm(doc=2872)
        0.32 = coord(8/25)