Document (#3909)

Author
Kim, Y.W.
Kim, J.H.
Title
¬A model of knowledge based information retrieval with hierarchical concept graph
Source
Journal of documentation. 46(1990) no.2, S.113-136
Year
1990
Abstract
This paper discusses a knowledge based information retrieval model with hierarchical thesaurus. The model computes the conceptual distance between a query and an object and both are indexed with weighted terms from a hierarchical thesaurus. The hierarchical thesaurus is represented by a hierarchical-concept graph (HCG) in which nodes represent concepts and directed edges represent generalised relationships. Rada et al. have developed a similar model. However, their model considered only a binary indexing schemes and revealed some counter-intuitive results. Our proposed model extends theirs by allowing the index term and the edge of the HCG to be weighted. A new concept mapping method is devised to overcome Rada's counter-intuitive results. In addition, a scheme for allowing Boolean operators in user queries is provided with a formula for computing conceptual destance from negated index terms. Experimental results have shown that our model simulates human performance more closely than Rada's model

Similar documents (content)

  1. Tang, X.; Chen, L.; Cui, J.; Wei, B.: Knowledge representation learning with entity descriptions, hierarchical types, and textual relations (2019) 0.19
    0.19491187 = sum of:
      0.19491187 = product of:
        0.8121328 = sum of:
          0.020713205 = weight(abstract_txt:with in 101) [ClassicSimilarity], result of:
            0.020713205 = score(doc=101,freq=3.0), product of:
              0.061323613 = queryWeight, product of:
                1.3777277 = boost
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.01783181 = queryNorm
              0.33776882 = fieldWeight in 101, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.078125 = fieldNorm(doc=101)
          0.024275703 = weight(abstract_txt:results in 101) [ClassicSimilarity], result of:
            0.024275703 = score(doc=101,freq=1.0), product of:
              0.089324705 = queryWeight, product of:
                1.4400115 = boost
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.01783181 = queryNorm
              0.2717692 = fieldWeight in 101, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.078125 = fieldNorm(doc=101)
          0.10823407 = weight(abstract_txt:graph in 101) [ClassicSimilarity], result of:
            0.10823407 = score(doc=101,freq=1.0), product of:
              0.21138263 = queryWeight, product of:
                1.808711 = boost
                6.553973 = idf(docFreq=171, maxDocs=44421)
                0.01783181 = queryNorm
              0.5120292 = fieldWeight in 101, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.553973 = idf(docFreq=171, maxDocs=44421)
                0.078125 = fieldNorm(doc=101)
          0.12894557 = weight(abstract_txt:weighted in 101) [ClassicSimilarity], result of:
            0.12894557 = score(doc=101,freq=1.0), product of:
              0.23755504 = queryWeight, product of:
                1.9174174 = boost
                6.9478774 = idf(docFreq=115, maxDocs=44421)
                0.01783181 = queryNorm
              0.54280293 = fieldWeight in 101, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9478774 = idf(docFreq=115, maxDocs=44421)
                0.078125 = fieldNorm(doc=101)
          0.36168507 = weight(abstract_txt:hierarchical in 101) [ClassicSimilarity], result of:
            0.36168507 = score(doc=101,freq=4.0), product of:
              0.40396184 = queryWeight, product of:
                3.9534361 = boost
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.01783181 = queryNorm
              0.8953446 = fieldWeight in 101, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.078125 = fieldNorm(doc=101)
          0.16827913 = weight(abstract_txt:model in 101) [ClassicSimilarity], result of:
            0.16827913 = score(doc=101,freq=3.0), product of:
              0.31224322 = queryWeight, product of:
                4.39654 = boost
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.01783181 = queryNorm
              0.538936 = fieldWeight in 101, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.078125 = fieldNorm(doc=101)
        0.24 = coord(6/25)
    
  2. Yang, C.C.; Liu, N.: Web site topic-hierarchy generation based on link structure (2009) 0.19
    0.19047396 = sum of:
      0.19047396 = product of:
        0.6802641 = sum of:
          0.117067516 = weight(abstract_txt:directed in 3738) [ClassicSimilarity], result of:
            0.117067516 = score(doc=3738,freq=4.0), product of:
              0.12922928 = queryWeight, product of:
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.01783181 = queryNorm
              0.90589005 = fieldWeight in 3738, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                7.2471204 = idf(docFreq=85, maxDocs=44421)
                0.0625 = fieldNorm(doc=3738)
          0.076319866 = weight(abstract_txt:edge in 3738) [ClassicSimilarity], result of:
            0.076319866 = score(doc=3738,freq=1.0), product of:
              0.15423456 = queryWeight, product of:
                1.0924722 = boost
                7.917278 = idf(docFreq=43, maxDocs=44421)
                0.01783181 = queryNorm
              0.49482986 = fieldWeight in 3738, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.917278 = idf(docFreq=43, maxDocs=44421)
                0.0625 = fieldNorm(doc=3738)
          0.1072567 = weight(abstract_txt:edges in 3738) [ClassicSimilarity], result of:
            0.1072567 = score(doc=3738,freq=1.0), product of:
              0.19351128 = queryWeight, product of:
                1.2236936 = boost
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.01783181 = queryNorm
              0.5542659 = fieldWeight in 3738, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.0625 = fieldNorm(doc=3738)
          0.016570564 = weight(abstract_txt:with in 3738) [ClassicSimilarity], result of:
            0.016570564 = score(doc=3738,freq=3.0), product of:
              0.061323613 = queryWeight, product of:
                1.3777277 = boost
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.01783181 = queryNorm
              0.27021506 = fieldWeight in 3738, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.0625 = fieldNorm(doc=3738)
          0.14997353 = weight(abstract_txt:graph in 3738) [ClassicSimilarity], result of:
            0.14997353 = score(doc=3738,freq=3.0), product of:
              0.21138263 = queryWeight, product of:
                1.808711 = boost
                6.553973 = idf(docFreq=171, maxDocs=44421)
                0.01783181 = queryNorm
              0.7094884 = fieldWeight in 3738, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.553973 = idf(docFreq=171, maxDocs=44421)
                0.0625 = fieldNorm(doc=3738)
          0.103156455 = weight(abstract_txt:weighted in 3738) [ClassicSimilarity], result of:
            0.103156455 = score(doc=3738,freq=1.0), product of:
              0.23755504 = queryWeight, product of:
                1.9174174 = boost
                6.9478774 = idf(docFreq=115, maxDocs=44421)
                0.01783181 = queryNorm
              0.43424234 = fieldWeight in 3738, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9478774 = idf(docFreq=115, maxDocs=44421)
                0.0625 = fieldNorm(doc=3738)
          0.10991946 = weight(abstract_txt:model in 3738) [ClassicSimilarity], result of:
            0.10991946 = score(doc=3738,freq=2.0), product of:
              0.31224322 = queryWeight, product of:
                4.39654 = boost
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.01783181 = queryNorm
              0.35203153 = fieldWeight in 3738, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.0625 = fieldNorm(doc=3738)
        0.28 = coord(7/25)
    
  3. Tudhope, D.; Binding, C.; Blocks, D.; Cunliffe, D.: FACET: thesaurus retrieval with semantic term expansion (2002) 0.18
    0.18166275 = sum of:
      0.18166275 = product of:
        0.5676961 = sum of:
          0.03978998 = weight(abstract_txt:terms in 1175) [ClassicSimilarity], result of:
            0.03978998 = score(doc=1175,freq=5.0), product of:
              0.08046748 = queryWeight, product of:
                1.11595 = boost
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.01783181 = queryNorm
              0.4944852 = fieldWeight in 1175, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1175)
          0.031324662 = weight(abstract_txt:conceptual in 1175) [ClassicSimilarity], result of:
            0.031324662 = score(doc=1175,freq=1.0), product of:
              0.11731464 = queryWeight, product of:
                1.3474437 = boost
                4.8825436 = idf(docFreq=914, maxDocs=44421)
                0.01783181 = queryNorm
              0.2670141 = fieldWeight in 1175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8825436 = idf(docFreq=914, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1175)
          0.018718442 = weight(abstract_txt:with in 1175) [ClassicSimilarity], result of:
            0.018718442 = score(doc=1175,freq=5.0), product of:
              0.061323613 = queryWeight, product of:
                1.3777277 = boost
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.01783181 = queryNorm
              0.30524036 = fieldWeight in 1175, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1175)
          0.024031717 = weight(abstract_txt:results in 1175) [ClassicSimilarity], result of:
            0.024031717 = score(doc=1175,freq=2.0), product of:
              0.089324705 = queryWeight, product of:
                1.4400115 = boost
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.01783181 = queryNorm
              0.26903775 = fieldWeight in 1175, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1175)
          0.03690289 = weight(abstract_txt:concept in 1175) [ClassicSimilarity], result of:
            0.03690289 = score(doc=1175,freq=1.0), product of:
              0.14979544 = queryWeight, product of:
                1.864788 = boost
                4.5047812 = idf(docFreq=1334, maxDocs=44421)
                0.01783181 = queryNorm
              0.24635522 = fieldWeight in 1175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5047812 = idf(docFreq=1334, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1175)
          0.0902619 = weight(abstract_txt:weighted in 1175) [ClassicSimilarity], result of:
            0.0902619 = score(doc=1175,freq=1.0), product of:
              0.23755504 = queryWeight, product of:
                1.9174174 = boost
                6.9478774 = idf(docFreq=115, maxDocs=44421)
                0.01783181 = queryNorm
              0.37996206 = fieldWeight in 1175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9478774 = idf(docFreq=115, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1175)
          0.14764155 = weight(abstract_txt:thesaurus in 1175) [ClassicSimilarity], result of:
            0.14764155 = score(doc=1175,freq=7.0), product of:
              0.1973474 = queryWeight, product of:
                2.1404047 = boost
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.01783181 = queryNorm
              0.7481302 = fieldWeight in 1175, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1175)
          0.17902496 = weight(abstract_txt:hierarchical in 1175) [ClassicSimilarity], result of:
            0.17902496 = score(doc=1175,freq=2.0), product of:
              0.40396184 = queryWeight, product of:
                3.9534361 = boost
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.01783181 = queryNorm
              0.44317296 = fieldWeight in 1175, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1175)
        0.32 = coord(8/25)
    
  4. Buizza, G.: Subject analysis and indexing : an "Italian version" of the analytico-synthetic model (2011) 0.16
    0.15825368 = sum of:
      0.15825368 = product of:
        0.56519175 = sum of:
          0.089995705 = weight(abstract_txt:binary in 2812) [ClassicSimilarity], result of:
            0.089995705 = score(doc=2812,freq=1.0), product of:
              0.13137427 = queryWeight, product of:
                1.008265 = boost
                7.3070183 = idf(docFreq=80, maxDocs=44421)
                0.01783181 = queryNorm
              0.68503296 = fieldWeight in 2812, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3070183 = idf(docFreq=80, maxDocs=44421)
                0.09375 = fieldNorm(doc=2812)
          0.053699423 = weight(abstract_txt:conceptual in 2812) [ClassicSimilarity], result of:
            0.053699423 = score(doc=2812,freq=1.0), product of:
              0.11731464 = queryWeight, product of:
                1.3474437 = boost
                4.8825436 = idf(docFreq=914, maxDocs=44421)
                0.01783181 = queryNorm
              0.45773846 = fieldWeight in 2812, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8825436 = idf(docFreq=914, maxDocs=44421)
                0.09375 = fieldNorm(doc=2812)
          0.020294711 = weight(abstract_txt:with in 2812) [ClassicSimilarity], result of:
            0.020294711 = score(doc=2812,freq=2.0), product of:
              0.061323613 = queryWeight, product of:
                1.3777277 = boost
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.01783181 = queryNorm
              0.33094448 = fieldWeight in 2812, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.09375 = fieldNorm(doc=2812)
          0.077397846 = weight(abstract_txt:represent in 2812) [ClassicSimilarity], result of:
            0.077397846 = score(doc=2812,freq=1.0), product of:
              0.14968963 = queryWeight, product of:
                1.5220553 = boost
                5.515259 = idf(docFreq=485, maxDocs=44421)
                0.01783181 = queryNorm
              0.5170555 = fieldWeight in 2812, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.515259 = idf(docFreq=485, maxDocs=44421)
                0.09375 = fieldNorm(doc=2812)
          0.0632621 = weight(abstract_txt:concept in 2812) [ClassicSimilarity], result of:
            0.0632621 = score(doc=2812,freq=1.0), product of:
              0.14979544 = queryWeight, product of:
                1.864788 = boost
                4.5047812 = idf(docFreq=1334, maxDocs=44421)
                0.01783181 = queryNorm
              0.42232323 = fieldWeight in 2812, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5047812 = idf(docFreq=1334, maxDocs=44421)
                0.09375 = fieldNorm(doc=2812)
          0.095662735 = weight(abstract_txt:thesaurus in 2812) [ClassicSimilarity], result of:
            0.095662735 = score(doc=2812,freq=1.0), product of:
              0.1973474 = queryWeight, product of:
                2.1404047 = boost
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.01783181 = queryNorm
              0.48474282 = fieldWeight in 2812, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.09375 = fieldNorm(doc=2812)
          0.1648792 = weight(abstract_txt:model in 2812) [ClassicSimilarity], result of:
            0.1648792 = score(doc=2812,freq=2.0), product of:
              0.31224322 = queryWeight, product of:
                4.39654 = boost
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.01783181 = queryNorm
              0.5280473 = fieldWeight in 2812, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.09375 = fieldNorm(doc=2812)
        0.28 = coord(7/25)
    
  5. Ma, X.; Carranza, E.J.M.; Wu, C.; Meer, F.D. van der; Liu, G.: ¬A SKOS-based multilingual thesaurus of geological time scale for interoperability of online geological maps (2011) 0.14
    0.14206253 = sum of:
      0.14206253 = product of:
        0.5073662 = sum of:
          0.040673416 = weight(abstract_txt:terms in 800) [ClassicSimilarity], result of:
            0.040673416 = score(doc=800,freq=4.0), product of:
              0.08046748 = queryWeight, product of:
                1.11595 = boost
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.01783181 = queryNorm
              0.505464 = fieldWeight in 800, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.0625 = fieldNorm(doc=800)
          0.013529808 = weight(abstract_txt:with in 800) [ClassicSimilarity], result of:
            0.013529808 = score(doc=800,freq=2.0), product of:
              0.061323613 = queryWeight, product of:
                1.3777277 = boost
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.01783181 = queryNorm
              0.22062966 = fieldWeight in 800, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4961398 = idf(docFreq=9949, maxDocs=44421)
                0.0625 = fieldNorm(doc=800)
          0.019420562 = weight(abstract_txt:results in 800) [ClassicSimilarity], result of:
            0.019420562 = score(doc=800,freq=1.0), product of:
              0.089324705 = queryWeight, product of:
                1.4400115 = boost
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.01783181 = queryNorm
              0.21741535 = fieldWeight in 800, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4786456 = idf(docFreq=3724, maxDocs=44421)
                0.0625 = fieldNorm(doc=800)
          0.051598564 = weight(abstract_txt:represent in 800) [ClassicSimilarity], result of:
            0.051598564 = score(doc=800,freq=1.0), product of:
              0.14968963 = queryWeight, product of:
                1.5220553 = boost
                5.515259 = idf(docFreq=485, maxDocs=44421)
                0.01783181 = queryNorm
              0.34470367 = fieldWeight in 800, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.515259 = idf(docFreq=485, maxDocs=44421)
                0.0625 = fieldNorm(doc=800)
          0.12755032 = weight(abstract_txt:thesaurus in 800) [ClassicSimilarity], result of:
            0.12755032 = score(doc=800,freq=4.0), product of:
              0.1973474 = queryWeight, product of:
                2.1404047 = boost
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.01783181 = queryNorm
              0.64632374 = fieldWeight in 800, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.0625 = fieldNorm(doc=800)
          0.14467402 = weight(abstract_txt:hierarchical in 800) [ClassicSimilarity], result of:
            0.14467402 = score(doc=800,freq=1.0), product of:
              0.40396184 = queryWeight, product of:
                3.9534361 = boost
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.01783181 = queryNorm
              0.35813785 = fieldWeight in 800, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7302055 = idf(docFreq=391, maxDocs=44421)
                0.0625 = fieldNorm(doc=800)
          0.10991946 = weight(abstract_txt:model in 800) [ClassicSimilarity], result of:
            0.10991946 = score(doc=800,freq=2.0), product of:
              0.31224322 = queryWeight, product of:
                4.39654 = boost
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.01783181 = queryNorm
              0.35203153 = fieldWeight in 800, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.0625 = fieldNorm(doc=800)
        0.28 = coord(7/25)