Document (#38106)

Author
Xamena, E.
Brignole, N.B.
Maguitman, A.G.
Title
¬A study of relevance propagation in large topic ontologies
Source
Journal of the American Society for Information Science and Technology. 64(2013) no.11, S.2238-2255
Year
2013
Abstract
Topic ontologies or web directories consist of large collections of links to websites, arranged by topic in different categories. The structure of these ontologies is typically not flat because there are hierarchical and nonhierarchical relationships among topics. As a consequence, websites classified under a certain topic may be relevant to other topics. Although some of these relevance relations are explicit, most of them must be discovered by an analysis of the structure of the ontologies. This article proposes a family of models of relevance propagation in topic ontologies. An efficient computational framework is described and used to compute nine different models for a portion of the Open Directory Project graph consisting of more than half a million nodes and approximately 1.5 million edges of different types. After performing a quantitative analysis, a user study was carried out to compare the most promising models. It was found that some general difficulties rule out the possibility of defining flawless models of relevance propagation that only take into account structural aspects of an ontology. However, there is a clear indication that including transitive relations induced by the nonhierarchical components of the ontology results in relevance propagation models that are superior to more basic approaches.
Theme
Semantisches Umfeld in Indexierung u. Retrieval

Similar documents (content)

  1. Call, A.; Gottlob, G.; Pieris, A.: ¬The return of the entity-relationship model : ontological query answering (2012) 0.18
    0.18389042 = sum of:
      0.18389042 = product of:
        0.57465756 = sum of:
          0.06025172 = weight(abstract_txt:compute in 1434) [ClassicSimilarity], result of:
            0.06025172 = score(doc=1434,freq=1.0), product of:
              0.12587819 = queryWeight, product of:
                1.0371666 = boost
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.01584758 = queryNorm
              0.47865102 = fieldWeight in 1434, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.6584163 = idf(docFreq=56, maxDocs=44421)
                0.0625 = fieldNorm(doc=1434)
          0.022205295 = weight(abstract_txt:structure in 1434) [ClassicSimilarity], result of:
            0.022205295 = score(doc=1434,freq=1.0), product of:
              0.08152395 = queryWeight, product of:
                1.1804045 = boost
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.01584758 = queryNorm
              0.27237758 = fieldWeight in 1434, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.0625 = fieldNorm(doc=1434)
          0.023519129 = weight(abstract_txt:large in 1434) [ClassicSimilarity], result of:
            0.023519129 = score(doc=1434,freq=1.0), product of:
              0.08470876 = queryWeight, product of:
                1.2032405 = boost
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.01584758 = queryNorm
              0.27764696 = fieldWeight in 1434, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.0625 = fieldNorm(doc=1434)
          0.01419385 = weight(abstract_txt:that in 1434) [ClassicSimilarity], result of:
            0.01419385 = score(doc=1434,freq=4.0), product of:
              0.04801434 = queryWeight, product of:
                1.2811168 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.01584758 = queryNorm
              0.2956169 = fieldWeight in 1434, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=1434)
          0.0783054 = weight(abstract_txt:ontology in 1434) [ClassicSimilarity], result of:
            0.0783054 = score(doc=1434,freq=3.0), product of:
              0.13095886 = queryWeight, product of:
                1.4960831 = boost
                5.5235233 = idf(docFreq=481, maxDocs=44421)
                0.01584758 = queryNorm
              0.59793895 = fieldWeight in 1434, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.5235233 = idf(docFreq=481, maxDocs=44421)
                0.0625 = fieldNorm(doc=1434)
          0.045726392 = weight(abstract_txt:relations in 1434) [ClassicSimilarity], result of:
            0.045726392 = score(doc=1434,freq=1.0), product of:
              0.13195488 = queryWeight, product of:
                1.5017617 = boost
                5.5444884 = idf(docFreq=471, maxDocs=44421)
                0.01584758 = queryNorm
              0.34653053 = fieldWeight in 1434, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5444884 = idf(docFreq=471, maxDocs=44421)
                0.0625 = fieldNorm(doc=1434)
          0.066271946 = weight(abstract_txt:models in 1434) [ClassicSimilarity], result of:
            0.066271946 = score(doc=1434,freq=1.0), product of:
              0.22935803 = queryWeight, product of:
                3.1305113 = boost
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.01584758 = queryNorm
              0.28894538 = fieldWeight in 1434, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.0625 = fieldNorm(doc=1434)
          0.26418382 = weight(abstract_txt:ontologies in 1434) [ClassicSimilarity], result of:
            0.26418382 = score(doc=1434,freq=4.0), product of:
              0.36325502 = queryWeight, product of:
                3.9397087 = boost
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.01584758 = queryNorm
              0.72726816 = fieldWeight in 1434, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.0625 = fieldNorm(doc=1434)
        0.32 = coord(8/25)
    
  2. Solskinnsbakk, G.; Gulla, J.A.; Haderlein, V.; Myrseth, P.; Cerrato, O.: Quality of hierarchies in ontologies and folksonomies (2012) 0.18
    0.18211238 = sum of:
      0.18211238 = product of:
        0.65040135 = sum of:
          0.022205295 = weight(abstract_txt:structure in 2034) [ClassicSimilarity], result of:
            0.022205295 = score(doc=2034,freq=1.0), product of:
              0.08152395 = queryWeight, product of:
                1.1804045 = boost
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.01584758 = queryNorm
              0.27237758 = fieldWeight in 2034, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.0625 = fieldNorm(doc=2034)
          0.007096925 = weight(abstract_txt:that in 2034) [ClassicSimilarity], result of:
            0.007096925 = score(doc=2034,freq=1.0), product of:
              0.04801434 = queryWeight, product of:
                1.2811168 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.01584758 = queryNorm
              0.14780845 = fieldWeight in 2034, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=2034)
          0.027895678 = weight(abstract_txt:different in 2034) [ClassicSimilarity], result of:
            0.027895678 = score(doc=2034,freq=2.0), product of:
              0.086236775 = queryWeight, product of:
                1.4868945 = boost
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.01584758 = queryNorm
              0.32347775 = fieldWeight in 2034, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.0625 = fieldNorm(doc=2034)
          0.11074056 = weight(abstract_txt:ontology in 2034) [ClassicSimilarity], result of:
            0.11074056 = score(doc=2034,freq=6.0), product of:
              0.13095886 = queryWeight, product of:
                1.4960831 = boost
                5.5235233 = idf(docFreq=481, maxDocs=44421)
                0.01584758 = queryNorm
              0.84561336 = fieldWeight in 2034, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.5235233 = idf(docFreq=481, maxDocs=44421)
                0.0625 = fieldNorm(doc=2034)
          0.06466688 = weight(abstract_txt:relations in 2034) [ClassicSimilarity], result of:
            0.06466688 = score(doc=2034,freq=2.0), product of:
              0.13195488 = queryWeight, product of:
                1.5017617 = boost
                5.5444884 = idf(docFreq=471, maxDocs=44421)
                0.01584758 = queryNorm
              0.49006817 = fieldWeight in 2034, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5444884 = idf(docFreq=471, maxDocs=44421)
                0.0625 = fieldNorm(doc=2034)
          0.12242955 = weight(abstract_txt:topic in 2034) [ClassicSimilarity], result of:
            0.12242955 = score(doc=2034,freq=2.0), product of:
              0.27407852 = queryWeight, product of:
                3.4221244 = boost
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.01584758 = queryNorm
              0.44669518 = fieldWeight in 2034, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.0625 = fieldNorm(doc=2034)
          0.2953665 = weight(abstract_txt:ontologies in 2034) [ClassicSimilarity], result of:
            0.2953665 = score(doc=2034,freq=5.0), product of:
              0.36325502 = queryWeight, product of:
                3.9397087 = boost
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.01584758 = queryNorm
              0.81311053 = fieldWeight in 2034, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.0625 = fieldNorm(doc=2034)
        0.28 = coord(7/25)
    
  3. Silvello, G.: Theory and practice of data citation (2018) 0.18
    0.18132457 = sum of:
      0.18132457 = product of:
        0.5666393 = sum of:
          0.01643709 = weight(abstract_txt:most in 6) [ClassicSimilarity], result of:
            0.01643709 = score(doc=6,freq=1.0), product of:
              0.06671099 = queryWeight, product of:
                1.0677928 = boost
                3.94228 = idf(docFreq=2342, maxDocs=44421)
                0.01584758 = queryNorm
              0.2463925 = fieldWeight in 6, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.94228 = idf(docFreq=2342, maxDocs=44421)
                0.0625 = fieldNorm(doc=6)
          0.018409984 = weight(abstract_txt:there in 6) [ClassicSimilarity], result of:
            0.018409984 = score(doc=6,freq=1.0), product of:
              0.07194762 = queryWeight, product of:
                1.1089106 = boost
                4.094086 = idf(docFreq=2012, maxDocs=44421)
                0.01584758 = queryNorm
              0.2558804 = fieldWeight in 6, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.094086 = idf(docFreq=2012, maxDocs=44421)
                0.0625 = fieldNorm(doc=6)
          0.023519129 = weight(abstract_txt:large in 6) [ClassicSimilarity], result of:
            0.023519129 = score(doc=6,freq=1.0), product of:
              0.08470876 = queryWeight, product of:
                1.2032405 = boost
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.01584758 = queryNorm
              0.27764696 = fieldWeight in 6, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.0625 = fieldNorm(doc=6)
          0.007096925 = weight(abstract_txt:that in 6) [ClassicSimilarity], result of:
            0.007096925 = score(doc=6,freq=1.0), product of:
              0.04801434 = queryWeight, product of:
                1.2811168 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.01584758 = queryNorm
              0.14780845 = fieldWeight in 6, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=6)
          0.019725222 = weight(abstract_txt:different in 6) [ClassicSimilarity], result of:
            0.019725222 = score(doc=6,freq=1.0), product of:
              0.086236775 = queryWeight, product of:
                1.4868945 = boost
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.01584758 = queryNorm
              0.2287333 = fieldWeight in 6, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.0625 = fieldNorm(doc=6)
          0.080341436 = weight(abstract_txt:relevance in 6) [ClassicSimilarity], result of:
            0.080341436 = score(doc=6,freq=1.0), product of:
              0.26076776 = queryWeight, product of:
                3.3379915 = boost
                4.929532 = idf(docFreq=872, maxDocs=44421)
                0.01584758 = queryNorm
              0.30809575 = fieldWeight in 6, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.929532 = idf(docFreq=872, maxDocs=44421)
                0.0625 = fieldNorm(doc=6)
          0.08657077 = weight(abstract_txt:topic in 6) [ClassicSimilarity], result of:
            0.08657077 = score(doc=6,freq=1.0), product of:
              0.27407852 = queryWeight, product of:
                3.4221244 = boost
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.01584758 = queryNorm
              0.3158612 = fieldWeight in 6, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.0625 = fieldNorm(doc=6)
          0.31453872 = weight(abstract_txt:propagation in 6) [ClassicSimilarity], result of:
            0.31453872 = score(doc=6,freq=1.0), product of:
              0.6013217 = queryWeight, product of:
                4.533741 = boost
                8.369263 = idf(docFreq=27, maxDocs=44421)
                0.01584758 = queryNorm
              0.5230789 = fieldWeight in 6, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.369263 = idf(docFreq=27, maxDocs=44421)
                0.0625 = fieldNorm(doc=6)
        0.32 = coord(8/25)
    
  4. Alkhodair, S.A.; Fung, B.C.M.; Patrick, O.R.; Hung, C.K.: Improving interpretations of topic modeling in microblogs (2018) 0.15
    0.1472921 = sum of:
      0.1472921 = product of:
        0.52604324 = sum of:
          0.022205295 = weight(abstract_txt:structure in 181) [ClassicSimilarity], result of:
            0.022205295 = score(doc=181,freq=1.0), product of:
              0.08152395 = queryWeight, product of:
                1.1804045 = boost
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.01584758 = queryNorm
              0.27237758 = fieldWeight in 181, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.0625 = fieldNorm(doc=181)
          0.023519129 = weight(abstract_txt:large in 181) [ClassicSimilarity], result of:
            0.023519129 = score(doc=181,freq=1.0), product of:
              0.08470876 = queryWeight, product of:
                1.2032405 = boost
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.01584758 = queryNorm
              0.27764696 = fieldWeight in 181, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4423513 = idf(docFreq=1420, maxDocs=44421)
                0.0625 = fieldNorm(doc=181)
          0.01419385 = weight(abstract_txt:that in 181) [ClassicSimilarity], result of:
            0.01419385 = score(doc=181,freq=4.0), product of:
              0.04801434 = queryWeight, product of:
                1.2811168 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.01584758 = queryNorm
              0.2956169 = fieldWeight in 181, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=181)
          0.07858383 = weight(abstract_txt:topics in 181) [ClassicSimilarity], result of:
            0.07858383 = score(doc=181,freq=5.0), product of:
              0.110716656 = queryWeight, product of:
                1.3756082 = boost
                5.078731 = idf(docFreq=751, maxDocs=44421)
                0.01584758 = queryNorm
              0.70977426 = fieldWeight in 181, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.078731 = idf(docFreq=751, maxDocs=44421)
                0.0625 = fieldNorm(doc=181)
          0.027895678 = weight(abstract_txt:different in 181) [ClassicSimilarity], result of:
            0.027895678 = score(doc=181,freq=2.0), product of:
              0.086236775 = queryWeight, product of:
                1.4868945 = boost
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.01584758 = queryNorm
              0.32347775 = fieldWeight in 181, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.0625 = fieldNorm(doc=181)
          0.11478637 = weight(abstract_txt:models in 181) [ClassicSimilarity], result of:
            0.11478637 = score(doc=181,freq=3.0), product of:
              0.22935803 = queryWeight, product of:
                3.1305113 = boost
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.01584758 = queryNorm
              0.5004681 = fieldWeight in 181, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.623126 = idf(docFreq=1185, maxDocs=44421)
                0.0625 = fieldNorm(doc=181)
          0.2448591 = weight(abstract_txt:topic in 181) [ClassicSimilarity], result of:
            0.2448591 = score(doc=181,freq=8.0), product of:
              0.27407852 = queryWeight, product of:
                3.4221244 = boost
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.01584758 = queryNorm
              0.89339036 = fieldWeight in 181, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.0625 = fieldNorm(doc=181)
        0.28 = coord(7/25)
    
  5. King, B.E.; Reinold, K.: Finding the concept, not just the word : a librarian's guide to ontologies and semantics (2008) 0.14
    0.14110069 = sum of:
      0.14110069 = product of:
        0.50393105 = sum of:
          0.010273181 = weight(abstract_txt:most in 3863) [ClassicSimilarity], result of:
            0.010273181 = score(doc=3863,freq=1.0), product of:
              0.06671099 = queryWeight, product of:
                1.0677928 = boost
                3.94228 = idf(docFreq=2342, maxDocs=44421)
                0.01584758 = queryNorm
              0.15399532 = fieldWeight in 3863, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.94228 = idf(docFreq=2342, maxDocs=44421)
                0.0390625 = fieldNorm(doc=3863)
          0.01387831 = weight(abstract_txt:structure in 3863) [ClassicSimilarity], result of:
            0.01387831 = score(doc=3863,freq=1.0), product of:
              0.08152395 = queryWeight, product of:
                1.1804045 = boost
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.01584758 = queryNorm
              0.17023599 = fieldWeight in 3863, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.0390625 = fieldNorm(doc=3863)
          0.021964848 = weight(abstract_txt:topics in 3863) [ClassicSimilarity], result of:
            0.021964848 = score(doc=3863,freq=1.0), product of:
              0.110716656 = queryWeight, product of:
                1.3756082 = boost
                5.078731 = idf(docFreq=751, maxDocs=44421)
                0.01584758 = queryNorm
              0.19838794 = fieldWeight in 3863, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.078731 = idf(docFreq=751, maxDocs=44421)
                0.0390625 = fieldNorm(doc=3863)
          0.012328264 = weight(abstract_txt:different in 3863) [ClassicSimilarity], result of:
            0.012328264 = score(doc=3863,freq=1.0), product of:
              0.086236775 = queryWeight, product of:
                1.4868945 = boost
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.01584758 = queryNorm
              0.14295831 = fieldWeight in 3863, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6597328 = idf(docFreq=3107, maxDocs=44421)
                0.0390625 = fieldNorm(doc=3863)
          0.09371464 = weight(abstract_txt:ontology in 3863) [ClassicSimilarity], result of:
            0.09371464 = score(doc=3863,freq=11.0), product of:
              0.13095886 = queryWeight, product of:
                1.4960831 = boost
                5.5235233 = idf(docFreq=481, maxDocs=44421)
                0.01584758 = queryNorm
              0.7156037 = fieldWeight in 3863, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                5.5235233 = idf(docFreq=481, maxDocs=44421)
                0.0390625 = fieldNorm(doc=3863)
          0.05410673 = weight(abstract_txt:topic in 3863) [ClassicSimilarity], result of:
            0.05410673 = score(doc=3863,freq=1.0), product of:
              0.27407852 = queryWeight, product of:
                3.4221244 = boost
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.01584758 = queryNorm
              0.19741325 = fieldWeight in 3863, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.053779 = idf(docFreq=770, maxDocs=44421)
                0.0390625 = fieldNorm(doc=3863)
          0.2976651 = weight(abstract_txt:ontologies in 3863) [ClassicSimilarity], result of:
            0.2976651 = score(doc=3863,freq=13.0), product of:
              0.36325502 = queryWeight, product of:
                3.9397087 = boost
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.01584758 = queryNorm
              0.81943834 = fieldWeight in 3863, product of:
                3.6055512 = tf(freq=13.0), with freq of:
                  13.0 = termFreq=13.0
                5.8181453 = idf(docFreq=358, maxDocs=44421)
                0.0390625 = fieldNorm(doc=3863)
        0.28 = coord(7/25)