Document (#36173)

Author
Fripp, D.
Title
Using linked data to classify web documents
Source
Aslib proceedings. 62(2010) no.6, S.585 - 595
Year
2010
Abstract
Purpose - The purpose of this paper is to find a relationship between traditional faceted classification schemes and semantic web document annotators, particularly in the linked data environment. Design/methodology/approach - A consideration of the conceptual ideas behind faceted classification and linked data architecture is made. Analysis of selected web documents is performed using Calais' Semantic Proxy to support the considerations. Findings - Technical language aside, the principles of both approaches are very similar. Modern classification techniques have the potential to automatically generate metadata to drive more precise information recall by including a semantic layer. Originality/value - Linked data have not been explicitly considered in this context before in the published literature.
Theme
Semantic Web
Klassifikationstheorie: Elemente / Struktur

Similar documents (content)

  1. Bianchini, C.; Bargioni, S.: Automated classification using linked open data : a case study on faceted classification and Wikidata (2021) 0.18
    0.18096057 = sum of:
      0.18096057 = product of:
        0.7540024 = sum of:
          0.095617905 = weight(abstract_txt:classify in 1725) [ClassicSimilarity], result of:
            0.095617905 = score(doc=1725,freq=1.0), product of:
              0.15616703 = queryWeight, product of:
                1.147265 = boost
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.020842368 = queryNorm
              0.6122797 = fieldWeight in 1725, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.09375 = fieldNorm(doc=1725)
          0.028358478 = weight(abstract_txt:using in 1725) [ClassicSimilarity], result of:
            0.028358478 = score(doc=1725,freq=1.0), product of:
              0.08750412 = queryWeight, product of:
                1.2145022 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.020842368 = queryNorm
              0.32408163 = fieldWeight in 1725, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.09375 = fieldNorm(doc=1725)
          0.14688887 = weight(abstract_txt:faceted in 1725) [ClassicSimilarity], result of:
            0.14688887 = score(doc=1725,freq=1.0), product of:
              0.26195848 = queryWeight, product of:
                2.1013591 = boost
                5.981156 = idf(docFreq=304, maxDocs=44421)
                0.020842368 = queryNorm
              0.5607334 = fieldWeight in 1725, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.981156 = idf(docFreq=304, maxDocs=44421)
                0.09375 = fieldNorm(doc=1725)
          0.16048114 = weight(abstract_txt:classification in 1725) [ClassicSimilarity], result of:
            0.16048114 = score(doc=1725,freq=6.0), product of:
              0.17505287 = queryWeight, product of:
                2.1038477 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.020842368 = queryNorm
              0.9167582 = fieldWeight in 1725, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.09375 = fieldNorm(doc=1725)
          0.08783043 = weight(abstract_txt:data in 1725) [ClassicSimilarity], result of:
            0.08783043 = score(doc=1725,freq=3.0), product of:
              0.16241999 = queryWeight, product of:
                2.340016 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.020842368 = queryNorm
              0.54076123 = fieldWeight in 1725, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.09375 = fieldNorm(doc=1725)
          0.23482557 = weight(abstract_txt:linked in 1725) [ClassicSimilarity], result of:
            0.23482557 = score(doc=1725,freq=1.0), product of:
              0.4512461 = queryWeight, product of:
                3.90037 = boost
                5.5508647 = idf(docFreq=468, maxDocs=44421)
                0.020842368 = queryNorm
              0.52039355 = fieldWeight in 1725, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5508647 = idf(docFreq=468, maxDocs=44421)
                0.09375 = fieldNorm(doc=1725)
        0.24 = coord(6/25)
    
  2. Jia, J.: From data to knowledge : the relationships between vocabularies, linked data and knowledge graphs (2021) 0.15
    0.14715268 = sum of:
      0.14715268 = product of:
        0.7357634 = sum of:
          0.16600415 = weight(abstract_txt:layer in 1107) [ClassicSimilarity], result of:
            0.16600415 = score(doc=1107,freq=3.0), product of:
              0.2049571 = queryWeight, product of:
                1.3143183 = boost
                7.48196 = idf(docFreq=67, maxDocs=44421)
                0.020842368 = queryNorm
              0.8099459 = fieldWeight in 1107, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.48196 = idf(docFreq=67, maxDocs=44421)
                0.0625 = fieldNorm(doc=1107)
          0.057562426 = weight(abstract_txt:purpose in 1107) [ClassicSimilarity], result of:
            0.057562426 = score(doc=1107,freq=2.0), product of:
              0.14589827 = queryWeight, product of:
                1.5682276 = boost
                4.4636893 = idf(docFreq=1390, maxDocs=44421)
                0.020842368 = queryNorm
              0.3945381 = fieldWeight in 1107, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4636893 = idf(docFreq=1390, maxDocs=44421)
                0.0625 = fieldNorm(doc=1107)
          0.11212166 = weight(abstract_txt:data in 1107) [ClassicSimilarity], result of:
            0.11212166 = score(doc=1107,freq=11.0), product of:
              0.16241999 = queryWeight, product of:
                2.340016 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.020842368 = queryNorm
              0.6903193 = fieldWeight in 1107, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=1107)
          0.08697437 = weight(abstract_txt:semantic in 1107) [ClassicSimilarity], result of:
            0.08697437 = score(doc=1107,freq=2.0), product of:
              0.21991187 = queryWeight, product of:
                2.3580556 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.020842368 = queryNorm
              0.39549646 = fieldWeight in 1107, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.0625 = fieldNorm(doc=1107)
          0.31310076 = weight(abstract_txt:linked in 1107) [ClassicSimilarity], result of:
            0.31310076 = score(doc=1107,freq=4.0), product of:
              0.4512461 = queryWeight, product of:
                3.90037 = boost
                5.5508647 = idf(docFreq=468, maxDocs=44421)
                0.020842368 = queryNorm
              0.6938581 = fieldWeight in 1107, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.5508647 = idf(docFreq=468, maxDocs=44421)
                0.0625 = fieldNorm(doc=1107)
        0.2 = coord(5/25)
    
  3. Smith, D.A.; Shadbolt, N.R.: FacetOntology : expressive descriptions of facets in the Semantic Web (2012) 0.13
    0.13312794 = sum of:
      0.13312794 = product of:
        0.5546998 = sum of:
          0.014987977 = weight(abstract_txt:have in 3208) [ClassicSimilarity], result of:
            0.014987977 = score(doc=3208,freq=1.0), product of:
              0.07495422 = queryWeight, product of:
                1.1240408 = boost
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.020842368 = queryNorm
              0.19996175 = fieldWeight in 3208, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.0625 = fieldNorm(doc=3208)
          0.018905653 = weight(abstract_txt:using in 3208) [ClassicSimilarity], result of:
            0.018905653 = score(doc=3208,freq=1.0), product of:
              0.08750412 = queryWeight, product of:
                1.2145022 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.020842368 = queryNorm
              0.21605442 = fieldWeight in 3208, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.0625 = fieldNorm(doc=3208)
          0.19585182 = weight(abstract_txt:faceted in 3208) [ClassicSimilarity], result of:
            0.19585182 = score(doc=3208,freq=4.0), product of:
              0.26195848 = queryWeight, product of:
                2.1013591 = boost
                5.981156 = idf(docFreq=304, maxDocs=44421)
                0.020842368 = queryNorm
              0.7476445 = fieldWeight in 3208, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.981156 = idf(docFreq=304, maxDocs=44421)
                0.0625 = fieldNorm(doc=3208)
          0.1069038 = weight(abstract_txt:data in 3208) [ClassicSimilarity], result of:
            0.1069038 = score(doc=3208,freq=10.0), product of:
              0.16241999 = queryWeight, product of:
                2.340016 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.020842368 = queryNorm
              0.6581936 = fieldWeight in 3208, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=3208)
          0.061500166 = weight(abstract_txt:semantic in 3208) [ClassicSimilarity], result of:
            0.061500166 = score(doc=3208,freq=1.0), product of:
              0.21991187 = queryWeight, product of:
                2.3580556 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.020842368 = queryNorm
              0.27965823 = fieldWeight in 3208, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.0625 = fieldNorm(doc=3208)
          0.15655038 = weight(abstract_txt:linked in 3208) [ClassicSimilarity], result of:
            0.15655038 = score(doc=3208,freq=1.0), product of:
              0.4512461 = queryWeight, product of:
                3.90037 = boost
                5.5508647 = idf(docFreq=468, maxDocs=44421)
                0.020842368 = queryNorm
              0.34692904 = fieldWeight in 3208, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5508647 = idf(docFreq=468, maxDocs=44421)
                0.0625 = fieldNorm(doc=3208)
        0.24 = coord(6/25)
    
  4. Bianchini, C.; Willer, M.: ISBD resource and Its description in the context of the Semantic Web (2014) 0.13
    0.13307126 = sum of:
      0.13307126 = product of:
        0.6653563 = sum of:
          0.028358478 = weight(abstract_txt:using in 2998) [ClassicSimilarity], result of:
            0.028358478 = score(doc=2998,freq=1.0), product of:
              0.08750412 = queryWeight, product of:
                1.2145022 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.020842368 = queryNorm
              0.32408163 = fieldWeight in 2998, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.09375 = fieldNorm(doc=2998)
          0.061054174 = weight(abstract_txt:purpose in 2998) [ClassicSimilarity], result of:
            0.061054174 = score(doc=2998,freq=1.0), product of:
              0.14589827 = queryWeight, product of:
                1.5682276 = boost
                4.4636893 = idf(docFreq=1390, maxDocs=44421)
                0.020842368 = queryNorm
              0.41847086 = fieldWeight in 2998, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4636893 = idf(docFreq=1390, maxDocs=44421)
                0.09375 = fieldNorm(doc=2998)
          0.113388605 = weight(abstract_txt:data in 2998) [ClassicSimilarity], result of:
            0.113388605 = score(doc=2998,freq=5.0), product of:
              0.16241999 = queryWeight, product of:
                2.340016 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.020842368 = queryNorm
              0.69811976 = fieldWeight in 2998, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.09375 = fieldNorm(doc=2998)
          0.13046154 = weight(abstract_txt:semantic in 2998) [ClassicSimilarity], result of:
            0.13046154 = score(doc=2998,freq=2.0), product of:
              0.21991187 = queryWeight, product of:
                2.3580556 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.020842368 = queryNorm
              0.5932447 = fieldWeight in 2998, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.09375 = fieldNorm(doc=2998)
          0.3320935 = weight(abstract_txt:linked in 2998) [ClassicSimilarity], result of:
            0.3320935 = score(doc=2998,freq=2.0), product of:
              0.4512461 = queryWeight, product of:
                3.90037 = boost
                5.5508647 = idf(docFreq=468, maxDocs=44421)
                0.020842368 = queryNorm
              0.7359476 = fieldWeight in 2998, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5508647 = idf(docFreq=468, maxDocs=44421)
                0.09375 = fieldNorm(doc=2998)
        0.2 = coord(5/25)
    
  5. Tudhope, D.; Blocks, D.; Cunliffe, D.; Binding, C.: Query expansion via conceptual distance in thesaurus indexed collections (2006) 0.13
    0.12785381 = sum of:
      0.12785381 = product of:
        0.45662075 = sum of:
          0.059699647 = weight(abstract_txt:architecture in 3215) [ClassicSimilarity], result of:
            0.059699647 = score(doc=3215,freq=2.0), product of:
              0.1186484 = queryWeight, product of:
                5.6926546 = idf(docFreq=406, maxDocs=44421)
                0.020842368 = queryNorm
              0.50316435 = fieldWeight in 3215, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.6926546 = idf(docFreq=406, maxDocs=44421)
                0.0625 = fieldNorm(doc=3215)
          0.014987977 = weight(abstract_txt:have in 3215) [ClassicSimilarity], result of:
            0.014987977 = score(doc=3215,freq=1.0), product of:
              0.07495422 = queryWeight, product of:
                1.1240408 = boost
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.020842368 = queryNorm
              0.19996175 = fieldWeight in 3215, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.199388 = idf(docFreq=4924, maxDocs=44421)
                0.0625 = fieldNorm(doc=3215)
          0.070020586 = weight(abstract_txt:precise in 3215) [ClassicSimilarity], result of:
            0.070020586 = score(doc=3215,freq=1.0), product of:
              0.16625494 = queryWeight, product of:
                1.18374 = boost
                6.738623 = idf(docFreq=142, maxDocs=44421)
                0.020842368 = queryNorm
              0.42116395 = fieldWeight in 3215, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.738623 = idf(docFreq=142, maxDocs=44421)
                0.0625 = fieldNorm(doc=3215)
          0.018905653 = weight(abstract_txt:using in 3215) [ClassicSimilarity], result of:
            0.018905653 = score(doc=3215,freq=1.0), product of:
              0.08750412 = queryWeight, product of:
                1.2145022 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.020842368 = queryNorm
              0.21605442 = fieldWeight in 3215, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.0625 = fieldNorm(doc=3215)
          0.057562426 = weight(abstract_txt:purpose in 3215) [ClassicSimilarity], result of:
            0.057562426 = score(doc=3215,freq=2.0), product of:
              0.14589827 = queryWeight, product of:
                1.5682276 = boost
                4.4636893 = idf(docFreq=1390, maxDocs=44421)
                0.020842368 = queryNorm
              0.3945381 = fieldWeight in 3215, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4636893 = idf(docFreq=1390, maxDocs=44421)
                0.0625 = fieldNorm(doc=3215)
          0.09792591 = weight(abstract_txt:faceted in 3215) [ClassicSimilarity], result of:
            0.09792591 = score(doc=3215,freq=1.0), product of:
              0.26195848 = queryWeight, product of:
                2.1013591 = boost
                5.981156 = idf(docFreq=304, maxDocs=44421)
                0.020842368 = queryNorm
              0.37382224 = fieldWeight in 3215, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.981156 = idf(docFreq=304, maxDocs=44421)
                0.0625 = fieldNorm(doc=3215)
          0.13751854 = weight(abstract_txt:semantic in 3215) [ClassicSimilarity], result of:
            0.13751854 = score(doc=3215,freq=5.0), product of:
              0.21991187 = queryWeight, product of:
                2.3580556 = boost
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.020842368 = queryNorm
              0.6253348 = fieldWeight in 3215, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.4745317 = idf(docFreq=1375, maxDocs=44421)
                0.0625 = fieldNorm(doc=3215)
        0.28 = coord(7/25)