Document (#36830)

Author
Alexander, F.
Heather, A.
Title
Transformation of a legacy UDC-based classification system : exploiting and remodelling semantic relationships
Source
Classification and ontology: formal approaches and access to knowledge: proceedings of the International UDC Seminar, 19-20 September 2011, The Hague, The Netherlands. Eds.: A. Slavic u. E. Civallero
Imprint
Würzburg : Ergon Verlag
Year
2011
Pages
S.251-267
Abstract
This paper reviews a project to remodel and unify diverse BBC Archive classification schemes, including the large Universal Decimal Classification (UDC) - based classification, Lonclass, as part of the BBC's Digital Media Initiative (DMI). The aims of the remodelling included migrating classification data from legacy systems and using the faceted structure of the classifications as a basis for proto-ontological relationship building. The processes of analysis and development of a methodology to decompose and reassemble the classifications raised such challenges as how to adapt bibliographic classifications for use as digital asset management tools and how to preserve the legacy intellectual property to enable continuing use of taxonomic classification as an access route to multimedia content. These objectives required the sophisticated semantics of the UDC-based classification to be retained during migration to an off-the-shelf taxonomy management product that could be integrated with diverse systems to form the basis of an enterprise-wide framework. The decompositions and reclassification process informed ways of preserving the high precision semantics of bibliographic classifications for use as a foundation for natural language-based retrieval and for translation into ontologically expressive formats, such as Resource Description Framework (RDF).
Object
UDC
BBC
Lonclass
Area
Medienarchive (Rundfunk/Fernsehen)

Similar documents (author)

  1. Alexander, M.: Automatic indexing of document images using Excalibur EFS (1995) 5.58
    5.5805492 = sum of:
      5.5805492 = weight(author_txt:alexander in 1979) [ClassicSimilarity], result of:
        5.5805492 = fieldWeight in 1979, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.928879 = idf(docFreq=15, maxDocs=44421)
          0.625 = fieldNorm(doc=1979)
    
  2. Alexander, M.: Retrieving digital data with fuzzy matching (1996) 5.58
    5.5805492 = sum of:
      5.5805492 = weight(author_txt:alexander in 30) [ClassicSimilarity], result of:
        5.5805492 = fieldWeight in 30, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.928879 = idf(docFreq=15, maxDocs=44421)
          0.625 = fieldNorm(doc=30)
    
  3. Alexander, M.: Retrieving digital data with fuzzy matching (1997) 5.58
    5.5805492 = sum of:
      5.5805492 = weight(author_txt:alexander in 151) [ClassicSimilarity], result of:
        5.5805492 = fieldWeight in 151, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.928879 = idf(docFreq=15, maxDocs=44421)
          0.625 = fieldNorm(doc=151)
    
  4. Alexander, J.: Customs and excise process 2.5 million documents (1997) 5.58
    5.5805492 = sum of:
      5.5805492 = weight(author_txt:alexander in 3427) [ClassicSimilarity], result of:
        5.5805492 = fieldWeight in 3427, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.928879 = idf(docFreq=15, maxDocs=44421)
          0.625 = fieldNorm(doc=3427)
    
  5. Alexander, M.: Digitising books, manuscripts and scholarly materials : preparation, handling, scanning, recognition, compression, storage formats (1998) 5.58
    5.5805492 = sum of:
      5.5805492 = weight(author_txt:alexander in 4686) [ClassicSimilarity], result of:
        5.5805492 = fieldWeight in 4686, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.928879 = idf(docFreq=15, maxDocs=44421)
          0.625 = fieldNorm(doc=4686)
    

Similar documents (content)

  1. Quick Guide to Publishing a Classification Scheme on the Semantic Web (2008) 0.13
    0.13211708 = sum of:
      0.13211708 = product of:
        0.66058534 = sum of:
          0.16739033 = weight(abstract_txt:unify in 48) [ClassicSimilarity], result of:
            0.16739033 = score(doc=48,freq=1.0), product of:
              0.19853374 = queryWeight, product of:
                1.2043636 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.018329557 = queryNorm
              0.8431329 = fieldWeight in 48, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.09375 = fieldNorm(doc=48)
          0.07481839 = weight(abstract_txt:framework in 48) [ClassicSimilarity], result of:
            0.07481839 = score(doc=48,freq=3.0), product of:
              0.101388626 = queryWeight, product of:
                1.2171667 = boost
                4.5445113 = idf(docFreq=1282, maxDocs=44421)
                0.018329557 = queryNorm
              0.7379367 = fieldWeight in 48, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.5445113 = idf(docFreq=1282, maxDocs=44421)
                0.09375 = fieldNorm(doc=48)
          0.029686349 = weight(abstract_txt:based in 48) [ClassicSimilarity], result of:
            0.029686349 = score(doc=48,freq=1.0), product of:
              0.099480644 = queryWeight, product of:
                1.7050602 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.018329557 = queryNorm
              0.2984133 = fieldWeight in 48, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.09375 = fieldNorm(doc=48)
          0.2111739 = weight(abstract_txt:classifications in 48) [ClassicSimilarity], result of:
            0.2111739 = score(doc=48,freq=1.0), product of:
              0.3679546 = queryWeight, product of:
                3.2791975 = boost
                6.121738 = idf(docFreq=264, maxDocs=44421)
                0.018329557 = queryNorm
              0.5739129 = fieldWeight in 48, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.121738 = idf(docFreq=264, maxDocs=44421)
                0.09375 = fieldNorm(doc=48)
          0.17751636 = weight(abstract_txt:classification in 48) [ClassicSimilarity], result of:
            0.17751636 = score(doc=48,freq=3.0), product of:
              0.27384108 = queryWeight, product of:
                3.7423015 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.018329557 = queryNorm
              0.6482459 = fieldWeight in 48, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.09375 = fieldNorm(doc=48)
        0.2 = coord(5/25)
    
  2. Smiraglia, R.P.: ¬The progress of theory in knowledge organization (2002) 0.13
    0.125948 = sum of:
      0.125948 = product of:
        0.44981426 = sum of:
          0.056911375 = weight(abstract_txt:bibliographic in 936) [ClassicSimilarity], result of:
            0.056911375 = score(doc=936,freq=6.0), product of:
              0.08786864 = queryWeight, product of:
                1.1331109 = boost
                4.230674 = idf(docFreq=1755, maxDocs=44421)
                0.018329557 = queryNorm
              0.647687 = fieldWeight in 936, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.230674 = idf(docFreq=1755, maxDocs=44421)
                0.0625 = fieldNorm(doc=936)
          0.02879761 = weight(abstract_txt:framework in 936) [ClassicSimilarity], result of:
            0.02879761 = score(doc=936,freq=1.0), product of:
              0.101388626 = queryWeight, product of:
                1.2171667 = boost
                4.5445113 = idf(docFreq=1282, maxDocs=44421)
                0.018329557 = queryNorm
              0.28403196 = fieldWeight in 936, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5445113 = idf(docFreq=1282, maxDocs=44421)
                0.0625 = fieldNorm(doc=936)
          0.031494636 = weight(abstract_txt:basis in 936) [ClassicSimilarity], result of:
            0.031494636 = score(doc=936,freq=1.0), product of:
              0.10762405 = queryWeight, product of:
                1.2540363 = boost
                4.682171 = idf(docFreq=1117, maxDocs=44421)
                0.018329557 = queryNorm
              0.29263568 = fieldWeight in 936, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.682171 = idf(docFreq=1117, maxDocs=44421)
                0.0625 = fieldNorm(doc=936)
          0.06721184 = weight(abstract_txt:semantics in 936) [ClassicSimilarity], result of:
            0.06721184 = score(doc=936,freq=1.0), product of:
              0.17839476 = queryWeight, product of:
                1.6145314 = boost
                6.0281444 = idf(docFreq=290, maxDocs=44421)
                0.018329557 = queryNorm
              0.37675902 = fieldWeight in 936, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0281444 = idf(docFreq=290, maxDocs=44421)
                0.0625 = fieldNorm(doc=936)
          0.027988555 = weight(abstract_txt:based in 936) [ClassicSimilarity], result of:
            0.027988555 = score(doc=936,freq=2.0), product of:
              0.099480644 = queryWeight, product of:
                1.7050602 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.018329557 = queryNorm
              0.28134674 = fieldWeight in 936, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0625 = fieldNorm(doc=936)
          0.14078261 = weight(abstract_txt:classifications in 936) [ClassicSimilarity], result of:
            0.14078261 = score(doc=936,freq=1.0), product of:
              0.3679546 = queryWeight, product of:
                3.2791975 = boost
                6.121738 = idf(docFreq=264, maxDocs=44421)
                0.018329557 = queryNorm
              0.38260862 = fieldWeight in 936, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.121738 = idf(docFreq=264, maxDocs=44421)
                0.0625 = fieldNorm(doc=936)
          0.09662766 = weight(abstract_txt:classification in 936) [ClassicSimilarity], result of:
            0.09662766 = score(doc=936,freq=2.0), product of:
              0.27384108 = queryWeight, product of:
                3.7423015 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.018329557 = queryNorm
              0.35286036 = fieldWeight in 936, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.0625 = fieldNorm(doc=936)
        0.28 = coord(7/25)
    
  3. Slavic, A.; Cordeiro, M.I.: Core requirements for automation of analytico-synthetic classifications (2004) 0.12
    0.12249451 = sum of:
      0.12249451 = product of:
        0.61247253 = sum of:
          0.029042466 = weight(abstract_txt:bibliographic in 3651) [ClassicSimilarity], result of:
            0.029042466 = score(doc=3651,freq=1.0), product of:
              0.08786864 = queryWeight, product of:
                1.1331109 = boost
                4.230674 = idf(docFreq=1755, maxDocs=44421)
                0.018329557 = queryNorm
              0.3305214 = fieldWeight in 3651, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.230674 = idf(docFreq=1755, maxDocs=44421)
                0.078125 = fieldNorm(doc=3651)
          0.041339412 = weight(abstract_txt:management in 3651) [ClassicSimilarity], result of:
            0.041339412 = score(doc=3651,freq=2.0), product of:
              0.08824927 = queryWeight, product of:
                1.1355624 = boost
                4.239827 = idf(docFreq=1739, maxDocs=44421)
                0.018329557 = queryNorm
              0.46843913 = fieldWeight in 3651, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.239827 = idf(docFreq=1739, maxDocs=44421)
                0.078125 = fieldNorm(doc=3651)
          0.084014796 = weight(abstract_txt:semantics in 3651) [ClassicSimilarity], result of:
            0.084014796 = score(doc=3651,freq=1.0), product of:
              0.17839476 = queryWeight, product of:
                1.6145314 = boost
                6.0281444 = idf(docFreq=290, maxDocs=44421)
                0.018329557 = queryNorm
              0.4709488 = fieldWeight in 3651, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0281444 = idf(docFreq=290, maxDocs=44421)
                0.078125 = fieldNorm(doc=3651)
          0.24887083 = weight(abstract_txt:classifications in 3651) [ClassicSimilarity], result of:
            0.24887083 = score(doc=3651,freq=2.0), product of:
              0.3679546 = queryWeight, product of:
                3.2791975 = boost
                6.121738 = idf(docFreq=264, maxDocs=44421)
                0.018329557 = queryNorm
              0.6763629 = fieldWeight in 3651, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.121738 = idf(docFreq=264, maxDocs=44421)
                0.078125 = fieldNorm(doc=3651)
          0.20920506 = weight(abstract_txt:classification in 3651) [ClassicSimilarity], result of:
            0.20920506 = score(doc=3651,freq=6.0), product of:
              0.27384108 = queryWeight, product of:
                3.7423015 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.018329557 = queryNorm
              0.7639652 = fieldWeight in 3651, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.078125 = fieldNorm(doc=3651)
        0.2 = coord(5/25)
    
  4. Rafols, I.; Leydesdorff, L.: Content-based and algorithmic classifications of journals : perspectives on the dynamics of scientific communication and indexer effects (2009) 0.12
    0.11899527 = sum of:
      0.11899527 = product of:
        0.59497637 = sum of:
          0.023233972 = weight(abstract_txt:bibliographic in 82) [ClassicSimilarity], result of:
            0.023233972 = score(doc=82,freq=1.0), product of:
              0.08786864 = queryWeight, product of:
                1.1331109 = boost
                4.230674 = idf(docFreq=1755, maxDocs=44421)
                0.018329557 = queryNorm
              0.2644171 = fieldWeight in 82, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.230674 = idf(docFreq=1755, maxDocs=44421)
                0.0625 = fieldNorm(doc=82)
          0.14929573 = weight(abstract_txt:decompositions in 82) [ClassicSimilarity], result of:
            0.14929573 = score(doc=82,freq=1.0), product of:
              0.24104966 = queryWeight, product of:
                1.3270696 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.018329557 = queryNorm
              0.61935675 = fieldWeight in 82, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.0625 = fieldNorm(doc=82)
          0.044253796 = weight(abstract_txt:based in 82) [ClassicSimilarity], result of:
            0.044253796 = score(doc=82,freq=5.0), product of:
              0.099480644 = queryWeight, product of:
                1.7050602 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.018329557 = queryNorm
              0.4448483 = fieldWeight in 82, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0625 = fieldNorm(doc=82)
          0.28156522 = weight(abstract_txt:classifications in 82) [ClassicSimilarity], result of:
            0.28156522 = score(doc=82,freq=4.0), product of:
              0.3679546 = queryWeight, product of:
                3.2791975 = boost
                6.121738 = idf(docFreq=264, maxDocs=44421)
                0.018329557 = queryNorm
              0.76521724 = fieldWeight in 82, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.121738 = idf(docFreq=264, maxDocs=44421)
                0.0625 = fieldNorm(doc=82)
          0.09662766 = weight(abstract_txt:classification in 82) [ClassicSimilarity], result of:
            0.09662766 = score(doc=82,freq=2.0), product of:
              0.27384108 = queryWeight, product of:
                3.7423015 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.018329557 = queryNorm
              0.35286036 = fieldWeight in 82, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.0625 = fieldNorm(doc=82)
        0.2 = coord(5/25)
    
  5. Gnoli, C.; Santis, R. de; Pusterla, L.: Commerce, see also Rhetoric : cross-discipline relationships as authority data for enhanced retrieval (2015) 0.11
    0.109412305 = sum of:
      0.109412305 = product of:
        0.5470615 = sum of:
          0.023233972 = weight(abstract_txt:bibliographic in 3299) [ClassicSimilarity], result of:
            0.023233972 = score(doc=3299,freq=1.0), product of:
              0.08786864 = queryWeight, product of:
                1.1331109 = boost
                4.230674 = idf(docFreq=1755, maxDocs=44421)
                0.018329557 = queryNorm
              0.2644171 = fieldWeight in 3299, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.230674 = idf(docFreq=1755, maxDocs=44421)
                0.0625 = fieldNorm(doc=3299)
          0.12354185 = weight(abstract_txt:ontologically in 3299) [ClassicSimilarity], result of:
            0.12354185 = score(doc=3299,freq=1.0), product of:
              0.2124635 = queryWeight, product of:
                1.2458984 = boost
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.018329557 = queryNorm
              0.5814733 = fieldWeight in 3299, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.0625 = fieldNorm(doc=3299)
          0.019790899 = weight(abstract_txt:based in 3299) [ClassicSimilarity], result of:
            0.019790899 = score(doc=3299,freq=1.0), product of:
              0.099480644 = queryWeight, product of:
                1.7050602 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.018329557 = queryNorm
              0.1989422 = fieldWeight in 3299, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0625 = fieldNorm(doc=3299)
          0.24384262 = weight(abstract_txt:classifications in 3299) [ClassicSimilarity], result of:
            0.24384262 = score(doc=3299,freq=3.0), product of:
              0.3679546 = queryWeight, product of:
                3.2791975 = boost
                6.121738 = idf(docFreq=264, maxDocs=44421)
                0.018329557 = queryNorm
              0.66269755 = fieldWeight in 3299, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.121738 = idf(docFreq=264, maxDocs=44421)
                0.0625 = fieldNorm(doc=3299)
          0.13665216 = weight(abstract_txt:classification in 3299) [ClassicSimilarity], result of:
            0.13665216 = score(doc=3299,freq=4.0), product of:
              0.27384108 = queryWeight, product of:
                3.7423015 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.018329557 = queryNorm
              0.49901992 = fieldWeight in 3299, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.0625 = fieldNorm(doc=3299)
        0.2 = coord(5/25)