Document (#35944)

Author
Wagger, S.
Park, R.
Bedford, D.A.D.
Title
Lessons learned in content architecture harmonization and metadata models
Source
Aslib proceedings. 62(2010) nos.4/5, S.387-405
Year
2010
Abstract
Purpose - This paper aims to review key content, architecture, and metadata model decisions and strategies in creation of a publication portal (on DVD to start), based on a 30+ year series of flagship reports from the World Bank. Design/methodology/approach - The paper describes and analyzes key considerations and aspects of the project, including content architecture, content analysis, DTD selection, retrospective conversion, vendor management, design of metadata architectures, use of automated profiling methods, user-information behavior, and search architectures supporting complex content architectures. It includes the challenges of applying an institutionally based taxonomy required to express subject-matter responsibilities and relationships within the World Bank. Findings - The team learned that the metadata behavior and architecture (inheritance, relationships, variations) are more complex than simple links between parent and child objects. The project also reinforced the importance of comprehensive and dynamic topic taxonomy for classifying content that is both historical and current. The approach to defining classes for each full report (parent) will be likely to change, given what has been learned. The team would recommend that parts be classified and the sum of the part classes be assigned to the whole report. As a result of this exploratory work, the Bank's approach to classification and indexing of report series is changing from a top-down to a bottom-up inheritance. Originality/value - The study provides insights into both general and World Bank-specific challenges in creating a publication portal and derives some best practices for content architecture, metadata architecture, and use of automated profiling methods.
Footnote
Beitrag in einem Special Issue: Content architecture: exploiting and managing diverse resources: proceedings of the first national conference of the United Kingdom chapter of the International Society for Knowedge Organization (ISKO)

Similar documents (author)

  1. Park, A.L.: ¬A comparison of a new OCLC/PRISM searches with earlier OCLC derived searches (1992) 4.65
    4.649242 = sum of:
      4.649242 = weight(author_txt:park in 4238) [ClassicSimilarity], result of:
        4.649242 = score(doc=4238,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.438788 = idf(docFreq=70, maxDocs=44421)
            0.1344305 = queryNorm
          4.6492424 = fieldWeight in 4238, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.438788 = idf(docFreq=70, maxDocs=44421)
            0.625 = fieldNorm(doc=4238)
    
  2. Park, T.K.: ¬The nature of relevance in information retrieval : an empirical study (1993) 4.65
    4.649242 = sum of:
      4.649242 = weight(author_txt:park in 5335) [ClassicSimilarity], result of:
        4.649242 = score(doc=5335,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.438788 = idf(docFreq=70, maxDocs=44421)
            0.1344305 = queryNorm
          4.6492424 = fieldWeight in 5335, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.438788 = idf(docFreq=70, maxDocs=44421)
            0.625 = fieldNorm(doc=5335)
    
  3. Park, T.K.: ¬The nature of relevance in information retrieval : an empirical study (1992) 4.65
    4.649242 = sum of:
      4.649242 = weight(author_txt:park in 5369) [ClassicSimilarity], result of:
        4.649242 = score(doc=5369,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.438788 = idf(docFreq=70, maxDocs=44421)
            0.1344305 = queryNorm
          4.6492424 = fieldWeight in 5369, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.438788 = idf(docFreq=70, maxDocs=44421)
            0.625 = fieldNorm(doc=5369)
    
  4. Park, A.L.: Automated authority control : making the transition (1992) 4.65
    4.649242 = sum of:
      4.649242 = weight(author_txt:park in 5393) [ClassicSimilarity], result of:
        4.649242 = score(doc=5393,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.438788 = idf(docFreq=70, maxDocs=44421)
            0.1344305 = queryNorm
          4.6492424 = fieldWeight in 5393, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.438788 = idf(docFreq=70, maxDocs=44421)
            0.625 = fieldNorm(doc=5393)
    
  5. Park, T.K.: Toward a theory of user-based relevance : a call for a new paradigm of inquiry (1994) 4.65
    4.649242 = sum of:
      4.649242 = weight(author_txt:park in 6925) [ClassicSimilarity], result of:
        4.649242 = score(doc=6925,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.438788 = idf(docFreq=70, maxDocs=44421)
            0.1344305 = queryNorm
          4.6492424 = fieldWeight in 6925, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.438788 = idf(docFreq=70, maxDocs=44421)
            0.625 = fieldNorm(doc=6925)
    

Similar documents (content)

  1. Willis, C.; Greenberg, J.; White, H.: Analysis and synthesis of metadata goals for scientific data (2012) 0.12
    0.119122684 = sum of:
      0.119122684 = product of:
        0.49634454 = sum of:
          0.032418482 = weight(abstract_txt:relationships in 1367) [ClassicSimilarity], result of:
            0.032418482 = score(doc=1367,freq=2.0), product of:
              0.08733187 = queryWeight, product of:
                4.7997303 = idf(docFreq=993, maxDocs=44421)
                0.018195162 = queryNorm
              0.37121022 = fieldWeight in 1367, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7997303 = idf(docFreq=993, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1367)
          0.031298432 = weight(abstract_txt:publication in 1367) [ClassicSimilarity], result of:
            0.031298432 = score(doc=1367,freq=1.0), product of:
              0.107482076 = queryWeight, product of:
                1.1093833 = boost
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.018195162 = queryNorm
              0.29119676 = fieldWeight in 1367, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1367)
          0.04908523 = weight(abstract_txt:report in 1367) [ClassicSimilarity], result of:
            0.04908523 = score(doc=1367,freq=1.0), product of:
              0.16608049 = queryWeight, product of:
                1.688957 = boost
                5.4043584 = idf(docFreq=542, maxDocs=44421)
                0.018195162 = queryNorm
              0.29555085 = fieldWeight in 1367, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4043584 = idf(docFreq=542, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1367)
          0.12800442 = weight(abstract_txt:architectures in 1367) [ClassicSimilarity], result of:
            0.12800442 = score(doc=1367,freq=1.0), product of:
              0.31465507 = queryWeight, product of:
                2.3247519 = boost
                7.438788 = idf(docFreq=70, maxDocs=44421)
                0.018195162 = queryNorm
              0.4068087 = fieldWeight in 1367, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.438788 = idf(docFreq=70, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1367)
          0.18061465 = weight(abstract_txt:metadata in 1367) [ClassicSimilarity], result of:
            0.18061465 = score(doc=1367,freq=9.0), product of:
              0.22562583 = queryWeight, product of:
                2.5414293 = boost
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.018195162 = queryNorm
              0.8005052 = fieldWeight in 1367, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1367)
          0.074923314 = weight(abstract_txt:content in 1367) [ClassicSimilarity], result of:
            0.074923314 = score(doc=1367,freq=2.0), product of:
              0.23178126 = queryWeight, product of:
                3.0478024 = boost
                4.1796083 = idf(docFreq=1847, maxDocs=44421)
                0.018195162 = queryNorm
              0.3232501 = fieldWeight in 1367, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1796083 = idf(docFreq=1847, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1367)
        0.24 = coord(6/25)
    
  2. Braun, S.: Manifold: a custom analytics platform to visualize research impact (2015) 0.12
    0.11909858 = sum of:
      0.11909858 = product of:
        0.4962441 = sum of:
          0.040062837 = weight(abstract_txt:challenges in 3906) [ClassicSimilarity], result of:
            0.040062837 = score(doc=3906,freq=1.0), product of:
              0.09989586 = queryWeight, product of:
                1.0695162 = boost
                5.1333895 = idf(docFreq=711, maxDocs=44421)
                0.018195162 = queryNorm
              0.40104604 = fieldWeight in 3906, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1333895 = idf(docFreq=711, maxDocs=44421)
                0.078125 = fieldNorm(doc=3906)
          0.044712048 = weight(abstract_txt:publication in 3906) [ClassicSimilarity], result of:
            0.044712048 = score(doc=3906,freq=1.0), product of:
              0.107482076 = queryWeight, product of:
                1.1093833 = boost
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.018195162 = queryNorm
              0.4159954 = fieldWeight in 3906, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.324741 = idf(docFreq=587, maxDocs=44421)
                0.078125 = fieldNorm(doc=3906)
          0.05204277 = weight(abstract_txt:automated in 3906) [ClassicSimilarity], result of:
            0.05204277 = score(doc=3906,freq=1.0), product of:
              0.11893051 = queryWeight, product of:
                1.1669716 = boost
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.018195162 = queryNorm
              0.43758973 = fieldWeight in 3906, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.078125 = fieldNorm(doc=3906)
          0.07012176 = weight(abstract_txt:report in 3906) [ClassicSimilarity], result of:
            0.07012176 = score(doc=3906,freq=1.0), product of:
              0.16608049 = queryWeight, product of:
                1.688957 = boost
                5.4043584 = idf(docFreq=542, maxDocs=44421)
                0.018195162 = queryNorm
              0.4222155 = fieldWeight in 3906, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.4043584 = idf(docFreq=542, maxDocs=44421)
                0.078125 = fieldNorm(doc=3906)
          0.12539868 = weight(abstract_txt:learned in 3906) [ClassicSimilarity], result of:
            0.12539868 = score(doc=3906,freq=1.0), product of:
              0.24468768 = queryWeight, product of:
                2.050054 = boost
                6.559804 = idf(docFreq=170, maxDocs=44421)
                0.018195162 = queryNorm
              0.51248467 = fieldWeight in 3906, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.559804 = idf(docFreq=170, maxDocs=44421)
                0.078125 = fieldNorm(doc=3906)
          0.16390601 = weight(abstract_txt:architecture in 3906) [ClassicSimilarity], result of:
            0.16390601 = score(doc=3906,freq=1.0), product of:
              0.36854458 = queryWeight, product of:
                3.5581093 = boost
                5.6926546 = idf(docFreq=406, maxDocs=44421)
                0.018195162 = queryNorm
              0.44473863 = fieldWeight in 3906, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6926546 = idf(docFreq=406, maxDocs=44421)
                0.078125 = fieldNorm(doc=3906)
        0.24 = coord(6/25)
    
  3. Kurth, M.; Ruddy, D.; Rupp, N.: Repurposing MARC metadata : using digital project experience to develop a metadata management design (2004) 0.11
    0.111005686 = sum of:
      0.111005686 = product of:
        0.55502844 = sum of:
          0.03274761 = weight(abstract_txt:relationships in 5748) [ClassicSimilarity], result of:
            0.03274761 = score(doc=5748,freq=1.0), product of:
              0.08733187 = queryWeight, product of:
                4.7997303 = idf(docFreq=993, maxDocs=44421)
                0.018195162 = queryNorm
              0.37497893 = fieldWeight in 5748, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7997303 = idf(docFreq=993, maxDocs=44421)
                0.078125 = fieldNorm(doc=5748)
          0.023261279 = weight(abstract_txt:approach in 5748) [ClassicSimilarity], result of:
            0.023261279 = score(doc=5748,freq=1.0), product of:
              0.079586454 = queryWeight, product of:
                1.1691731 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.018195162 = queryNorm
              0.29227686 = fieldWeight in 5748, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.078125 = fieldNorm(doc=5748)
          0.12539868 = weight(abstract_txt:learned in 5748) [ClassicSimilarity], result of:
            0.12539868 = score(doc=5748,freq=1.0), product of:
              0.24468768 = queryWeight, product of:
                2.050054 = boost
                6.559804 = idf(docFreq=170, maxDocs=44421)
                0.018195162 = queryNorm
              0.51248467 = fieldWeight in 5748, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.559804 = idf(docFreq=170, maxDocs=44421)
                0.078125 = fieldNorm(doc=5748)
          0.2979369 = weight(abstract_txt:metadata in 5748) [ClassicSimilarity], result of:
            0.2979369 = score(doc=5748,freq=12.0), product of:
              0.22562583 = queryWeight, product of:
                2.5414293 = boost
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.018195162 = queryNorm
              1.3204911 = fieldWeight in 5748, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.078125 = fieldNorm(doc=5748)
          0.075683974 = weight(abstract_txt:content in 5748) [ClassicSimilarity], result of:
            0.075683974 = score(doc=5748,freq=1.0), product of:
              0.23178126 = queryWeight, product of:
                3.0478024 = boost
                4.1796083 = idf(docFreq=1847, maxDocs=44421)
                0.018195162 = queryNorm
              0.3265319 = fieldWeight in 5748, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1796083 = idf(docFreq=1847, maxDocs=44421)
                0.078125 = fieldNorm(doc=5748)
        0.2 = coord(5/25)
    
  4. Zimmermann, E.H.: CRIS-Cross : Current Research Information Systems at a Crossroads (2002) 0.10
    0.10456817 = sum of:
      0.10456817 = product of:
        0.4357007 = sum of:
          0.038827207 = weight(abstract_txt:complex in 4590) [ClassicSimilarity], result of:
            0.038827207 = score(doc=4590,freq=1.0), product of:
              0.09783114 = queryWeight, product of:
                1.0584058 = boost
                5.080062 = idf(docFreq=750, maxDocs=44421)
                0.018195162 = queryNorm
              0.39687985 = fieldWeight in 4590, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.080062 = idf(docFreq=750, maxDocs=44421)
                0.078125 = fieldNorm(doc=4590)
          0.040062837 = weight(abstract_txt:challenges in 4590) [ClassicSimilarity], result of:
            0.040062837 = score(doc=4590,freq=1.0), product of:
              0.09989586 = queryWeight, product of:
                1.0695162 = boost
                5.1333895 = idf(docFreq=711, maxDocs=44421)
                0.018195162 = queryNorm
              0.40104604 = fieldWeight in 4590, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1333895 = idf(docFreq=711, maxDocs=44421)
                0.078125 = fieldNorm(doc=4590)
          0.07867769 = weight(abstract_txt:taxonomy in 4590) [ClassicSimilarity], result of:
            0.07867769 = score(doc=4590,freq=1.0), product of:
              0.15665855 = queryWeight, product of:
                1.3393395 = boost
                6.428468 = idf(docFreq=194, maxDocs=44421)
                0.018195162 = queryNorm
              0.5022241 = fieldWeight in 4590, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.428468 = idf(docFreq=194, maxDocs=44421)
                0.078125 = fieldNorm(doc=4590)
          0.038542993 = weight(abstract_txt:world in 4590) [ClassicSimilarity], result of:
            0.038542993 = score(doc=4590,freq=1.0), product of:
              0.11144152 = queryWeight, product of:
                1.3835114 = boost
                4.426988 = idf(docFreq=1442, maxDocs=44421)
                0.018195162 = queryNorm
              0.34585845 = fieldWeight in 4590, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.426988 = idf(docFreq=1442, maxDocs=44421)
                0.078125 = fieldNorm(doc=4590)
          0.075683974 = weight(abstract_txt:content in 4590) [ClassicSimilarity], result of:
            0.075683974 = score(doc=4590,freq=1.0), product of:
              0.23178126 = queryWeight, product of:
                3.0478024 = boost
                4.1796083 = idf(docFreq=1847, maxDocs=44421)
                0.018195162 = queryNorm
              0.3265319 = fieldWeight in 4590, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1796083 = idf(docFreq=1847, maxDocs=44421)
                0.078125 = fieldNorm(doc=4590)
          0.16390601 = weight(abstract_txt:architecture in 4590) [ClassicSimilarity], result of:
            0.16390601 = score(doc=4590,freq=1.0), product of:
              0.36854458 = queryWeight, product of:
                3.5581093 = boost
                5.6926546 = idf(docFreq=406, maxDocs=44421)
                0.018195162 = queryNorm
              0.44473863 = fieldWeight in 4590, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6926546 = idf(docFreq=406, maxDocs=44421)
                0.078125 = fieldNorm(doc=4590)
        0.24 = coord(6/25)
    
  5. Tsui, E.; Wang, W.M.; Cheung, C.F.; Lau, A.S.M.: ¬A concept-relationship acquisition and inference approach for hierarchical taxonomy construction from tags (2010) 0.10
    0.10067449 = sum of:
      0.10067449 = product of:
        0.41947705 = sum of:
          0.033346493 = weight(abstract_txt:behavior in 220) [ClassicSimilarity], result of:
            0.033346493 = score(doc=220,freq=1.0), product of:
              0.10257144 = queryWeight, product of:
                1.0837444 = boost
                5.2016807 = idf(docFreq=664, maxDocs=44421)
                0.018195162 = queryNorm
              0.32510504 = fieldWeight in 220, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2016807 = idf(docFreq=664, maxDocs=44421)
                0.0625 = fieldNorm(doc=220)
          0.041634217 = weight(abstract_txt:automated in 220) [ClassicSimilarity], result of:
            0.041634217 = score(doc=220,freq=1.0), product of:
              0.11893051 = queryWeight, product of:
                1.1669716 = boost
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.018195162 = queryNorm
              0.3500718 = fieldWeight in 220, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6011486 = idf(docFreq=445, maxDocs=44421)
                0.0625 = fieldNorm(doc=220)
          0.026317133 = weight(abstract_txt:approach in 220) [ClassicSimilarity], result of:
            0.026317133 = score(doc=220,freq=2.0), product of:
              0.079586454 = queryWeight, product of:
                1.1691731 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.018195162 = queryNorm
              0.33067352 = fieldWeight in 220, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.0625 = fieldNorm(doc=220)
          0.18882646 = weight(abstract_txt:taxonomy in 220) [ClassicSimilarity], result of:
            0.18882646 = score(doc=220,freq=9.0), product of:
              0.15665855 = queryWeight, product of:
                1.3393395 = boost
                6.428468 = idf(docFreq=194, maxDocs=44421)
                0.018195162 = queryNorm
              1.2053378 = fieldWeight in 220, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                6.428468 = idf(docFreq=194, maxDocs=44421)
                0.0625 = fieldNorm(doc=220)
          0.06880558 = weight(abstract_txt:metadata in 220) [ClassicSimilarity], result of:
            0.06880558 = score(doc=220,freq=1.0), product of:
              0.22562583 = queryWeight, product of:
                2.5414293 = boost
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.018195162 = queryNorm
              0.30495438 = fieldWeight in 220, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.0625 = fieldNorm(doc=220)
          0.06054718 = weight(abstract_txt:content in 220) [ClassicSimilarity], result of:
            0.06054718 = score(doc=220,freq=1.0), product of:
              0.23178126 = queryWeight, product of:
                3.0478024 = boost
                4.1796083 = idf(docFreq=1847, maxDocs=44421)
                0.018195162 = queryNorm
              0.26122552 = fieldWeight in 220, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1796083 = idf(docFreq=1847, maxDocs=44421)
                0.0625 = fieldNorm(doc=220)
        0.24 = coord(6/25)