Document (#28814)

Author
Kuhr, P.S.
Title
Putting the world back together : mapping multiple vocabularies into a single thesaurus
Source
Subject retrieval in a networked environment: Proceedings of the IFLA Satellite Meeting held in Dublin, OH, 14-16 August 2001 and sponsored by the IFLA Classification and Indexing Section, the IFLA Information Technology Section and OCLC. Ed.: I.C. McIlwaine
Imprint
München : Saur
Year
2003
Pages
S.37-42
Series
UBCIM publications: new series; vol.25
Abstract
This paper describes an ongoing project in which the subject headings contained in twelve controlled vocabularies covering multiple disciplines from the humanities to the sciences and including law and education among others are being collapsed into a single vocabulary and reference structure. The design of the database, algorithms created to programmatically link like-concepts, and daily maintenance are detailed. The problems and pitfalls of dealing with multiple vocabularies are noted, as well as the difficulties in relying purely an computer generated algorithms. The application of this megathesaurus to bibliographic records and methodology of retrieval is explained.
Footnote
Ein Beitrag zum Thema des Mischens oder Zusammenspielens verschiedener Thesauri zu einem
Theme
Konzeption und Anwendung des Prinzips Thesaurus

Similar documents (content)

  1. Kempf, A.O.; Ritze, D.; Eckert, K.; Zapilko, B.: New ways of mapping knowledge organization systems : using a semi-automatic matching procedure for building up vocabulary crosswalks (2014) 0.11
    0.110316105 = sum of:
      0.110316105 = product of:
        0.45965046 = sum of:
          0.040361214 = weight(abstract_txt:generated in 2371) [ClassicSimilarity], result of:
            0.040361214 = score(doc=2371,freq=1.0), product of:
              0.11722039 = queryWeight, product of:
                5.509105 = idf(docFreq=488, maxDocs=44421)
                0.021277573 = queryNorm
              0.34431908 = fieldWeight in 2371, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.509105 = idf(docFreq=488, maxDocs=44421)
                0.0625 = fieldNorm(doc=2371)
          0.04488076 = weight(abstract_txt:link in 2371) [ClassicSimilarity], result of:
            0.04488076 = score(doc=2371,freq=1.0), product of:
              0.12581539 = queryWeight, product of:
                1.0360132 = boost
                5.707506 = idf(docFreq=400, maxDocs=44421)
                0.021277573 = queryNorm
              0.35671914 = fieldWeight in 2371, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.707506 = idf(docFreq=400, maxDocs=44421)
                0.0625 = fieldNorm(doc=2371)
          0.061384965 = weight(abstract_txt:maintenance in 2371) [ClassicSimilarity], result of:
            0.061384965 = score(doc=2371,freq=1.0), product of:
              0.15502498 = queryWeight, product of:
                1.1500038 = boost
                6.3354917 = idf(docFreq=213, maxDocs=44421)
                0.021277573 = queryNorm
              0.39596823 = fieldWeight in 2371, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3354917 = idf(docFreq=213, maxDocs=44421)
                0.0625 = fieldNorm(doc=2371)
          0.06522765 = weight(abstract_txt:back in 2371) [ClassicSimilarity], result of:
            0.06522765 = score(doc=2371,freq=1.0), product of:
              0.16142897 = queryWeight, product of:
                1.1735164 = boost
                6.4650254 = idf(docFreq=187, maxDocs=44421)
                0.021277573 = queryNorm
              0.4040641 = fieldWeight in 2371, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4650254 = idf(docFreq=187, maxDocs=44421)
                0.0625 = fieldNorm(doc=2371)
          0.09796138 = weight(abstract_txt:multiple in 2371) [ClassicSimilarity], result of:
            0.09796138 = score(doc=2371,freq=1.0), product of:
              0.30533084 = queryWeight, product of:
                2.7954028 = boost
                5.1333895 = idf(docFreq=711, maxDocs=44421)
                0.021277573 = queryNorm
              0.32083684 = fieldWeight in 2371, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1333895 = idf(docFreq=711, maxDocs=44421)
                0.0625 = fieldNorm(doc=2371)
          0.14983453 = weight(abstract_txt:vocabularies in 2371) [ClassicSimilarity], result of:
            0.14983453 = score(doc=2371,freq=1.0), product of:
              0.40532994 = queryWeight, product of:
                3.2207973 = boost
                5.9145703 = idf(docFreq=325, maxDocs=44421)
                0.021277573 = queryNorm
              0.36966065 = fieldWeight in 2371, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9145703 = idf(docFreq=325, maxDocs=44421)
                0.0625 = fieldNorm(doc=2371)
        0.24 = coord(6/25)
    
  2. Purpura, A.; Silvello, G.; Susto, G.A.: Learning to rank from relevance judgments distributions (2022) 0.09
    0.09361587 = sum of:
      0.09361587 = product of:
        0.46807933 = sum of:
          0.040361214 = weight(abstract_txt:generated in 1646) [ClassicSimilarity], result of:
            0.040361214 = score(doc=1646,freq=1.0), product of:
              0.11722039 = queryWeight, product of:
                5.509105 = idf(docFreq=488, maxDocs=44421)
                0.021277573 = queryNorm
              0.34431908 = fieldWeight in 1646, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.509105 = idf(docFreq=488, maxDocs=44421)
                0.0625 = fieldNorm(doc=1646)
          0.14298227 = weight(abstract_txt:relying in 1646) [ClassicSimilarity], result of:
            0.14298227 = score(doc=1646,freq=2.0), product of:
              0.21620803 = queryWeight, product of:
                1.358108 = boost
                7.48196 = idf(docFreq=67, maxDocs=44421)
                0.021277573 = queryNorm
              0.66131806 = fieldWeight in 1646, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.48196 = idf(docFreq=67, maxDocs=44421)
                0.0625 = fieldNorm(doc=1646)
          0.09736415 = weight(abstract_txt:single in 1646) [ClassicSimilarity], result of:
            0.09736415 = score(doc=1646,freq=2.0), product of:
              0.21084325 = queryWeight, product of:
                1.8966765 = boost
                5.2244954 = idf(docFreq=649, maxDocs=44421)
                0.021277573 = queryNorm
              0.4617845 = fieldWeight in 1646, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2244954 = idf(docFreq=649, maxDocs=44421)
                0.0625 = fieldNorm(doc=1646)
          0.08941031 = weight(abstract_txt:algorithms in 1646) [ClassicSimilarity], result of:
            0.08941031 = score(doc=1646,freq=1.0), product of:
              0.250974 = queryWeight, product of:
                2.0693207 = boost
                5.7000527 = idf(docFreq=403, maxDocs=44421)
                0.021277573 = queryNorm
              0.3562533 = fieldWeight in 1646, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7000527 = idf(docFreq=403, maxDocs=44421)
                0.0625 = fieldNorm(doc=1646)
          0.09796138 = weight(abstract_txt:multiple in 1646) [ClassicSimilarity], result of:
            0.09796138 = score(doc=1646,freq=1.0), product of:
              0.30533084 = queryWeight, product of:
                2.7954028 = boost
                5.1333895 = idf(docFreq=711, maxDocs=44421)
                0.021277573 = queryNorm
              0.32083684 = fieldWeight in 1646, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1333895 = idf(docFreq=711, maxDocs=44421)
                0.0625 = fieldNorm(doc=1646)
        0.2 = coord(5/25)
    
  3. ISO 25964-2: Thesauri and interoperability with other vocabularies : Part 2: Interoperability with other vocabularies (2013) 0.09
    0.08909865 = sum of:
      0.08909865 = product of:
        0.74248874 = sum of:
          0.12276993 = weight(abstract_txt:maintenance in 832) [ClassicSimilarity], result of:
            0.12276993 = score(doc=832,freq=1.0), product of:
              0.15502498 = queryWeight, product of:
                1.1500038 = boost
                6.3354917 = idf(docFreq=213, maxDocs=44421)
                0.021277573 = queryNorm
              0.79193646 = fieldWeight in 832, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3354917 = idf(docFreq=213, maxDocs=44421)
                0.125 = fieldNorm(doc=832)
          0.19592276 = weight(abstract_txt:multiple in 832) [ClassicSimilarity], result of:
            0.19592276 = score(doc=832,freq=1.0), product of:
              0.30533084 = queryWeight, product of:
                2.7954028 = boost
                5.1333895 = idf(docFreq=711, maxDocs=44421)
                0.021277573 = queryNorm
              0.6416737 = fieldWeight in 832, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1333895 = idf(docFreq=711, maxDocs=44421)
                0.125 = fieldNorm(doc=832)
          0.42379606 = weight(abstract_txt:vocabularies in 832) [ClassicSimilarity], result of:
            0.42379606 = score(doc=832,freq=2.0), product of:
              0.40532994 = queryWeight, product of:
                3.2207973 = boost
                5.9145703 = idf(docFreq=325, maxDocs=44421)
                0.021277573 = queryNorm
              1.0455582 = fieldWeight in 832, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.9145703 = idf(docFreq=325, maxDocs=44421)
                0.125 = fieldNorm(doc=832)
        0.12 = coord(3/25)
    
  4. Will, L.: Thesaurus management software (2009) 0.09
    0.08716515 = sum of:
      0.08716515 = product of:
        0.54478216 = sum of:
          0.06981797 = weight(abstract_txt:mapping in 879) [ClassicSimilarity], result of:
            0.06981797 = score(doc=879,freq=1.0), product of:
              0.12890734 = queryWeight, product of:
                1.0486661 = boost
                5.7772117 = idf(docFreq=373, maxDocs=44421)
                0.021277573 = queryNorm
              0.5416136 = fieldWeight in 879, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.7772117 = idf(docFreq=373, maxDocs=44421)
                0.09375 = fieldNorm(doc=879)
          0.10327028 = weight(abstract_txt:single in 879) [ClassicSimilarity], result of:
            0.10327028 = score(doc=879,freq=1.0), product of:
              0.21084325 = queryWeight, product of:
                1.8966765 = boost
                5.2244954 = idf(docFreq=649, maxDocs=44421)
                0.021277573 = queryNorm
              0.48979646 = fieldWeight in 879, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2244954 = idf(docFreq=649, maxDocs=44421)
                0.09375 = fieldNorm(doc=879)
          0.14694208 = weight(abstract_txt:multiple in 879) [ClassicSimilarity], result of:
            0.14694208 = score(doc=879,freq=1.0), product of:
              0.30533084 = queryWeight, product of:
                2.7954028 = boost
                5.1333895 = idf(docFreq=711, maxDocs=44421)
                0.021277573 = queryNorm
              0.48125526 = fieldWeight in 879, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1333895 = idf(docFreq=711, maxDocs=44421)
                0.09375 = fieldNorm(doc=879)
          0.2247518 = weight(abstract_txt:vocabularies in 879) [ClassicSimilarity], result of:
            0.2247518 = score(doc=879,freq=1.0), product of:
              0.40532994 = queryWeight, product of:
                3.2207973 = boost
                5.9145703 = idf(docFreq=325, maxDocs=44421)
                0.021277573 = queryNorm
              0.554491 = fieldWeight in 879, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9145703 = idf(docFreq=325, maxDocs=44421)
                0.09375 = fieldNorm(doc=879)
        0.16 = coord(4/25)
    
  5. Lauser, B.; Johannsen, G.; Caracciolo, C.; Hage, W.R. van; Keizer, J.; Mayr, P.: Comparing human and automatic thesaurus mapping approaches in the agricultural domain (2008) 0.08
    0.078596324 = sum of:
      0.078596324 = product of:
        0.49122703 = sum of:
          0.1300981 = weight(abstract_txt:mapping in 3627) [ClassicSimilarity], result of:
            0.1300981 = score(doc=3627,freq=5.0), product of:
              0.12890734 = queryWeight, product of:
                1.0486661 = boost
                5.7772117 = idf(docFreq=373, maxDocs=44421)
                0.021277573 = queryNorm
              1.0092374 = fieldWeight in 3627, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.7772117 = idf(docFreq=373, maxDocs=44421)
                0.078125 = fieldNorm(doc=3627)
          0.06576024 = weight(abstract_txt:difficulties in 3627) [ClassicSimilarity], result of:
            0.06576024 = score(doc=3627,freq=1.0), product of:
              0.13987151 = queryWeight, product of:
                1.0923531 = boost
                6.017888 = idf(docFreq=293, maxDocs=44421)
                0.021277573 = queryNorm
              0.4701475 = fieldWeight in 3627, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.017888 = idf(docFreq=293, maxDocs=44421)
                0.078125 = fieldNorm(doc=3627)
          0.030496174 = weight(abstract_txt:into in 3627) [ClassicSimilarity], result of:
            0.030496174 = score(doc=3627,freq=1.0), product of:
              0.105582975 = queryWeight, product of:
                1.3421788 = boost
                3.697102 = idf(docFreq=2993, maxDocs=44421)
                0.021277573 = queryNorm
              0.2888361 = fieldWeight in 3627, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.697102 = idf(docFreq=2993, maxDocs=44421)
                0.078125 = fieldNorm(doc=3627)
          0.26487252 = weight(abstract_txt:vocabularies in 3627) [ClassicSimilarity], result of:
            0.26487252 = score(doc=3627,freq=2.0), product of:
              0.40532994 = queryWeight, product of:
                3.2207973 = boost
                5.9145703 = idf(docFreq=325, maxDocs=44421)
                0.021277573 = queryNorm
              0.65347385 = fieldWeight in 3627, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.9145703 = idf(docFreq=325, maxDocs=44421)
                0.078125 = fieldNorm(doc=3627)
        0.16 = coord(4/25)