Document (#40196)

Author
Lamb, I.
Larson, C.
Title
Shining a light on scientific data : building a data catalog to foster data sharing and reuse
Source
Code4Lib journal. Issue 32(2016), [http://journal.code4lib.org]
Year
2016
Abstract
The scientific community's growing eagerness to make research data available to the public provides libraries - with our expertise in metadata and discovery - an interesting new opportunity. This paper details the in-house creation of a "data catalog" which describes datasets ranging from population-level studies like the US Census to small, specialized datasets created by researchers at our own institution. Based on Symfony2 and Solr, the data catalog provides a powerful search interface to help researchers locate the data that can help them, and an administrative interface so librarians can add, edit, and manage metadata elements at will. This paper will outline the successes, failures, and total redos that culminated in the current manifestation of our data catalog.
Content
Vgl.: http://journal.code4lib.org/articles/11421.
Theme
Informetrie
Visualisierung

Similar documents (author)

  1. Larson, R.R.: Between Scylla and Charybdis : searching in the online catalog (1991) 5.21
    5.2088575 = sum of:
      5.2088575 = weight(author_txt:larson in 461) [ClassicSimilarity], result of:
        5.2088575 = fieldWeight in 461, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.334172 = idf(docFreq=28, maxDocs=44421)
          0.625 = fieldNorm(doc=461)
    
  2. Larson, R.R.: Evaluation of advanced retrieval techniques in an experimental online catalog (1992) 5.21
    5.2088575 = sum of:
      5.2088575 = weight(author_txt:larson in 480) [ClassicSimilarity], result of:
        5.2088575 = fieldWeight in 480, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.334172 = idf(docFreq=28, maxDocs=44421)
          0.625 = fieldNorm(doc=480)
    
  3. Larson, R.R.: Experiments in automatic Library of Congress Classification (1992) 5.21
    5.2088575 = sum of:
      5.2088575 = weight(author_txt:larson in 1053) [ClassicSimilarity], result of:
        5.2088575 = fieldWeight in 1053, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.334172 = idf(docFreq=28, maxDocs=44421)
          0.625 = fieldNorm(doc=1053)
    
  4. Larson, R.R.: Classification clustering, probabilistic information retrieval, and the online catalog (1991) 5.21
    5.2088575 = sum of:
      5.2088575 = weight(author_txt:larson in 1069) [ClassicSimilarity], result of:
        5.2088575 = fieldWeight in 1069, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.334172 = idf(docFreq=28, maxDocs=44421)
          0.625 = fieldNorm(doc=1069)
    
  5. Larson, R.R.: ¬The decline of subject searching : long-term trends and patterns of index use in an online catalog (1991) 5.21
    5.2088575 = sum of:
      5.2088575 = weight(author_txt:larson in 1103) [ClassicSimilarity], result of:
        5.2088575 = fieldWeight in 1103, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.334172 = idf(docFreq=28, maxDocs=44421)
          0.625 = fieldNorm(doc=1103)
    

Similar documents (content)

  1. Senzig, D.: Library catalogs for library users (1984) 0.18
    0.17580597 = sum of:
      0.17580597 = product of:
        0.87902987 = sum of:
          0.055910025 = weight(abstract_txt:will in 830) [ClassicSimilarity], result of:
            0.055910025 = score(doc=830,freq=2.0), product of:
              0.093604155 = queryWeight, product of:
                1.1460941 = boost
                3.8615482 = idf(docFreq=2539, maxDocs=44421)
                0.021150148 = queryNorm
              0.5973028 = fieldWeight in 830, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.8615482 = idf(docFreq=2539, maxDocs=44421)
                0.109375 = fieldNorm(doc=830)
          0.15980966 = weight(abstract_txt:failures in 830) [ClassicSimilarity], result of:
            0.15980966 = score(doc=830,freq=1.0), product of:
              0.18852577 = queryWeight, product of:
                1.1501198 = boost
                7.750224 = idf(docFreq=51, maxDocs=44421)
                0.021150148 = queryNorm
              0.84768075 = fieldWeight in 830, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.750224 = idf(docFreq=51, maxDocs=44421)
                0.109375 = fieldNorm(doc=830)
          0.20387843 = weight(abstract_txt:successes in 830) [ClassicSimilarity], result of:
            0.20387843 = score(doc=830,freq=1.0), product of:
              0.22175984 = queryWeight, product of:
                1.2473811 = boost
                8.405631 = idf(docFreq=26, maxDocs=44421)
                0.021150148 = queryNorm
              0.9193659 = fieldWeight in 830, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.405631 = idf(docFreq=26, maxDocs=44421)
                0.109375 = fieldNorm(doc=830)
          0.07654371 = weight(abstract_txt:help in 830) [ClassicSimilarity], result of:
            0.07654371 = score(doc=830,freq=1.0), product of:
              0.14540692 = queryWeight, product of:
                1.4284507 = boost
                4.8128953 = idf(docFreq=980, maxDocs=44421)
                0.021150148 = queryNorm
              0.5264104 = fieldWeight in 830, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8128953 = idf(docFreq=980, maxDocs=44421)
                0.109375 = fieldNorm(doc=830)
          0.38288805 = weight(abstract_txt:catalog in 830) [ClassicSimilarity], result of:
            0.38288805 = score(doc=830,freq=3.0), product of:
              0.37153193 = queryWeight, product of:
                3.229132 = boost
                5.4399757 = idf(docFreq=523, maxDocs=44421)
                0.021150148 = queryNorm
              1.0305656 = fieldWeight in 830, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.4399757 = idf(docFreq=523, maxDocs=44421)
                0.109375 = fieldNorm(doc=830)
        0.2 = coord(5/25)
    
  2. Daniel Jr., R.; Lagoze, C.: Extending the Warwick framework : from metadata containers to active digital objects (1997) 0.14
    0.14078863 = sum of:
      0.14078863 = product of:
        0.50281656 = sum of:
          0.017616816 = weight(abstract_txt:paper in 2264) [ClassicSimilarity], result of:
            0.017616816 = score(doc=2264,freq=3.0), product of:
              0.07521918 = queryWeight, product of:
                1.0273939 = boost
                3.4616103 = idf(docFreq=3788, maxDocs=44421)
                0.021150148 = queryNorm
              0.23420644 = fieldWeight in 2264, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4616103 = idf(docFreq=3788, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2264)
          0.014119413 = weight(abstract_txt:will in 2264) [ClassicSimilarity], result of:
            0.014119413 = score(doc=2264,freq=1.0), product of:
              0.093604155 = queryWeight, product of:
                1.1460941 = boost
                3.8615482 = idf(docFreq=2539, maxDocs=44421)
                0.021150148 = queryNorm
              0.15084173 = fieldWeight in 2264, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.8615482 = idf(docFreq=2539, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2264)
          0.025903394 = weight(abstract_txt:provides in 2264) [ClassicSimilarity], result of:
            0.025903394 = score(doc=2264,freq=2.0), product of:
              0.11133845 = queryWeight, product of:
                1.2499577 = boost
                4.211497 = idf(docFreq=1789, maxDocs=44421)
                0.021150148 = queryNorm
              0.23265453 = fieldWeight in 2264, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.211497 = idf(docFreq=1789, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2264)
          0.09867052 = weight(abstract_txt:metadata in 2264) [ClassicSimilarity], result of:
            0.09867052 = score(doc=2264,freq=12.0), product of:
              0.14944519 = queryWeight, product of:
                1.4481504 = boost
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.021150148 = queryNorm
              0.66024554 = fieldWeight in 2264, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2264)
          0.068848304 = weight(abstract_txt:datasets in 2264) [ClassicSimilarity], result of:
            0.068848304 = score(doc=2264,freq=1.0), product of:
              0.26916146 = queryWeight, product of:
                1.9434758 = boost
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.021150148 = queryNorm
              0.25578812 = fieldWeight in 2264, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2264)
          0.11165243 = weight(abstract_txt:catalog in 2264) [ClassicSimilarity], result of:
            0.11165243 = score(doc=2264,freq=2.0), product of:
              0.37153193 = queryWeight, product of:
                3.229132 = boost
                5.4399757 = idf(docFreq=523, maxDocs=44421)
                0.021150148 = queryNorm
              0.30051905 = fieldWeight in 2264, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.4399757 = idf(docFreq=523, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2264)
          0.16600564 = weight(abstract_txt:data in 2264) [ClassicSimilarity], result of:
            0.16600564 = score(doc=2264,freq=21.0), product of:
              0.2784707 = queryWeight, product of:
                3.9535975 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.021150148 = queryNorm
              0.59613323 = fieldWeight in 2264, product of:
                4.582576 = tf(freq=21.0), with freq of:
                  21.0 = termFreq=21.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2264)
        0.28 = coord(7/25)
    
  3. Baker, T.; Bermès, E.; Coyle, K.; Dunsire, G.; Isaac, A.; Murray, P.; Panzer, M.; Schneider, J.; Singer, R.; Summers, E.; Waites, W.; Young, J.; Zeng, M.: Library Linked Data Incubator Group Final Report (2011) 0.13
    0.13162448 = sum of:
      0.13162448 = product of:
        0.54843533 = sum of:
          0.04727681 = weight(abstract_txt:ranging in 796) [ClassicSimilarity], result of:
            0.04727681 = score(doc=796,freq=1.0), product of:
              0.14724888 = queryWeight, product of:
                1.0164446 = boost
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.021150148 = queryNorm
              0.32106736 = fieldWeight in 796, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.046875 = fieldNorm(doc=796)
          0.06519589 = weight(abstract_txt:foster in 796) [ClassicSimilarity], result of:
            0.06519589 = score(doc=796,freq=1.0), product of:
              0.1824316 = queryWeight, product of:
                1.131378 = boost
                7.62393 = idf(docFreq=58, maxDocs=44421)
                0.021150148 = queryNorm
              0.35737172 = fieldWeight in 796, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.62393 = idf(docFreq=58, maxDocs=44421)
                0.046875 = fieldNorm(doc=796)
          0.032804452 = weight(abstract_txt:help in 796) [ClassicSimilarity], result of:
            0.032804452 = score(doc=796,freq=1.0), product of:
              0.14540692 = queryWeight, product of:
                1.4284507 = boost
                4.8128953 = idf(docFreq=980, maxDocs=44421)
                0.021150148 = queryNorm
              0.22560447 = fieldWeight in 796, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8128953 = idf(docFreq=980, maxDocs=44421)
                0.046875 = fieldNorm(doc=796)
          0.034180474 = weight(abstract_txt:metadata in 796) [ClassicSimilarity], result of:
            0.034180474 = score(doc=796,freq=1.0), product of:
              0.14944519 = queryWeight, product of:
                1.4481504 = boost
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.021150148 = queryNorm
              0.22871578 = fieldWeight in 796, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.046875 = fieldNorm(doc=796)
          0.1430985 = weight(abstract_txt:datasets in 796) [ClassicSimilarity], result of:
            0.1430985 = score(doc=796,freq=3.0), product of:
              0.26916146 = queryWeight, product of:
                1.9434758 = boost
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.021150148 = queryNorm
              0.5316456 = fieldWeight in 796, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.046875 = fieldNorm(doc=796)
          0.2258792 = weight(abstract_txt:data in 796) [ClassicSimilarity], result of:
            0.2258792 = score(doc=796,freq=27.0), product of:
              0.2784707 = queryWeight, product of:
                3.9535975 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.021150148 = queryNorm
              0.8111417 = fieldWeight in 796, product of:
                5.196152 = tf(freq=27.0), with freq of:
                  27.0 = termFreq=27.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.046875 = fieldNorm(doc=796)
        0.24 = coord(6/25)
    
  4. McCutcheon, S.; Kreyche, M.; Maurer, M.B.; Nickerson, J.: Morphing metadata : maximizing access to electronic theses and dissertations (2008) 0.13
    0.13105287 = sum of:
      0.13105287 = product of:
        0.54605365 = sum of:
          0.016273716 = weight(abstract_txt:paper in 3394) [ClassicSimilarity], result of:
            0.016273716 = score(doc=3394,freq=1.0), product of:
              0.07521918 = queryWeight, product of:
                1.0273939 = boost
                3.4616103 = idf(docFreq=3788, maxDocs=44421)
                0.021150148 = queryNorm
              0.21635064 = fieldWeight in 3394, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4616103 = idf(docFreq=3788, maxDocs=44421)
                0.0625 = fieldNorm(doc=3394)
          0.08692785 = weight(abstract_txt:foster in 3394) [ClassicSimilarity], result of:
            0.08692785 = score(doc=3394,freq=1.0), product of:
              0.1824316 = queryWeight, product of:
                1.131378 = boost
                7.62393 = idf(docFreq=58, maxDocs=44421)
                0.021150148 = queryNorm
              0.47649562 = fieldWeight in 3394, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.62393 = idf(docFreq=58, maxDocs=44421)
                0.0625 = fieldNorm(doc=3394)
          0.029306347 = weight(abstract_txt:provides in 3394) [ClassicSimilarity], result of:
            0.029306347 = score(doc=3394,freq=1.0), product of:
              0.11133845 = queryWeight, product of:
                1.2499577 = boost
                4.211497 = idf(docFreq=1789, maxDocs=44421)
                0.021150148 = queryNorm
              0.26321855 = fieldWeight in 3394, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.211497 = idf(docFreq=1789, maxDocs=44421)
                0.0625 = fieldNorm(doc=3394)
          0.07893642 = weight(abstract_txt:metadata in 3394) [ClassicSimilarity], result of:
            0.07893642 = score(doc=3394,freq=3.0), product of:
              0.14944519 = queryWeight, product of:
                1.4481504 = boost
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.021150148 = queryNorm
              0.52819645 = fieldWeight in 3394, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.0625 = fieldNorm(doc=3394)
          0.25264058 = weight(abstract_txt:catalog in 3394) [ClassicSimilarity], result of:
            0.25264058 = score(doc=3394,freq=4.0), product of:
              0.37153193 = queryWeight, product of:
                3.229132 = boost
                5.4399757 = idf(docFreq=523, maxDocs=44421)
                0.021150148 = queryNorm
              0.67999697 = fieldWeight in 3394, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.4399757 = idf(docFreq=523, maxDocs=44421)
                0.0625 = fieldNorm(doc=3394)
          0.08196872 = weight(abstract_txt:data in 3394) [ClassicSimilarity], result of:
            0.08196872 = score(doc=3394,freq=2.0), product of:
              0.2784707 = queryWeight, product of:
                3.9535975 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.021150148 = queryNorm
              0.29435313 = fieldWeight in 3394, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=3394)
        0.24 = coord(6/25)
    
  5. Jiao, H.; Qiu, Y.; Ma, X.; Yang, B.: Dissmination effect of data papers on scientific datasets (2024) 0.13
    0.12721547 = sum of:
      0.12721547 = product of:
        0.63607734 = sum of:
          0.060025495 = weight(abstract_txt:reuse in 2206) [ClassicSimilarity], result of:
            0.060025495 = score(doc=2206,freq=1.0), product of:
              0.14252287 = queryWeight, product of:
                6.738623 = idf(docFreq=142, maxDocs=44421)
                0.021150148 = queryNorm
              0.42116395 = fieldWeight in 2206, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.738623 = idf(docFreq=142, maxDocs=44421)
                0.0625 = fieldNorm(doc=2206)
          0.016273716 = weight(abstract_txt:paper in 2206) [ClassicSimilarity], result of:
            0.016273716 = score(doc=2206,freq=1.0), product of:
              0.07521918 = queryWeight, product of:
                1.0273939 = boost
                3.4616103 = idf(docFreq=3788, maxDocs=44421)
                0.021150148 = queryNorm
              0.21635064 = fieldWeight in 2206, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4616103 = idf(docFreq=3788, maxDocs=44421)
                0.0625 = fieldNorm(doc=2206)
          0.06755282 = weight(abstract_txt:scientific in 2206) [ClassicSimilarity], result of:
            0.06755282 = score(doc=2206,freq=3.0), product of:
              0.13470776 = queryWeight, product of:
                1.3748934 = boost
                4.6324444 = idf(docFreq=1174, maxDocs=44421)
                0.021150148 = queryNorm
              0.5014768 = fieldWeight in 2206, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.6324444 = idf(docFreq=1174, maxDocs=44421)
                0.0625 = fieldNorm(doc=2206)
          0.24631917 = weight(abstract_txt:datasets in 2206) [ClassicSimilarity], result of:
            0.24631917 = score(doc=2206,freq=5.0), product of:
              0.26916146 = queryWeight, product of:
                1.9434758 = boost
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.021150148 = queryNorm
              0.9151354 = fieldWeight in 2206, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.0625 = fieldNorm(doc=2206)
          0.24590614 = weight(abstract_txt:data in 2206) [ClassicSimilarity], result of:
            0.24590614 = score(doc=2206,freq=18.0), product of:
              0.2784707 = queryWeight, product of:
                3.9535975 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.021150148 = queryNorm
              0.8830593 = fieldWeight in 2206, product of:
                4.2426405 = tf(freq=18.0), with freq of:
                  18.0 = termFreq=18.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=2206)
        0.2 = coord(5/25)