Document (#39276)

Author
Godfrey, B.
Johnson, J.
Title
¬The geospatial metadata manager's toolbox : three techniques for maintaining records
Source
Code4Lib journal. Issue 29(2015), [http://journal.code4lib.org/issues/issues/issue29]
Year
2015
Abstract
Managing geospatial metadata records requires a range of techniques. At the University of Idaho Library, we have tens of thousands of records which need to be maintained as well as the addition of new records which need to be normalized and added to the collections. We show a graphical user interface (GUI) tool that was developed to make simple modifications, a simple XSLT that operates on complex metadata, and a Python script with enables parallel processing to make maintenance tasks more efficient. Throughout, we compare these techniques and discuss when they may be useful.
Content
Vgl.: http://journal.code4lib.org/articles/10601.
Field
Geowissenschaften

Similar documents (author)

  1. Johnson, S.W.: Do-it-yourself CD-ROMs (1992) 4.57
    4.566886 = sum of:
      4.566886 = weight(author_txt:johnson in 4284) [ClassicSimilarity], result of:
        4.566886 = score(doc=4284,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.3070183 = idf(docFreq=80, maxDocs=44421)
            0.13685472 = queryNorm
          4.5668864 = fieldWeight in 4284, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.3070183 = idf(docFreq=80, maxDocs=44421)
            0.625 = fieldNorm(doc=4284)
    
  2. Johnson, S.: Virtual documents : the past, the present and some standards for the future (1993) 4.57
    4.566886 = sum of:
      4.566886 = weight(author_txt:johnson in 4420) [ClassicSimilarity], result of:
        4.566886 = score(doc=4420,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.3070183 = idf(docFreq=80, maxDocs=44421)
            0.13685472 = queryNorm
          4.5668864 = fieldWeight in 4420, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.3070183 = idf(docFreq=80, maxDocs=44421)
            0.625 = fieldNorm(doc=4420)
    
  3. Johnson, R.D.: Public libraries and the Internet / NREN : new challenges, new opportunities (1992) 4.57
    4.566886 = sum of:
      4.566886 = weight(author_txt:johnson in 6247) [ClassicSimilarity], result of:
        4.566886 = score(doc=6247,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.3070183 = idf(docFreq=80, maxDocs=44421)
            0.13685472 = queryNorm
          4.5668864 = fieldWeight in 6247, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.3070183 = idf(docFreq=80, maxDocs=44421)
            0.625 = fieldNorm(doc=6247)
    
  4. Johnson, F.C.: ¬A classification of ellipsis based on a corpus of information seeking dialogues (1994) 4.57
    4.566886 = sum of:
      4.566886 = weight(author_txt:johnson in 7802) [ClassicSimilarity], result of:
        4.566886 = score(doc=7802,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.3070183 = idf(docFreq=80, maxDocs=44421)
            0.13685472 = queryNorm
          4.5668864 = fieldWeight in 7802, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.3070183 = idf(docFreq=80, maxDocs=44421)
            0.625 = fieldNorm(doc=7802)
    
  5. Johnson, A.: Information brokers (1991) 4.57
    4.566886 = sum of:
      4.566886 = weight(author_txt:johnson in 1362) [ClassicSimilarity], result of:
        4.566886 = score(doc=1362,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            7.3070183 = idf(docFreq=80, maxDocs=44421)
            0.13685472 = queryNorm
          4.5668864 = fieldWeight in 1362, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            7.3070183 = idf(docFreq=80, maxDocs=44421)
            0.625 = fieldNorm(doc=1362)
    

Similar documents (content)

  1. Lagoze, C.: Keeping Dublin Core simple : Cross-domain discovery or resource description? (2001) 0.14
    0.13576363 = sum of:
      0.13576363 = product of:
        0.42426136 = sum of:
          0.03383809 = weight(abstract_txt:managing in 2216) [ClassicSimilarity], result of:
            0.03383809 = score(doc=2216,freq=2.0), product of:
              0.122498564 = queryWeight, product of:
                1.0314987 = boost
                6.250429 = idf(docFreq=232, maxDocs=44421)
                0.018999951 = queryNorm
              0.27623254 = fieldWeight in 2216, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.250429 = idf(docFreq=232, maxDocs=44421)
                0.03125 = fieldNorm(doc=2216)
          0.026608827 = weight(abstract_txt:maintaining in 2216) [ClassicSimilarity], result of:
            0.026608827 = score(doc=2216,freq=1.0), product of:
              0.13148844 = queryWeight, product of:
                1.0686783 = boost
                6.475721 = idf(docFreq=185, maxDocs=44421)
                0.018999951 = queryNorm
              0.20236628 = fieldWeight in 2216, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.475721 = idf(docFreq=185, maxDocs=44421)
                0.03125 = fieldNorm(doc=2216)
          0.041040014 = weight(abstract_txt:normalized in 2216) [ClassicSimilarity], result of:
            0.041040014 = score(doc=2216,freq=1.0), product of:
              0.17552626 = queryWeight, product of:
                1.2347363 = boost
                7.48196 = idf(docFreq=67, maxDocs=44421)
                0.018999951 = queryNorm
              0.23381124 = fieldWeight in 2216, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.48196 = idf(docFreq=67, maxDocs=44421)
                0.03125 = fieldNorm(doc=2216)
          0.014286441 = weight(abstract_txt:need in 2216) [ClassicSimilarity], result of:
            0.014286441 = score(doc=2216,freq=1.0), product of:
              0.109436736 = queryWeight, product of:
                1.3787951 = boost
                4.1774464 = idf(docFreq=1851, maxDocs=44421)
                0.018999951 = queryNorm
              0.1305452 = fieldWeight in 2216, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1774464 = idf(docFreq=1851, maxDocs=44421)
                0.03125 = fieldNorm(doc=2216)
          0.08825137 = weight(abstract_txt:simple in 2216) [ClassicSimilarity], result of:
            0.08825137 = score(doc=2216,freq=9.0), product of:
              0.17712529 = queryWeight, product of:
                1.7541167 = boost
                5.314588 = idf(docFreq=593, maxDocs=44421)
                0.018999951 = queryNorm
              0.49824262 = fieldWeight in 2216, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                5.314588 = idf(docFreq=593, maxDocs=44421)
                0.03125 = fieldNorm(doc=2216)
          0.027364407 = weight(abstract_txt:techniques in 2216) [ClassicSimilarity], result of:
            0.027364407 = score(doc=2216,freq=1.0), product of:
              0.19321235 = queryWeight, product of:
                2.243785 = boost
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.018999951 = queryNorm
              0.14162867 = fieldWeight in 2216, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.03125 = fieldNorm(doc=2216)
          0.14487164 = weight(abstract_txt:metadata in 2216) [ClassicSimilarity], result of:
            0.14487164 = score(doc=2216,freq=18.0), product of:
              0.22394545 = queryWeight, product of:
                2.4156551 = boost
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.018999951 = queryNorm
              0.6469059 = fieldWeight in 2216, product of:
                4.2426405 = tf(freq=18.0), with freq of:
                  18.0 = termFreq=18.0
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.03125 = fieldNorm(doc=2216)
          0.04800058 = weight(abstract_txt:records in 2216) [ClassicSimilarity], result of:
            0.04800058 = score(doc=2216,freq=2.0), product of:
              0.2454962 = queryWeight, product of:
                2.920489 = boost
                4.42422 = idf(docFreq=1446, maxDocs=44421)
                0.018999951 = queryNorm
              0.19552475 = fieldWeight in 2216, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.42422 = idf(docFreq=1446, maxDocs=44421)
                0.03125 = fieldNorm(doc=2216)
        0.32 = coord(8/25)
    
  2. Roy, W.; Gray, C.: Preparing existing metadata for repository batch import : a recipe for a fickle food (2018) 0.11
    0.10633844 = sum of:
      0.10633844 = product of:
        0.5316922 = sum of:
          0.04983475 = weight(abstract_txt:maintenance in 550) [ClassicSimilarity], result of:
            0.04983475 = score(doc=550,freq=1.0), product of:
              0.12585543 = queryWeight, product of:
                1.0455364 = boost
                6.3354917 = idf(docFreq=213, maxDocs=44421)
                0.018999951 = queryNorm
              0.39596823 = fieldWeight in 550, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3354917 = idf(docFreq=213, maxDocs=44421)
                0.0625 = fieldNorm(doc=550)
          0.09898105 = weight(abstract_txt:script in 550) [ClassicSimilarity], result of:
            0.09898105 = score(doc=550,freq=1.0), product of:
              0.198862 = queryWeight, product of:
                1.3142533 = boost
                7.963798 = idf(docFreq=41, maxDocs=44421)
                0.018999951 = queryNorm
              0.49773738 = fieldWeight in 550, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.963798 = idf(docFreq=41, maxDocs=44421)
                0.0625 = fieldNorm(doc=550)
          0.028572882 = weight(abstract_txt:need in 550) [ClassicSimilarity], result of:
            0.028572882 = score(doc=550,freq=1.0), product of:
              0.109436736 = queryWeight, product of:
                1.3787951 = boost
                4.1774464 = idf(docFreq=1851, maxDocs=44421)
                0.018999951 = queryNorm
              0.2610904 = fieldWeight in 550, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1774464 = idf(docFreq=1851, maxDocs=44421)
                0.0625 = fieldNorm(doc=550)
          0.20159541 = weight(abstract_txt:python in 550) [ClassicSimilarity], result of:
            0.20159541 = score(doc=550,freq=2.0), product of:
              0.2536068 = queryWeight, product of:
                1.48417 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.018999951 = queryNorm
              0.7949133 = fieldWeight in 550, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.0625 = fieldNorm(doc=550)
          0.15270811 = weight(abstract_txt:metadata in 550) [ClassicSimilarity], result of:
            0.15270811 = score(doc=550,freq=5.0), product of:
              0.22394545 = queryWeight, product of:
                2.4156551 = boost
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.018999951 = queryNorm
              0.6818987 = fieldWeight in 550, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.0625 = fieldNorm(doc=550)
        0.2 = coord(5/25)
    
  3. Gilchrist, A.: Reflections on knowledge, communication and knowledge organization (2015) 0.09
    0.08520068 = sum of:
      0.08520068 = product of:
        0.4260034 = sum of:
          0.0357161 = weight(abstract_txt:need in 3375) [ClassicSimilarity], result of:
            0.0357161 = score(doc=3375,freq=1.0), product of:
              0.109436736 = queryWeight, product of:
                1.3787951 = boost
                4.1774464 = idf(docFreq=1851, maxDocs=44421)
                0.018999951 = queryNorm
              0.326363 = fieldWeight in 3375, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1774464 = idf(docFreq=1851, maxDocs=44421)
                0.078125 = fieldNorm(doc=3375)
          0.05028902 = weight(abstract_txt:make in 3375) [ClassicSimilarity], result of:
            0.05028902 = score(doc=3375,freq=1.0), product of:
              0.13747884 = queryWeight, product of:
                1.545383 = boost
                4.682171 = idf(docFreq=1117, maxDocs=44421)
                0.018999951 = queryNorm
              0.3657946 = fieldWeight in 3375, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.682171 = idf(docFreq=1117, maxDocs=44421)
                0.078125 = fieldNorm(doc=3375)
          0.07354281 = weight(abstract_txt:simple in 3375) [ClassicSimilarity], result of:
            0.07354281 = score(doc=3375,freq=1.0), product of:
              0.17712529 = queryWeight, product of:
                1.7541167 = boost
                5.314588 = idf(docFreq=593, maxDocs=44421)
                0.018999951 = queryNorm
              0.4152022 = fieldWeight in 3375, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.314588 = idf(docFreq=593, maxDocs=44421)
                0.078125 = fieldNorm(doc=3375)
          0.096747786 = weight(abstract_txt:techniques in 3375) [ClassicSimilarity], result of:
            0.096747786 = score(doc=3375,freq=2.0), product of:
              0.19321235 = queryWeight, product of:
                2.243785 = boost
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.018999951 = queryNorm
              0.50073296 = fieldWeight in 3375, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.5321174 = idf(docFreq=1298, maxDocs=44421)
                0.078125 = fieldNorm(doc=3375)
          0.1697077 = weight(abstract_txt:records in 3375) [ClassicSimilarity], result of:
            0.1697077 = score(doc=3375,freq=4.0), product of:
              0.2454962 = queryWeight, product of:
                2.920489 = boost
                4.42422 = idf(docFreq=1446, maxDocs=44421)
                0.018999951 = queryNorm
              0.6912844 = fieldWeight in 3375, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.42422 = idf(docFreq=1446, maxDocs=44421)
                0.078125 = fieldNorm(doc=3375)
        0.2 = coord(5/25)
    
  4. Jayakanth, F.; Aswath, L.: ¬A PFT-based approach to make CDS/ISIS data based OAI-compliant (2006) 0.08
    0.08408217 = sum of:
      0.08408217 = product of:
        0.3503424 = sum of:
          0.047854286 = weight(abstract_txt:managing in 2495) [ClassicSimilarity], result of:
            0.047854286 = score(doc=2495,freq=1.0), product of:
              0.122498564 = queryWeight, product of:
                1.0314987 = boost
                6.250429 = idf(docFreq=232, maxDocs=44421)
                0.018999951 = queryNorm
              0.39065182 = fieldWeight in 2495, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.250429 = idf(docFreq=232, maxDocs=44421)
                0.0625 = fieldNorm(doc=2495)
          0.06724642 = weight(abstract_txt:maintained in 2495) [ClassicSimilarity], result of:
            0.06724642 = score(doc=2495,freq=1.0), product of:
              0.15368444 = queryWeight, product of:
                1.1553621 = boost
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.018999951 = queryNorm
              0.4375617 = fieldWeight in 2495, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.0625 = fieldNorm(doc=2495)
          0.040231213 = weight(abstract_txt:make in 2495) [ClassicSimilarity], result of:
            0.040231213 = score(doc=2495,freq=1.0), product of:
              0.13747884 = queryWeight, product of:
                1.545383 = boost
                4.682171 = idf(docFreq=1117, maxDocs=44421)
                0.018999951 = queryNorm
              0.29263568 = fieldWeight in 2495, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.682171 = idf(docFreq=1117, maxDocs=44421)
                0.0625 = fieldNorm(doc=2495)
          0.058834247 = weight(abstract_txt:simple in 2495) [ClassicSimilarity], result of:
            0.058834247 = score(doc=2495,freq=1.0), product of:
              0.17712529 = queryWeight, product of:
                1.7541167 = boost
                5.314588 = idf(docFreq=593, maxDocs=44421)
                0.018999951 = queryNorm
              0.33216175 = fieldWeight in 2495, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.314588 = idf(docFreq=593, maxDocs=44421)
                0.0625 = fieldNorm(doc=2495)
          0.06829315 = weight(abstract_txt:metadata in 2495) [ClassicSimilarity], result of:
            0.06829315 = score(doc=2495,freq=1.0), product of:
              0.22394545 = queryWeight, product of:
                2.4156551 = boost
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.018999951 = queryNorm
              0.30495438 = fieldWeight in 2495, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.0625 = fieldNorm(doc=2495)
          0.067883074 = weight(abstract_txt:records in 2495) [ClassicSimilarity], result of:
            0.067883074 = score(doc=2495,freq=1.0), product of:
              0.2454962 = queryWeight, product of:
                2.920489 = boost
                4.42422 = idf(docFreq=1446, maxDocs=44421)
                0.018999951 = queryNorm
              0.27651376 = fieldWeight in 2495, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.42422 = idf(docFreq=1446, maxDocs=44421)
                0.0625 = fieldNorm(doc=2495)
        0.24 = coord(6/25)
    
  5. Kaiser, M.; Lieder, H.J.; Majcen, K.; Vallant, H.: New ways of sharing and using authority information : the LEAF project (2003) 0.08
    0.083895445 = sum of:
      0.083895445 = product of:
        0.34956437 = sum of:
          0.024917375 = weight(abstract_txt:maintenance in 2166) [ClassicSimilarity], result of:
            0.024917375 = score(doc=2166,freq=1.0), product of:
              0.12585543 = queryWeight, product of:
                1.0455364 = boost
                6.3354917 = idf(docFreq=213, maxDocs=44421)
                0.018999951 = queryNorm
              0.19798411 = fieldWeight in 2166, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.3354917 = idf(docFreq=213, maxDocs=44421)
                0.03125 = fieldNorm(doc=2166)
          0.0475504 = weight(abstract_txt:maintained in 2166) [ClassicSimilarity], result of:
            0.0475504 = score(doc=2166,freq=2.0), product of:
              0.15368444 = queryWeight, product of:
                1.1553621 = boost
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.018999951 = queryNorm
              0.30940282 = fieldWeight in 2166, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.03125 = fieldNorm(doc=2166)
          0.049490526 = weight(abstract_txt:script in 2166) [ClassicSimilarity], result of:
            0.049490526 = score(doc=2166,freq=1.0), product of:
              0.198862 = queryWeight, product of:
                1.3142533 = boost
                7.963798 = idf(docFreq=41, maxDocs=44421)
                0.018999951 = queryNorm
              0.24886869 = fieldWeight in 2166, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.963798 = idf(docFreq=41, maxDocs=44421)
                0.03125 = fieldNorm(doc=2166)
          0.020115606 = weight(abstract_txt:make in 2166) [ClassicSimilarity], result of:
            0.020115606 = score(doc=2166,freq=1.0), product of:
              0.13747884 = queryWeight, product of:
                1.545383 = boost
                4.682171 = idf(docFreq=1117, maxDocs=44421)
                0.018999951 = queryNorm
              0.14631784 = fieldWeight in 2166, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.682171 = idf(docFreq=1117, maxDocs=44421)
                0.03125 = fieldNorm(doc=2166)
          0.048290543 = weight(abstract_txt:metadata in 2166) [ClassicSimilarity], result of:
            0.048290543 = score(doc=2166,freq=2.0), product of:
              0.22394545 = queryWeight, product of:
                2.4156551 = boost
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.018999951 = queryNorm
              0.2156353 = fieldWeight in 2166, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.03125 = fieldNorm(doc=2166)
          0.15919992 = weight(abstract_txt:records in 2166) [ClassicSimilarity], result of:
            0.15919992 = score(doc=2166,freq=22.0), product of:
              0.2454962 = queryWeight, product of:
                2.920489 = boost
                4.42422 = idf(docFreq=1446, maxDocs=44421)
                0.018999951 = queryNorm
              0.64848226 = fieldWeight in 2166, product of:
                4.690416 = tf(freq=22.0), with freq of:
                  22.0 = termFreq=22.0
                4.42422 = idf(docFreq=1446, maxDocs=44421)
                0.03125 = fieldNorm(doc=2166)
        0.24 = coord(6/25)