Document (#29777)

Author
Hagedorn, K.
Title
OAIster: a "no dead ends" OAI service provider
Source
Library hi tech. 21(2003) no.2, S.170-181
Year
2003
Abstract
OAIster, at the University of Michigan, University Libraries, Digital Library Production Service (DLPS), is an Andrew W. Mellon Foundation grant-funded project designed to test the feasibility of using the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) to harvest digital object metadata from multiple and varied digital object repositories and develop a service to allow end-users to access that metadata. This article describes in-depth the development of our system to harvest, store, transform the metadata into Digital Library eXtension Service (DLXS) Bibliographic Class format, build indexes and make the metadata searchable through an interface using the XPAT search engine. Results of the testing of our service and statistics on usage are reported, as well as the issues that we have encountered during our harvesting and transformation operations. The article closes by discussing the future improvements and potential of OAIster and the OAI-PMH protocol.
Content
Vgl. auch unter: http://www.emeraldinsight.com/10.1108/07378830310479811.
Theme
Metadaten
Object
OAI-PMH

Similar documents (content)

  1. Shreeves, S.L.; Kaczmarek, J.S.; Cole, T.W.: Harvesting cultural heritage metadata using OAI Protocol (2003) 0.41
    0.41200265 = sum of:
      0.41200265 = product of:
        1.0300066 = sum of:
          0.013999107 = weight(abstract_txt:using in 5775) [ClassicSimilarity], result of:
            0.013999107 = score(doc=5775,freq=1.0), product of:
              0.05183549 = queryWeight, product of:
                1.0384046 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.014440342 = queryNorm
              0.27006802 = fieldWeight in 5775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.078125 = fieldNorm(doc=5775)
          0.018585697 = weight(abstract_txt:article in 5775) [ClassicSimilarity], result of:
            0.018585697 = score(doc=5775,freq=1.0), product of:
              0.06261516 = queryWeight, product of:
                1.1412814 = boost
                3.79935 = idf(docFreq=2702, maxDocs=44421)
                0.014440342 = queryNorm
              0.29682422 = fieldWeight in 5775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.79935 = idf(docFreq=2702, maxDocs=44421)
                0.078125 = fieldNorm(doc=5775)
          0.026321024 = weight(abstract_txt:university in 5775) [ClassicSimilarity], result of:
            0.026321024 = score(doc=5775,freq=1.0), product of:
              0.078963935 = queryWeight, product of:
                1.2816439 = boost
                4.2666197 = idf(docFreq=1693, maxDocs=44421)
                0.014440342 = queryNorm
              0.33332968 = fieldWeight in 5775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2666197 = idf(docFreq=1693, maxDocs=44421)
                0.078125 = fieldNorm(doc=5775)
          0.10992968 = weight(abstract_txt:mellon in 5775) [ClassicSimilarity], result of:
            0.10992968 = score(doc=5775,freq=1.0), product of:
              0.16254 = queryWeight, product of:
                1.3002238 = boost
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.014440342 = queryNorm
              0.67632383 = fieldWeight in 5775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.078125 = fieldNorm(doc=5775)
          0.111798845 = weight(abstract_txt:andrew in 5775) [ClassicSimilarity], result of:
            0.111798845 = score(doc=5775,freq=1.0), product of:
              0.1643773 = queryWeight, product of:
                1.3075517 = boost
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.014440342 = queryNorm
              0.68013555 = fieldWeight in 5775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.078125 = fieldNorm(doc=5775)
          0.09973134 = weight(abstract_txt:protocol in 5775) [ClassicSimilarity], result of:
            0.09973134 = score(doc=5775,freq=1.0), product of:
              0.19191755 = queryWeight, product of:
                1.9980683 = boost
                6.651612 = idf(docFreq=155, maxDocs=44421)
                0.014440342 = queryNorm
              0.5196572 = fieldWeight in 5775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.651612 = idf(docFreq=155, maxDocs=44421)
                0.078125 = fieldNorm(doc=5775)
          0.21237463 = weight(abstract_txt:harvesting in 5775) [ClassicSimilarity], result of:
            0.21237463 = score(doc=5775,freq=2.0), product of:
              0.25212663 = queryWeight, product of:
                2.2901416 = boost
                7.62393 = idf(docFreq=58, maxDocs=44421)
                0.014440342 = queryNorm
              0.8423332 = fieldWeight in 5775, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.62393 = idf(docFreq=58, maxDocs=44421)
                0.078125 = fieldNorm(doc=5775)
          0.07767777 = weight(abstract_txt:digital in 5775) [ClassicSimilarity], result of:
            0.07767777 = score(doc=5775,freq=2.0), product of:
              0.16246437 = queryWeight, product of:
                2.5998425 = boost
                4.3274655 = idf(docFreq=1593, maxDocs=44421)
                0.014440342 = queryNorm
              0.4781219 = fieldWeight in 5775, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3274655 = idf(docFreq=1593, maxDocs=44421)
                0.078125 = fieldNorm(doc=5775)
          0.08123264 = weight(abstract_txt:service in 5775) [ClassicSimilarity], result of:
            0.08123264 = score(doc=5775,freq=1.0), product of:
              0.22717506 = queryWeight, product of:
                3.4371881 = boost
                4.576989 = idf(docFreq=1241, maxDocs=44421)
                0.014440342 = queryNorm
              0.35757726 = fieldWeight in 5775, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.576989 = idf(docFreq=1241, maxDocs=44421)
                0.078125 = fieldNorm(doc=5775)
          0.27835596 = weight(abstract_txt:metadata in 5775) [ClassicSimilarity], result of:
            0.27835596 = score(doc=5775,freq=8.0), product of:
              0.2581729 = queryWeight, product of:
                3.6641927 = boost
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.014440342 = queryNorm
              1.0781765 = fieldWeight in 5775, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.078125 = fieldNorm(doc=5775)
        0.4 = coord(10/25)
    
  2. Halbert, M.: ¬The Metascholar Initiative : AmericanSouth.Org and MetaArchive.Org (2003) 0.23
    0.22595814 = sum of:
      0.22595814 = product of:
        0.80699337 = sum of:
          0.05001043 = weight(abstract_txt:feasibility in 5777) [ClassicSimilarity], result of:
            0.05001043 = score(doc=5777,freq=1.0), product of:
              0.09614441 = queryWeight, product of:
                6.6580424 = idf(docFreq=154, maxDocs=44421)
                0.014440342 = queryNorm
              0.52015954 = fieldWeight in 5777, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6580424 = idf(docFreq=154, maxDocs=44421)
                0.078125 = fieldNorm(doc=5777)
          0.026321024 = weight(abstract_txt:university in 5777) [ClassicSimilarity], result of:
            0.026321024 = score(doc=5777,freq=1.0), product of:
              0.078963935 = queryWeight, product of:
                1.2816439 = boost
                4.2666197 = idf(docFreq=1693, maxDocs=44421)
                0.014440342 = queryNorm
              0.33332968 = fieldWeight in 5777, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2666197 = idf(docFreq=1693, maxDocs=44421)
                0.078125 = fieldNorm(doc=5777)
          0.10992968 = weight(abstract_txt:mellon in 5777) [ClassicSimilarity], result of:
            0.10992968 = score(doc=5777,freq=1.0), product of:
              0.16254 = queryWeight, product of:
                1.3002238 = boost
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.014440342 = queryNorm
              0.67632383 = fieldWeight in 5777, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.656945 = idf(docFreq=20, maxDocs=44421)
                0.078125 = fieldNorm(doc=5777)
          0.111798845 = weight(abstract_txt:andrew in 5777) [ClassicSimilarity], result of:
            0.111798845 = score(doc=5777,freq=1.0), product of:
              0.1643773 = queryWeight, product of:
                1.3075517 = boost
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.014440342 = queryNorm
              0.68013555 = fieldWeight in 5777, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.705735 = idf(docFreq=19, maxDocs=44421)
                0.078125 = fieldNorm(doc=5777)
          0.09973134 = weight(abstract_txt:protocol in 5777) [ClassicSimilarity], result of:
            0.09973134 = score(doc=5777,freq=1.0), product of:
              0.19191755 = queryWeight, product of:
                1.9980683 = boost
                6.651612 = idf(docFreq=155, maxDocs=44421)
                0.014440342 = queryNorm
              0.5196572 = fieldWeight in 5777, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.651612 = idf(docFreq=155, maxDocs=44421)
                0.078125 = fieldNorm(doc=5777)
          0.21237463 = weight(abstract_txt:harvesting in 5777) [ClassicSimilarity], result of:
            0.21237463 = score(doc=5777,freq=2.0), product of:
              0.25212663 = queryWeight, product of:
                2.2901416 = boost
                7.62393 = idf(docFreq=58, maxDocs=44421)
                0.014440342 = queryNorm
              0.8423332 = fieldWeight in 5777, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.62393 = idf(docFreq=58, maxDocs=44421)
                0.078125 = fieldNorm(doc=5777)
          0.1968274 = weight(abstract_txt:metadata in 5777) [ClassicSimilarity], result of:
            0.1968274 = score(doc=5777,freq=4.0), product of:
              0.2581729 = queryWeight, product of:
                3.6641927 = boost
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.014440342 = queryNorm
              0.76238596 = fieldWeight in 5777, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.078125 = fieldNorm(doc=5777)
        0.28 = coord(7/25)
    
  3. Van de Sompel, H.; Nelson, M.L.; Lagoze, C.; Warner, S.: Resource harvesting within the OAI-PMH framework (2004) 0.21
    0.20593505 = sum of:
      0.20593505 = product of:
        0.8580627 = sum of:
          0.0290966 = weight(abstract_txt:using in 5110) [ClassicSimilarity], result of:
            0.0290966 = score(doc=5110,freq=3.0), product of:
              0.05183549 = queryWeight, product of:
                1.0384046 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.014440342 = queryNorm
              0.56132585 = fieldWeight in 5110, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.09375 = fieldNorm(doc=5110)
          0.10408349 = weight(abstract_txt:object in 5110) [ClassicSimilarity], result of:
            0.10408349 = score(doc=5110,freq=2.0), product of:
              0.13878761 = queryWeight, product of:
                1.6991367 = boost
                5.656462 = idf(docFreq=421, maxDocs=44421)
                0.014440342 = queryNorm
              0.749948 = fieldWeight in 5110, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.656462 = idf(docFreq=421, maxDocs=44421)
                0.09375 = fieldNorm(doc=5110)
          0.1196776 = weight(abstract_txt:protocol in 5110) [ClassicSimilarity], result of:
            0.1196776 = score(doc=5110,freq=1.0), product of:
              0.19191755 = queryWeight, product of:
                1.9980683 = boost
                6.651612 = idf(docFreq=155, maxDocs=44421)
                0.014440342 = queryNorm
              0.6235886 = fieldWeight in 5110, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.651612 = idf(docFreq=155, maxDocs=44421)
                0.09375 = fieldNorm(doc=5110)
          0.25484958 = weight(abstract_txt:harvesting in 5110) [ClassicSimilarity], result of:
            0.25484958 = score(doc=5110,freq=2.0), product of:
              0.25212663 = queryWeight, product of:
                2.2901416 = boost
                7.62393 = idf(docFreq=58, maxDocs=44421)
                0.014440342 = queryNorm
              1.0107999 = fieldWeight in 5110, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.62393 = idf(docFreq=58, maxDocs=44421)
                0.09375 = fieldNorm(doc=5110)
          0.11416254 = weight(abstract_txt:digital in 5110) [ClassicSimilarity], result of:
            0.11416254 = score(doc=5110,freq=3.0), product of:
              0.16246437 = queryWeight, product of:
                2.5998425 = boost
                4.3274655 = idf(docFreq=1593, maxDocs=44421)
                0.014440342 = queryNorm
              0.7026928 = fieldWeight in 5110, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.3274655 = idf(docFreq=1593, maxDocs=44421)
                0.09375 = fieldNorm(doc=5110)
          0.23619287 = weight(abstract_txt:metadata in 5110) [ClassicSimilarity], result of:
            0.23619287 = score(doc=5110,freq=4.0), product of:
              0.2581729 = queryWeight, product of:
                3.6641927 = boost
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.014440342 = queryNorm
              0.9148631 = fieldWeight in 5110, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.09375 = fieldNorm(doc=5110)
        0.24 = coord(6/25)
    
  4. Hagedorn, K.; Chapman, S.; Newman, D.: Enhancing search and browse using automated clustering of subject metadata (2007) 0.19
    0.19327693 = sum of:
      0.19327693 = product of:
        0.80532056 = sum of:
          0.051370528 = weight(abstract_txt:varied in 2168) [ClassicSimilarity], result of:
            0.051370528 = score(doc=2168,freq=1.0), product of:
              0.097879775 = queryWeight, product of:
                1.0089844 = boost
                6.717861 = idf(docFreq=145, maxDocs=44421)
                0.014440342 = queryNorm
              0.5248329 = fieldWeight in 2168, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.717861 = idf(docFreq=145, maxDocs=44421)
                0.078125 = fieldNorm(doc=2168)
          0.024247168 = weight(abstract_txt:using in 2168) [ClassicSimilarity], result of:
            0.024247168 = score(doc=2168,freq=3.0), product of:
              0.05183549 = queryWeight, product of:
                1.0384046 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.014440342 = queryNorm
              0.46777156 = fieldWeight in 2168, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.078125 = fieldNorm(doc=2168)
          0.081348695 = weight(abstract_txt:michigan in 2168) [ClassicSimilarity], result of:
            0.081348695 = score(doc=2168,freq=1.0), product of:
              0.13297929 = queryWeight, product of:
                1.1760614 = boost
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.014440342 = queryNorm
              0.6117396 = fieldWeight in 2168, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.078125 = fieldNorm(doc=2168)
          0.037223544 = weight(abstract_txt:university in 2168) [ClassicSimilarity], result of:
            0.037223544 = score(doc=2168,freq=2.0), product of:
              0.078963935 = queryWeight, product of:
                1.2816439 = boost
                4.2666197 = idf(docFreq=1693, maxDocs=44421)
                0.014440342 = queryNorm
              0.4713993 = fieldWeight in 2168, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.2666197 = idf(docFreq=1693, maxDocs=44421)
                0.078125 = fieldNorm(doc=2168)
          0.13917798 = weight(abstract_txt:metadata in 2168) [ClassicSimilarity], result of:
            0.13917798 = score(doc=2168,freq=2.0), product of:
              0.2581729 = queryWeight, product of:
                3.6641927 = boost
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.014440342 = queryNorm
              0.53908825 = fieldWeight in 2168, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.078125 = fieldNorm(doc=2168)
          0.47195265 = weight(abstract_txt:oaister in 2168) [ClassicSimilarity], result of:
            0.47195265 = score(doc=2168,freq=1.0), product of:
              0.6192362 = queryWeight, product of:
                4.395687 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.014440342 = queryNorm
              0.7621529 = fieldWeight in 2168, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.078125 = fieldNorm(doc=2168)
        0.24 = coord(6/25)
    
  5. Arms, C.R.: Available and useful : OAI at the Library of Congress (2003) 0.19
    0.18826 = sum of:
      0.18826 = product of:
        0.7844167 = sum of:
          0.018585697 = weight(abstract_txt:article in 5773) [ClassicSimilarity], result of:
            0.018585697 = score(doc=5773,freq=1.0), product of:
              0.06261516 = queryWeight, product of:
                1.1412814 = boost
                3.79935 = idf(docFreq=2702, maxDocs=44421)
                0.014440342 = queryNorm
              0.29682422 = fieldWeight in 5773, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.79935 = idf(docFreq=2702, maxDocs=44421)
                0.078125 = fieldNorm(doc=5773)
          0.17273973 = weight(abstract_txt:protocol in 5773) [ClassicSimilarity], result of:
            0.17273973 = score(doc=5773,freq=3.0), product of:
              0.19191755 = queryWeight, product of:
                1.9980683 = boost
                6.651612 = idf(docFreq=155, maxDocs=44421)
                0.014440342 = queryNorm
              0.9000726 = fieldWeight in 5773, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.651612 = idf(docFreq=155, maxDocs=44421)
                0.078125 = fieldNorm(doc=5773)
          0.26010475 = weight(abstract_txt:harvesting in 5773) [ClassicSimilarity], result of:
            0.26010475 = score(doc=5773,freq=3.0), product of:
              0.25212663 = queryWeight, product of:
                2.2901416 = boost
                7.62393 = idf(docFreq=58, maxDocs=44421)
                0.014440342 = queryNorm
              1.0316433 = fieldWeight in 5773, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.62393 = idf(docFreq=58, maxDocs=44421)
                0.078125 = fieldNorm(doc=5773)
          0.054926477 = weight(abstract_txt:digital in 5773) [ClassicSimilarity], result of:
            0.054926477 = score(doc=5773,freq=1.0), product of:
              0.16246437 = queryWeight, product of:
                2.5998425 = boost
                4.3274655 = idf(docFreq=1593, maxDocs=44421)
                0.014440342 = queryNorm
              0.33808324 = fieldWeight in 5773, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3274655 = idf(docFreq=1593, maxDocs=44421)
                0.078125 = fieldNorm(doc=5773)
          0.08123264 = weight(abstract_txt:service in 5773) [ClassicSimilarity], result of:
            0.08123264 = score(doc=5773,freq=1.0), product of:
              0.22717506 = queryWeight, product of:
                3.4371881 = boost
                4.576989 = idf(docFreq=1241, maxDocs=44421)
                0.014440342 = queryNorm
              0.35757726 = fieldWeight in 5773, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.576989 = idf(docFreq=1241, maxDocs=44421)
                0.078125 = fieldNorm(doc=5773)
          0.1968274 = weight(abstract_txt:metadata in 5773) [ClassicSimilarity], result of:
            0.1968274 = score(doc=5773,freq=4.0), product of:
              0.2581729 = queryWeight, product of:
                3.6641927 = boost
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.014440342 = queryNorm
              0.76238596 = fieldWeight in 5773, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.87927 = idf(docFreq=917, maxDocs=44421)
                0.078125 = fieldNorm(doc=5773)
        0.24 = coord(6/25)