Document (#23783)

Author
Danowski, P.
Voß, J.
Title
Wikipedia sammelt Metadaten
Source
Bibliotheksdienst. 39(2005) H.3, S.385
Year
2005
Abstract
Im Rahmen der Vorbereitung auf die Wikipedia-DVD, die zur Buchmesse in Leipzig erscheinen soll, wurden fast 30.000 Artikel der freien Enzyklopädie Wikipedia mit Personendaten versehen. Damit sind die biographischen Artikel erstmals mit strukturierten Metadaten versehen, die wie alle Inhalte des Projekts unter den Bedingungen der GFDL frei weiterverwendet werden können. Die Personendaten umfassen Angaben zu Namen, Geburtsdatum, Geburtsort, Sterbedatum und Sterbeort. Gleichzeitig wird eine Kurzbeschreibung zu den einzelnen Personen gespeichert. Bisher waren diese Daten nur im Fließtext und Personennamen nur in der Form "Vorname Nachname" abgespeichert. Da auf der DVD jedoch eine gezielte Suche nach Personen möglich sein soll, müssen die Namen und anderen Angaben einheitlich, wie es in bibliothekarischen Datenbanken die Regel ist, in der Form "Nachname, Vorname" angesetzt werden. Ziel der Sammlung von Personendaten ist die dokumentarische Erschließung aller biographischen Artikel. Da wie an der gesamten Wikipedia viele Freiwillige an diesem Prozess beteiligt sind, entsprechen die Ergebnisse sicherlich noch nicht professionellen Regelwerken wie RAK. Sie sind ein erster Schritt um die Wikipedia besser automatisch weiterverwendbar zu machen und somit neue Möglichkeiten der Anwendung zu erschließen. Die Personendaten wurden zum größten Teil in einer vom Verlag Directmedia Publishing ausgerichteten "Tagging-Party" vom 28. bis 30. Januar mit Hilfe eines selbst entwickelten Softwaretools direkt in Online-Enzyklopädie eingetragen. Dazu wurden alle Artikel angeschaut und Fehler in den Datenfeldern korrigiert. Die Strukturierung der Personendaten könnte noch wesentlich durch bestehende bibliothekarische Datenbanken wie die Personennormdatei (PND) verbessert werden. Bibliotheken könnten im Gegenzug die Informationen aus der Wikipedia zur Kataloganreicherung nutzen - beispielsweise zur Anzeige von Kurzbiographien zu einzelnen Autoren. Auch weitere Kooperationsmöglichkeiten sind denkbar. Bei Interesse können Sie sich an Jakob Voss oder Patrick Danowski wenden.
Footnote
Ansprechpartner: zu den Personendaten [email protected] (Jakob Voß), [email protected] (Patrick Danowski); zur DVD [email protected]; zu allgemeinen Fragen über die Wikipedia [email protected]. Informationen zur deutschsprachigen Wikipedia: http://www.wikipedia.de/
Theme
Informationsmittel
Internet
Object
Wikipedia

Similar documents (author)

  1. Danowski, J.A.: Network analysis of message content (1993) 5.58
    5.5805492 = sum of:
      5.5805492 = weight(author_txt:danowski in 907) [ClassicSimilarity], result of:
        5.5805492 = fieldWeight in 907, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.928879 = idf(docFreq=15, maxDocs=44421)
          0.625 = fieldNorm(doc=907)
    
  2. Danowski, P.: Kontext Open Access : Creative Commons (2012) 5.58
    5.5805492 = sum of:
      5.5805492 = weight(author_txt:danowski in 828) [ClassicSimilarity], result of:
        5.5805492 = fieldWeight in 828, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.928879 = idf(docFreq=15, maxDocs=44421)
          0.625 = fieldNorm(doc=828)
    
  3. Danowski, P.: Authority files and Web 2.0 : Wikipedia and the PND. An Example (2007) 5.58
    5.5805492 = sum of:
      5.5805492 = weight(author_txt:danowski in 2291) [ClassicSimilarity], result of:
        5.5805492 = fieldWeight in 2291, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.928879 = idf(docFreq=15, maxDocs=44421)
          0.625 = fieldNorm(doc=2291)
    
  4. Danowski, P.: Step one: blow up the silo! : Open bibliographic data, the first step towards Linked Open Data (2010) 5.58
    5.5805492 = sum of:
      5.5805492 = weight(author_txt:danowski in 949) [ClassicSimilarity], result of:
        5.5805492 = fieldWeight in 949, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.928879 = idf(docFreq=15, maxDocs=44421)
          0.625 = fieldNorm(doc=949)
    
  5. Voß, J.; Danowski, P.: Bibliothek, Information und Dokumentation in der Wikipedia (2004) 4.46
    4.4644394 = sum of:
      4.4644394 = weight(author_txt:danowski in 4046) [ClassicSimilarity], result of:
        4.4644394 = fieldWeight in 4046, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.928879 = idf(docFreq=15, maxDocs=44421)
          0.5 = fieldNorm(doc=4046)
    

Similar documents (content)

  1. Online-Enzyklopädie Wikipedia (2003) 0.23
    0.22963892 = sum of:
      0.22963892 = product of:
        0.7176216 = sum of:
          0.027994806 = weight(abstract_txt:alle in 2410) [ClassicSimilarity], result of:
            0.027994806 = score(doc=2410,freq=2.0), product of:
              0.09993723 = queryWeight, product of:
                1.0790185 = boost
                5.070784 = idf(docFreq=757, maxDocs=44421)
                0.018265152 = queryNorm
              0.2801239 = fieldWeight in 2410, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.070784 = idf(docFreq=757, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2410)
          0.017025119 = weight(abstract_txt:werden in 2410) [ClassicSimilarity], result of:
            0.017025119 = score(doc=2410,freq=3.0), product of:
              0.07173577 = queryWeight, product of:
                1.1196408 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.018265152 = queryNorm
              0.23733094 = fieldWeight in 2410, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2410)
          0.049998377 = weight(abstract_txt:personen in 2410) [ClassicSimilarity], result of:
            0.049998377 = score(doc=2410,freq=1.0), product of:
              0.18534872 = queryWeight, product of:
                1.4694676 = boost
                6.905677 = idf(docFreq=120, maxDocs=44421)
                0.018265152 = queryNorm
              0.269753 = fieldWeight in 2410, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.905677 = idf(docFreq=120, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2410)
          0.058067683 = weight(abstract_txt:enzyklopädie in 2410) [ClassicSimilarity], result of:
            0.058067683 = score(doc=2410,freq=1.0), product of:
              0.20478995 = queryWeight, product of:
                1.5446125 = boost
                7.2588162 = idf(docFreq=84, maxDocs=44421)
                0.018265152 = queryNorm
              0.28354752 = fieldWeight in 2410, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2588162 = idf(docFreq=84, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2410)
          0.05864099 = weight(abstract_txt:angaben in 2410) [ClassicSimilarity], result of:
            0.05864099 = score(doc=2410,freq=1.0), product of:
              0.20613569 = queryWeight, product of:
                1.5496793 = boost
                7.282627 = idf(docFreq=82, maxDocs=44421)
                0.018265152 = queryNorm
              0.28447762 = fieldWeight in 2410, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.282627 = idf(docFreq=82, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2410)
          0.040856857 = weight(abstract_txt:sind in 2410) [ClassicSimilarity], result of:
            0.040856857 = score(doc=2410,freq=5.0), product of:
              0.11936646 = queryWeight, product of:
                1.6677133 = boost
                3.9186604 = idf(docFreq=2398, maxDocs=44421)
                0.018265152 = queryNorm
              0.3422809 = fieldWeight in 2410, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.9186604 = idf(docFreq=2398, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2410)
          0.17016608 = weight(abstract_txt:artikel in 2410) [ClassicSimilarity], result of:
            0.17016608 = score(doc=2410,freq=10.0), product of:
              0.24525112 = queryWeight, product of:
                2.3904834 = boost
                5.616968 = idf(docFreq=438, maxDocs=44421)
                0.018265152 = queryNorm
              0.69384426 = fieldWeight in 2410, product of:
                3.1622777 = tf(freq=10.0), with freq of:
                  10.0 = termFreq=10.0
                5.616968 = idf(docFreq=438, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2410)
          0.29487172 = weight(abstract_txt:wikipedia in 2410) [ClassicSimilarity], result of:
            0.29487172 = score(doc=2410,freq=7.0), product of:
              0.4561582 = queryWeight, product of:
                3.9928555 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.018265152 = queryNorm
              0.64642423 = fieldWeight in 2410, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.0390625 = fieldNorm(doc=2410)
        0.32 = coord(8/25)
    
  2. Kleinz, T.: Wikipedia professionalisiert sich : Das Büro der deutschen Sektion soll im Oktober in Frankfurt eröffnen - Schreiber und Spender werden umworben (2006) 0.20
    0.20484608 = sum of:
      0.20484608 = product of:
        0.8535254 = sum of:
          0.052661207 = weight(abstract_txt:soll in 3871) [ClassicSimilarity], result of:
            0.052661207 = score(doc=3871,freq=1.0), product of:
              0.10703818 = queryWeight, product of:
                1.116695 = boost
                5.247843 = idf(docFreq=634, maxDocs=44421)
                0.018265152 = queryNorm
              0.49198526 = fieldWeight in 3871, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.247843 = idf(docFreq=634, maxDocs=44421)
                0.09375 = fieldNorm(doc=3871)
          0.023590695 = weight(abstract_txt:werden in 3871) [ClassicSimilarity], result of:
            0.023590695 = score(doc=3871,freq=1.0), product of:
              0.07173577 = queryWeight, product of:
                1.1196408 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.018265152 = queryNorm
              0.3288554 = fieldWeight in 3871, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.09375 = fieldNorm(doc=3871)
          0.13936244 = weight(abstract_txt:enzyklopädie in 3871) [ClassicSimilarity], result of:
            0.13936244 = score(doc=3871,freq=1.0), product of:
              0.20478995 = queryWeight, product of:
                1.5446125 = boost
                7.2588162 = idf(docFreq=84, maxDocs=44421)
                0.018265152 = queryNorm
              0.68051404 = fieldWeight in 3871, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2588162 = idf(docFreq=84, maxDocs=44421)
                0.09375 = fieldNorm(doc=3871)
          0.07699233 = weight(abstract_txt:wurden in 3871) [ClassicSimilarity], result of:
            0.07699233 = score(doc=3871,freq=1.0), product of:
              0.1578363 = queryWeight, product of:
                1.6607884 = boost
                5.2031856 = idf(docFreq=663, maxDocs=44421)
                0.018265152 = queryNorm
              0.48779863 = fieldWeight in 3871, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2031856 = idf(docFreq=663, maxDocs=44421)
                0.09375 = fieldNorm(doc=3871)
          0.1826414 = weight(abstract_txt:artikel in 3871) [ClassicSimilarity], result of:
            0.1826414 = score(doc=3871,freq=2.0), product of:
              0.24525112 = queryWeight, product of:
                2.3904834 = boost
                5.616968 = idf(docFreq=438, maxDocs=44421)
                0.018265152 = queryNorm
              0.7447118 = fieldWeight in 3871, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.616968 = idf(docFreq=438, maxDocs=44421)
                0.09375 = fieldNorm(doc=3871)
          0.37827733 = weight(abstract_txt:wikipedia in 3871) [ClassicSimilarity], result of:
            0.37827733 = score(doc=3871,freq=2.0), product of:
              0.4561582 = queryWeight, product of:
                3.9928555 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.018265152 = queryNorm
              0.82926786 = fieldWeight in 3871, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.09375 = fieldNorm(doc=3871)
        0.24 = coord(6/25)
    
  3. Ersch, J.S.; Gruber, J.G.: Allgemeine Encyclopädie der Wissenschaften und Künste (1996) 0.20
    0.1970445 = sum of:
      0.1970445 = product of:
        0.9852225 = sum of:
          0.09554713 = weight(abstract_txt:einzelnen in 1927) [ClassicSimilarity], result of:
            0.09554713 = score(doc=1927,freq=1.0), product of:
              0.13144098 = queryWeight, product of:
                1.2374585 = boost
                5.8153634 = idf(docFreq=359, maxDocs=44421)
                0.018265152 = queryNorm
              0.7269204 = fieldWeight in 1927, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8153634 = idf(docFreq=359, maxDocs=44421)
                0.125 = fieldNorm(doc=1927)
          0.23140958 = weight(abstract_txt:versehen in 1927) [ClassicSimilarity], result of:
            0.23140958 = score(doc=1927,freq=1.0), product of:
              0.23704997 = queryWeight, product of:
                1.6618246 = boost
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.018265152 = queryNorm
              0.9762059 = fieldWeight in 1927, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.125 = fieldNorm(doc=1927)
          0.082688466 = weight(abstract_txt:sind in 1927) [ClassicSimilarity], result of:
            0.082688466 = score(doc=1927,freq=2.0), product of:
              0.11936646 = queryWeight, product of:
                1.6677133 = boost
                3.9186604 = idf(docFreq=2398, maxDocs=44421)
                0.018265152 = queryNorm
              0.6927278 = fieldWeight in 1927, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9186604 = idf(docFreq=2398, maxDocs=44421)
                0.125 = fieldNorm(doc=1927)
          0.40338132 = weight(abstract_txt:biographischen in 1927) [ClassicSimilarity], result of:
            0.40338132 = score(doc=1927,freq=1.0), product of:
              0.34334406 = queryWeight, product of:
                2.0 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.018265152 = queryNorm
              1.1748604 = fieldWeight in 1927, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.125 = fieldNorm(doc=1927)
          0.17219597 = weight(abstract_txt:artikel in 1927) [ClassicSimilarity], result of:
            0.17219597 = score(doc=1927,freq=1.0), product of:
              0.24525112 = queryWeight, product of:
                2.3904834 = boost
                5.616968 = idf(docFreq=438, maxDocs=44421)
                0.018265152 = queryNorm
              0.702121 = fieldWeight in 1927, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.616968 = idf(docFreq=438, maxDocs=44421)
                0.125 = fieldNorm(doc=1927)
        0.2 = coord(5/25)
    
  4. Portal "Bibliothek Information Dokumentation" eingestellt (2004) 0.19
    0.18555011 = sum of:
      0.18555011 = product of:
        1.1596882 = sum of:
          0.27872488 = weight(abstract_txt:enzyklopädie in 4293) [ClassicSimilarity], result of:
            0.27872488 = score(doc=4293,freq=1.0), product of:
              0.20478995 = queryWeight, product of:
                1.5446125 = boost
                7.2588162 = idf(docFreq=84, maxDocs=44421)
                0.018265152 = queryNorm
              1.3610281 = fieldWeight in 4293, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2588162 = idf(docFreq=84, maxDocs=44421)
                0.1875 = fieldNorm(doc=4293)
          0.08770437 = weight(abstract_txt:sind in 4293) [ClassicSimilarity], result of:
            0.08770437 = score(doc=4293,freq=1.0), product of:
              0.11936646 = queryWeight, product of:
                1.6677133 = boost
                3.9186604 = idf(docFreq=2398, maxDocs=44421)
                0.018265152 = queryNorm
              0.73474884 = fieldWeight in 4293, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9186604 = idf(docFreq=2398, maxDocs=44421)
                0.1875 = fieldNorm(doc=4293)
          0.25829396 = weight(abstract_txt:artikel in 4293) [ClassicSimilarity], result of:
            0.25829396 = score(doc=4293,freq=1.0), product of:
              0.24525112 = queryWeight, product of:
                2.3904834 = boost
                5.616968 = idf(docFreq=438, maxDocs=44421)
                0.018265152 = queryNorm
              1.0531815 = fieldWeight in 4293, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.616968 = idf(docFreq=438, maxDocs=44421)
                0.1875 = fieldNorm(doc=4293)
          0.534965 = weight(abstract_txt:wikipedia in 4293) [ClassicSimilarity], result of:
            0.534965 = score(doc=4293,freq=1.0), product of:
              0.4561582 = queryWeight, product of:
                3.9928555 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.018265152 = queryNorm
              1.1727619 = fieldWeight in 4293, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.1875 = fieldNorm(doc=4293)
        0.16 = coord(4/25)
    
  5. Stöcklin, N.: Wikipedia clever nutzen : in Schule und Beruf (2010) 0.17
    0.16929291 = sum of:
      0.16929291 = product of:
        0.7053872 = sum of:
          0.031672508 = weight(abstract_txt:alle in 531) [ClassicSimilarity], result of:
            0.031672508 = score(doc=531,freq=1.0), product of:
              0.09993723 = queryWeight, product of:
                1.0790185 = boost
                5.070784 = idf(docFreq=757, maxDocs=44421)
                0.018265152 = queryNorm
              0.316924 = fieldWeight in 531, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.070784 = idf(docFreq=757, maxDocs=44421)
                0.0625 = fieldNorm(doc=531)
          0.02224152 = weight(abstract_txt:werden in 531) [ClassicSimilarity], result of:
            0.02224152 = score(doc=531,freq=2.0), product of:
              0.07173577 = queryWeight, product of:
                1.1196408 = boost
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.018265152 = queryNorm
              0.31004784 = fieldWeight in 531, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.507791 = idf(docFreq=3617, maxDocs=44421)
                0.0625 = fieldNorm(doc=531)
          0.0799974 = weight(abstract_txt:personen in 531) [ClassicSimilarity], result of:
            0.0799974 = score(doc=531,freq=1.0), product of:
              0.18534872 = queryWeight, product of:
                1.4694676 = boost
                6.905677 = idf(docFreq=120, maxDocs=44421)
                0.018265152 = queryNorm
              0.4316048 = fieldWeight in 531, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.905677 = idf(docFreq=120, maxDocs=44421)
                0.0625 = fieldNorm(doc=531)
          0.13139217 = weight(abstract_txt:enzyklopädie in 531) [ClassicSimilarity], result of:
            0.13139217 = score(doc=531,freq=2.0), product of:
              0.20478995 = queryWeight, product of:
                1.5446125 = boost
                7.2588162 = idf(docFreq=84, maxDocs=44421)
                0.018265152 = queryNorm
              0.64159477 = fieldWeight in 531, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.2588162 = idf(docFreq=84, maxDocs=44421)
                0.0625 = fieldNorm(doc=531)
          0.041344233 = weight(abstract_txt:sind in 531) [ClassicSimilarity], result of:
            0.041344233 = score(doc=531,freq=2.0), product of:
              0.11936646 = queryWeight, product of:
                1.6677133 = boost
                3.9186604 = idf(docFreq=2398, maxDocs=44421)
                0.018265152 = queryNorm
              0.3463639 = fieldWeight in 531, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9186604 = idf(docFreq=2398, maxDocs=44421)
                0.0625 = fieldNorm(doc=531)
          0.39873934 = weight(abstract_txt:wikipedia in 531) [ClassicSimilarity], result of:
            0.39873934 = score(doc=531,freq=5.0), product of:
              0.4561582 = queryWeight, product of:
                3.9928555 = boost
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.018265152 = queryNorm
              0.8741251 = fieldWeight in 531, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.25473 = idf(docFreq=231, maxDocs=44421)
                0.0625 = fieldNorm(doc=531)
        0.24 = coord(6/25)