Document (#38022)

Author
Kempf, A.O.
Zapilko, B.
Title
Normdatenpflege in Zeiten der Automatisierung : Erstellung und Evaluation automatisch aufgebauter Thesaurus-Crosskonkordanzen
Source
Information - Wissenschaft und Praxis. 64(2013) H.4, S.199-208
Year
2013
Abstract
Thesaurus-Crosskonkordanzen bilden eine wichtige Voraussetzung für die integrierte Suche in einer verteilten Datenstruktur. Ihr Aufbau erfordert allerdings erhebliche personelle Ressourcen. Der vorliegende Beitrag liefert Evaluationsergebnisse des Library Track 2012 der Ontology Alignment Evaluation Initiative (OAEI), in dem Crosskonkordanzen zwischen dem Thesaurus Sozialwissenschaften (TheSoz) und dem Standard Thesaurus Wirtschaft (STW) erstmals automatisch erstellt wurden. Die Evaluation weist auf deutliche Unterschiede in den getesteten Matching- Tools hin und stellt die qualitativen Unterschiede einer automatisch im Vergleich zu einer intellektuell erstellten Crosskonkordanz heraus. Die Ergebnisse sprechen für einen Einsatz automatisch generierter Thesaurus-Crosskonkordanzen, um Domänenexperten eine maschinell erzeugte Vorselektion von möglichen Äquivalenzrelationen anzubieten.
Content
Vgl.: http://www.degruyter.com/view/j/iwp.2013.64.issue-4/iwp-2013-0025/iwp-2013-0025.xml?format=INT.
Theme
Semantische Interoperabilität
Object
Thesaurus Sozialwissenschaften
Standard Thesaurus Wirtschaft

Similar documents (author)

  1. Kempf, A.O.; Ritze, D.; Eckert, K.; Zapilko, B.: New ways of mapping knowledge organization systems : using a semi­automatic matching­procedure for building up vocabulary crosswalks (2013) 3.91
    3.9053016 = sum of:
      3.9053016 = sum of:
        1.6452473 = weight(author_txt:kempf in 1989) [ClassicSimilarity], result of:
          1.6452473 = score(doc=1989,freq=1.0), product of:
            0.6290628 = queryWeight, product of:
              8.369263 = idf(docFreq=27, maxDocs=44421)
              0.07516346 = queryNorm
            2.6153946 = fieldWeight in 1989, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.369263 = idf(docFreq=27, maxDocs=44421)
              0.3125 = fieldNorm(doc=1989)
        2.2600543 = weight(author_txt:zapilko in 1989) [ClassicSimilarity], result of:
          2.2600543 = score(doc=1989,freq=1.0), product of:
            0.7773545 = queryWeight, product of:
              1.1116359 = boost
              9.303573 = idf(docFreq=10, maxDocs=44421)
              0.07516346 = queryNorm
            2.9073665 = fieldWeight in 1989, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.303573 = idf(docFreq=10, maxDocs=44421)
              0.3125 = fieldNorm(doc=1989)
    
  2. Kempf, A.O.; Ritze, D.; Eckert, K.; Zapilko, B.: New ways of mapping knowledge organization systems : using a semi-automatic matching procedure for building up vocabulary crosswalks (2014) 3.91
    3.9053016 = sum of:
      3.9053016 = sum of:
        1.6452473 = weight(author_txt:kempf in 2371) [ClassicSimilarity], result of:
          1.6452473 = score(doc=2371,freq=1.0), product of:
            0.6290628 = queryWeight, product of:
              8.369263 = idf(docFreq=27, maxDocs=44421)
              0.07516346 = queryNorm
            2.6153946 = fieldWeight in 2371, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              8.369263 = idf(docFreq=27, maxDocs=44421)
              0.3125 = fieldNorm(doc=2371)
        2.2600543 = weight(author_txt:zapilko in 2371) [ClassicSimilarity], result of:
          2.2600543 = score(doc=2371,freq=1.0), product of:
            0.7773545 = queryWeight, product of:
              1.1116359 = boost
              9.303573 = idf(docFreq=10, maxDocs=44421)
              0.07516346 = queryNorm
            2.9073665 = fieldWeight in 2371, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              9.303573 = idf(docFreq=10, maxDocs=44421)
              0.3125 = fieldNorm(doc=2371)
    
  3. Zapilko, B.: Dynamisches Browsing im Kontext von Informationsarchitekturen (2010) 2.26
    2.2600543 = sum of:
      2.2600543 = product of:
        4.5201087 = sum of:
          4.5201087 = weight(author_txt:zapilko in 731) [ClassicSimilarity], result of:
            4.5201087 = score(doc=731,freq=1.0), product of:
              0.7773545 = queryWeight, product of:
                1.1116359 = boost
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.07516346 = queryNorm
              5.814733 = fieldWeight in 731, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.625 = fieldNorm(doc=731)
        0.5 = coord(1/2)
    
  4. Zapilko, B.: InFoLiS (2017) 2.26
    2.2600543 = sum of:
      2.2600543 = product of:
        4.5201087 = sum of:
          4.5201087 = weight(author_txt:zapilko in 2031) [ClassicSimilarity], result of:
            4.5201087 = score(doc=2031,freq=1.0), product of:
              0.7773545 = queryWeight, product of:
                1.1116359 = boost
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.07516346 = queryNorm
              5.814733 = fieldWeight in 2031, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.625 = fieldNorm(doc=2031)
        0.5 = coord(1/2)
    
  5. Stempfhuber, M.; Zapilko, B.: Modelling text-fact-integration in digital libraries (2009) 1.81
    1.8080435 = sum of:
      1.8080435 = product of:
        3.616087 = sum of:
          3.616087 = weight(author_txt:zapilko in 380) [ClassicSimilarity], result of:
            3.616087 = score(doc=380,freq=1.0), product of:
              0.7773545 = queryWeight, product of:
                1.1116359 = boost
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.07516346 = queryNorm
              4.6517863 = fieldWeight in 380, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.5 = fieldNorm(doc=380)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Mayr, P.; Petras, V.: Crosskonkordanzen : Terminologie Mapping und deren Effektivität für das Information Retrieval 0.29
    0.29324722 = sum of:
      0.29324722 = product of:
        1.4662361 = sum of:
          0.07492917 = weight(abstract_txt:sozialwissenschaften in 2996) [ClassicSimilarity], result of:
            0.07492917 = score(doc=2996,freq=1.0), product of:
              0.106398344 = queryWeight, product of:
                1.0227447 = boost
                7.5118127 = idf(docFreq=65, maxDocs=44421)
                0.013849143 = queryNorm
              0.70423245 = fieldWeight in 2996, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5118127 = idf(docFreq=65, maxDocs=44421)
                0.09375 = fieldNorm(doc=2996)
          0.15747668 = weight(abstract_txt:crosskonkordanz in 2996) [ClassicSimilarity], result of:
            0.15747668 = score(doc=2996,freq=1.0), product of:
              0.17457354 = queryWeight, product of:
                1.3100533 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.013849143 = queryNorm
              0.902065 = fieldWeight in 2996, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.09375 = fieldNorm(doc=2996)
          0.15747668 = weight(abstract_txt:evaluationsergebnisse in 2996) [ClassicSimilarity], result of:
            0.15747668 = score(doc=2996,freq=1.0), product of:
              0.17457354 = queryWeight, product of:
                1.3100533 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.013849143 = queryNorm
              0.902065 = fieldWeight in 2996, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.09375 = fieldNorm(doc=2996)
          0.047672167 = weight(abstract_txt:evaluation in 2996) [ClassicSimilarity], result of:
            0.047672167 = score(doc=2996,freq=1.0), product of:
              0.11351448 = queryWeight, product of:
                1.829726 = boost
                4.479632 = idf(docFreq=1368, maxDocs=44421)
                0.013849143 = queryNorm
              0.4199655 = fieldWeight in 2996, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.479632 = idf(docFreq=1368, maxDocs=44421)
                0.09375 = fieldNorm(doc=2996)
          1.0286815 = weight(abstract_txt:crosskonkordanzen in 2996) [ClassicSimilarity], result of:
            1.0286815 = score(doc=2996,freq=4.0), product of:
              0.6100352 = queryWeight, product of:
                4.8978696 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.013849143 = queryNorm
              1.6862658 = fieldWeight in 2996, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.09375 = fieldNorm(doc=2996)
        0.2 = coord(5/25)
    
  2. Schott, H.; Schroeder, A.: Crosskonkordanzen von Thesauri und Klassifikationen (2004) 0.23
    0.2300239 = sum of:
      0.2300239 = product of:
        1.1501195 = sum of:
          0.064848006 = weight(abstract_txt:verteilten in 4126) [ClassicSimilarity], result of:
            0.064848006 = score(doc=4126,freq=1.0), product of:
              0.10911543 = queryWeight, product of:
                1.0357212 = boost
                7.607123 = idf(docFreq=59, maxDocs=44421)
                0.013849143 = queryNorm
              0.59430647 = fieldWeight in 4126, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.607123 = idf(docFreq=59, maxDocs=44421)
                0.078125 = fieldNorm(doc=4126)
          0.07016647 = weight(abstract_txt:integrierte in 4126) [ClassicSimilarity], result of:
            0.07016647 = score(doc=4126,freq=1.0), product of:
              0.11500274 = queryWeight, product of:
                1.0632952 = boost
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.013849143 = queryNorm
              0.6101287 = fieldWeight in 4126, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.809647 = idf(docFreq=48, maxDocs=44421)
                0.078125 = fieldNorm(doc=4126)
          0.076500475 = weight(abstract_txt:intellektuell in 4126) [ClassicSimilarity], result of:
            0.076500475 = score(doc=4126,freq=1.0), product of:
              0.12182353 = queryWeight, product of:
                1.094373 = boost
                8.037906 = idf(docFreq=38, maxDocs=44421)
                0.013849143 = queryNorm
              0.6279614 = fieldWeight in 4126, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.037906 = idf(docFreq=38, maxDocs=44421)
                0.078125 = fieldNorm(doc=4126)
          0.08137008 = weight(abstract_txt:maschinell in 4126) [ClassicSimilarity], result of:
            0.08137008 = score(doc=4126,freq=1.0), product of:
              0.12693992 = queryWeight, product of:
                1.1171176 = boost
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.013849143 = queryNorm
              0.6410125 = fieldWeight in 4126, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.078125 = fieldNorm(doc=4126)
          0.85723454 = weight(abstract_txt:crosskonkordanzen in 4126) [ClassicSimilarity], result of:
            0.85723454 = score(doc=4126,freq=4.0), product of:
              0.6100352 = queryWeight, product of:
                4.8978696 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.013849143 = queryNorm
              1.4052215 = fieldWeight in 4126, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.078125 = fieldNorm(doc=4126)
        0.2 = coord(5/25)
    
  3. Mayr, P.: Re-Ranking auf Basis von Bradfordizing für die verteilte Suche in Digitalen Bibliotheken (2009) 0.19
    0.19387275 = sum of:
      0.19387275 = product of:
        0.80780315 = sum of:
          0.04370868 = weight(abstract_txt:sozialwissenschaften in 302) [ClassicSimilarity], result of:
            0.04370868 = score(doc=302,freq=1.0), product of:
              0.106398344 = queryWeight, product of:
                1.0227447 = boost
                7.5118127 = idf(docFreq=65, maxDocs=44421)
                0.013849143 = queryNorm
              0.41080225 = fieldWeight in 302, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5118127 = idf(docFreq=65, maxDocs=44421)
                0.0546875 = fieldNorm(doc=302)
          0.053045917 = weight(abstract_txt:qualitativen in 302) [ClassicSimilarity], result of:
            0.053045917 = score(doc=302,freq=1.0), product of:
              0.12105732 = queryWeight, product of:
                1.090926 = boost
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.013849143 = queryNorm
              0.43818843 = fieldWeight in 302, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.0546875 = fieldNorm(doc=302)
          0.05355033 = weight(abstract_txt:intellektuell in 302) [ClassicSimilarity], result of:
            0.05355033 = score(doc=302,freq=1.0), product of:
              0.12182353 = queryWeight, product of:
                1.094373 = boost
                8.037906 = idf(docFreq=38, maxDocs=44421)
                0.013849143 = queryNorm
              0.43957296 = fieldWeight in 302, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.037906 = idf(docFreq=38, maxDocs=44421)
                0.0546875 = fieldNorm(doc=302)
          0.018106494 = weight(abstract_txt:einer in 302) [ClassicSimilarity], result of:
            0.018106494 = score(doc=302,freq=1.0), product of:
              0.08527461 = queryWeight, product of:
                1.5858798 = boost
                3.882635 = idf(docFreq=2486, maxDocs=44421)
                0.013849143 = queryNorm
              0.21233161 = fieldWeight in 302, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.882635 = idf(docFreq=2486, maxDocs=44421)
                0.0546875 = fieldNorm(doc=302)
          0.03932753 = weight(abstract_txt:evaluation in 302) [ClassicSimilarity], result of:
            0.03932753 = score(doc=302,freq=2.0), product of:
              0.11351448 = queryWeight, product of:
                1.829726 = boost
                4.479632 = idf(docFreq=1368, maxDocs=44421)
                0.013849143 = queryNorm
              0.34645385 = fieldWeight in 302, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.479632 = idf(docFreq=1368, maxDocs=44421)
                0.0546875 = fieldNorm(doc=302)
          0.6000642 = weight(abstract_txt:crosskonkordanzen in 302) [ClassicSimilarity], result of:
            0.6000642 = score(doc=302,freq=4.0), product of:
              0.6100352 = queryWeight, product of:
                4.8978696 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.013849143 = queryNorm
              0.9836551 = fieldWeight in 302, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.0546875 = fieldNorm(doc=302)
        0.24 = coord(6/25)
    
  4. Strötgen, R.; Kokkelink, S.: Metadatenextraktion aus Internetquellen : Heterogenitätsbehandlung im Projekt CARMEN (2001) 0.17
    0.17129414 = sum of:
      0.17129414 = product of:
        0.71372557 = sum of:
          0.049952775 = weight(abstract_txt:sozialwissenschaften in 6808) [ClassicSimilarity], result of:
            0.049952775 = score(doc=6808,freq=1.0), product of:
              0.106398344 = queryWeight, product of:
                1.0227447 = boost
                7.5118127 = idf(docFreq=65, maxDocs=44421)
                0.013849143 = queryNorm
              0.4694883 = fieldWeight in 6808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5118127 = idf(docFreq=65, maxDocs=44421)
                0.0625 = fieldNorm(doc=6808)
          0.051878404 = weight(abstract_txt:verteilten in 6808) [ClassicSimilarity], result of:
            0.051878404 = score(doc=6808,freq=1.0), product of:
              0.10911543 = queryWeight, product of:
                1.0357212 = boost
                7.607123 = idf(docFreq=59, maxDocs=44421)
                0.013849143 = queryNorm
              0.47544518 = fieldWeight in 6808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.607123 = idf(docFreq=59, maxDocs=44421)
                0.0625 = fieldNorm(doc=6808)
          0.0865504 = weight(abstract_txt:intellektuell in 6808) [ClassicSimilarity], result of:
            0.0865504 = score(doc=6808,freq=2.0), product of:
              0.12182353 = queryWeight, product of:
                1.094373 = boost
                8.037906 = idf(docFreq=38, maxDocs=44421)
                0.013849143 = queryNorm
              0.7104572 = fieldWeight in 6808, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.037906 = idf(docFreq=38, maxDocs=44421)
                0.0625 = fieldNorm(doc=6808)
          0.020693136 = weight(abstract_txt:einer in 6808) [ClassicSimilarity], result of:
            0.020693136 = score(doc=6808,freq=1.0), product of:
              0.08527461 = queryWeight, product of:
                1.5858798 = boost
                3.882635 = idf(docFreq=2486, maxDocs=44421)
                0.013849143 = queryNorm
              0.2426647 = fieldWeight in 6808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.882635 = idf(docFreq=2486, maxDocs=44421)
                0.0625 = fieldNorm(doc=6808)
          0.16175704 = weight(abstract_txt:automatisch in 6808) [ClassicSimilarity], result of:
            0.16175704 = score(doc=6808,freq=1.0), product of:
              0.36967823 = queryWeight, product of:
                3.8127797 = boost
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.013849143 = queryNorm
              0.4375617 = fieldWeight in 6808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.0625 = fieldNorm(doc=6808)
          0.34289384 = weight(abstract_txt:crosskonkordanzen in 6808) [ClassicSimilarity], result of:
            0.34289384 = score(doc=6808,freq=1.0), product of:
              0.6100352 = queryWeight, product of:
                4.8978696 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.013849143 = queryNorm
              0.5620886 = fieldWeight in 6808, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.0625 = fieldNorm(doc=6808)
        0.24 = coord(6/25)
    
  5. Mayr, P.; Zapilko, B.; Sure, Y.: ¬Ein Mehr-Thesauri-Szenario auf Basis von SKOS und Crosskonkordanzen (2010) 0.15
    0.14585283 = sum of:
      0.14585283 = product of:
        0.9115802 = sum of:
          0.06244097 = weight(abstract_txt:sozialwissenschaften in 379) [ClassicSimilarity], result of:
            0.06244097 = score(doc=379,freq=1.0), product of:
              0.106398344 = queryWeight, product of:
                1.0227447 = boost
                7.5118127 = idf(docFreq=65, maxDocs=44421)
                0.013849143 = queryNorm
              0.58686036 = fieldWeight in 379, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.5118127 = idf(docFreq=65, maxDocs=44421)
                0.078125 = fieldNorm(doc=379)
          0.06662849 = weight(abstract_txt:anzubieten in 379) [ClassicSimilarity], result of:
            0.06662849 = score(doc=379,freq=1.0), product of:
              0.11110367 = queryWeight, product of:
                1.0451148 = boost
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.013849143 = queryNorm
              0.5996966 = fieldWeight in 379, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.078125 = fieldNorm(doc=379)
          0.17635442 = weight(abstract_txt:thesaurus in 379) [ClassicSimilarity], result of:
            0.17635442 = score(doc=379,freq=3.0), product of:
              0.25205514 = queryWeight, product of:
                3.519918 = boost
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.013849143 = queryNorm
              0.699666 = fieldWeight in 379, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.078125 = fieldNorm(doc=379)
          0.60615635 = weight(abstract_txt:crosskonkordanzen in 379) [ClassicSimilarity], result of:
            0.60615635 = score(doc=379,freq=2.0), product of:
              0.6100352 = queryWeight, product of:
                4.8978696 = boost
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.013849143 = queryNorm
              0.9936416 = fieldWeight in 379, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.993418 = idf(docFreq=14, maxDocs=44421)
                0.078125 = fieldNorm(doc=379)
        0.16 = coord(4/25)