Document (#39516)

Author
Posch, L.
Schaer, P.
Bleier, A.
Strohmaier, M.
Title
¬A system for probabilistic linking of thesauri and classification systems
Source
Künstliche Intelligenz. 2015, S.1-4
Year
2015
Abstract
This paper presents a system which creates and visualizes probabilistic semantic links between concepts in a thesaurus and classes in a classification system. For creating the links, we build on the Polylingual Labeled Topic Model (PLL-TM) (Posch et al., in KI 2015: advances in artificial intelligence, 2015). PLL-TM identifies probable thesaurus descriptors for each class in the classification system by using information from the natural language text of documents, their assigned thesaurus descriptors and their designated classes. The links are then presented to users of the system in an interactive visualization, providing them with an automatically generated overview of the relations between the thesaurus and the classification system.
Content
Vgl.: http://link.springer.com/article/10.1007%2Fs13218-015-0413-9.
Theme
Semantische Interoperabilität

Similar documents (author)

  1. Schaer, P.: Integration von Open-Access-Repositorien in Fachportale (2010) 5.54
    5.5426593 = sum of:
      5.5426593 = weight(author_txt:schaer in 3320) [ClassicSimilarity], result of:
        5.5426593 = fieldWeight in 3320, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.868255 = idf(docFreq=16, maxDocs=44421)
          0.625 = fieldNorm(doc=3320)
    
  2. Schaer, P.: Sprachmodelle und neuronale Netze im Information Retrieval (2023) 5.54
    5.5426593 = sum of:
      5.5426593 = weight(author_txt:schaer in 1800) [ClassicSimilarity], result of:
        5.5426593 = fieldWeight in 1800, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.868255 = idf(docFreq=16, maxDocs=44421)
          0.625 = fieldNorm(doc=1800)
    
  3. Munkelt, J.; Schaer, P.: Towards an IR test collection for the German National Library (2018) 4.43
    4.4341273 = sum of:
      4.4341273 = weight(author_txt:schaer in 780) [ClassicSimilarity], result of:
        4.4341273 = fieldWeight in 780, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.868255 = idf(docFreq=16, maxDocs=44421)
          0.5 = fieldNorm(doc=780)
    
  4. Mayr, P.; Schaer, P.; Mutschke, P.: ¬A science model driven retrieval prototype (2011) 3.33
    3.3255954 = sum of:
      3.3255954 = weight(author_txt:schaer in 1649) [ClassicSimilarity], result of:
        3.3255954 = fieldWeight in 1649, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.868255 = idf(docFreq=16, maxDocs=44421)
          0.375 = fieldNorm(doc=1649)
    
  5. Neumann, M.; Steinberg, J.; Schaer, P.: Web-ccraping for non-programmers : introducing OXPath for digital library metadata harvesting (2017) 3.33
    3.3255954 = sum of:
      3.3255954 = weight(author_txt:schaer in 4895) [ClassicSimilarity], result of:
        3.3255954 = fieldWeight in 4895, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.868255 = idf(docFreq=16, maxDocs=44421)
          0.375 = fieldNorm(doc=4895)
    

Similar documents (content)

  1. Loosjes, T.P.; Tichelaar, P.A.; Goossens, J.; Stuurman, P.: Ontsluiting op onderwerp (1977) 0.19
    0.18542497 = sum of:
      0.18542497 = product of:
        0.77260405 = sum of:
          0.020924196 = weight(abstract_txt:between in 978) [ClassicSimilarity], result of:
            0.020924196 = score(doc=978,freq=1.0), product of:
              0.07746572 = queryWeight, product of:
                1.244276 = boost
                3.4573963 = idf(docFreq=3804, maxDocs=44421)
                0.018007096 = queryNorm
              0.2701091 = fieldWeight in 978, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4573963 = idf(docFreq=3804, maxDocs=44421)
                0.078125 = fieldNorm(doc=978)
          0.100145794 = weight(abstract_txt:classes in 978) [ClassicSimilarity], result of:
            0.100145794 = score(doc=978,freq=1.0), product of:
              0.22000483 = queryWeight, product of:
                2.0969017 = boost
                5.8265367 = idf(docFreq=355, maxDocs=44421)
                0.018007096 = queryNorm
              0.45519817 = fieldWeight in 978, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8265367 = idf(docFreq=355, maxDocs=44421)
                0.078125 = fieldNorm(doc=978)
          0.2595778 = weight(abstract_txt:descriptors in 978) [ClassicSimilarity], result of:
            0.2595778 = score(doc=978,freq=3.0), product of:
              0.28783816 = queryWeight, product of:
                2.3984802 = boost
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.018007096 = queryNorm
              0.90181863 = fieldWeight in 978, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.078125 = fieldNorm(doc=978)
          0.11158748 = weight(abstract_txt:classification in 978) [ClassicSimilarity], result of:
            0.11158748 = score(doc=978,freq=3.0), product of:
              0.20656511 = queryWeight, product of:
                2.8734617 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.018007096 = queryNorm
              0.5402049 = fieldWeight in 978, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.078125 = fieldNorm(doc=978)
          0.08241375 = weight(abstract_txt:system in 978) [ClassicSimilarity], result of:
            0.08241375 = score(doc=978,freq=2.0), product of:
              0.22116035 = queryWeight, product of:
                3.6414654 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.018007096 = queryNorm
              0.37264252 = fieldWeight in 978, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.078125 = fieldNorm(doc=978)
          0.19795503 = weight(abstract_txt:thesaurus in 978) [ClassicSimilarity], result of:
            0.19795503 = score(doc=978,freq=2.0), product of:
              0.3465145 = queryWeight, product of:
                3.7216682 = boost
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.018007096 = queryNorm
              0.5712749 = fieldWeight in 978, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.078125 = fieldNorm(doc=978)
        0.24 = coord(6/25)
    
  2. Williamson, N.J.: Deriving a thesaurus from a restructured UDC (1996) 0.17
    0.16889998 = sum of:
      0.16889998 = product of:
        0.70374995 = sum of:
          0.08806705 = weight(abstract_txt:class in 5262) [ClassicSimilarity], result of:
            0.08806705 = score(doc=5262,freq=2.0), product of:
              0.11265363 = queryWeight, product of:
                1.0610101 = boost
                5.8963327 = idf(docFreq=331, maxDocs=44421)
                0.018007096 = queryNorm
              0.7817507 = fieldWeight in 5262, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8963327 = idf(docFreq=331, maxDocs=44421)
                0.09375 = fieldNorm(doc=5262)
          0.019032756 = weight(abstract_txt:their in 5262) [ClassicSimilarity], result of:
            0.019032756 = score(doc=5262,freq=1.0), product of:
              0.06440072 = queryWeight, product of:
                1.1345073 = boost
                3.1523883 = idf(docFreq=5161, maxDocs=44421)
                0.018007096 = queryNorm
              0.2955364 = fieldWeight in 5262, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1523883 = idf(docFreq=5161, maxDocs=44421)
                0.09375 = fieldNorm(doc=5262)
          0.17984079 = weight(abstract_txt:descriptors in 5262) [ClassicSimilarity], result of:
            0.17984079 = score(doc=5262,freq=1.0), product of:
              0.28783816 = queryWeight, product of:
                2.3984802 = boost
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.018007096 = queryNorm
              0.6247983 = fieldWeight in 5262, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.09375 = fieldNorm(doc=5262)
          0.109332964 = weight(abstract_txt:classification in 5262) [ClassicSimilarity], result of:
            0.109332964 = score(doc=5262,freq=2.0), product of:
              0.20656511 = queryWeight, product of:
                2.8734617 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.018007096 = queryNorm
              0.52929056 = fieldWeight in 5262, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.09375 = fieldNorm(doc=5262)
          0.06993039 = weight(abstract_txt:system in 5262) [ClassicSimilarity], result of:
            0.06993039 = score(doc=5262,freq=1.0), product of:
              0.22116035 = queryWeight, product of:
                3.6414654 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.018007096 = queryNorm
              0.31619766 = fieldWeight in 5262, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.09375 = fieldNorm(doc=5262)
          0.23754603 = weight(abstract_txt:thesaurus in 5262) [ClassicSimilarity], result of:
            0.23754603 = score(doc=5262,freq=2.0), product of:
              0.3465145 = queryWeight, product of:
                3.7216682 = boost
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.018007096 = queryNorm
              0.6855298 = fieldWeight in 5262, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.09375 = fieldNorm(doc=5262)
        0.24 = coord(6/25)
    
  3. Francu, V.: Multilingual access to information using an intermediate language (2003) 0.16
    0.16061598 = sum of:
      0.16061598 = product of:
        0.66923326 = sum of:
          0.014646937 = weight(abstract_txt:between in 2742) [ClassicSimilarity], result of:
            0.014646937 = score(doc=2742,freq=1.0), product of:
              0.07746572 = queryWeight, product of:
                1.244276 = boost
                3.4573963 = idf(docFreq=3804, maxDocs=44421)
                0.018007096 = queryNorm
              0.18907636 = fieldWeight in 2742, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4573963 = idf(docFreq=3804, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2742)
          0.09913928 = weight(abstract_txt:classes in 2742) [ClassicSimilarity], result of:
            0.09913928 = score(doc=2742,freq=2.0), product of:
              0.22000483 = queryWeight, product of:
                2.0969017 = boost
                5.8265367 = idf(docFreq=355, maxDocs=44421)
                0.018007096 = queryNorm
              0.4506232 = fieldWeight in 2742, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8265367 = idf(docFreq=355, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2742)
          0.104907125 = weight(abstract_txt:descriptors in 2742) [ClassicSimilarity], result of:
            0.104907125 = score(doc=2742,freq=1.0), product of:
              0.28783816 = queryWeight, product of:
                2.3984802 = boost
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.018007096 = queryNorm
              0.36446565 = fieldWeight in 2742, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2742)
          0.11931688 = weight(abstract_txt:classification in 2742) [ClassicSimilarity], result of:
            0.11931688 = score(doc=2742,freq=7.0), product of:
              0.20656511 = queryWeight, product of:
                2.8734617 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.018007096 = queryNorm
              0.5776236 = fieldWeight in 2742, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2742)
          0.091215305 = weight(abstract_txt:system in 2742) [ClassicSimilarity], result of:
            0.091215305 = score(doc=2742,freq=5.0), product of:
              0.22116035 = queryWeight, product of:
                3.6414654 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.018007096 = queryNorm
              0.41243967 = fieldWeight in 2742, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2742)
          0.24000771 = weight(abstract_txt:thesaurus in 2742) [ClassicSimilarity], result of:
            0.24000771 = score(doc=2742,freq=6.0), product of:
              0.3465145 = queryWeight, product of:
                3.7216682 = boost
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.018007096 = queryNorm
              0.692634 = fieldWeight in 2742, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2742)
        0.24 = coord(6/25)
    
  4. Tudhope, D.; Binding, C.; Blocks, D.; Cunliffe, D.: FACET: thesaurus retrieval with semantic term expansion (2002) 0.15
    0.15429652 = sum of:
      0.15429652 = product of:
        0.6429022 = sum of:
          0.014646937 = weight(abstract_txt:between in 1175) [ClassicSimilarity], result of:
            0.014646937 = score(doc=1175,freq=1.0), product of:
              0.07746572 = queryWeight, product of:
                1.244276 = boost
                3.4573963 = idf(docFreq=3804, maxDocs=44421)
                0.018007096 = queryNorm
              0.18907636 = fieldWeight in 1175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4573963 = idf(docFreq=3804, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1175)
          0.07010206 = weight(abstract_txt:classes in 1175) [ClassicSimilarity], result of:
            0.07010206 = score(doc=1175,freq=1.0), product of:
              0.22000483 = queryWeight, product of:
                2.0969017 = boost
                5.8265367 = idf(docFreq=355, maxDocs=44421)
                0.018007096 = queryNorm
              0.3186387 = fieldWeight in 1175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8265367 = idf(docFreq=355, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1175)
          0.18170446 = weight(abstract_txt:descriptors in 1175) [ClassicSimilarity], result of:
            0.18170446 = score(doc=1175,freq=3.0), product of:
              0.28783816 = queryWeight, product of:
                2.3984802 = boost
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.018007096 = queryNorm
              0.63127303 = fieldWeight in 1175, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1175)
          0.076418065 = weight(abstract_txt:links in 1175) [ClassicSimilarity], result of:
            0.076418065 = score(doc=1175,freq=1.0), product of:
              0.26675105 = queryWeight, product of:
                2.827878 = boost
                5.238438 = idf(docFreq=640, maxDocs=44421)
                0.018007096 = queryNorm
              0.2864771 = fieldWeight in 1175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.238438 = idf(docFreq=640, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1175)
          0.040792722 = weight(abstract_txt:system in 1175) [ClassicSimilarity], result of:
            0.040792722 = score(doc=1175,freq=1.0), product of:
              0.22116035 = queryWeight, product of:
                3.6414654 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.018007096 = queryNorm
              0.18444863 = fieldWeight in 1175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1175)
          0.25923795 = weight(abstract_txt:thesaurus in 1175) [ClassicSimilarity], result of:
            0.25923795 = score(doc=1175,freq=7.0), product of:
              0.3465145 = queryWeight, product of:
                3.7216682 = boost
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.018007096 = queryNorm
              0.7481302 = fieldWeight in 1175, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.0546875 = fieldNorm(doc=1175)
        0.24 = coord(6/25)
    
  5. Doorn, M. van; Polman, K.: From classification to thesaurus ... and back? : subject indexing tools at the library of the Afrika-Studiecentrum Leiden (2010) 0.15
    0.14505947 = sum of:
      0.14505947 = product of:
        0.60441446 = sum of:
          0.045869567 = weight(abstract_txt:linking in 62) [ClassicSimilarity], result of:
            0.045869567 = score(doc=62,freq=1.0), product of:
              0.12039917 = queryWeight, product of:
                1.0968789 = boost
                6.0956655 = idf(docFreq=271, maxDocs=44421)
                0.018007096 = queryNorm
              0.3809791 = fieldWeight in 62, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0956655 = idf(docFreq=271, maxDocs=44421)
                0.0625 = fieldNorm(doc=62)
          0.06619625 = weight(abstract_txt:assigned in 62) [ClassicSimilarity], result of:
            0.06619625 = score(doc=62,freq=2.0), product of:
              0.122035444 = queryWeight, product of:
                1.1043073 = boost
                6.136947 = idf(docFreq=260, maxDocs=44421)
                0.018007096 = queryNorm
              0.54243463 = fieldWeight in 62, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.136947 = idf(docFreq=260, maxDocs=44421)
                0.0625 = fieldNorm(doc=62)
          0.11989386 = weight(abstract_txt:descriptors in 62) [ClassicSimilarity], result of:
            0.11989386 = score(doc=62,freq=1.0), product of:
              0.28783816 = queryWeight, product of:
                2.3984802 = boost
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.018007096 = queryNorm
              0.4165322 = fieldWeight in 62, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.0625 = fieldNorm(doc=62)
          0.051540054 = weight(abstract_txt:classification in 62) [ClassicSimilarity], result of:
            0.051540054 = score(doc=62,freq=1.0), product of:
              0.20656511 = queryWeight, product of:
                2.8734617 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.018007096 = queryNorm
              0.24950996 = fieldWeight in 62, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.0625 = fieldNorm(doc=62)
          0.046620257 = weight(abstract_txt:system in 62) [ClassicSimilarity], result of:
            0.046620257 = score(doc=62,freq=1.0), product of:
              0.22116035 = queryWeight, product of:
                3.6414654 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.018007096 = queryNorm
              0.21079844 = fieldWeight in 62, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.0625 = fieldNorm(doc=62)
          0.27429453 = weight(abstract_txt:thesaurus in 62) [ClassicSimilarity], result of:
            0.27429453 = score(doc=62,freq=6.0), product of:
              0.3465145 = queryWeight, product of:
                3.7216682 = boost
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.018007096 = queryNorm
              0.7915817 = fieldWeight in 62, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.17059 = idf(docFreq=685, maxDocs=44421)
                0.0625 = fieldNorm(doc=62)
        0.24 = coord(6/25)