Document (#40854)

Author
Piros, A.
Title
¬The thought behind the symbol : about the automatic interpretation and representation of UDC numbers
Source
Knowledge organization. 44(2017) no.6, S.416-424
Year
2017
Abstract
Analytico-synthetic and faceted classifications, such as Universal Decimal Classification (UDC) provide facilities to express pre-coordinated subject statements using syntactic relations. In this case, the relevance, in the process of UDC-based information retrieval, can be determined by extracting the meaning of the classmarks as precisely as is possible. The central question here is how the identification mentioned above can be supported by automatic means and an analysis of the structure of complex classmarks appears to be an obvious requirement. Many bibliographic sources contain complex UDC classmarks which are stored as simple text strings and on which it is very difficult to perform any meaningful information discovery. The paper presents results from a phase of ongoing research focused on developing a new platform-independent, machine-processable data format capable of representing the whole syntactic structure of the composite UDC numbers to support their further automatic processing. An algorithm that can produce the representation of the numbers in such a format directly from their designations has also been developed and implemented. The research also includes implementing conversion methods to provide outputs that can be employed by other software directly and, as a service, make them available for other software. The paper provides an overview of the solutions developed and implemented since 2015 and outlines future research plans.
Content
Beitrag in einem Special Issue: Selected Papers from the International UDC Seminar 2017, Faceted Classification Today: Theory, Technology and End Users, 14-15 September, London UK.
Theme
International bedeutende Universalklassifikationen
Object
UDC

Similar documents (content)

  1. Piros, A.: Az ETO-jelzetek automatikus interpretálásának és elemzésének kérdései (2018) 0.42
    0.41578755 = sum of:
      0.41578755 = product of:
        1.0394689 = sum of:
          0.0646825 = weight(abstract_txt:requirement in 1856) [ClassicSimilarity], result of:
            0.0646825 = score(doc=1856,freq=1.0), product of:
              0.13832204 = queryWeight, product of:
                1.0307411 = boost
                7.48196 = idf(docFreq=67, maxDocs=44421)
                0.017936034 = queryNorm
              0.4676225 = fieldWeight in 1856, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.48196 = idf(docFreq=67, maxDocs=44421)
                0.0625 = fieldNorm(doc=1856)
          0.06710793 = weight(abstract_txt:coordinated in 1856) [ClassicSimilarity], result of:
            0.06710793 = score(doc=1856,freq=1.0), product of:
              0.1417586 = queryWeight, product of:
                1.0434667 = boost
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.017936034 = queryNorm
              0.47339582 = fieldWeight in 1856, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.0625 = fieldNorm(doc=1856)
          0.025564933 = weight(abstract_txt:software in 1856) [ClassicSimilarity], result of:
            0.025564933 = score(doc=1856,freq=1.0), product of:
              0.093858436 = queryWeight, product of:
                1.2007581 = boost
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.017936034 = queryNorm
              0.27237758 = fieldWeight in 1856, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.0625 = fieldNorm(doc=1856)
          0.025564933 = weight(abstract_txt:structure in 1856) [ClassicSimilarity], result of:
            0.025564933 = score(doc=1856,freq=1.0), product of:
              0.093858436 = queryWeight, product of:
                1.2007581 = boost
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.017936034 = queryNorm
              0.27237758 = fieldWeight in 1856, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.0625 = fieldNorm(doc=1856)
          0.020666443 = weight(abstract_txt:research in 1856) [ClassicSimilarity], result of:
            0.020666443 = score(doc=1856,freq=2.0), product of:
              0.074001595 = queryWeight, product of:
                1.3058252 = boost
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.017936034 = queryNorm
              0.27927023 = fieldWeight in 1856, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.0625 = fieldNorm(doc=1856)
          0.040492784 = weight(abstract_txt:complex in 1856) [ClassicSimilarity], result of:
            0.040492784 = score(doc=1856,freq=1.0), product of:
              0.12753478 = queryWeight, product of:
                1.3996943 = boost
                5.080062 = idf(docFreq=750, maxDocs=44421)
                0.017936034 = queryNorm
              0.31750387 = fieldWeight in 1856, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.080062 = idf(docFreq=750, maxDocs=44421)
                0.0625 = fieldNorm(doc=1856)
          0.057811894 = weight(abstract_txt:format in 1856) [ClassicSimilarity], result of:
            0.057811894 = score(doc=1856,freq=2.0), product of:
              0.12834482 = queryWeight, product of:
                1.4041324 = boost
                5.0961695 = idf(docFreq=738, maxDocs=44421)
                0.017936034 = queryNorm
              0.450442 = fieldWeight in 1856, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.0961695 = idf(docFreq=738, maxDocs=44421)
                0.0625 = fieldNorm(doc=1856)
          0.12168061 = weight(abstract_txt:syntactic in 1856) [ClassicSimilarity], result of:
            0.12168061 = score(doc=1856,freq=2.0), product of:
              0.21078892 = queryWeight, product of:
                1.7994624 = boost
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.017936034 = queryNorm
              0.5772628 = fieldWeight in 1856, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.5309834 = idf(docFreq=175, maxDocs=44421)
                0.0625 = fieldNorm(doc=1856)
          0.2312248 = weight(abstract_txt:numbers in 1856) [ClassicSimilarity], result of:
            0.2312248 = score(doc=1856,freq=6.0), product of:
              0.25667277 = queryWeight, product of:
                2.4319487 = boost
                5.8843565 = idf(docFreq=335, maxDocs=44421)
                0.017936034 = queryNorm
              0.90085447 = fieldWeight in 1856, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.8843565 = idf(docFreq=335, maxDocs=44421)
                0.0625 = fieldNorm(doc=1856)
          0.38467202 = weight(abstract_txt:classmarks in 1856) [ClassicSimilarity], result of:
            0.38467202 = score(doc=1856,freq=1.0), product of:
              0.6548387 = queryWeight, product of:
                3.884469 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.017936034 = queryNorm
              0.5874302 = fieldWeight in 1856, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.0625 = fieldNorm(doc=1856)
        0.4 = coord(10/25)
    
  2. Gnoli, C.; Pullman, T.; Cousson, P.; Merli, G.; Szostak, R.: Representing the structural elements of a freely faceted classification (2011) 0.12
    0.121426046 = sum of:
      0.121426046 = product of:
        0.6071302 = sum of:
          0.02496834 = weight(abstract_txt:provide in 825) [ClassicSimilarity], result of:
            0.02496834 = score(doc=825,freq=1.0), product of:
              0.07962143 = queryWeight, product of:
                1.1059458 = boost
                4.013929 = idf(docFreq=2180, maxDocs=44421)
                0.017936034 = queryNorm
              0.3135882 = fieldWeight in 825, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.013929 = idf(docFreq=2180, maxDocs=44421)
                0.078125 = fieldNorm(doc=825)
          0.031956166 = weight(abstract_txt:structure in 825) [ClassicSimilarity], result of:
            0.031956166 = score(doc=825,freq=1.0), product of:
              0.093858436 = queryWeight, product of:
                1.2007581 = boost
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.017936034 = queryNorm
              0.34047198 = fieldWeight in 825, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.078125 = fieldNorm(doc=825)
          0.018266726 = weight(abstract_txt:research in 825) [ClassicSimilarity], result of:
            0.018266726 = score(doc=825,freq=1.0), product of:
              0.074001595 = queryWeight, product of:
                1.3058252 = boost
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.017936034 = queryNorm
              0.24684234 = fieldWeight in 825, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.159582 = idf(docFreq=5124, maxDocs=44421)
                0.078125 = fieldNorm(doc=825)
          0.05109898 = weight(abstract_txt:format in 825) [ClassicSimilarity], result of:
            0.05109898 = score(doc=825,freq=1.0), product of:
              0.12834482 = queryWeight, product of:
                1.4041324 = boost
                5.0961695 = idf(docFreq=738, maxDocs=44421)
                0.017936034 = queryNorm
              0.39813823 = fieldWeight in 825, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0961695 = idf(docFreq=738, maxDocs=44421)
                0.078125 = fieldNorm(doc=825)
          0.48084003 = weight(abstract_txt:classmarks in 825) [ClassicSimilarity], result of:
            0.48084003 = score(doc=825,freq=1.0), product of:
              0.6548387 = queryWeight, product of:
                3.884469 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.017936034 = queryNorm
              0.73428774 = fieldWeight in 825, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.078125 = fieldNorm(doc=825)
        0.2 = coord(5/25)
    
  3. Piros, A.: Automatic interpretation of complex UDC numbers : towards support for library systems (2015) 0.12
    0.11645492 = sum of:
      0.11645492 = product of:
        0.41591042 = sum of:
          0.05168281 = weight(abstract_txt:synthetic in 3301) [ClassicSimilarity], result of:
            0.05168281 = score(doc=3301,freq=1.0), product of:
              0.13019438 = queryWeight, product of:
                7.2588162 = idf(docFreq=84, maxDocs=44421)
                0.017936034 = queryNorm
              0.39696652 = fieldWeight in 3301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2588162 = idf(docFreq=84, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3301)
          0.07304676 = weight(abstract_txt:analytico in 3301) [ClassicSimilarity], result of:
            0.07304676 = score(doc=3301,freq=1.0), product of:
              0.16396914 = queryWeight, product of:
                1.1222379 = boost
                8.146119 = idf(docFreq=34, maxDocs=44421)
                0.017936034 = queryNorm
              0.4454909 = fieldWeight in 3301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.146119 = idf(docFreq=34, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3301)
          0.03163499 = weight(abstract_txt:software in 3301) [ClassicSimilarity], result of:
            0.03163499 = score(doc=3301,freq=2.0), product of:
              0.093858436 = queryWeight, product of:
                1.2007581 = boost
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.017936034 = queryNorm
              0.33705005 = fieldWeight in 3301, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3301)
          0.050107263 = weight(abstract_txt:complex in 3301) [ClassicSimilarity], result of:
            0.050107263 = score(doc=3301,freq=2.0), product of:
              0.12753478 = queryWeight, product of:
                1.3996943 = boost
                5.080062 = idf(docFreq=750, maxDocs=44421)
                0.017936034 = queryNorm
              0.392891 = fieldWeight in 3301, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.080062 = idf(docFreq=750, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3301)
          0.035769287 = weight(abstract_txt:format in 3301) [ClassicSimilarity], result of:
            0.035769287 = score(doc=3301,freq=1.0), product of:
              0.12834482 = queryWeight, product of:
                1.4041324 = boost
                5.0961695 = idf(docFreq=738, maxDocs=44421)
                0.017936034 = queryNorm
              0.27869678 = fieldWeight in 3301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0961695 = idf(docFreq=738, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3301)
          0.05685884 = weight(abstract_txt:automatic in 3301) [ClassicSimilarity], result of:
            0.05685884 = score(doc=3301,freq=1.0), product of:
              0.20010929 = queryWeight, product of:
                2.1473267 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.017936034 = queryNorm
              0.28413895 = fieldWeight in 3301, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3301)
          0.116810486 = weight(abstract_txt:numbers in 3301) [ClassicSimilarity], result of:
            0.116810486 = score(doc=3301,freq=2.0), product of:
              0.25667277 = queryWeight, product of:
                2.4319487 = boost
                5.8843565 = idf(docFreq=335, maxDocs=44421)
                0.017936034 = queryNorm
              0.45509496 = fieldWeight in 3301, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8843565 = idf(docFreq=335, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3301)
        0.28 = coord(7/25)
    
  4. Zeng, M.L.; Fan, W.; Lin, X.: SKOS for an integrated vocabulary structure (2008) 0.10
    0.10037148 = sum of:
      0.10037148 = product of:
        0.35846958 = sum of:
          0.05168281 = weight(abstract_txt:synthetic in 3654) [ClassicSimilarity], result of:
            0.05168281 = score(doc=3654,freq=1.0), product of:
              0.13019438 = queryWeight, product of:
                7.2588162 = idf(docFreq=84, maxDocs=44421)
                0.017936034 = queryNorm
              0.39696652 = fieldWeight in 3654, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.2588162 = idf(docFreq=84, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3654)
          0.083041824 = weight(abstract_txt:coordinated in 3654) [ClassicSimilarity], result of:
            0.083041824 = score(doc=3654,freq=2.0), product of:
              0.1417586 = queryWeight, product of:
                1.0434667 = boost
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.017936034 = queryNorm
              0.5857974 = fieldWeight in 3654, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.574333 = idf(docFreq=61, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3654)
          0.024717396 = weight(abstract_txt:provide in 3654) [ClassicSimilarity], result of:
            0.024717396 = score(doc=3654,freq=2.0), product of:
              0.07962143 = queryWeight, product of:
                1.1059458 = boost
                4.013929 = idf(docFreq=2180, maxDocs=44421)
                0.017936034 = queryNorm
              0.3104365 = fieldWeight in 3654, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.013929 = idf(docFreq=2180, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3654)
          0.03163499 = weight(abstract_txt:structure in 3654) [ClassicSimilarity], result of:
            0.03163499 = score(doc=3654,freq=2.0), product of:
              0.093858436 = queryWeight, product of:
                1.2007581 = boost
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.017936034 = queryNorm
              0.33705005 = fieldWeight in 3654, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3654)
          0.09619212 = weight(abstract_txt:processable in 3654) [ClassicSimilarity], result of:
            0.09619212 = score(doc=3654,freq=1.0), product of:
              0.19699468 = queryWeight, product of:
                1.2300737 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.017936034 = queryNorm
              0.48829806 = fieldWeight in 3654, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3654)
          0.035431188 = weight(abstract_txt:complex in 3654) [ClassicSimilarity], result of:
            0.035431188 = score(doc=3654,freq=1.0), product of:
              0.12753478 = queryWeight, product of:
                1.3996943 = boost
                5.080062 = idf(docFreq=750, maxDocs=44421)
                0.017936034 = queryNorm
              0.27781588 = fieldWeight in 3654, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.080062 = idf(docFreq=750, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3654)
          0.035769287 = weight(abstract_txt:format in 3654) [ClassicSimilarity], result of:
            0.035769287 = score(doc=3654,freq=1.0), product of:
              0.12834482 = queryWeight, product of:
                1.4041324 = boost
                5.0961695 = idf(docFreq=738, maxDocs=44421)
                0.017936034 = queryNorm
              0.27869678 = fieldWeight in 3654, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0961695 = idf(docFreq=738, maxDocs=44421)
                0.0546875 = fieldNorm(doc=3654)
        0.28 = coord(7/25)
    
  5. Slavic, A.; Davies, S.: Facet analysis in UDC : questions of structure, functionality and data formality (2017) 0.10
    0.09958715 = sum of:
      0.09958715 = product of:
        0.6224197 = sum of:
          0.083532035 = weight(abstract_txt:synthetic in 4848) [ClassicSimilarity], result of:
            0.083532035 = score(doc=4848,freq=2.0), product of:
              0.13019438 = queryWeight, product of:
                7.2588162 = idf(docFreq=84, maxDocs=44421)
                0.017936034 = queryNorm
              0.64159477 = fieldWeight in 4848, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.2588162 = idf(docFreq=84, maxDocs=44421)
                0.0625 = fieldNorm(doc=4848)
          0.11806139 = weight(abstract_txt:analytico in 4848) [ClassicSimilarity], result of:
            0.11806139 = score(doc=4848,freq=2.0), product of:
              0.16396914 = queryWeight, product of:
                1.1222379 = boost
                8.146119 = idf(docFreq=34, maxDocs=44421)
                0.017936034 = queryNorm
              0.720022 = fieldWeight in 4848, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.146119 = idf(docFreq=34, maxDocs=44421)
                0.0625 = fieldNorm(doc=4848)
          0.036154274 = weight(abstract_txt:structure in 4848) [ClassicSimilarity], result of:
            0.036154274 = score(doc=4848,freq=2.0), product of:
              0.093858436 = queryWeight, product of:
                1.2007581 = boost
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.017936034 = queryNorm
              0.38520005 = fieldWeight in 4848, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.0625 = fieldNorm(doc=4848)
          0.38467202 = weight(abstract_txt:classmarks in 4848) [ClassicSimilarity], result of:
            0.38467202 = score(doc=4848,freq=1.0), product of:
              0.6548387 = queryWeight, product of:
                3.884469 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.017936034 = queryNorm
              0.5874302 = fieldWeight in 4848, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.0625 = fieldNorm(doc=4848)
        0.16 = coord(4/25)