Document (#39302)

Author
Piros, A.
Title
Automatic interpretation of complex UDC numbers : towards support for library systems
Source
Classification and authority control: expanding resource discovery: proceedings of the International UDC Seminar 2015, 29-30 October 2015, Lisbon, Portugal. Eds.: Slavic, A. u. M.I. Cordeiro
Imprint
Würzburg : Ergon-Verlag
Year
2015
Pages
S.177-194
Abstract
Analytico-synthetic and faceted classifications, such as Universal Decimal Classification (UDC) express content of documents with complex, pre-combined classification codes. Without classification authority control that would help manage and access structured notations, the use of UDC codes in searching and browsing is limited. Existing UDC parsing solutions are usually created for a particular database system or a specific task and are not widely applicable. The approach described in this paper provides a solution by which the analysis and interpretation of UDC notations would be stored into an intermediate format (in this case, in XML) by automatic means without any data or information loss. Due to its richness, the output file can be converted into different formats, such as standard mark-up and data exchange formats or simple lists of the recommended entry points of a UDC number. The program can also be used to create authority records containing complex UDC numbers which can be comprehensively analysed in order to be retrieved effectively. The Java program, as well as the corresponding schema definition it employs, is under continuous development. The current version of the interpreter software is now available online for testing purposes at the following web site: http://interpreter-eto.rhcloud.com. The future plan is to implement conversion methods for standard formats and to create standard online interfaces in order to make it possible to use the features of software as a service. This would result in the algorithm being able to be employed both in existing and future library systems to analyse UDC numbers without any significant programming effort.
Content
Präsentation unter: http://www.udcds.com/seminar/2015/media/slides/Piros_InternationalUDCSeminar2015.pdf.
Theme
Automatisches Klassifizieren
Object
UDC

Similar documents (content)

  1. Frâncu, V.; Sabo, C.-N.: Implementation of a UDC-based multilingual thesaurus in a library catalogue : the case of BiblioPhil (2010) 0.22
    0.22052819 = sum of:
      0.22052819 = product of:
        0.6891506 = sum of:
          0.0352656 = weight(abstract_txt:order in 684) [ClassicSimilarity], result of:
            0.0352656 = score(doc=684,freq=1.0), product of:
              0.10146765 = queryWeight, product of:
                1.193807 = boost
                4.448705 = idf(docFreq=1411, maxDocs=44421)
                0.019105563 = queryNorm
              0.3475551 = fieldWeight in 684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.448705 = idf(docFreq=1411, maxDocs=44421)
                0.078125 = fieldNorm(doc=684)
          0.040083427 = weight(abstract_txt:existing in 684) [ClassicSimilarity], result of:
            0.040083427 = score(doc=684,freq=1.0), product of:
              0.11051044 = queryWeight, product of:
                1.2458678 = boost
                4.6427093 = idf(docFreq=1162, maxDocs=44421)
                0.019105563 = queryNorm
              0.36271167 = fieldWeight in 684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6427093 = idf(docFreq=1162, maxDocs=44421)
                0.078125 = fieldNorm(doc=684)
          0.060355183 = weight(abstract_txt:authority in 684) [ClassicSimilarity], result of:
            0.060355183 = score(doc=684,freq=1.0), product of:
              0.14517878 = queryWeight, product of:
                1.4279792 = boost
                5.321345 = idf(docFreq=589, maxDocs=44421)
                0.019105563 = queryNorm
              0.41573006 = fieldWeight in 684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.321345 = idf(docFreq=589, maxDocs=44421)
                0.078125 = fieldNorm(doc=684)
          0.066210344 = weight(abstract_txt:classification in 684) [ClassicSimilarity], result of:
            0.066210344 = score(doc=684,freq=3.0), product of:
              0.12256525 = queryWeight, product of:
                1.6069399 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.019105563 = queryNorm
              0.5402049 = fieldWeight in 684, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.078125 = fieldNorm(doc=684)
          0.06353475 = weight(abstract_txt:standard in 684) [ClassicSimilarity], result of:
            0.06353475 = score(doc=684,freq=1.0), product of:
              0.17197478 = queryWeight, product of:
                1.90348 = boost
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.019105563 = queryNorm
              0.36944228 = fieldWeight in 684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.7288613 = idf(docFreq=1066, maxDocs=44421)
                0.078125 = fieldNorm(doc=684)
          0.195453 = weight(abstract_txt:notations in 684) [ClassicSimilarity], result of:
            0.195453 = score(doc=684,freq=1.0), product of:
              0.3177764 = queryWeight, product of:
                2.1126676 = boost
                7.872826 = idf(docFreq=45, maxDocs=44421)
                0.019105563 = queryNorm
              0.61506456 = fieldWeight in 684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.872826 = idf(docFreq=45, maxDocs=44421)
                0.078125 = fieldNorm(doc=684)
          0.10583223 = weight(abstract_txt:formats in 684) [ClassicSimilarity], result of:
            0.10583223 = score(doc=684,freq=1.0), product of:
              0.2416587 = queryWeight, product of:
                2.2564056 = boost
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.019105563 = queryNorm
              0.4379409 = fieldWeight in 684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.6056433 = idf(docFreq=443, maxDocs=44421)
                0.078125 = fieldNorm(doc=684)
          0.1224161 = weight(abstract_txt:numbers in 684) [ClassicSimilarity], result of:
            0.1224161 = score(doc=684,freq=1.0), product of:
              0.26628673 = queryWeight, product of:
                2.3685944 = boost
                5.8843565 = idf(docFreq=335, maxDocs=44421)
                0.019105563 = queryNorm
              0.45971537 = fieldWeight in 684, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8843565 = idf(docFreq=335, maxDocs=44421)
                0.078125 = fieldNorm(doc=684)
        0.32 = coord(8/25)
    
  2. Piros, A.: Az ETO-jelzetek automatikus interpretálásának és elemzésének kérdései (2018) 0.17
    0.16851006 = sum of:
      0.16851006 = product of:
        0.60182166 = sum of:
          0.074585274 = weight(abstract_txt:richness in 1856) [ClassicSimilarity], result of:
            0.074585274 = score(doc=1856,freq=1.0), product of:
              0.15397805 = queryWeight, product of:
                1.0398836 = boost
                7.750224 = idf(docFreq=51, maxDocs=44421)
                0.019105563 = queryNorm
              0.484389 = fieldWeight in 1856, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.750224 = idf(docFreq=51, maxDocs=44421)
                0.0625 = fieldNorm(doc=1856)
          0.026522495 = weight(abstract_txt:software in 1856) [ClassicSimilarity], result of:
            0.026522495 = score(doc=1856,freq=1.0), product of:
              0.097374 = queryWeight, product of:
                1.1694773 = boost
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.019105563 = queryNorm
              0.27237758 = fieldWeight in 1856, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.0625 = fieldNorm(doc=1856)
          0.045349214 = weight(abstract_txt:existing in 1856) [ClassicSimilarity], result of:
            0.045349214 = score(doc=1856,freq=2.0), product of:
              0.11051044 = queryWeight, product of:
                1.2458678 = boost
                4.6427093 = idf(docFreq=1162, maxDocs=44421)
                0.019105563 = queryNorm
              0.41036138 = fieldWeight in 1856, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.6427093 = idf(docFreq=1162, maxDocs=44421)
                0.0625 = fieldNorm(doc=1856)
          0.055871096 = weight(abstract_txt:would in 1856) [ClassicSimilarity], result of:
            0.055871096 = score(doc=1856,freq=1.0), product of:
              0.18317041 = queryWeight, product of:
                1.9644619 = boost
                4.88036 = idf(docFreq=916, maxDocs=44421)
                0.019105563 = queryNorm
              0.3050225 = fieldWeight in 1856, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.88036 = idf(docFreq=916, maxDocs=44421)
                0.0625 = fieldNorm(doc=1856)
          0.06301423 = weight(abstract_txt:complex in 1856) [ClassicSimilarity], result of:
            0.06301423 = score(doc=1856,freq=1.0), product of:
              0.1984676 = queryWeight, product of:
                2.0448468 = boost
                5.080062 = idf(docFreq=750, maxDocs=44421)
                0.019105563 = queryNorm
              0.31750387 = fieldWeight in 1856, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.080062 = idf(docFreq=750, maxDocs=44421)
                0.0625 = fieldNorm(doc=1856)
          0.0965937 = weight(abstract_txt:without in 1856) [ClassicSimilarity], result of:
            0.0965937 = score(doc=1856,freq=2.0), product of:
              0.20942076 = queryWeight, product of:
                2.1005151 = boost
                5.2183604 = idf(docFreq=653, maxDocs=44421)
                0.019105563 = queryNorm
              0.46124226 = fieldWeight in 1856, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.2183604 = idf(docFreq=653, maxDocs=44421)
                0.0625 = fieldNorm(doc=1856)
          0.2398856 = weight(abstract_txt:numbers in 1856) [ClassicSimilarity], result of:
            0.2398856 = score(doc=1856,freq=6.0), product of:
              0.26628673 = queryWeight, product of:
                2.3685944 = boost
                5.8843565 = idf(docFreq=335, maxDocs=44421)
                0.019105563 = queryNorm
              0.90085447 = fieldWeight in 1856, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.8843565 = idf(docFreq=335, maxDocs=44421)
                0.0625 = fieldNorm(doc=1856)
        0.28 = coord(7/25)
    
  3. Piros, A.: ¬The thought behind the symbol : about the automatic interpretation and representation of UDC numbers (2017) 0.13
    0.13166024 = sum of:
      0.13166024 = product of:
        0.47021514 = sum of:
          0.08660892 = weight(abstract_txt:analytico in 4853) [ClassicSimilarity], result of:
            0.08660892 = score(doc=4853,freq=1.0), product of:
              0.17011078 = queryWeight, product of:
                1.0930027 = boost
                8.146119 = idf(docFreq=34, maxDocs=44421)
                0.019105563 = queryNorm
              0.50913244 = fieldWeight in 4853, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.146119 = idf(docFreq=34, maxDocs=44421)
                0.0625 = fieldNorm(doc=4853)
          0.024343017 = weight(abstract_txt:future in 4853) [ClassicSimilarity], result of:
            0.024343017 = score(doc=4853,freq=1.0), product of:
              0.091963686 = queryWeight, product of:
                1.1365237 = boost
                4.23524 = idf(docFreq=1747, maxDocs=44421)
                0.019105563 = queryNorm
              0.2647025 = fieldWeight in 4853, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.23524 = idf(docFreq=1747, maxDocs=44421)
                0.0625 = fieldNorm(doc=4853)
          0.03750847 = weight(abstract_txt:software in 4853) [ClassicSimilarity], result of:
            0.03750847 = score(doc=4853,freq=2.0), product of:
              0.097374 = queryWeight, product of:
                1.1694773 = boost
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.019105563 = queryNorm
              0.38520005 = fieldWeight in 4853, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.3580413 = idf(docFreq=1545, maxDocs=44421)
                0.0625 = fieldNorm(doc=4853)
          0.06355993 = weight(abstract_txt:automatic in 4853) [ClassicSimilarity], result of:
            0.06355993 = score(doc=4853,freq=2.0), product of:
              0.13840306 = queryWeight, product of:
                1.394258 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.019105563 = queryNorm
              0.45923787 = fieldWeight in 4853, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.0625 = fieldNorm(doc=4853)
          0.03058125 = weight(abstract_txt:classification in 4853) [ClassicSimilarity], result of:
            0.03058125 = score(doc=4853,freq=1.0), product of:
              0.12256525 = queryWeight, product of:
                1.6069399 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.019105563 = queryNorm
              0.24950996 = fieldWeight in 4853, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.0625 = fieldNorm(doc=4853)
          0.089115575 = weight(abstract_txt:complex in 4853) [ClassicSimilarity], result of:
            0.089115575 = score(doc=4853,freq=2.0), product of:
              0.1984676 = queryWeight, product of:
                2.0448468 = boost
                5.080062 = idf(docFreq=750, maxDocs=44421)
                0.019105563 = queryNorm
              0.44901827 = fieldWeight in 4853, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.080062 = idf(docFreq=750, maxDocs=44421)
                0.0625 = fieldNorm(doc=4853)
          0.138498 = weight(abstract_txt:numbers in 4853) [ClassicSimilarity], result of:
            0.138498 = score(doc=4853,freq=2.0), product of:
              0.26628673 = queryWeight, product of:
                2.3685944 = boost
                5.8843565 = idf(docFreq=335, maxDocs=44421)
                0.019105563 = queryNorm
              0.5201085 = fieldWeight in 4853, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.8843565 = idf(docFreq=335, maxDocs=44421)
                0.0625 = fieldNorm(doc=4853)
        0.28 = coord(7/25)
    
  4. Classification Research Group: ¬The need for a faceted classification as the basis of all methods of information retrieval (1985) 0.11
    0.10952391 = sum of:
      0.10952391 = product of:
        0.3422622 = sum of:
          0.015214386 = weight(abstract_txt:future in 4640) [ClassicSimilarity], result of:
            0.015214386 = score(doc=4640,freq=1.0), product of:
              0.091963686 = queryWeight, product of:
                1.1365237 = boost
                4.23524 = idf(docFreq=1747, maxDocs=44421)
                0.019105563 = queryNorm
              0.16543907 = fieldWeight in 4640, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.23524 = idf(docFreq=1747, maxDocs=44421)
                0.0390625 = fieldNorm(doc=4640)
          0.040083427 = weight(abstract_txt:existing in 4640) [ClassicSimilarity], result of:
            0.040083427 = score(doc=4640,freq=4.0), product of:
              0.11051044 = queryWeight, product of:
                1.2458678 = boost
                4.6427093 = idf(docFreq=1162, maxDocs=44421)
                0.019105563 = queryNorm
              0.36271167 = fieldWeight in 4640, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.6427093 = idf(docFreq=1162, maxDocs=44421)
                0.0390625 = fieldNorm(doc=4640)
          0.028089784 = weight(abstract_txt:automatic in 4640) [ClassicSimilarity], result of:
            0.028089784 = score(doc=4640,freq=1.0), product of:
              0.13840306 = queryWeight, product of:
                1.394258 = boost
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.019105563 = queryNorm
              0.20295638 = fieldWeight in 4640, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1956835 = idf(docFreq=668, maxDocs=44421)
                0.0390625 = fieldNorm(doc=4640)
          0.066210344 = weight(abstract_txt:classification in 4640) [ClassicSimilarity], result of:
            0.066210344 = score(doc=4640,freq=12.0), product of:
              0.12256525 = queryWeight, product of:
                1.6069399 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.019105563 = queryNorm
              0.5402049 = fieldWeight in 4640, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.0390625 = fieldNorm(doc=4640)
          0.04938354 = weight(abstract_txt:would in 4640) [ClassicSimilarity], result of:
            0.04938354 = score(doc=4640,freq=2.0), product of:
              0.18317041 = queryWeight, product of:
                1.9644619 = boost
                4.88036 = idf(docFreq=916, maxDocs=44421)
                0.019105563 = queryNorm
              0.26960436 = fieldWeight in 4640, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.88036 = idf(docFreq=916, maxDocs=44421)
                0.0390625 = fieldNorm(doc=4640)
          0.039383896 = weight(abstract_txt:complex in 4640) [ClassicSimilarity], result of:
            0.039383896 = score(doc=4640,freq=1.0), product of:
              0.1984676 = queryWeight, product of:
                2.0448468 = boost
                5.080062 = idf(docFreq=750, maxDocs=44421)
                0.019105563 = queryNorm
              0.19843993 = fieldWeight in 4640, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.080062 = idf(docFreq=750, maxDocs=44421)
                0.0390625 = fieldNorm(doc=4640)
          0.04268879 = weight(abstract_txt:without in 4640) [ClassicSimilarity], result of:
            0.04268879 = score(doc=4640,freq=1.0), product of:
              0.20942076 = queryWeight, product of:
                2.1005151 = boost
                5.2183604 = idf(docFreq=653, maxDocs=44421)
                0.019105563 = queryNorm
              0.20384221 = fieldWeight in 4640, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2183604 = idf(docFreq=653, maxDocs=44421)
                0.0390625 = fieldNorm(doc=4640)
          0.06120805 = weight(abstract_txt:numbers in 4640) [ClassicSimilarity], result of:
            0.06120805 = score(doc=4640,freq=1.0), product of:
              0.26628673 = queryWeight, product of:
                2.3685944 = boost
                5.8843565 = idf(docFreq=335, maxDocs=44421)
                0.019105563 = queryNorm
              0.22985768 = fieldWeight in 4640, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.8843565 = idf(docFreq=335, maxDocs=44421)
                0.0390625 = fieldNorm(doc=4640)
        0.32 = coord(8/25)
    
  5. Riesthuis, G.J.A.: Decomposition of UDC-numbers and the text of the UDC Master Reference File (1998) 0.11
    0.10713493 = sum of:
      0.10713493 = product of:
        0.66959333 = sum of:
          0.045871876 = weight(abstract_txt:classification in 1399) [ClassicSimilarity], result of:
            0.045871876 = score(doc=1399,freq=1.0), product of:
              0.12256525 = queryWeight, product of:
                1.6069399 = boost
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.019105563 = queryNorm
              0.37426496 = fieldWeight in 1399, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9921594 = idf(docFreq=2228, maxDocs=44421)
                0.09375 = fieldNorm(doc=1399)
          0.08380665 = weight(abstract_txt:would in 1399) [ClassicSimilarity], result of:
            0.08380665 = score(doc=1399,freq=1.0), product of:
              0.18317041 = queryWeight, product of:
                1.9644619 = boost
                4.88036 = idf(docFreq=916, maxDocs=44421)
                0.019105563 = queryNorm
              0.45753378 = fieldWeight in 1399, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.88036 = idf(docFreq=916, maxDocs=44421)
                0.09375 = fieldNorm(doc=1399)
          0.13367337 = weight(abstract_txt:complex in 1399) [ClassicSimilarity], result of:
            0.13367337 = score(doc=1399,freq=2.0), product of:
              0.1984676 = queryWeight, product of:
                2.0448468 = boost
                5.080062 = idf(docFreq=750, maxDocs=44421)
                0.019105563 = queryNorm
              0.6735274 = fieldWeight in 1399, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.080062 = idf(docFreq=750, maxDocs=44421)
                0.09375 = fieldNorm(doc=1399)
          0.40624142 = weight(abstract_txt:notations in 1399) [ClassicSimilarity], result of:
            0.40624142 = score(doc=1399,freq=3.0), product of:
              0.3177764 = queryWeight, product of:
                2.1126676 = boost
                7.872826 = idf(docFreq=45, maxDocs=44421)
                0.019105563 = queryNorm
              1.2783875 = fieldWeight in 1399, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.872826 = idf(docFreq=45, maxDocs=44421)
                0.09375 = fieldNorm(doc=1399)
        0.16 = coord(4/25)