Document (#36706)

Author
Assem, M. van
Rijgersberg, H.
Wigham, M.
Top, J.
Title
Converting and annotating quantitative data tables
Source
The Semantic Web - ISWC 2010. 9th International Semantic Web Conference, ISWC 2010, Shanghai, China, November 7-11, 2010, Revised Selected Papers, Part I. Eds.: Peter F. Patel-Schneider et al
Imprint
Berlin : Springer
Year
2010
Pages
S.16-31
Series
Lecture notes in computer science; 6496
Abstract
Companies, governmental agencies and scientists produce a large amount of quantitative (research) data, consisting of measurements ranging from e.g. the surface temperatures of an ocean to the viscosity of a sample of mayonnaise. Such measurements are stored in tables in e.g. spreadsheet files and research reports. To integrate and reuse such data, it is necessary to have a semantic description of the data. However, the notation used is often ambiguous, making automatic interpretation and conversion to RDF or other suitable format diffiult. For example, the table header cell "f(Hz)" refers to frequency measured in Hertz, but the symbol "f" can also refer to the unit farad or the quantities force or luminous flux. Current annotation tools for this task either work on less ambiguous data or perform a more limited task. We introduce new disambiguation strategies based on an ontology, which allows to improve performance on "sloppy" datasets not yet targeted by existing systems.
Content
Vgl. unter: http://www.cs.vu.nl/~mark/papers/Assem10a.pdf.
Theme
Wissensrepräsentation
Object
OWL
RDF

Similar documents (author)

  1. Assem, M. van: Converting and integrating vocabularies for the Semantic Web (2010) 4.61
    4.6082807 = sum of:
      4.6082807 = weight(author_txt:assem in 639) [ClassicSimilarity], result of:
        4.6082807 = fieldWeight in 639, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.216561 = idf(docFreq=11, maxDocs=44421)
          0.5 = fieldNorm(doc=639)
    
  2. Hollink, L.; Assem, M. van: Estimating the relevance of search results in the Culture-Web : a study of semantic distance measures (2010) 4.03
    4.0322456 = sum of:
      4.0322456 = weight(author_txt:assem in 649) [ClassicSimilarity], result of:
        4.0322456 = fieldWeight in 649, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.216561 = idf(docFreq=11, maxDocs=44421)
          0.4375 = fieldNorm(doc=649)
    
  3. Assem, M. van; Gangemi, A.; Schreiber, G.: Conversion of WordNet to a standard RDF/OWL representation (2006) 3.46
    3.4562106 = sum of:
      3.4562106 = weight(author_txt:assem in 641) [ClassicSimilarity], result of:
        3.4562106 = fieldWeight in 641, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.216561 = idf(docFreq=11, maxDocs=44421)
          0.375 = fieldNorm(doc=641)
    
  4. Wielinga, B.; Wielemaker, J.; Schreiber, G.; Assem, M. van: Methods for porting resources to the Semantic Web (2004) 2.88
    2.8801754 = sum of:
      2.8801754 = weight(author_txt:assem in 640) [ClassicSimilarity], result of:
        2.8801754 = fieldWeight in 640, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.216561 = idf(docFreq=11, maxDocs=44421)
          0.3125 = fieldNorm(doc=640)
    
  5. Assem, M. van; Malaisé, V.; Miles, A.; Schreiber, G.: ¬A method to convert thesauri to SKOS (2006) 2.88
    2.8801754 = sum of:
      2.8801754 = weight(author_txt:assem in 642) [ClassicSimilarity], result of:
        2.8801754 = fieldWeight in 642, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          9.216561 = idf(docFreq=11, maxDocs=44421)
          0.3125 = fieldNorm(doc=642)
    

Similar documents (content)

  1. Whitlatch, J.B.: Reference services : research methodologies for assessment and accountability (1992) 0.07
    0.069194406 = sum of:
      0.069194406 = product of:
        0.57662004 = sum of:
          0.14587569 = weight(abstract_txt:quantitative in 4539) [ClassicSimilarity], result of:
            0.14587569 = score(doc=4539,freq=1.0), product of:
              0.19606702 = queryWeight, product of:
                1.7379748 = boost
                5.9520745 = idf(docFreq=313, maxDocs=44421)
                0.018953646 = queryNorm
              0.7440093 = fieldWeight in 4539, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9520745 = idf(docFreq=313, maxDocs=44421)
                0.125 = fieldNorm(doc=4539)
          0.34040982 = weight(abstract_txt:measurements in 4539) [ClassicSimilarity], result of:
            0.34040982 = score(doc=4539,freq=1.0), product of:
              0.34494564 = queryWeight, product of:
                2.3052418 = boost
                7.894805 = idf(docFreq=44, maxDocs=44421)
                0.018953646 = queryNorm
              0.9868506 = fieldWeight in 4539, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.894805 = idf(docFreq=44, maxDocs=44421)
                0.125 = fieldNorm(doc=4539)
          0.090334535 = weight(abstract_txt:data in 4539) [ClassicSimilarity], result of:
            0.090334535 = score(doc=4539,freq=2.0), product of:
              0.15344585 = queryWeight, product of:
                2.4310212 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.018953646 = queryNorm
              0.58870625 = fieldWeight in 4539, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.125 = fieldNorm(doc=4539)
        0.12 = coord(3/25)
    
  2. Grassi, M.; Morbidoni, C.; Nucci, M.; Fonda, S.; Ledda, G.: Pundit: semantically structured annotations for Web contents and digital libraries (2012) 0.06
    0.059514444 = sum of:
      0.059514444 = product of:
        0.3719653 = sum of:
          0.06946924 = weight(abstract_txt:ranging in 1473) [ClassicSimilarity], result of:
            0.06946924 = score(doc=1473,freq=1.0), product of:
              0.1298218 = queryWeight, product of:
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.018953646 = queryNorm
              0.53511226 = fieldWeight in 1473, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.078125 = fieldNorm(doc=1473)
          0.074768044 = weight(abstract_txt:annotation in 1473) [ClassicSimilarity], result of:
            0.074768044 = score(doc=1473,freq=1.0), product of:
              0.1363421 = queryWeight, product of:
                1.0248048 = boost
                7.019336 = idf(docFreq=107, maxDocs=44421)
                0.018953646 = queryNorm
              0.5483856 = fieldWeight in 1473, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.019336 = idf(docFreq=107, maxDocs=44421)
                0.078125 = fieldNorm(doc=1473)
          0.14788282 = weight(abstract_txt:annotating in 1473) [ClassicSimilarity], result of:
            0.14788282 = score(doc=1473,freq=1.0), product of:
              0.2148314 = queryWeight, product of:
                1.2863971 = boost
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.018953646 = queryNorm
              0.6883669 = fieldWeight in 1473, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.078125 = fieldNorm(doc=1473)
          0.0798452 = weight(abstract_txt:data in 1473) [ClassicSimilarity], result of:
            0.0798452 = score(doc=1473,freq=4.0), product of:
              0.15344585 = queryWeight, product of:
                2.4310212 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.018953646 = queryNorm
              0.5203477 = fieldWeight in 1473, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.078125 = fieldNorm(doc=1473)
        0.16 = coord(4/25)
    
  3. Billal, B.; Fonseca, A.; Sadat, F.; Lounis, H.: Semi-supervised learning and social media text analysis towards multi-labeling categorization (2017) 0.06
    0.057765115 = sum of:
      0.057765115 = product of:
        0.28882557 = sum of:
          0.05233763 = weight(abstract_txt:annotation in 95) [ClassicSimilarity], result of:
            0.05233763 = score(doc=95,freq=1.0), product of:
              0.1363421 = queryWeight, product of:
                1.0248048 = boost
                7.019336 = idf(docFreq=107, maxDocs=44421)
                0.018953646 = queryNorm
              0.38386995 = fieldWeight in 95, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.019336 = idf(docFreq=107, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
          0.05964813 = weight(abstract_txt:disambiguation in 95) [ClassicSimilarity], result of:
            0.05964813 = score(doc=95,freq=1.0), product of:
              0.14875965 = queryWeight, product of:
                1.0704558 = boost
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.018953646 = queryNorm
              0.40096983 = fieldWeight in 95, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
          0.07265333 = weight(abstract_txt:quantities in 95) [ClassicSimilarity], result of:
            0.07265333 = score(doc=95,freq=1.0), product of:
              0.16966447 = queryWeight, product of:
                1.1431985 = boost
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.018953646 = queryNorm
              0.4282177 = fieldWeight in 95, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.8302665 = idf(docFreq=47, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
          0.035733473 = weight(abstract_txt:task in 95) [ClassicSimilarity], result of:
            0.035733473 = score(doc=95,freq=1.0), product of:
              0.13319279 = queryWeight, product of:
                1.4324569 = boost
                4.9057617 = idf(docFreq=893, maxDocs=44421)
                0.018953646 = queryNorm
              0.26828384 = fieldWeight in 95, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.9057617 = idf(docFreq=893, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
          0.06845301 = weight(abstract_txt:data in 95) [ClassicSimilarity], result of:
            0.06845301 = score(doc=95,freq=6.0), product of:
              0.15344585 = queryWeight, product of:
                2.4310212 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.018953646 = queryNorm
              0.44610527 = fieldWeight in 95, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0546875 = fieldNorm(doc=95)
        0.2 = coord(5/25)
    
  4. Stathopoulos, Y.; Baker, S.; Rei, M.; Teufel, S.: Variable typing : assigning meaning to variables in mathematical text (2018) 0.06
    0.055570442 = sum of:
      0.055570442 = product of:
        0.34731528 = sum of:
          0.08521162 = weight(abstract_txt:disambiguation in 432) [ClassicSimilarity], result of:
            0.08521162 = score(doc=432,freq=1.0), product of:
              0.14875965 = queryWeight, product of:
                1.0704558 = boost
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.018953646 = queryNorm
              0.57281405 = fieldWeight in 432, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.33202 = idf(docFreq=78, maxDocs=44421)
                0.078125 = fieldNorm(doc=432)
          0.120763175 = weight(abstract_txt:symbol in 432) [ClassicSimilarity], result of:
            0.120763175 = score(doc=432,freq=1.0), product of:
              0.1876905 = queryWeight, product of:
                1.2023954 = boost
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.018953646 = queryNorm
              0.6434166 = fieldWeight in 432, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.235732 = idf(docFreq=31, maxDocs=44421)
                0.078125 = fieldNorm(doc=432)
          0.07219252 = weight(abstract_txt:task in 432) [ClassicSimilarity], result of:
            0.07219252 = score(doc=432,freq=2.0), product of:
              0.13319279 = queryWeight, product of:
                1.4324569 = boost
                4.9057617 = idf(docFreq=893, maxDocs=44421)
                0.018953646 = queryNorm
              0.5420152 = fieldWeight in 432, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.9057617 = idf(docFreq=893, maxDocs=44421)
                0.078125 = fieldNorm(doc=432)
          0.069147974 = weight(abstract_txt:data in 432) [ClassicSimilarity], result of:
            0.069147974 = score(doc=432,freq=3.0), product of:
              0.15344585 = queryWeight, product of:
                2.4310212 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.018953646 = queryNorm
              0.45063436 = fieldWeight in 432, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.078125 = fieldNorm(doc=432)
        0.16 = coord(4/25)
    
  5. Kozak, M.; Hartley, J.: Presenting numerical values within sentences and text tables (2012) 0.06
    0.055486243 = sum of:
      0.055486243 = product of:
        0.34678903 = sum of:
          0.09625941 = weight(abstract_txt:table in 968) [ClassicSimilarity], result of:
            0.09625941 = score(doc=968,freq=3.0), product of:
              0.1298218 = queryWeight, product of:
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.018953646 = queryNorm
              0.7414733 = fieldWeight in 968, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.849437 = idf(docFreq=127, maxDocs=44421)
                0.0625 = fieldNorm(doc=968)
          0.072937846 = weight(abstract_txt:quantitative in 968) [ClassicSimilarity], result of:
            0.072937846 = score(doc=968,freq=1.0), product of:
              0.19606702 = queryWeight, product of:
                1.7379748 = boost
                5.9520745 = idf(docFreq=313, maxDocs=44421)
                0.018953646 = queryNorm
              0.37200466 = fieldWeight in 968, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9520745 = idf(docFreq=313, maxDocs=44421)
                0.0625 = fieldNorm(doc=968)
          0.1456537 = weight(abstract_txt:tables in 968) [ClassicSimilarity], result of:
            0.1456537 = score(doc=968,freq=2.0), product of:
              0.2467783 = queryWeight, product of:
                1.9498206 = boost
                6.677587 = idf(docFreq=151, maxDocs=44421)
                0.018953646 = queryNorm
              0.59022087 = fieldWeight in 968, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.677587 = idf(docFreq=151, maxDocs=44421)
                0.0625 = fieldNorm(doc=968)
          0.03193808 = weight(abstract_txt:data in 968) [ClassicSimilarity], result of:
            0.03193808 = score(doc=968,freq=1.0), product of:
              0.15344585 = queryWeight, product of:
                2.4310212 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.018953646 = queryNorm
              0.20813909 = fieldWeight in 968, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=968)
        0.16 = coord(4/25)