Document (#34629)

Author
Spero, S.
Title
LCSH is to thesaurus as doorbell is to mammal
Issue
[20.01.2008].
Source
http://www.ibiblio.org/fred2.0/wordpress/?p=20
Year
2008
Content
"When you look at the Library of Congress Subject headings as individual entries it's almost impossible to understand just how confused much of the hierarchical reference structure has become. I've written some code to generate graphical representations for the Broader Terms of entries in the LCSH. The starting term appears at the bottom of the graph; according to the rules, this term is a specialization of every other term on the graph. Top level terms are highlighted using double circles. Layout and rendering is courtesy of the wonderful graphviz (AT&T Research). I have generated dot files for all entries in the LCSH; I need to set up an dynamic renderer so they can be viewed online, but a p7zip archive raw dot is available here. (5M compressed, 672M uncompressed) Lets see what the LCSH has to tell us about Doorbells. Doorbells are a Social science. Doorbells are Souls. Doorbells are even Ontologies - which would explain why Protege keeps beeping at me. But most of all, Doorbells are mammals. Obviously this conclusion is absurd. Everyone knows that doorbells aren't hairy. But where are the errors that lead us to this mistaken conclusion, and how can we start to correct them? That's the subject of tomorrow's post."
Footnote
Vgl. auch: http://www.ibiblio.org/fred2.0/wordpress/?p=28.
Object
LCSH

Similar documents (content)

  1. Hemmasi, H.: ¬The Music Thesaurus Project at Rutgers University (1995) 1.45
    1.4543347 = sum of:
      1.4543347 = sum of:
        0.6391064 = weight(abstract_txt:thesaurus in 1936) [ClassicSimilarity], result of:
          0.6391064 = score(doc=1936,freq=2.0), product of:
            0.55936855 = queryWeight, product of:
              5.17059 = idf(docFreq=685, maxDocs=44421)
              0.10818273 = queryNorm
            1.1425498 = fieldWeight in 1936, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              5.17059 = idf(docFreq=685, maxDocs=44421)
              0.15625 = fieldNorm(doc=1936)
        0.8152284 = weight(abstract_txt:lcsh in 1936) [ClassicSimilarity], result of:
          0.8152284 = score(doc=1936,freq=1.0), product of:
            0.82891905 = queryWeight, product of:
              1.2173264 = boost
              6.294296 = idf(docFreq=222, maxDocs=44421)
              0.10818273 = queryNorm
            0.98348373 = fieldWeight in 1936, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.294296 = idf(docFreq=222, maxDocs=44421)
              0.15625 = fieldNorm(doc=1936)
    
  2. Hemmasi, H.: ARIS music thesaurus : another view of LCSH (1992) 1.35
    1.3549545 = sum of:
      1.3549545 = sum of:
        0.5479196 = weight(abstract_txt:thesaurus in 3943) [ClassicSimilarity], result of:
          0.5479196 = score(doc=3943,freq=3.0), product of:
            0.55936855 = queryWeight, product of:
              5.17059 = idf(docFreq=685, maxDocs=44421)
              0.10818273 = queryNorm
            0.97953236 = fieldWeight in 3943, product of:
              1.7320508 = tf(freq=3.0), with freq of:
                3.0 = termFreq=3.0
              5.17059 = idf(docFreq=685, maxDocs=44421)
              0.109375 = fieldNorm(doc=3943)
        0.80703497 = weight(abstract_txt:lcsh in 3943) [ClassicSimilarity], result of:
          0.80703497 = score(doc=3943,freq=2.0), product of:
            0.82891905 = queryWeight, product of:
              1.2173264 = boost
              6.294296 = idf(docFreq=222, maxDocs=44421)
              0.10818273 = queryNorm
            0.97359926 = fieldWeight in 3943, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              6.294296 = idf(docFreq=222, maxDocs=44421)
              0.109375 = fieldNorm(doc=3943)
    
  3. Weintraub, T.S.: ¬The Dual-Thesaurus Catalog : MeSH and LCSH (1992) 1.27
    1.2671449 = sum of:
      1.2671449 = sum of:
        0.45191646 = weight(abstract_txt:thesaurus in 7207) [ClassicSimilarity], result of:
          0.45191646 = score(doc=7207,freq=1.0), product of:
            0.55936855 = queryWeight, product of:
              5.17059 = idf(docFreq=685, maxDocs=44421)
              0.10818273 = queryNorm
            0.80790466 = fieldWeight in 7207, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              5.17059 = idf(docFreq=685, maxDocs=44421)
              0.15625 = fieldNorm(doc=7207)
        0.8152284 = weight(abstract_txt:lcsh in 7207) [ClassicSimilarity], result of:
          0.8152284 = score(doc=7207,freq=1.0), product of:
            0.82891905 = queryWeight, product of:
              1.2173264 = boost
              6.294296 = idf(docFreq=222, maxDocs=44421)
              0.10818273 = queryNorm
            0.98348373 = fieldWeight in 7207, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              6.294296 = idf(docFreq=222, maxDocs=44421)
              0.15625 = fieldNorm(doc=7207)
    
  4. McKnight, M.: Improving access to music : a report of the MLA Music Thesaurus Project Working Group. (1988) 0.97
    0.9678247 = sum of:
      0.9678247 = sum of:
        0.39137116 = weight(abstract_txt:thesaurus in 3554) [ClassicSimilarity], result of:
          0.39137116 = score(doc=3554,freq=3.0), product of:
            0.55936855 = queryWeight, product of:
              5.17059 = idf(docFreq=685, maxDocs=44421)
              0.10818273 = queryNorm
            0.699666 = fieldWeight in 3554, product of:
              1.7320508 = tf(freq=3.0), with freq of:
                3.0 = termFreq=3.0
              5.17059 = idf(docFreq=685, maxDocs=44421)
              0.078125 = fieldNorm(doc=3554)
        0.5764535 = weight(abstract_txt:lcsh in 3554) [ClassicSimilarity], result of:
          0.5764535 = score(doc=3554,freq=2.0), product of:
            0.82891905 = queryWeight, product of:
              1.2173264 = boost
              6.294296 = idf(docFreq=222, maxDocs=44421)
              0.10818273 = queryNorm
            0.695428 = fieldWeight in 3554, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              6.294296 = idf(docFreq=222, maxDocs=44421)
              0.078125 = fieldNorm(doc=3554)
    
  5. Lucarelli, A.; Viti, E.: Florence-Washington round trip : ways and intersections between semantic indexing tools in different languages (2015) 0.96
    0.9628942 = sum of:
      0.9628942 = sum of:
        0.2711499 = weight(abstract_txt:thesaurus in 2886) [ClassicSimilarity], result of:
          0.2711499 = score(doc=2886,freq=1.0), product of:
            0.55936855 = queryWeight, product of:
              5.17059 = idf(docFreq=685, maxDocs=44421)
              0.10818273 = queryNorm
            0.48474282 = fieldWeight in 2886, product of:
              1.0 = tf(freq=1.0), with freq of:
                1.0 = termFreq=1.0
              5.17059 = idf(docFreq=685, maxDocs=44421)
              0.09375 = fieldNorm(doc=2886)
        0.69174427 = weight(abstract_txt:lcsh in 2886) [ClassicSimilarity], result of:
          0.69174427 = score(doc=2886,freq=2.0), product of:
            0.82891905 = queryWeight, product of:
              1.2173264 = boost
              6.294296 = idf(docFreq=222, maxDocs=44421)
              0.10818273 = queryNorm
            0.83451366 = fieldWeight in 2886, product of:
              1.4142135 = tf(freq=2.0), with freq of:
                2.0 = termFreq=2.0
              6.294296 = idf(docFreq=222, maxDocs=44421)
              0.09375 = fieldNorm(doc=2886)