Document (#33266)

Author
Shirky, C.
Title
Ontology is overrated : categories, links, and tags
Source
http://www.shirky.com/writings/ontology_overrated.html
Year
2005
Series
Clay Shirky's writings about the Internet
Abstract
Today I want to talk about categorization, and I want to convince you that a lot of what we think we know about categorization is wrong. In particular, I want to convince you that many of the ways we're attempting to apply categorization to the electronic world are actually a bad fit, because we've adopted habits of mind that are left over from earlier strategies. I also want to convince you that what we're seeing when we see the Web is actually a radical break with previous categorization strategies, rather than an extension of them. The second part of the talk is more speculative, because it is often the case that old systems get broken before people know what's going to take their place. (Anyone watching the music industry can see this at work today.) That's what I think is happening with categorization. What I think is coming instead are much more organic ways of organizing information than our current categorization schemes allow, based on two units -- the link, which can point to anything, and the tag, which is a way of attaching labels to links. The strategy of tagging -- free-form labeling, without regard to categorical constraints -- seems like a recipe for disaster, but as the Web has shown us, you can extract a surprising amount of value from big messy data sets.
Footnote
This piece is based on two talks I gave in the spring of 2005 -- one at the O'Reilly ETech conference in March, entitled "Ontology Is Overrated", and one at the IMCExpo in April entitled "Folksonomies & Tags: The rise of user-developed classification." The written version is a heavily edited concatenation of those two talks.
Theme
Folksonomies
Social tagging

Similar documents (content)

  1. Pioneers in library and information science (2004) 0.11
    0.112296306 = sum of:
      0.112296306 = product of:
        0.40105823 = sum of:
          0.07427055 = weight(abstract_txt:happening in 1024) [ClassicSimilarity], result of:
            0.07427055 = score(doc=1024,freq=1.0), product of:
              0.1348673 = queryWeight, product of:
                1.0482373 = boost
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.014602158 = queryNorm
              0.5506935 = fieldWeight in 1024, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.811096 = idf(docFreq=17, maxDocs=44421)
                0.0625 = fieldNorm(doc=1024)
          0.024891641 = weight(abstract_txt:because in 1024) [ClassicSimilarity], result of:
            0.024891641 = score(doc=1024,freq=1.0), product of:
              0.08198629 = queryWeight, product of:
                1.1558245 = boost
                4.8577175 = idf(docFreq=937, maxDocs=44421)
                0.014602158 = queryNorm
              0.30360734 = fieldWeight in 1024, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8577175 = idf(docFreq=937, maxDocs=44421)
                0.0625 = fieldNorm(doc=1024)
          0.010154755 = weight(abstract_txt:that in 1024) [ClassicSimilarity], result of:
            0.010154755 = score(doc=1024,freq=2.0), product of:
              0.048579738 = queryWeight, product of:
                1.4067564 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.014602158 = queryNorm
              0.20903271 = fieldWeight in 1024, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=1024)
          0.049909733 = weight(abstract_txt:know in 1024) [ClassicSimilarity], result of:
            0.049909733 = score(doc=1024,freq=1.0), product of:
              0.13036542 = queryWeight, product of:
                1.4574797 = boost
                6.1255183 = idf(docFreq=263, maxDocs=44421)
                0.014602158 = queryNorm
              0.3828449 = fieldWeight in 1024, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1255183 = idf(docFreq=263, maxDocs=44421)
                0.0625 = fieldNorm(doc=1024)
          0.062302325 = weight(abstract_txt:actually in 1024) [ClassicSimilarity], result of:
            0.062302325 = score(doc=1024,freq=1.0), product of:
              0.15113848 = queryWeight, product of:
                1.5693103 = boost
                6.595522 = idf(docFreq=164, maxDocs=44421)
                0.014602158 = queryNorm
              0.41222012 = fieldWeight in 1024, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.595522 = idf(docFreq=164, maxDocs=44421)
                0.0625 = fieldNorm(doc=1024)
          0.085028656 = weight(abstract_txt:what in 1024) [ClassicSimilarity], result of:
            0.085028656 = score(doc=1024,freq=6.0), product of:
              0.12893635 = queryWeight, product of:
                2.049859 = boost
                4.3075895 = idf(docFreq=1625, maxDocs=44421)
                0.014602158 = queryNorm
              0.6594623 = fieldWeight in 1024, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.3075895 = idf(docFreq=1625, maxDocs=44421)
                0.0625 = fieldNorm(doc=1024)
          0.094500564 = weight(abstract_txt:think in 1024) [ClassicSimilarity], result of:
            0.094500564 = score(doc=1024,freq=1.0), product of:
              0.22839797 = queryWeight, product of:
                2.3627243 = boost
                6.6200633 = idf(docFreq=160, maxDocs=44421)
                0.014602158 = queryNorm
              0.41375396 = fieldWeight in 1024, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6200633 = idf(docFreq=160, maxDocs=44421)
                0.0625 = fieldNorm(doc=1024)
        0.28 = coord(7/25)
    
  2. Müller, J.F.: ¬A librarian's guide to the Internet : a guide to searching and evaluating information (2003) 0.11
    0.107788086 = sum of:
      0.107788086 = product of:
        0.44911703 = sum of:
          0.04356214 = weight(abstract_txt:strategies in 5502) [ClassicSimilarity], result of:
            0.04356214 = score(doc=5502,freq=1.0), product of:
              0.0908624 = queryWeight, product of:
                1.2167838 = boost
                5.113918 = idf(docFreq=725, maxDocs=44421)
                0.014602158 = queryNorm
              0.47942978 = fieldWeight in 5502, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.113918 = idf(docFreq=725, maxDocs=44421)
                0.09375 = fieldNorm(doc=5502)
          0.046822384 = weight(abstract_txt:links in 5502) [ClassicSimilarity], result of:
            0.046822384 = score(doc=5502,freq=1.0), product of:
              0.09534115 = queryWeight, product of:
                1.2464117 = boost
                5.238438 = idf(docFreq=640, maxDocs=44421)
                0.014602158 = queryNorm
              0.4911036 = fieldWeight in 5502, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.238438 = idf(docFreq=640, maxDocs=44421)
                0.09375 = fieldNorm(doc=5502)
          0.010770744 = weight(abstract_txt:that in 5502) [ClassicSimilarity], result of:
            0.010770744 = score(doc=5502,freq=1.0), product of:
              0.048579738 = queryWeight, product of:
                1.4067564 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.014602158 = queryNorm
              0.22171268 = fieldWeight in 5502, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.09375 = fieldNorm(doc=5502)
          0.074864596 = weight(abstract_txt:know in 5502) [ClassicSimilarity], result of:
            0.074864596 = score(doc=5502,freq=1.0), product of:
              0.13036542 = queryWeight, product of:
                1.4574797 = boost
                6.1255183 = idf(docFreq=263, maxDocs=44421)
                0.014602158 = queryNorm
              0.5742673 = fieldWeight in 5502, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1255183 = idf(docFreq=263, maxDocs=44421)
                0.09375 = fieldNorm(doc=5502)
          0.09018651 = weight(abstract_txt:what in 5502) [ClassicSimilarity], result of:
            0.09018651 = score(doc=5502,freq=3.0), product of:
              0.12893635 = queryWeight, product of:
                2.049859 = boost
                4.3075895 = idf(docFreq=1625, maxDocs=44421)
                0.014602158 = queryNorm
              0.69946533 = fieldWeight in 5502, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.3075895 = idf(docFreq=1625, maxDocs=44421)
                0.09375 = fieldNorm(doc=5502)
          0.18291065 = weight(abstract_txt:want in 5502) [ClassicSimilarity], result of:
            0.18291065 = score(doc=5502,freq=1.0), product of:
              0.29795274 = queryWeight, product of:
                3.11609 = boost
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.014602158 = queryNorm
              0.6138915 = fieldWeight in 5502, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.09375 = fieldNorm(doc=5502)
        0.24 = coord(6/25)
    
  3. Larson, E.J.: ¬The myth of artificial intelligence : why computers can't think the way we do (2021) 0.10
    0.10170824 = sum of:
      0.10170824 = product of:
        0.42378435 = sum of:
          0.021780184 = weight(abstract_txt:because in 2343) [ClassicSimilarity], result of:
            0.021780184 = score(doc=2343,freq=1.0), product of:
              0.08198629 = queryWeight, product of:
                1.1558245 = boost
                4.8577175 = idf(docFreq=937, maxDocs=44421)
                0.014602158 = queryNorm
              0.2656564 = fieldWeight in 2343, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8577175 = idf(docFreq=937, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2343)
          0.014049068 = weight(abstract_txt:that in 2343) [ClassicSimilarity], result of:
            0.014049068 = score(doc=2343,freq=5.0), product of:
              0.048579738 = queryWeight, product of:
                1.4067564 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.014602158 = queryNorm
              0.28919604 = fieldWeight in 2343, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2343)
          0.07564042 = weight(abstract_txt:know in 2343) [ClassicSimilarity], result of:
            0.07564042 = score(doc=2343,freq=3.0), product of:
              0.13036542 = queryWeight, product of:
                1.4574797 = boost
                6.1255183 = idf(docFreq=263, maxDocs=44421)
                0.014602158 = queryNorm
              0.58021843 = fieldWeight in 2343, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.1255183 = idf(docFreq=263, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2343)
          0.052608795 = weight(abstract_txt:what in 2343) [ClassicSimilarity], result of:
            0.052608795 = score(doc=2343,freq=3.0), product of:
              0.12893635 = queryWeight, product of:
                2.049859 = boost
                4.3075895 = idf(docFreq=1625, maxDocs=44421)
                0.014602158 = queryNorm
              0.40802145 = fieldWeight in 2343, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.3075895 = idf(docFreq=1625, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2343)
          0.15300797 = weight(abstract_txt:we're in 2343) [ClassicSimilarity], result of:
            0.15300797 = score(doc=2343,freq=1.0), product of:
              0.30072963 = queryWeight, product of:
                2.2136524 = boost
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.014602158 = queryNorm
              0.5087891 = fieldWeight in 2343, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2343)
          0.10669788 = weight(abstract_txt:want in 2343) [ClassicSimilarity], result of:
            0.10669788 = score(doc=2343,freq=1.0), product of:
              0.29795274 = queryWeight, product of:
                3.11609 = boost
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.014602158 = queryNorm
              0.35810336 = fieldWeight in 2343, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.0546875 = fieldNorm(doc=2343)
        0.24 = coord(6/25)
    
  4. Allo, P.; Baumgaertner, B.; D'Alfonso, S.; Fresco, N.; Gobbo, F.; Grubaugh, C.; Iliadis, A.; Illari, P.; Kerr, E.; Primiero, G.; Russo, F.; Schulz, C.; Taddeo, M.; Turilli, M.; Vakarelov, O.; Zenil, H.: ¬The philosophy of information : an introduction (2013) 0.09
    0.09107552 = sum of:
      0.09107552 = product of:
        0.37948135 = sum of:
          0.015557275 = weight(abstract_txt:because in 4380) [ClassicSimilarity], result of:
            0.015557275 = score(doc=4380,freq=1.0), product of:
              0.08198629 = queryWeight, product of:
                1.1558245 = boost
                4.8577175 = idf(docFreq=937, maxDocs=44421)
                0.014602158 = queryNorm
              0.18975459 = fieldWeight in 4380, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.8577175 = idf(docFreq=937, maxDocs=44421)
                0.0390625 = fieldNorm(doc=4380)
          0.007773114 = weight(abstract_txt:that in 4380) [ClassicSimilarity], result of:
            0.007773114 = score(doc=4380,freq=3.0), product of:
              0.048579738 = queryWeight, product of:
                1.4067564 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.014602158 = queryNorm
              0.16000733 = fieldWeight in 4380, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0390625 = fieldNorm(doc=4380)
          0.05555092 = weight(abstract_txt:talk in 4380) [ClassicSimilarity], result of:
            0.05555092 = score(doc=4380,freq=1.0), product of:
              0.19153422 = queryWeight, product of:
                1.7666255 = boost
                7.4248013 = idf(docFreq=71, maxDocs=44421)
                0.014602158 = queryNorm
              0.2900313 = fieldWeight in 4380, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.4248013 = idf(docFreq=71, maxDocs=44421)
                0.0390625 = fieldNorm(doc=4380)
          0.109291404 = weight(abstract_txt:we're in 4380) [ClassicSimilarity], result of:
            0.109291404 = score(doc=4380,freq=1.0), product of:
              0.30072963 = queryWeight, product of:
                2.2136524 = boost
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.014602158 = queryNorm
              0.3634208 = fieldWeight in 4380, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.303573 = idf(docFreq=10, maxDocs=44421)
                0.0390625 = fieldNorm(doc=4380)
          0.08352748 = weight(abstract_txt:think in 4380) [ClassicSimilarity], result of:
            0.08352748 = score(doc=4380,freq=2.0), product of:
              0.22839797 = queryWeight, product of:
                2.3627243 = boost
                6.6200633 = idf(docFreq=160, maxDocs=44421)
                0.014602158 = queryNorm
              0.3657103 = fieldWeight in 4380, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.6200633 = idf(docFreq=160, maxDocs=44421)
                0.0390625 = fieldNorm(doc=4380)
          0.10778114 = weight(abstract_txt:want in 4380) [ClassicSimilarity], result of:
            0.10778114 = score(doc=4380,freq=2.0), product of:
              0.29795274 = queryWeight, product of:
                3.11609 = boost
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.014602158 = queryNorm
              0.36173904 = fieldWeight in 4380, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.0390625 = fieldNorm(doc=4380)
        0.24 = coord(6/25)
    
  5. Stephens, O.: Introduction to OpenRefine (2014) 0.09
    0.087312676 = sum of:
      0.087312676 = product of:
        0.43656337 = sum of:
          0.02333829 = weight(abstract_txt:ways in 3884) [ClassicSimilarity], result of:
            0.02333829 = score(doc=3884,freq=1.0), product of:
              0.07853892 = queryWeight, product of:
                1.1312635 = boost
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.014602158 = queryNorm
              0.29715574 = fieldWeight in 3884, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.754492 = idf(docFreq=1039, maxDocs=44421)
                0.0625 = fieldNorm(doc=3884)
          0.09672225 = weight(abstract_txt:messy in 3884) [ClassicSimilarity], result of:
            0.09672225 = score(doc=3884,freq=1.0), product of:
              0.16083473 = queryWeight, product of:
                1.144712 = boost
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.014602158 = queryNorm
              0.60137665 = fieldWeight in 3884, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.622026 = idf(docFreq=7, maxDocs=44421)
                0.0625 = fieldNorm(doc=3884)
          0.07058302 = weight(abstract_txt:know in 3884) [ClassicSimilarity], result of:
            0.07058302 = score(doc=3884,freq=2.0), product of:
              0.13036542 = queryWeight, product of:
                1.4574797 = boost
                6.1255183 = idf(docFreq=263, maxDocs=44421)
                0.014602158 = queryNorm
              0.54142445 = fieldWeight in 3884, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1255183 = idf(docFreq=263, maxDocs=44421)
                0.0625 = fieldNorm(doc=3884)
          0.034712806 = weight(abstract_txt:what in 3884) [ClassicSimilarity], result of:
            0.034712806 = score(doc=3884,freq=1.0), product of:
              0.12893635 = queryWeight, product of:
                2.049859 = boost
                4.3075895 = idf(docFreq=1625, maxDocs=44421)
                0.014602158 = queryNorm
              0.26922435 = fieldWeight in 3884, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.3075895 = idf(docFreq=1625, maxDocs=44421)
                0.0625 = fieldNorm(doc=3884)
          0.21120702 = weight(abstract_txt:want in 3884) [ClassicSimilarity], result of:
            0.21120702 = score(doc=3884,freq=3.0), product of:
              0.29795274 = queryWeight, product of:
                3.11609 = boost
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.014602158 = queryNorm
              0.7088608 = fieldWeight in 3884, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.548176 = idf(docFreq=172, maxDocs=44421)
                0.0625 = fieldNorm(doc=3884)
        0.2 = coord(5/25)