Document (#34014)

Author
Cosh, K.J.
Burns, R.
Daniel, T.
Title
Content clouds : classifying content in Web 2.0
Source
Library review. 57(2008) no.9, S.722-729
Year
2008
Abstract
Purpose - With increasing amounts of user generated content being produced electronically in the form of wikis, blogs, forums etc. the purpose of this paper is to investigate a new approach to classifying ad hoc content. Design/methodology/approach - The approach applies natural language processing (NLP) tools to automatically extract the content of some text, visualizing the results in a content cloud. Findings - Content clouds share the visual simplicity of a tag cloud, but display the details of an article at a different level of abstraction, providing a complimentary classification. Research limitations/implications - Provides the general approach to creating a content cloud. In the future, the process can be refined and enhanced by further evaluation of results. Further work is also required to better identify closely related articles. Practical implications - Being able to automatically classify the content generated by web users will enable others to find more appropriate content. Originality/value - The approach is original. Other researchers have produced a cloud, simply by using skiplists to filter unwanted words, this paper's approach improves this by applying appropriate NLP techniques.
Theme
Automatisches Klassifizieren
Object
Tag cloud
Word cloud

Similar documents (author)

  1. Burns, B.A.F.: Alternatives for library catalogues : tools for catalogue planning (1981) 2.55
    2.5542176 = sum of:
      2.5542176 = product of:
        5.108435 = sum of:
          5.108435 = weight(author_txt:burns in 5391) [ClassicSimilarity], result of:
            5.108435 = score(doc=5391,freq=1.0), product of:
              0.8247969 = queryWeight, product of:
                1.2077705 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.068913095 = queryNorm
              6.1935673 = fieldWeight in 5391, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.625 = fieldNorm(doc=5391)
        0.5 = coord(1/2)
    
  2. Bullard, J.; Burns, C.S.; VanScoy, A.: Warrant as a means to study classification system design (2017) 1.53
    1.5325307 = sum of:
      1.5325307 = product of:
        3.0650613 = sum of:
          3.0650613 = weight(author_txt:burns in 4360) [ClassicSimilarity], result of:
            3.0650613 = score(doc=4360,freq=1.0), product of:
              0.8247969 = queryWeight, product of:
                1.2077705 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.068913095 = queryNorm
              3.7161405 = fieldWeight in 4360, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.375 = fieldNorm(doc=4360)
        0.5 = coord(1/2)
    
  3. Bossaller, J.; Burns, C.S.; VanScoy, A.: Re-conceiving time in reference and information services work : a qualitative secondary analysis (2017) 1.53
    1.5325307 = sum of:
      1.5325307 = product of:
        3.0650613 = sum of:
          3.0650613 = weight(author_txt:burns in 4363) [ClassicSimilarity], result of:
            3.0650613 = score(doc=4363,freq=1.0), product of:
              0.8247969 = queryWeight, product of:
                1.2077705 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.068913095 = queryNorm
              3.7161405 = fieldWeight in 4363, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.375 = fieldNorm(doc=4363)
        0.5 = coord(1/2)
    
  4. Daniel, F.: Elektronische Informationsdienste in der StadtBibliothek Köln (1995) 1.45
    1.4497886 = sum of:
      1.4497886 = product of:
        2.8995771 = sum of:
          2.8995771 = weight(author_txt:daniel in 2764) [ClassicSimilarity], result of:
            2.8995771 = score(doc=2764,freq=1.0), product of:
              0.56542915 = queryWeight, product of:
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.068913095 = queryNorm
              5.1281 = fieldWeight in 2764, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.625 = fieldNorm(doc=2764)
        0.5 = coord(1/2)
    
  5. Daniel, F.: Präsentationssoftware 'infoThek' für elektronische Informationsmedien (1996) 1.45
    1.4497886 = sum of:
      1.4497886 = product of:
        2.8995771 = sum of:
          2.8995771 = weight(author_txt:daniel in 3383) [ClassicSimilarity], result of:
            2.8995771 = score(doc=3383,freq=1.0), product of:
              0.56542915 = queryWeight, product of:
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.068913095 = queryNorm
              5.1281 = fieldWeight in 3383, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.625 = fieldNorm(doc=3383)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Leginus, M.; Zhai, C.X.; Dolog, P.: Personalized generation of word clouds from tweets (2016) 0.24
    0.23640943 = sum of:
      0.23640943 = product of:
        1.1820471 = sum of:
          0.034851953 = weight(abstract_txt:further in 3886) [ClassicSimilarity], result of:
            0.034851953 = score(doc=3886,freq=1.0), product of:
              0.0957475 = queryWeight, product of:
                1.2816409 = boost
                4.6591816 = idf(docFreq=1143, maxDocs=44421)
                0.016034354 = queryNorm
              0.36399856 = fieldWeight in 3886, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6591816 = idf(docFreq=1143, maxDocs=44421)
                0.078125 = fieldNorm(doc=3886)
          0.057615772 = weight(abstract_txt:generated in 3886) [ClassicSimilarity], result of:
            0.057615772 = score(doc=3886,freq=1.0), product of:
              0.133866 = queryWeight, product of:
                1.5154366 = boost
                5.509105 = idf(docFreq=488, maxDocs=44421)
                0.016034354 = queryNorm
              0.43039885 = fieldWeight in 3886, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.509105 = idf(docFreq=488, maxDocs=44421)
                0.078125 = fieldNorm(doc=3886)
          0.49555004 = weight(abstract_txt:clouds in 3886) [ClassicSimilarity], result of:
            0.49555004 = score(doc=3886,freq=3.0), product of:
              0.38963738 = queryWeight, product of:
                2.5854309 = boost
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.016034354 = queryNorm
              1.2718236 = fieldWeight in 3886, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.398883 = idf(docFreq=9, maxDocs=44421)
                0.078125 = fieldNorm(doc=3886)
          0.054129332 = weight(abstract_txt:approach in 3886) [ClassicSimilarity], result of:
            0.054129332 = score(doc=3886,freq=1.0), product of:
              0.18519883 = queryWeight, product of:
                3.087325 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.016034354 = queryNorm
              0.29227686 = fieldWeight in 3886, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.078125 = fieldNorm(doc=3886)
          0.53990006 = weight(abstract_txt:cloud in 3886) [ClassicSimilarity], result of:
            0.53990006 = score(doc=3886,freq=3.0), product of:
              0.51978195 = queryWeight, product of:
                4.2230697 = boost
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.016034354 = queryNorm
              1.0387049 = fieldWeight in 3886, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.078125 = fieldNorm(doc=3886)
        0.2 = coord(5/25)
    
  2. Huang, C.; Fu, T.; Chen, H.: Text-based video content classification for online video-sharing sites (2010) 0.10
    0.099981524 = sum of:
      0.099981524 = product of:
        0.41658968 = sum of:
          0.047052946 = weight(abstract_txt:blogs in 439) [ClassicSimilarity], result of:
            0.047052946 = score(doc=439,freq=1.0), product of:
              0.117749356 = queryWeight, product of:
                1.0050019 = boost
                7.3070183 = idf(docFreq=80, maxDocs=44421)
                0.016034354 = queryNorm
              0.39960256 = fieldWeight in 439, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3070183 = idf(docFreq=80, maxDocs=44421)
                0.0546875 = fieldNorm(doc=439)
          0.058370605 = weight(abstract_txt:forums in 439) [ClassicSimilarity], result of:
            0.058370605 = score(doc=439,freq=1.0), product of:
              0.13594507 = queryWeight, product of:
                1.0798647 = boost
                7.85132 = idf(docFreq=46, maxDocs=44421)
                0.016034354 = queryNorm
              0.42936906 = fieldWeight in 439, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.85132 = idf(docFreq=46, maxDocs=44421)
                0.0546875 = fieldNorm(doc=439)
          0.024396367 = weight(abstract_txt:further in 439) [ClassicSimilarity], result of:
            0.024396367 = score(doc=439,freq=1.0), product of:
              0.0957475 = queryWeight, product of:
                1.2816409 = boost
                4.6591816 = idf(docFreq=1143, maxDocs=44421)
                0.016034354 = queryNorm
              0.254799 = fieldWeight in 439, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6591816 = idf(docFreq=1143, maxDocs=44421)
                0.0546875 = fieldNorm(doc=439)
          0.08066208 = weight(abstract_txt:generated in 439) [ClassicSimilarity], result of:
            0.08066208 = score(doc=439,freq=4.0), product of:
              0.133866 = queryWeight, product of:
                1.5154366 = boost
                5.509105 = idf(docFreq=488, maxDocs=44421)
                0.016034354 = queryNorm
              0.6025584 = fieldWeight in 439, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.509105 = idf(docFreq=488, maxDocs=44421)
                0.0546875 = fieldNorm(doc=439)
          0.053585306 = weight(abstract_txt:approach in 439) [ClassicSimilarity], result of:
            0.053585306 = score(doc=439,freq=2.0), product of:
              0.18519883 = queryWeight, product of:
                3.087325 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.016034354 = queryNorm
              0.28933933 = fieldWeight in 439, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.0546875 = fieldNorm(doc=439)
          0.15252239 = weight(abstract_txt:content in 439) [ClassicSimilarity], result of:
            0.15252239 = score(doc=439,freq=3.0), product of:
              0.38525593 = queryWeight, product of:
                5.7486024 = boost
                4.1796083 = idf(docFreq=1847, maxDocs=44421)
                0.016034354 = queryNorm
              0.39589888 = fieldWeight in 439, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.1796083 = idf(docFreq=1847, maxDocs=44421)
                0.0546875 = fieldNorm(doc=439)
        0.24 = coord(6/25)
    
  3. Hartel, J.; Savolainen, R.: Pictorial metaphors for information (2016) 0.10
    0.0956558 = sum of:
      0.0956558 = product of:
        0.39856583 = sum of:
          0.030338427 = weight(abstract_txt:purpose in 4163) [ClassicSimilarity], result of:
            0.030338427 = score(doc=4163,freq=2.0), product of:
              0.087881215 = queryWeight, product of:
                1.2278651 = boost
                4.4636893 = idf(docFreq=1390, maxDocs=44421)
                0.016034354 = queryNorm
              0.34522083 = fieldWeight in 4163, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.4636893 = idf(docFreq=1390, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4163)
          0.030694697 = weight(abstract_txt:implications in 4163) [ClassicSimilarity], result of:
            0.030694697 = score(doc=4163,freq=2.0), product of:
              0.08856788 = queryWeight, product of:
                1.2326528 = boost
                4.481094 = idf(docFreq=1366, maxDocs=44421)
                0.016034354 = queryNorm
              0.34656692 = fieldWeight in 4163, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.481094 = idf(docFreq=1366, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4163)
          0.04033104 = weight(abstract_txt:generated in 4163) [ClassicSimilarity], result of:
            0.04033104 = score(doc=4163,freq=1.0), product of:
              0.133866 = queryWeight, product of:
                1.5154366 = boost
                5.509105 = idf(docFreq=488, maxDocs=44421)
                0.016034354 = queryNorm
              0.3012792 = fieldWeight in 4163, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.509105 = idf(docFreq=488, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4163)
          0.041113142 = weight(abstract_txt:produced in 4163) [ClassicSimilarity], result of:
            0.041113142 = score(doc=4163,freq=1.0), product of:
              0.13559107 = queryWeight, product of:
                1.5251697 = boost
                5.5444884 = idf(docFreq=471, maxDocs=44421)
                0.016034354 = queryNorm
              0.30321422 = fieldWeight in 4163, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5444884 = idf(docFreq=471, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4163)
          0.037890535 = weight(abstract_txt:approach in 4163) [ClassicSimilarity], result of:
            0.037890535 = score(doc=4163,freq=1.0), product of:
              0.18519883 = queryWeight, product of:
                3.087325 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.016034354 = queryNorm
              0.20459381 = fieldWeight in 4163, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4163)
          0.218198 = weight(abstract_txt:cloud in 4163) [ClassicSimilarity], result of:
            0.218198 = score(doc=4163,freq=1.0), product of:
              0.51978195 = queryWeight, product of:
                4.2230697 = boost
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.016034354 = queryNorm
              0.4197876 = fieldWeight in 4163, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.676116 = idf(docFreq=55, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4163)
        0.24 = coord(6/25)
    
  4. Williamson, A.: Strategies for managing digital content formats (2005) 0.10
    0.09564599 = sum of:
      0.09564599 = product of:
        0.47822994 = sum of:
          0.030646442 = weight(abstract_txt:purpose in 5745) [ClassicSimilarity], result of:
            0.030646442 = score(doc=5745,freq=1.0), product of:
              0.087881215 = queryWeight, product of:
                1.2278651 = boost
                4.4636893 = idf(docFreq=1390, maxDocs=44421)
                0.016034354 = queryNorm
              0.34872574 = fieldWeight in 5745, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4636893 = idf(docFreq=1390, maxDocs=44421)
                0.078125 = fieldNorm(doc=5745)
          0.031006329 = weight(abstract_txt:implications in 5745) [ClassicSimilarity], result of:
            0.031006329 = score(doc=5745,freq=1.0), product of:
              0.08856788 = queryWeight, product of:
                1.2326528 = boost
                4.481094 = idf(docFreq=1366, maxDocs=44421)
                0.016034354 = queryNorm
              0.35008547 = fieldWeight in 5745, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.481094 = idf(docFreq=1366, maxDocs=44421)
                0.078125 = fieldNorm(doc=5745)
          0.05873306 = weight(abstract_txt:produced in 5745) [ClassicSimilarity], result of:
            0.05873306 = score(doc=5745,freq=1.0), product of:
              0.13559107 = queryWeight, product of:
                1.5251697 = boost
                5.5444884 = idf(docFreq=471, maxDocs=44421)
                0.016034354 = queryNorm
              0.43316317 = fieldWeight in 5745, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.5444884 = idf(docFreq=471, maxDocs=44421)
                0.078125 = fieldNorm(doc=5745)
          0.07655043 = weight(abstract_txt:approach in 5745) [ClassicSimilarity], result of:
            0.07655043 = score(doc=5745,freq=2.0), product of:
              0.18519883 = queryWeight, product of:
                3.087325 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.016034354 = queryNorm
              0.41334188 = fieldWeight in 5745, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.078125 = fieldNorm(doc=5745)
          0.28129366 = weight(abstract_txt:content in 5745) [ClassicSimilarity], result of:
            0.28129366 = score(doc=5745,freq=5.0), product of:
              0.38525593 = queryWeight, product of:
                5.7486024 = boost
                4.1796083 = idf(docFreq=1847, maxDocs=44421)
                0.016034354 = queryNorm
              0.7301475 = fieldWeight in 5745, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                4.1796083 = idf(docFreq=1847, maxDocs=44421)
                0.078125 = fieldNorm(doc=5745)
        0.2 = coord(5/25)
    
  5. Pu, H.-T.; Chuang, S.-L.; Yang, C.: Subject categorization of query terms for exploring Web users' search interests (2002) 0.09
    0.0916762 = sum of:
      0.0916762 = product of:
        0.38198417 = sum of:
          0.027881563 = weight(abstract_txt:further in 1587) [ClassicSimilarity], result of:
            0.027881563 = score(doc=1587,freq=1.0), product of:
              0.0957475 = queryWeight, product of:
                1.2816409 = boost
                4.6591816 = idf(docFreq=1143, maxDocs=44421)
                0.016034354 = queryNorm
              0.29119885 = fieldWeight in 1587, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.6591816 = idf(docFreq=1143, maxDocs=44421)
                0.0625 = fieldNorm(doc=1587)
          0.041578397 = weight(abstract_txt:appropriate in 1587) [ClassicSimilarity], result of:
            0.041578397 = score(doc=1587,freq=1.0), product of:
              0.12497636 = queryWeight, product of:
                1.4642545 = boost
                5.3230414 = idf(docFreq=588, maxDocs=44421)
                0.016034354 = queryNorm
              0.3326901 = fieldWeight in 1587, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.3230414 = idf(docFreq=588, maxDocs=44421)
                0.0625 = fieldNorm(doc=1587)
          0.04640319 = weight(abstract_txt:automatically in 1587) [ClassicSimilarity], result of:
            0.04640319 = score(doc=1587,freq=1.0), product of:
              0.13446665 = queryWeight, product of:
                1.5188327 = boost
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.016034354 = queryNorm
              0.3450907 = fieldWeight in 1587, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.521451 = idf(docFreq=482, maxDocs=44421)
                0.0625 = fieldNorm(doc=1587)
          0.0788754 = weight(abstract_txt:classifying in 1587) [ClassicSimilarity], result of:
            0.0788754 = score(doc=1587,freq=1.0), product of:
              0.19151837 = queryWeight, product of:
                1.8126245 = boost
                6.58948 = idf(docFreq=165, maxDocs=44421)
                0.016034354 = queryNorm
              0.4118425 = fieldWeight in 1587, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.58948 = idf(docFreq=165, maxDocs=44421)
                0.0625 = fieldNorm(doc=1587)
          0.086606935 = weight(abstract_txt:approach in 1587) [ClassicSimilarity], result of:
            0.086606935 = score(doc=1587,freq=4.0), product of:
              0.18519883 = queryWeight, product of:
                3.087325 = boost
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.016034354 = queryNorm
              0.467643 = fieldWeight in 1587, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.741144 = idf(docFreq=2864, maxDocs=44421)
                0.0625 = fieldNorm(doc=1587)
          0.10063868 = weight(abstract_txt:content in 1587) [ClassicSimilarity], result of:
            0.10063868 = score(doc=1587,freq=1.0), product of:
              0.38525593 = queryWeight, product of:
                5.7486024 = boost
                4.1796083 = idf(docFreq=1847, maxDocs=44421)
                0.016034354 = queryNorm
              0.26122552 = fieldWeight in 1587, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.1796083 = idf(docFreq=1847, maxDocs=44421)
                0.0625 = fieldNorm(doc=1587)
        0.24 = coord(6/25)