Document (#32719)

Author
Stapleton, M.
Adams, M.
Title
Faceted categorisation for the corporate desktop : visualisation and interaction using metadata to enhance user experience
Source
http://www.iskouk.org/presentations/dow_jones.pdf
Year
2007
Abstract
Mark Stapleton and Matt Adamson began their presentation by describing how Dow Jones' Factiva range of information services processed an average of 170,000 documents every day, drawn from over 10,000 sources in 22 languages. These documents are categorized within five facets: Company, Subject, Industry, Region and Language. The digital feeds received from information providers undergo a series of processing stages, initially to prepare them for automatic categorization and then to format them ready for distribution. The categorization stage is able to handle 98% of documents automatically, the remaining 2% requiring some form of human intervention. Depending on the source, categorization can involve any combination of 'Autocoding', 'Dictionary-based Categorizing', 'Rules-based Coding' or 'Manual Coding'
Content
Vortrag anlässlich: KOnnecting KOmmunities: RANGANATHAN REVISITED: FACETS FOR THE FUTURE, 5th November 2007, London - Vgl. auch den Bericht unter: http://www.iskouk.org/presentations/KOKO_event_report_2007-11-05.pdf.
Object
Factiva

Similar documents (author)

  1. Adams, J.A.: ¬The computer catalog : a democratic or authoritarian technology? (1988) 5.41
    5.4077277 = sum of:
      5.4077277 = weight(author_txt:adams in 421) [ClassicSimilarity], result of:
        5.4077277 = score(doc=421,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.115575336 = queryNorm
          5.407728 = fieldWeight in 421, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.625 = fieldNorm(doc=421)
    
  2. Adams, B.: Stand der retrospektiven Katalogisierung in Deutschland : zum gegenwärtigen Stand der Diskussion (1992) 5.41
    5.4077277 = sum of:
      5.4077277 = weight(author_txt:adams in 2368) [ClassicSimilarity], result of:
        5.4077277 = score(doc=2368,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.115575336 = queryNorm
          5.407728 = fieldWeight in 2368, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.625 = fieldNorm(doc=2368)
    
  3. Adams, B.: Charles Ami Cutters 'Expansive classification' : eine kritsche Darstellung (1965) 5.41
    5.4077277 = sum of:
      5.4077277 = weight(author_txt:adams in 4943) [ClassicSimilarity], result of:
        5.4077277 = score(doc=4943,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.115575336 = queryNorm
          5.407728 = fieldWeight in 4943, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.625 = fieldNorm(doc=4943)
    
  4. Adams, J.: ¬Le catalogue informatique (1989) 5.41
    5.4077277 = sum of:
      5.4077277 = weight(author_txt:adams in 5315) [ClassicSimilarity], result of:
        5.4077277 = score(doc=5315,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.115575336 = queryNorm
          5.407728 = fieldWeight in 5315, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.625 = fieldNorm(doc=5315)
    
  5. Adams, J.: Identifizierung für Waren mit Hilfe moderner Informationssysteme (1978) 5.41
    5.4077277 = sum of:
      5.4077277 = weight(author_txt:adams in 87) [ClassicSimilarity], result of:
        5.4077277 = score(doc=87,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.115575336 = queryNorm
          5.407728 = fieldWeight in 87, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            8.652365 = idf(docFreq=20, maxDocs=44218)
            0.625 = fieldNorm(doc=87)
    

Similar documents (content)

  1. Sykes, J.: Making solid business decisions through intelligent indexing taxonomies : a white paper prepared for Factiva, Factiva, a Dow Jones and Reuters Company (2003) 0.23
    0.23362799 = sum of:
      0.23362799 = product of:
        1.1681399 = sum of:
          0.06810196 = weight(abstract_txt:jones in 721) [ClassicSimilarity], result of:
            0.06810196 = score(doc=721,freq=1.0), product of:
              0.18515229 = queryWeight, product of:
                1.151547 = boost
                7.84674 = idf(docFreq=46, maxDocs=44218)
                0.020490766 = queryNorm
              0.3678159 = fieldWeight in 721, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.84674 = idf(docFreq=46, maxDocs=44218)
                0.046875 = fieldNorm(doc=721)
          0.022318 = weight(abstract_txt:them in 721) [ClassicSimilarity], result of:
            0.022318 = score(doc=721,freq=1.0), product of:
              0.11088417 = queryWeight, product of:
                1.2602795 = boost
                4.293826 = idf(docFreq=1640, maxDocs=44218)
                0.020490766 = queryNorm
              0.2012731 = fieldWeight in 721, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.293826 = idf(docFreq=1640, maxDocs=44218)
                0.046875 = fieldNorm(doc=721)
          0.7392841 = weight(title_txt:factiva in 721) [ClassicSimilarity], result of:
            0.7392841 = score(doc=721,freq=2.0), product of:
              0.28592157 = queryWeight, product of:
                1.4310031 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.020490766 = queryNorm
              2.5856185 = fieldWeight in 721, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.1875 = fieldNorm(doc=721)
          0.04186313 = weight(abstract_txt:documents in 721) [ClassicSimilarity], result of:
            0.04186313 = score(doc=721,freq=2.0), product of:
              0.15322897 = queryWeight, product of:
                1.8144633 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.020490766 = queryNorm
              0.27320635 = fieldWeight in 721, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.046875 = fieldNorm(doc=721)
          0.29657272 = weight(abstract_txt:categorization in 721) [ClassicSimilarity], result of:
            0.29657272 = score(doc=721,freq=6.0), product of:
              0.39189237 = queryWeight, product of:
                2.9017577 = boost
                6.590942 = idf(docFreq=164, maxDocs=44218)
                0.020490766 = queryNorm
              0.75677085 = fieldWeight in 721, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                6.590942 = idf(docFreq=164, maxDocs=44218)
                0.046875 = fieldNorm(doc=721)
        0.2 = coord(5/25)
    
  2. Sykes, J.: ¬The value of indexing : a white paper prepared for Factiva, Factiva, a Dow Jones and Reuters Company (2001) 0.16
    0.1636784 = sum of:
      0.1636784 = product of:
        1.02299 = sum of:
          0.09631072 = weight(abstract_txt:jones in 720) [ClassicSimilarity], result of:
            0.09631072 = score(doc=720,freq=2.0), product of:
              0.18515229 = queryWeight, product of:
                1.151547 = boost
                7.84674 = idf(docFreq=46, maxDocs=44218)
                0.020490766 = queryNorm
              0.5201703 = fieldWeight in 720, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.84674 = idf(docFreq=46, maxDocs=44218)
                0.046875 = fieldNorm(doc=720)
          0.022318 = weight(abstract_txt:them in 720) [ClassicSimilarity], result of:
            0.022318 = score(doc=720,freq=1.0), product of:
              0.11088417 = queryWeight, product of:
                1.2602795 = boost
                4.293826 = idf(docFreq=1640, maxDocs=44218)
                0.020490766 = queryNorm
              0.2012731 = fieldWeight in 720, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.293826 = idf(docFreq=1640, maxDocs=44218)
                0.046875 = fieldNorm(doc=720)
          0.8624981 = weight(title_txt:factiva in 720) [ClassicSimilarity], result of:
            0.8624981 = score(doc=720,freq=2.0), product of:
              0.28592157 = queryWeight, product of:
                1.4310031 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.020490766 = queryNorm
              3.0165548 = fieldWeight in 720, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.21875 = fieldNorm(doc=720)
          0.04186313 = weight(abstract_txt:documents in 720) [ClassicSimilarity], result of:
            0.04186313 = score(doc=720,freq=2.0), product of:
              0.15322897 = queryWeight, product of:
                1.8144633 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.020490766 = queryNorm
              0.27320635 = fieldWeight in 720, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.046875 = fieldNorm(doc=720)
        0.16 = coord(4/25)
    
  3. Ribeiro-Neto, B.; Laender, A.H.F.; Lima, L.R.S. de: ¬An experimental study in automatically categorizing medical documents (2001) 0.13
    0.12831251 = sum of:
      0.12831251 = product of:
        0.6415626 = sum of:
          0.07281875 = weight(abstract_txt:categorized in 5702) [ClassicSimilarity], result of:
            0.07281875 = score(doc=5702,freq=1.0), product of:
              0.159818 = queryWeight, product of:
                1.0698674 = boost
                7.290168 = idf(docFreq=81, maxDocs=44218)
                0.020490766 = queryNorm
              0.4556355 = fieldWeight in 5702, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.290168 = idf(docFreq=81, maxDocs=44218)
                0.0625 = fieldNorm(doc=5702)
          0.029757334 = weight(abstract_txt:them in 5702) [ClassicSimilarity], result of:
            0.029757334 = score(doc=5702,freq=1.0), product of:
              0.11088417 = queryWeight, product of:
                1.2602795 = boost
                4.293826 = idf(docFreq=1640, maxDocs=44218)
                0.020490766 = queryNorm
              0.26836413 = fieldWeight in 5702, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.293826 = idf(docFreq=1640, maxDocs=44218)
                0.0625 = fieldNorm(doc=5702)
          0.078937866 = weight(abstract_txt:documents in 5702) [ClassicSimilarity], result of:
            0.078937866 = score(doc=5702,freq=4.0), product of:
              0.15322897 = queryWeight, product of:
                1.8144633 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.020490766 = queryNorm
              0.5151628 = fieldWeight in 5702, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=5702)
          0.23174687 = weight(abstract_txt:coding in 5702) [ClassicSimilarity], result of:
            0.23174687 = score(doc=5702,freq=4.0), product of:
              0.2744497 = queryWeight, product of:
                1.9827297 = boost
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.020490766 = queryNorm
              0.8444056 = fieldWeight in 5702, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.7552447 = idf(docFreq=139, maxDocs=44218)
                0.0625 = fieldNorm(doc=5702)
          0.22830178 = weight(abstract_txt:categorization in 5702) [ClassicSimilarity], result of:
            0.22830178 = score(doc=5702,freq=2.0), product of:
              0.39189237 = queryWeight, product of:
                2.9017577 = boost
                6.590942 = idf(docFreq=164, maxDocs=44218)
                0.020490766 = queryNorm
              0.58256245 = fieldWeight in 5702, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.590942 = idf(docFreq=164, maxDocs=44218)
                0.0625 = fieldNorm(doc=5702)
        0.2 = coord(5/25)
    
  4. Multimedia-Inhalte in Factiva : Produkten gezielt recherchierbar (2007) 0.09
    0.09262945 = sum of:
      0.09262945 = product of:
        1.1578681 = sum of:
          0.1123625 = weight(abstract_txt:jones in 500) [ClassicSimilarity], result of:
            0.1123625 = score(doc=500,freq=2.0), product of:
              0.18515229 = queryWeight, product of:
                1.151547 = boost
                7.84674 = idf(docFreq=46, maxDocs=44218)
                0.020490766 = queryNorm
              0.6068653 = fieldWeight in 500, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.84674 = idf(docFreq=46, maxDocs=44218)
                0.0546875 = fieldNorm(doc=500)
          1.0455056 = weight(title_txt:factiva in 500) [ClassicSimilarity], result of:
            1.0455056 = score(doc=500,freq=1.0), product of:
              0.28592157 = queryWeight, product of:
                1.4310031 = boost
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.020490766 = queryNorm
              3.6566167 = fieldWeight in 500, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.7509775 = idf(docFreq=6, maxDocs=44218)
                0.375 = fieldNorm(doc=500)
        0.08 = coord(2/25)
    
  5. Svarre, T.; Lykke, M.: Experiences with automated categorization in e-government information retrieval (2014) 0.07
    0.06668997 = sum of:
      0.06668997 = product of:
        0.5557498 = sum of:
          0.07281875 = weight(abstract_txt:categorized in 1372) [ClassicSimilarity], result of:
            0.07281875 = score(doc=1372,freq=1.0), product of:
              0.159818 = queryWeight, product of:
                1.0698674 = boost
                7.290168 = idf(docFreq=81, maxDocs=44218)
                0.020490766 = queryNorm
              0.4556355 = fieldWeight in 1372, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.290168 = idf(docFreq=81, maxDocs=44218)
                0.0625 = fieldNorm(doc=1372)
          0.055817503 = weight(abstract_txt:documents in 1372) [ClassicSimilarity], result of:
            0.055817503 = score(doc=1372,freq=2.0), product of:
              0.15322897 = queryWeight, product of:
                1.8144633 = boost
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.020490766 = queryNorm
              0.36427513 = fieldWeight in 1372, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.1213026 = idf(docFreq=1949, maxDocs=44218)
                0.0625 = fieldNorm(doc=1372)
          0.4271135 = weight(abstract_txt:categorization in 1372) [ClassicSimilarity], result of:
            0.4271135 = score(doc=1372,freq=7.0), product of:
              0.39189237 = queryWeight, product of:
                2.9017577 = boost
                6.590942 = idf(docFreq=164, maxDocs=44218)
                0.020490766 = queryNorm
              1.0898745 = fieldWeight in 1372, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                6.590942 = idf(docFreq=164, maxDocs=44218)
                0.0625 = fieldNorm(doc=1372)
        0.12 = coord(3/25)