Document (#19550)

Author
Carrick, C.
Watters, C.
Title
Automatic association of news items
Source
Information processing and management. 33(1997) no.5, S.615-632
Year
1997
Abstract
Examines the problem of the association of related times of different media type, specifically photos and stories involved in the automatic generation of electronic editions. Determines to what degree any 2 news items refer to the same news event. This metric can be used: to link multimedia items that can be shown together, such as a video, photo, and text story related to a shipwreck or state visit; and to form clusters of very similar items from a variety of sources so that 1 or 2 can be chosen to represent that event in an edition. Discusses the specific assocoation of text and photo news items, although the approach applies to a larger domain of news including scripted news video clips and sripted radio broadcasts
Footnote
Contribution to a special issue devoted to electronic newspapers
Theme
Elektronisches Publizieren
Form
Zeitungen

Similar documents (author)

  1. Watters, C.: Extending the multimedia class hierarchy for hypermedia applications (1996) 5.62
    5.6180234 = sum of:
      5.6180234 = weight(author_txt:watters in 605) [ClassicSimilarity], result of:
        5.6180234 = fieldWeight in 605, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.625 = fieldNorm(doc=605)
    
  2. Watters, C.: Information retrieval and the virtual document (1999) 5.62
    5.6180234 = sum of:
      5.6180234 = weight(author_txt:watters in 4319) [ClassicSimilarity], result of:
        5.6180234 = fieldWeight in 4319, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.625 = fieldNorm(doc=4319)
    
  3. Watters, C.; Shepherd, M.A.: Shifting the information paradigm from data-centered to user-centered (1994) 4.49
    4.4944186 = sum of:
      4.4944186 = weight(author_txt:watters in 7290) [ClassicSimilarity], result of:
        4.4944186 = fieldWeight in 7290, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.5 = fieldNorm(doc=7290)
    
  4. Watters, C.; Wang, H.: Rating new documents for similarity (2000) 4.49
    4.4944186 = sum of:
      4.4944186 = weight(author_txt:watters in 4856) [ClassicSimilarity], result of:
        4.4944186 = fieldWeight in 4856, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.5 = fieldNorm(doc=4856)
    
  5. Watters, C.; Amoudi, A.: Geosearcher : location-based ranking of search engine results (2003) 4.49
    4.4944186 = sum of:
      4.4944186 = weight(author_txt:watters in 5152) [ClassicSimilarity], result of:
        4.4944186 = fieldWeight in 5152, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.988837 = idf(docFreq=14, maxDocs=44218)
          0.5 = fieldNorm(doc=5152)
    

Similar documents (content)

  1. Watters, C.R.; Shepherd, M.A.; Burkowski, F.J.: Electronic news delivery project (1998) 0.35
    0.34964338 = sum of:
      0.34964338 = product of:
        1.0926356 = sum of:
          0.006124337 = weight(abstract_txt:that in 444) [ClassicSimilarity], result of:
            0.006124337 = score(doc=444,freq=1.0), product of:
              0.041354895 = queryWeight, product of:
                1.1732916 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.014875406 = queryNorm
              0.1480922 = fieldWeight in 444, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=444)
          0.07184822 = weight(abstract_txt:radio in 444) [ClassicSimilarity], result of:
            0.07184822 = score(doc=444,freq=1.0), product of:
              0.14804411 = queryWeight, product of:
                1.2816736 = boost
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.014875406 = queryNorm
              0.48531634 = fieldWeight in 444, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7650614 = idf(docFreq=50, maxDocs=44218)
                0.0625 = fieldNorm(doc=444)
          0.028702311 = weight(abstract_txt:text in 444) [ClassicSimilarity], result of:
            0.028702311 = score(doc=444,freq=2.0), product of:
              0.080301754 = queryWeight, product of:
                1.3349327 = boost
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.014875406 = queryNorm
              0.3574307 = fieldWeight in 444, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.0438666 = idf(docFreq=2106, maxDocs=44218)
                0.0625 = fieldNorm(doc=444)
          0.09098759 = weight(abstract_txt:photos in 444) [ClassicSimilarity], result of:
            0.09098759 = score(doc=444,freq=1.0), product of:
              0.17328803 = queryWeight, product of:
                1.3866477 = boost
                8.401051 = idf(docFreq=26, maxDocs=44218)
                0.014875406 = queryNorm
              0.52506566 = fieldWeight in 444, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.401051 = idf(docFreq=26, maxDocs=44218)
                0.0625 = fieldNorm(doc=444)
          0.02281453 = weight(abstract_txt:related in 444) [ClassicSimilarity], result of:
            0.02281453 = score(doc=444,freq=1.0), product of:
              0.08681567 = queryWeight, product of:
                1.3880206 = boost
                4.2046843 = idf(docFreq=1793, maxDocs=44218)
                0.014875406 = queryNorm
              0.26279277 = fieldWeight in 444, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.2046843 = idf(docFreq=1793, maxDocs=44218)
                0.0625 = fieldNorm(doc=444)
          0.097805195 = weight(abstract_txt:clips in 444) [ClassicSimilarity], result of:
            0.097805195 = score(doc=444,freq=1.0), product of:
              0.18183957 = queryWeight, product of:
                1.4204503 = boost
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.014875406 = queryNorm
              0.5378653 = fieldWeight in 444, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.6058445 = idf(docFreq=21, maxDocs=44218)
                0.0625 = fieldNorm(doc=444)
          0.10009462 = weight(abstract_txt:video in 444) [ClassicSimilarity], result of:
            0.10009462 = score(doc=444,freq=2.0), product of:
              0.18466628 = queryWeight, product of:
                2.0243735 = boost
                6.1323667 = idf(docFreq=260, maxDocs=44218)
                0.014875406 = queryNorm
              0.54202974 = fieldWeight in 444, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1323667 = idf(docFreq=260, maxDocs=44218)
                0.0625 = fieldNorm(doc=444)
          0.6742588 = weight(abstract_txt:news in 444) [ClassicSimilarity], result of:
            0.6742588 = score(doc=444,freq=12.0), product of:
              0.5227831 = queryWeight, product of:
                5.8995414 = boost
                5.957094 = idf(docFreq=310, maxDocs=44218)
                0.014875406 = queryNorm
              1.2897487 = fieldWeight in 444, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                5.957094 = idf(docFreq=310, maxDocs=44218)
                0.0625 = fieldNorm(doc=444)
        0.32 = coord(8/25)
    
  2. Watters, C.; Wang, H.: Rating new documents for similarity (2000) 0.21
    0.20520484 = sum of:
      0.20520484 = product of:
        1.0260242 = sum of:
          0.0132595785 = weight(abstract_txt:that in 4856) [ClassicSimilarity], result of:
            0.0132595785 = score(doc=4856,freq=3.0), product of:
              0.041354895 = queryWeight, product of:
                1.1732916 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.014875406 = queryNorm
              0.320629 = fieldWeight in 4856, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.078125 = fieldNorm(doc=4856)
          0.04939491 = weight(abstract_txt:related in 4856) [ClassicSimilarity], result of:
            0.04939491 = score(doc=4856,freq=3.0), product of:
              0.08681567 = queryWeight, product of:
                1.3880206 = boost
                4.2046843 = idf(docFreq=1793, maxDocs=44218)
                0.014875406 = queryNorm
              0.56896305 = fieldWeight in 4856, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.2046843 = idf(docFreq=1793, maxDocs=44218)
                0.078125 = fieldNorm(doc=4856)
          0.13190043 = weight(abstract_txt:event in 4856) [ClassicSimilarity], result of:
            0.13190043 = score(doc=4856,freq=1.0), product of:
              0.24099863 = queryWeight, product of:
                2.3126192 = boost
                7.0055394 = idf(docFreq=108, maxDocs=44218)
                0.014875406 = queryNorm
              0.5473078 = fieldWeight in 4856, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.0055394 = idf(docFreq=108, maxDocs=44218)
                0.078125 = fieldNorm(doc=4856)
          0.23550305 = weight(abstract_txt:items in 4856) [ClassicSimilarity], result of:
            0.23550305 = score(doc=4856,freq=2.0), product of:
              0.3820775 = queryWeight, product of:
                4.604077 = boost
                5.57879 = idf(docFreq=453, maxDocs=44218)
                0.014875406 = queryNorm
              0.6163751 = fieldWeight in 4856, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.57879 = idf(docFreq=453, maxDocs=44218)
                0.078125 = fieldNorm(doc=4856)
          0.5959662 = weight(abstract_txt:news in 4856) [ClassicSimilarity], result of:
            0.5959662 = score(doc=4856,freq=6.0), product of:
              0.5227831 = queryWeight, product of:
                5.8995414 = boost
                5.957094 = idf(docFreq=310, maxDocs=44218)
                0.014875406 = queryNorm
              1.1399876 = fieldWeight in 4856, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.957094 = idf(docFreq=310, maxDocs=44218)
                0.078125 = fieldNorm(doc=4856)
        0.2 = coord(5/25)
    
  3. Lehmann, J.; Castillo, C.; Lalmas, M.; Baeza-Yates, R.: Story-focused reading in online news and its potential for user engagement (2017) 0.20
    0.1964441 = sum of:
      0.1964441 = product of:
        0.8185171 = sum of:
          0.034125876 = weight(abstract_txt:larger in 3529) [ClassicSimilarity], result of:
            0.034125876 = score(doc=3529,freq=1.0), product of:
              0.09012314 = queryWeight, product of:
                6.0585327 = idf(docFreq=280, maxDocs=44218)
                0.014875406 = queryNorm
              0.3786583 = fieldWeight in 3529, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.0585327 = idf(docFreq=280, maxDocs=44218)
                0.0625 = fieldNorm(doc=3529)
          0.015001501 = weight(abstract_txt:that in 3529) [ClassicSimilarity], result of:
            0.015001501 = score(doc=3529,freq=6.0), product of:
              0.041354895 = queryWeight, product of:
                1.1732916 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.014875406 = queryNorm
              0.36275032 = fieldWeight in 3529, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=3529)
          0.14712118 = weight(abstract_txt:story in 3529) [ClassicSimilarity], result of:
            0.14712118 = score(doc=3529,freq=6.0), product of:
              0.13137522 = queryWeight, product of:
                1.207365 = boost
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.014875406 = queryNorm
              1.1198548 = fieldWeight in 3529, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.314861 = idf(docFreq=79, maxDocs=44218)
                0.0625 = fieldNorm(doc=3529)
          0.06166581 = weight(abstract_txt:stories in 3529) [ClassicSimilarity], result of:
            0.06166581 = score(doc=3529,freq=1.0), product of:
              0.1337037 = queryWeight, product of:
                1.2180176 = boost
                7.3793993 = idf(docFreq=74, maxDocs=44218)
                0.014875406 = queryNorm
              0.46121246 = fieldWeight in 3529, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3793993 = idf(docFreq=74, maxDocs=44218)
                0.0625 = fieldNorm(doc=3529)
          0.04562906 = weight(abstract_txt:related in 3529) [ClassicSimilarity], result of:
            0.04562906 = score(doc=3529,freq=4.0), product of:
              0.08681567 = queryWeight, product of:
                1.3880206 = boost
                4.2046843 = idf(docFreq=1793, maxDocs=44218)
                0.014875406 = queryNorm
              0.52558553 = fieldWeight in 3529, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.2046843 = idf(docFreq=1793, maxDocs=44218)
                0.0625 = fieldNorm(doc=3529)
          0.5149737 = weight(abstract_txt:news in 3529) [ClassicSimilarity], result of:
            0.5149737 = score(doc=3529,freq=7.0), product of:
              0.5227831 = queryWeight, product of:
                5.8995414 = boost
                5.957094 = idf(docFreq=310, maxDocs=44218)
                0.014875406 = queryNorm
              0.9850618 = fieldWeight in 3529, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.957094 = idf(docFreq=310, maxDocs=44218)
                0.0625 = fieldNorm(doc=3529)
        0.24 = coord(6/25)
    
  4. Sela, M.; Lavie, T.; Inbar, O.; Oppenheim, I.; Meyer, J.: Personalizing news content : an experimental study (2015) 0.17
    0.17360783 = sum of:
      0.17360783 = product of:
        1.0850489 = sum of:
          0.13294376 = weight(abstract_txt:editions in 1604) [ClassicSimilarity], result of:
            0.13294376 = score(doc=1604,freq=6.0), product of:
              0.12279349 = queryWeight, product of:
                1.1672652 = boost
                7.071914 = idf(docFreq=101, maxDocs=44218)
                0.014875406 = queryNorm
              1.0826614 = fieldWeight in 1604, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.071914 = idf(docFreq=101, maxDocs=44218)
                0.0625 = fieldNorm(doc=1604)
          0.00866112 = weight(abstract_txt:that in 1604) [ClassicSimilarity], result of:
            0.00866112 = score(doc=1604,freq=2.0), product of:
              0.041354895 = queryWeight, product of:
                1.1732916 = boost
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.014875406 = queryNorm
              0.20943399 = fieldWeight in 1604, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3694751 = idf(docFreq=11241, maxDocs=44218)
                0.0625 = fieldNorm(doc=1604)
          0.2978904 = weight(abstract_txt:items in 1604) [ClassicSimilarity], result of:
            0.2978904 = score(doc=1604,freq=5.0), product of:
              0.3820775 = queryWeight, product of:
                4.604077 = boost
                5.57879 = idf(docFreq=453, maxDocs=44218)
                0.014875406 = queryNorm
              0.7796596 = fieldWeight in 1604, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.57879 = idf(docFreq=453, maxDocs=44218)
                0.0625 = fieldNorm(doc=1604)
          0.6455537 = weight(abstract_txt:news in 1604) [ClassicSimilarity], result of:
            0.6455537 = score(doc=1604,freq=11.0), product of:
              0.5227831 = queryWeight, product of:
                5.8995414 = boost
                5.957094 = idf(docFreq=310, maxDocs=44218)
                0.014875406 = queryNorm
              1.2348404 = fieldWeight in 1604, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                5.957094 = idf(docFreq=310, maxDocs=44218)
                0.0625 = fieldNorm(doc=1604)
        0.16 = coord(4/25)
    
  5. O'Leary, M.: Reuters leads in current new photos (1996) 0.16
    0.16243999 = sum of:
      0.16243999 = product of:
        1.3536665 = sum of:
          0.22746898 = weight(abstract_txt:photos in 6635) [ClassicSimilarity], result of:
            0.22746898 = score(doc=6635,freq=1.0), product of:
              0.17328803 = queryWeight, product of:
                1.3866477 = boost
                8.401051 = idf(docFreq=26, maxDocs=44218)
                0.014875406 = queryNorm
              1.3126642 = fieldWeight in 6635, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.401051 = idf(docFreq=26, maxDocs=44218)
                0.15625 = fieldNorm(doc=6635)
          0.43803504 = weight(abstract_txt:photo in 6635) [ClassicSimilarity], result of:
            0.43803504 = score(doc=6635,freq=1.0), product of:
              0.33793747 = queryWeight, product of:
                2.7385144 = boost
                8.29569 = idf(docFreq=29, maxDocs=44218)
                0.014875406 = queryNorm
              1.2962015 = fieldWeight in 6635, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.29569 = idf(docFreq=29, maxDocs=44218)
                0.15625 = fieldNorm(doc=6635)
          0.6881625 = weight(abstract_txt:news in 6635) [ClassicSimilarity], result of:
            0.6881625 = score(doc=6635,freq=2.0), product of:
              0.5227831 = queryWeight, product of:
                5.8995414 = boost
                5.957094 = idf(docFreq=310, maxDocs=44218)
                0.014875406 = queryNorm
              1.3163443 = fieldWeight in 6635, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.957094 = idf(docFreq=310, maxDocs=44218)
                0.15625 = fieldNorm(doc=6635)
        0.12 = coord(3/25)