Document (#19550)

Author
Carrick, C.
Watters, C.
Title
Automatic association of news items
Source
Information processing and management. 33(1997) no.5, S.615-632
Year
1997
Abstract
Examines the problem of the association of related times of different media type, specifically photos and stories involved in the automatic generation of electronic editions. Determines to what degree any 2 news items refer to the same news event. This metric can be used: to link multimedia items that can be shown together, such as a video, photo, and text story related to a shipwreck or state visit; and to form clusters of very similar items from a variety of sources so that 1 or 2 can be chosen to represent that event in an edition. Discusses the specific assocoation of text and photo news items, although the approach applies to a larger domain of news including scripted news video clips and sripted radio broadcasts
Footnote
Contribution to a special issue devoted to electronic newspapers
Theme
Elektronisches Publizieren
Form
Zeitungen

Similar documents (author)

  1. Watters, C.: Extending the multimedia class hierarchy for hypermedia applications (1996) 5.62
    5.620886 = sum of:
      5.620886 = weight(author_txt:watters in 605) [ClassicSimilarity], result of:
        5.620886 = fieldWeight in 605, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.993418 = idf(docFreq=14, maxDocs=44421)
          0.625 = fieldNorm(doc=605)
    
  2. Watters, C.: Information retrieval and the virtual document (1999) 5.62
    5.620886 = sum of:
      5.620886 = weight(author_txt:watters in 5319) [ClassicSimilarity], result of:
        5.620886 = fieldWeight in 5319, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.993418 = idf(docFreq=14, maxDocs=44421)
          0.625 = fieldNorm(doc=5319)
    
  3. Watters, C.; Shepherd, M.A.: Shifting the information paradigm from data-centered to user-centered (1994) 4.50
    4.496709 = sum of:
      4.496709 = weight(author_txt:watters in 7289) [ClassicSimilarity], result of:
        4.496709 = fieldWeight in 7289, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.993418 = idf(docFreq=14, maxDocs=44421)
          0.5 = fieldNorm(doc=7289)
    
  4. Watters, C.; Wang, H.: Rating new documents for similarity (2000) 4.50
    4.496709 = sum of:
      4.496709 = weight(author_txt:watters in 5856) [ClassicSimilarity], result of:
        4.496709 = fieldWeight in 5856, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.993418 = idf(docFreq=14, maxDocs=44421)
          0.5 = fieldNorm(doc=5856)
    
  5. Watters, C.; Amoudi, A.: Geosearcher : location-based ranking of search engine results (2003) 4.50
    4.496709 = sum of:
      4.496709 = weight(author_txt:watters in 152) [ClassicSimilarity], result of:
        4.496709 = fieldWeight in 152, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.993418 = idf(docFreq=14, maxDocs=44421)
          0.5 = fieldNorm(doc=152)
    

Similar documents (content)

  1. Watters, C.R.; Shepherd, M.A.; Burkowski, F.J.: Electronic news delivery project (1998) 0.35
    0.34947845 = sum of:
      0.34947845 = product of:
        1.0921202 = sum of:
          0.0060908976 = weight(abstract_txt:that in 1444) [ClassicSimilarity], result of:
            0.0060908976 = score(doc=1444,freq=1.0), product of:
              0.041208047 = queryWeight, product of:
                1.1708449 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.014882072 = queryNorm
              0.14780845 = fieldWeight in 1444, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=1444)
          0.07199549 = weight(abstract_txt:radio in 1444) [ClassicSimilarity], result of:
            0.07199549 = score(doc=1444,freq=1.0), product of:
              0.14826009 = queryWeight, product of:
                1.282212 = boost
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.014882072 = queryNorm
              0.48560262 = fieldWeight in 1444, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.769642 = idf(docFreq=50, maxDocs=44421)
                0.0625 = fieldNorm(doc=1444)
          0.02864678 = weight(abstract_txt:text in 1444) [ClassicSimilarity], result of:
            0.02864678 = score(doc=1444,freq=2.0), product of:
              0.08020559 = queryWeight, product of:
                1.3337212 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.014882072 = queryNorm
              0.3571669 = fieldWeight in 1444, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0625 = fieldNorm(doc=1444)
          0.08998371 = weight(abstract_txt:photos in 1444) [ClassicSimilarity], result of:
            0.08998371 = score(doc=1444,freq=1.0), product of:
              0.17202702 = queryWeight, product of:
                1.3811666 = boost
                8.369263 = idf(docFreq=27, maxDocs=44421)
                0.014882072 = queryNorm
              0.5230789 = fieldWeight in 1444, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.369263 = idf(docFreq=27, maxDocs=44421)
                0.0625 = fieldNorm(doc=1444)
          0.022715105 = weight(abstract_txt:related in 1444) [ClassicSimilarity], result of:
            0.022715105 = score(doc=1444,freq=1.0), product of:
              0.08657129 = queryWeight, product of:
                1.3856376 = boost
                4.198178 = idf(docFreq=1813, maxDocs=44421)
                0.014882072 = queryNorm
              0.2623861 = fieldWeight in 1444, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.198178 = idf(docFreq=1813, maxDocs=44421)
                0.0625 = fieldNorm(doc=1444)
          0.0979887 = weight(abstract_txt:clips in 1444) [ClassicSimilarity], result of:
            0.0979887 = score(doc=1444,freq=1.0), product of:
              0.18208385 = queryWeight, product of:
                1.4209652 = boost
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.014882072 = queryNorm
              0.53815156 = fieldWeight in 1444, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.610425 = idf(docFreq=21, maxDocs=44421)
                0.0625 = fieldNorm(doc=1444)
          0.09978743 = weight(abstract_txt:video in 1444) [ClassicSimilarity], result of:
            0.09978743 = score(doc=1444,freq=2.0), product of:
              0.18430535 = queryWeight, product of:
                2.0217698 = boost
                6.1255183 = idf(docFreq=263, maxDocs=44421)
                0.014882072 = queryNorm
              0.54142445 = fieldWeight in 1444, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.1255183 = idf(docFreq=263, maxDocs=44421)
                0.0625 = fieldNorm(doc=1444)
          0.67491204 = weight(abstract_txt:news in 1444) [ClassicSimilarity], result of:
            0.67491204 = score(doc=1444,freq=12.0), product of:
              0.5231692 = queryWeight, product of:
                5.899897 = boost
                5.9584646 = idf(docFreq=311, maxDocs=44421)
                0.014882072 = queryNorm
              1.2900454 = fieldWeight in 1444, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                5.9584646 = idf(docFreq=311, maxDocs=44421)
                0.0625 = fieldNorm(doc=1444)
        0.32 = coord(8/25)
    
  2. Watters, C.; Wang, H.: Rating new documents for similarity (2000) 0.21
    0.2051345 = sum of:
      0.2051345 = product of:
        1.0256724 = sum of:
          0.013187179 = weight(abstract_txt:that in 5856) [ClassicSimilarity], result of:
            0.013187179 = score(doc=5856,freq=3.0), product of:
              0.041208047 = queryWeight, product of:
                1.1708449 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.014882072 = queryNorm
              0.32001466 = fieldWeight in 5856, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.078125 = fieldNorm(doc=5856)
          0.049179643 = weight(abstract_txt:related in 5856) [ClassicSimilarity], result of:
            0.049179643 = score(doc=5856,freq=3.0), product of:
              0.08657129 = queryWeight, product of:
                1.3856376 = boost
                4.198178 = idf(docFreq=1813, maxDocs=44421)
                0.014882072 = queryNorm
              0.5680826 = fieldWeight in 5856, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                4.198178 = idf(docFreq=1813, maxDocs=44421)
                0.078125 = fieldNorm(doc=5856)
          0.13117011 = weight(abstract_txt:event in 5856) [ClassicSimilarity], result of:
            0.13117011 = score(doc=5856,freq=1.0), product of:
              0.2401305 = queryWeight, product of:
                2.3077374 = boost
                6.9919376 = idf(docFreq=110, maxDocs=44421)
                0.014882072 = queryNorm
              0.5462451 = fieldWeight in 5856, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.9919376 = idf(docFreq=110, maxDocs=44421)
                0.078125 = fieldNorm(doc=5856)
          0.23559195 = weight(abstract_txt:items in 5856) [ClassicSimilarity], result of:
            0.23559195 = score(doc=5856,freq=2.0), product of:
              0.38220912 = queryWeight, product of:
                4.6034484 = boost
                5.5789747 = idf(docFreq=455, maxDocs=44421)
                0.014882072 = queryNorm
              0.6163954 = fieldWeight in 5856, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5789747 = idf(docFreq=455, maxDocs=44421)
                0.078125 = fieldNorm(doc=5856)
          0.5965436 = weight(abstract_txt:news in 5856) [ClassicSimilarity], result of:
            0.5965436 = score(doc=5856,freq=6.0), product of:
              0.5231692 = queryWeight, product of:
                5.899897 = boost
                5.9584646 = idf(docFreq=311, maxDocs=44421)
                0.014882072 = queryNorm
              1.1402498 = fieldWeight in 5856, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.9584646 = idf(docFreq=311, maxDocs=44421)
                0.078125 = fieldNorm(doc=5856)
        0.2 = coord(5/25)
    
  3. Lehmann, J.; Castillo, C.; Lalmas, M.; Baeza-Yates, R.: Story-focused reading in online news and its potential for user engagement (2017) 0.20
    0.19635129 = sum of:
      0.19635129 = product of:
        0.8181304 = sum of:
          0.03415275 = weight(abstract_txt:larger in 4529) [ClassicSimilarity], result of:
            0.03415275 = score(doc=4529,freq=1.0), product of:
              0.09017882 = queryWeight, product of:
                6.059561 = idf(docFreq=281, maxDocs=44421)
                0.014882072 = queryNorm
              0.37872255 = fieldWeight in 4529, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.059561 = idf(docFreq=281, maxDocs=44421)
                0.0625 = fieldNorm(doc=4529)
          0.014919592 = weight(abstract_txt:that in 4529) [ClassicSimilarity], result of:
            0.014919592 = score(doc=4529,freq=6.0), product of:
              0.041208047 = queryWeight, product of:
                1.1708449 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.014882072 = queryNorm
              0.3620553 = fieldWeight in 4529, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=4529)
          0.14668931 = weight(abstract_txt:story in 4529) [ClassicSimilarity], result of:
            0.14668931 = score(doc=4529,freq=6.0), product of:
              0.13113017 = queryWeight, product of:
                1.205866 = boost
                7.3070183 = idf(docFreq=80, maxDocs=44421)
                0.014882072 = queryNorm
              1.1186541 = fieldWeight in 4529, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.3070183 = idf(docFreq=80, maxDocs=44421)
                0.0625 = fieldNorm(doc=4529)
          0.061465938 = weight(abstract_txt:stories in 4529) [ClassicSimilarity], result of:
            0.061465938 = score(doc=4529,freq=1.0), product of:
              0.13342701 = queryWeight, product of:
                1.216381 = boost
                7.370734 = idf(docFreq=75, maxDocs=44421)
                0.014882072 = queryNorm
              0.4606709 = fieldWeight in 4529, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.370734 = idf(docFreq=75, maxDocs=44421)
                0.0625 = fieldNorm(doc=4529)
          0.04543021 = weight(abstract_txt:related in 4529) [ClassicSimilarity], result of:
            0.04543021 = score(doc=4529,freq=4.0), product of:
              0.08657129 = queryWeight, product of:
                1.3856376 = boost
                4.198178 = idf(docFreq=1813, maxDocs=44421)
                0.014882072 = queryNorm
              0.5247722 = fieldWeight in 4529, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.198178 = idf(docFreq=1813, maxDocs=44421)
                0.0625 = fieldNorm(doc=4529)
          0.5154726 = weight(abstract_txt:news in 4529) [ClassicSimilarity], result of:
            0.5154726 = score(doc=4529,freq=7.0), product of:
              0.5231692 = queryWeight, product of:
                5.899897 = boost
                5.9584646 = idf(docFreq=311, maxDocs=44421)
                0.014882072 = queryNorm
              0.98528844 = fieldWeight in 4529, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                5.9584646 = idf(docFreq=311, maxDocs=44421)
                0.0625 = fieldNorm(doc=4529)
        0.24 = coord(6/25)
    
  4. Sela, M.; Lavie, T.; Inbar, O.; Oppenheim, I.; Meyer, J.: Personalizing news content : an experimental study (2015) 0.17
    0.17376561 = sum of:
      0.17376561 = product of:
        1.0860351 = sum of:
          0.13323933 = weight(abstract_txt:editions in 2604) [ClassicSimilarity], result of:
            0.13323933 = score(doc=2604,freq=6.0), product of:
              0.12298683 = queryWeight, product of:
                1.1678231 = boost
                7.0764947 = idf(docFreq=101, maxDocs=44421)
                0.014882072 = queryNorm
              1.0833626 = fieldWeight in 2604, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                7.0764947 = idf(docFreq=101, maxDocs=44421)
                0.0625 = fieldNorm(doc=2604)
          0.00861383 = weight(abstract_txt:that in 2604) [ClassicSimilarity], result of:
            0.00861383 = score(doc=2604,freq=2.0), product of:
              0.041208047 = queryWeight, product of:
                1.1708449 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.014882072 = queryNorm
              0.20903271 = fieldWeight in 2604, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=2604)
          0.2980029 = weight(abstract_txt:items in 2604) [ClassicSimilarity], result of:
            0.2980029 = score(doc=2604,freq=5.0), product of:
              0.38220912 = queryWeight, product of:
                4.6034484 = boost
                5.5789747 = idf(docFreq=455, maxDocs=44421)
                0.014882072 = queryNorm
              0.77968544 = fieldWeight in 2604, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                5.5789747 = idf(docFreq=455, maxDocs=44421)
                0.0625 = fieldNorm(doc=2604)
          0.6461791 = weight(abstract_txt:news in 2604) [ClassicSimilarity], result of:
            0.6461791 = score(doc=2604,freq=11.0), product of:
              0.5231692 = queryWeight, product of:
                5.899897 = boost
                5.9584646 = idf(docFreq=311, maxDocs=44421)
                0.014882072 = queryNorm
              1.2351245 = fieldWeight in 2604, product of:
                3.3166249 = tf(freq=11.0), with freq of:
                  11.0 = termFreq=11.0
                5.9584646 = idf(docFreq=311, maxDocs=44421)
                0.0625 = fieldNorm(doc=2604)
        0.16 = coord(4/25)
    
  5. Montalvo, S.; Martínez, R.; Fresno, V.; Delgado, A.: Exploiting named entities for bilingual news clustering (2015) 0.16
    0.16236968 = sum of:
      0.16236968 = product of:
        0.8118484 = sum of:
          0.053173095 = weight(abstract_txt:clusters in 2642) [ClassicSimilarity], result of:
            0.053173095 = score(doc=2642,freq=1.0), product of:
              0.10439396 = queryWeight, product of:
                1.0759335 = boost
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.014882072 = queryNorm
              0.5093503 = fieldWeight in 2642, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.519684 = idf(docFreq=177, maxDocs=44421)
                0.078125 = fieldNorm(doc=2642)
          0.0076136217 = weight(abstract_txt:that in 2642) [ClassicSimilarity], result of:
            0.0076136217 = score(doc=2642,freq=1.0), product of:
              0.041208047 = queryWeight, product of:
                1.1708449 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.014882072 = queryNorm
              0.18476056 = fieldWeight in 2642, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.078125 = fieldNorm(doc=2642)
          0.02839388 = weight(abstract_txt:related in 2642) [ClassicSimilarity], result of:
            0.02839388 = score(doc=2642,freq=1.0), product of:
              0.08657129 = queryWeight, product of:
                1.3856376 = boost
                4.198178 = idf(docFreq=1813, maxDocs=44421)
                0.014882072 = queryNorm
              0.32798263 = fieldWeight in 2642, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.198178 = idf(docFreq=1813, maxDocs=44421)
                0.078125 = fieldNorm(doc=2642)
          0.23559195 = weight(abstract_txt:items in 2642) [ClassicSimilarity], result of:
            0.23559195 = score(doc=2642,freq=2.0), product of:
              0.38220912 = queryWeight, product of:
                4.6034484 = boost
                5.5789747 = idf(docFreq=455, maxDocs=44421)
                0.014882072 = queryNorm
              0.6163954 = fieldWeight in 2642, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.5789747 = idf(docFreq=455, maxDocs=44421)
                0.078125 = fieldNorm(doc=2642)
          0.48707584 = weight(abstract_txt:news in 2642) [ClassicSimilarity], result of:
            0.48707584 = score(doc=2642,freq=4.0), product of:
              0.5231692 = queryWeight, product of:
                5.899897 = boost
                5.9584646 = idf(docFreq=311, maxDocs=44421)
                0.014882072 = queryNorm
              0.9310101 = fieldWeight in 2642, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                5.9584646 = idf(docFreq=311, maxDocs=44421)
                0.078125 = fieldNorm(doc=2642)
        0.2 = coord(5/25)