Document (#17765)

Author
Srihari, R.K.
Title
Using speech input for image interpretation, annotation, and retrieval
Source
Digital image access and retrieval: Proceedings of the 1996 Clinic on Library Applications of Data Processing, 24-26 Mar 1996. Ed.: P.B. Heidorn u. B. Sandore
Imprint
Urbana-Champaign, IL : Illinois University at Urbana-Champaign, Department of Library and Information Science
Year
1997
Pages
S.140-156
Abstract
Explores the interaction of textual and photographic information in an integrated text and image database environment and describes 3 different applications involving the exploitation of linguistic context in vision. Describes the practical application of these ideas in working systems. PICTION uses captions to identify human faces in a photograph, wile Show&Tell is a multimedia system for semi automatic image annotation. The system combines advances in speech recognition, natural language processing and image understanding to assist in image annotation and enhance image retrieval capabilities. Presents an extension of this work to video annotation and retrieval
Theme
Sprachretrieval
Form
Bilder
Object
PICTION
Show&Tell

Similar documents (content)

  1. Chen, J.; Wang, D.; Xie, I.; Lu, Q.: Image annotation tactics : transitions, strategies and efficiency (2018) 0.20
    0.19801249 = sum of:
      0.19801249 = product of:
        1.237578 = sum of:
          0.045668084 = weight(abstract_txt:interpretation in 46) [ClassicSimilarity], result of:
            0.045668084 = score(doc=46,freq=2.0), product of:
              0.099047035 = queryWeight, product of:
                5.9616747 = idf(docFreq=310, maxDocs=44421)
                0.016613962 = queryNorm
              0.46107474 = fieldWeight in 46, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.9616747 = idf(docFreq=310, maxDocs=44421)
                0.0546875 = fieldNorm(doc=46)
          0.03549154 = weight(abstract_txt:involving in 46) [ClassicSimilarity], result of:
            0.03549154 = score(doc=46,freq=1.0), product of:
              0.105485514 = queryWeight, product of:
                1.0319904 = boost
                6.1523914 = idf(docFreq=256, maxDocs=44421)
                0.016613962 = queryNorm
              0.33645892 = fieldWeight in 46, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1523914 = idf(docFreq=256, maxDocs=44421)
                0.0546875 = fieldNorm(doc=46)
          0.73035127 = weight(abstract_txt:annotation in 46) [ClassicSimilarity], result of:
            0.73035127 = score(doc=46,freq=12.0), product of:
              0.54923356 = queryWeight, product of:
                4.7096405 = boost
                7.019336 = idf(docFreq=107, maxDocs=44421)
                0.016613962 = queryNorm
              1.3297645 = fieldWeight in 46, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                7.019336 = idf(docFreq=107, maxDocs=44421)
                0.0546875 = fieldNorm(doc=46)
          0.4260671 = weight(abstract_txt:image in 46) [ClassicSimilarity], result of:
            0.4260671 = score(doc=46,freq=9.0), product of:
              0.4831306 = queryWeight, product of:
                5.409874 = boost
                5.375318 = idf(docFreq=558, maxDocs=44421)
                0.016613962 = queryNorm
              0.8818881 = fieldWeight in 46, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                5.375318 = idf(docFreq=558, maxDocs=44421)
                0.0546875 = fieldNorm(doc=46)
        0.16 = coord(4/25)
    
  2. ISLIP Media introduces a system for searching digital video and audio libraries (1998) 0.19
    0.18631881 = sum of:
      0.18631881 = product of:
        0.7763284 = sum of:
          0.0796229 = weight(abstract_txt:recognition in 2513) [ClassicSimilarity], result of:
            0.0796229 = score(doc=2513,freq=1.0), product of:
              0.10418063 = queryWeight, product of:
                1.0255876 = boost
                6.114219 = idf(docFreq=266, maxDocs=44421)
                0.016613962 = queryNorm
              0.7642774 = fieldWeight in 2513, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.114219 = idf(docFreq=266, maxDocs=44421)
                0.125 = fieldNorm(doc=2513)
          0.080065146 = weight(abstract_txt:video in 2513) [ClassicSimilarity], result of:
            0.080065146 = score(doc=2513,freq=1.0), product of:
              0.10456604 = queryWeight, product of:
                1.0274829 = boost
                6.1255183 = idf(docFreq=263, maxDocs=44421)
                0.016613962 = queryNorm
              0.7656898 = fieldWeight in 2513, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.1255183 = idf(docFreq=263, maxDocs=44421)
                0.125 = fieldNorm(doc=2513)
          0.026730502 = weight(abstract_txt:system in 2513) [ClassicSimilarity], result of:
            0.026730502 = score(doc=2513,freq=1.0), product of:
              0.06340299 = queryWeight, product of:
                1.1314858 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.016613962 = queryNorm
              0.42159688 = fieldWeight in 2513, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.125 = fieldNorm(doc=2513)
          0.039076574 = weight(abstract_txt:describes in 2513) [ClassicSimilarity], result of:
            0.039076574 = score(doc=2513,freq=1.0), product of:
              0.081667505 = queryWeight, product of:
                1.2841593 = boost
                3.82787 = idf(docFreq=2626, maxDocs=44421)
                0.016613962 = queryNorm
              0.47848374 = fieldWeight in 2513, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.82787 = idf(docFreq=2626, maxDocs=44421)
                0.125 = fieldNorm(doc=2513)
          0.22621067 = weight(abstract_txt:speech in 2513) [ClassicSimilarity], result of:
            0.22621067 = score(doc=2513,freq=1.0), product of:
              0.26329768 = queryWeight, product of:
                2.3057795 = boost
                6.8731537 = idf(docFreq=124, maxDocs=44421)
                0.016613962 = queryNorm
              0.8591442 = fieldWeight in 2513, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.8731537 = idf(docFreq=124, maxDocs=44421)
                0.125 = fieldNorm(doc=2513)
          0.32462257 = weight(abstract_txt:image in 2513) [ClassicSimilarity], result of:
            0.32462257 = score(doc=2513,freq=1.0), product of:
              0.4831306 = queryWeight, product of:
                5.409874 = boost
                5.375318 = idf(docFreq=558, maxDocs=44421)
                0.016613962 = queryNorm
              0.67191476 = fieldWeight in 2513, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.375318 = idf(docFreq=558, maxDocs=44421)
                0.125 = fieldNorm(doc=2513)
        0.24 = coord(6/25)
    
  3. Starostenko, O.; Rodríguez-Asomoza, J.; Sénchez-López, S.E.; Chévez-Aragón, J.A.: Shape indexing and retrieval : a hybrid approach using ontological description (2008) 0.18
    0.17656359 = sum of:
      0.17656359 = product of:
        0.73568165 = sum of:
          0.046131734 = weight(abstract_txt:interpretation in 318) [ClassicSimilarity], result of:
            0.046131734 = score(doc=318,freq=1.0), product of:
              0.099047035 = queryWeight, product of:
                5.9616747 = idf(docFreq=310, maxDocs=44421)
                0.016613962 = queryNorm
              0.46575582 = fieldWeight in 318, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9616747 = idf(docFreq=310, maxDocs=44421)
                0.078125 = fieldNorm(doc=318)
          0.06534591 = weight(abstract_txt:textual in 318) [ClassicSimilarity], result of:
            0.06534591 = score(doc=318,freq=2.0), product of:
              0.099154085 = queryWeight, product of:
                1.0005403 = boost
                5.9648952 = idf(docFreq=309, maxDocs=44421)
                0.016613962 = queryNorm
              0.659034 = fieldWeight in 318, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.9648952 = idf(docFreq=309, maxDocs=44421)
                0.078125 = fieldNorm(doc=318)
          0.06298863 = weight(abstract_txt:combines in 318) [ClassicSimilarity], result of:
            0.06298863 = score(doc=318,freq=1.0), product of:
              0.12190357 = queryWeight, product of:
                1.1093982 = boost
                6.613871 = idf(docFreq=161, maxDocs=44421)
                0.016613962 = queryNorm
              0.5167087 = fieldWeight in 318, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.613871 = idf(docFreq=161, maxDocs=44421)
                0.078125 = fieldNorm(doc=318)
          0.016706565 = weight(abstract_txt:system in 318) [ClassicSimilarity], result of:
            0.016706565 = score(doc=318,freq=1.0), product of:
              0.06340299 = queryWeight, product of:
                1.1314858 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.016613962 = queryNorm
              0.26349807 = fieldWeight in 318, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.078125 = fieldNorm(doc=318)
          0.047533914 = weight(abstract_txt:retrieval in 318) [ClassicSimilarity], result of:
            0.047533914 = score(doc=318,freq=3.0), product of:
              0.10104404 = queryWeight, product of:
                1.7494247 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.016613962 = queryNorm
              0.4704277 = fieldWeight in 318, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.078125 = fieldNorm(doc=318)
          0.49697486 = weight(abstract_txt:image in 318) [ClassicSimilarity], result of:
            0.49697486 = score(doc=318,freq=6.0), product of:
              0.4831306 = queryWeight, product of:
                5.409874 = boost
                5.375318 = idf(docFreq=558, maxDocs=44421)
                0.016613962 = queryNorm
              1.0286553 = fieldWeight in 318, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                5.375318 = idf(docFreq=558, maxDocs=44421)
                0.078125 = fieldNorm(doc=318)
        0.24 = coord(6/25)
    
  4. Broadhurst, R.N.: Caere PageKeeper (1993) 0.17
    0.16932353 = sum of:
      0.16932353 = product of:
        1.0582721 = sum of:
          0.07947693 = weight(abstract_txt:input in 6303) [ClassicSimilarity], result of:
            0.07947693 = score(doc=6303,freq=1.0), product of:
              0.10405326 = queryWeight, product of:
                1.0249604 = boost
                6.110481 = idf(docFreq=267, maxDocs=44421)
                0.016613962 = queryNorm
              0.7638101 = fieldWeight in 6303, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.110481 = idf(docFreq=267, maxDocs=44421)
                0.125 = fieldNorm(doc=6303)
          0.037802637 = weight(abstract_txt:system in 6303) [ClassicSimilarity], result of:
            0.037802637 = score(doc=6303,freq=2.0), product of:
              0.06340299 = queryWeight, product of:
                1.1314858 = boost
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.016613962 = queryNorm
              0.596228 = fieldWeight in 6303, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.372775 = idf(docFreq=4140, maxDocs=44421)
                0.125 = fieldNorm(doc=6303)
          0.48190686 = weight(abstract_txt:annotation in 6303) [ClassicSimilarity], result of:
            0.48190686 = score(doc=6303,freq=1.0), product of:
              0.54923356 = queryWeight, product of:
                4.7096405 = boost
                7.019336 = idf(docFreq=107, maxDocs=44421)
                0.016613962 = queryNorm
              0.877417 = fieldWeight in 6303, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.019336 = idf(docFreq=107, maxDocs=44421)
                0.125 = fieldNorm(doc=6303)
          0.45908564 = weight(abstract_txt:image in 6303) [ClassicSimilarity], result of:
            0.45908564 = score(doc=6303,freq=2.0), product of:
              0.4831306 = queryWeight, product of:
                5.409874 = boost
                5.375318 = idf(docFreq=558, maxDocs=44421)
                0.016613962 = queryNorm
              0.95023096 = fieldWeight in 6303, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.375318 = idf(docFreq=558, maxDocs=44421)
                0.125 = fieldNorm(doc=6303)
        0.16 = coord(4/25)
    
  5. Wittbrock, M.J.; Hauptmann, A.G.: Speech recognition for a digital video library (1998) 0.17
    0.16520163 = sum of:
      0.16520163 = product of:
        0.6883402 = sum of:
          0.0796229 = weight(abstract_txt:recognition in 1873) [ClassicSimilarity], result of:
            0.0796229 = score(doc=1873,freq=4.0), product of:
              0.10418063 = queryWeight, product of:
                1.0255876 = boost
                6.114219 = idf(docFreq=266, maxDocs=44421)
                0.016613962 = queryNorm
              0.7642774 = fieldWeight in 1873, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.114219 = idf(docFreq=266, maxDocs=44421)
                0.0625 = fieldNorm(doc=1873)
          0.08951556 = weight(abstract_txt:video in 1873) [ClassicSimilarity], result of:
            0.08951556 = score(doc=1873,freq=5.0), product of:
              0.10456604 = queryWeight, product of:
                1.0274829 = boost
                6.1255183 = idf(docFreq=263, maxDocs=44421)
                0.016613962 = queryNorm
              0.85606724 = fieldWeight in 1873, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.1255183 = idf(docFreq=263, maxDocs=44421)
                0.0625 = fieldNorm(doc=1873)
          0.019538287 = weight(abstract_txt:describes in 1873) [ClassicSimilarity], result of:
            0.019538287 = score(doc=1873,freq=1.0), product of:
              0.081667505 = queryWeight, product of:
                1.2841593 = boost
                3.82787 = idf(docFreq=2626, maxDocs=44421)
                0.016613962 = queryNorm
              0.23924187 = fieldWeight in 1873, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.82787 = idf(docFreq=2626, maxDocs=44421)
                0.0625 = fieldNorm(doc=1873)
          0.04390995 = weight(abstract_txt:retrieval in 1873) [ClassicSimilarity], result of:
            0.04390995 = score(doc=1873,freq=4.0), product of:
              0.10104404 = queryWeight, product of:
                1.7494247 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.016613962 = queryNorm
              0.4345625 = fieldWeight in 1873, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0625 = fieldNorm(doc=1873)
          0.22621067 = weight(abstract_txt:speech in 1873) [ClassicSimilarity], result of:
            0.22621067 = score(doc=1873,freq=4.0), product of:
              0.26329768 = queryWeight, product of:
                2.3057795 = boost
                6.8731537 = idf(docFreq=124, maxDocs=44421)
                0.016613962 = queryNorm
              0.8591442 = fieldWeight in 1873, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                6.8731537 = idf(docFreq=124, maxDocs=44421)
                0.0625 = fieldNorm(doc=1873)
          0.22954282 = weight(abstract_txt:image in 1873) [ClassicSimilarity], result of:
            0.22954282 = score(doc=1873,freq=2.0), product of:
              0.4831306 = queryWeight, product of:
                5.409874 = boost
                5.375318 = idf(docFreq=558, maxDocs=44421)
                0.016613962 = queryNorm
              0.47511548 = fieldWeight in 1873, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.375318 = idf(docFreq=558, maxDocs=44421)
                0.0625 = fieldNorm(doc=1873)
        0.24 = coord(6/25)