Document (#31972)

Author
Rui, Y.
Ortega, M.
Huang, T.S.
Mehrotra, S.
Title
Information retrieval beyond the text document
Source
Library trends. 48(1999) no.2, S.455-474
Year
1999
Abstract
With the expansion of the Internet, searching for information goes beyond the boundary of physical libraries. Millions of documents of various media types-such as text, image, video, audio, graphics, and animation-are available around the world and linked by the Internet. Unfortunately, the state of the art of search engines for media types other than text lags far behind their text counterparts. To address this situation, we have developed the Multimedia Analysis and Retrieval System (MARS). This article reports some of the progress made over the years toward exploring information retrieval beyond the text domain. In particular, the following aspects of MARS are addressed in the article: visual feature extraction, retrieval models, query reformulation techniques, efficient execution speed performance, and user interface considerations. Extensive experimental results are reported to validate the proposed approaches.
Form
Bilder

Similar documents (author)

  1. Ortega, C.D.: Conceptual and procedural grounding of documentary systems (2012) 2.34
    2.3431525 = sum of:
      2.3431525 = product of:
        4.686305 = sum of:
          4.686305 = weight(author_txt:ortega in 1143) [ClassicSimilarity], result of:
            4.686305 = score(doc=1143,freq=1.0), product of:
              0.83975697 = queryWeight, product of:
                1.2436321 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.07562489 = queryNorm
              5.5805492 = fieldWeight in 1143, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.625 = fieldNorm(doc=1143)
        0.5 = coord(1/2)
    
  2. Ortega, J.L.: ¬The presence of academic journals on Twitter and its relationship with dissemination (tweets) and research impact (citations) (2017) 2.34
    2.3431525 = sum of:
      2.3431525 = product of:
        4.686305 = sum of:
          4.686305 = weight(author_txt:ortega in 410) [ClassicSimilarity], result of:
            4.686305 = score(doc=410,freq=1.0), product of:
              0.83975697 = queryWeight, product of:
                1.2436321 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.07562489 = queryNorm
              5.5805492 = fieldWeight in 410, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.625 = fieldNorm(doc=410)
        0.5 = coord(1/2)
    
  3. Ortega, J.L.: Classification and analysis of PubPeer comments : how a web journal club is used (2022) 2.34
    2.3431525 = sum of:
      2.3431525 = product of:
        4.686305 = sum of:
          4.686305 = weight(author_txt:ortega in 1545) [ClassicSimilarity], result of:
            4.686305 = score(doc=1545,freq=1.0), product of:
              0.83975697 = queryWeight, product of:
                1.2436321 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.07562489 = queryNorm
              5.5805492 = fieldWeight in 1545, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.625 = fieldNorm(doc=1545)
        0.5 = coord(1/2)
    
  4. Ortega, C. Dotta => Cristina Dotta Ortega, C.D.: 1.99
    1.9882308 = sum of:
      1.9882308 = product of:
        3.9764616 = sum of:
          3.9764616 = weight(author_txt:ortega in 706) [ClassicSimilarity], result of:
            3.9764616 = score(doc=706,freq=2.0), product of:
              0.83975697 = queryWeight, product of:
                1.2436321 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.07562489 = queryNorm
              4.735253 = fieldWeight in 706, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.375 = fieldNorm(doc=706)
        0.5 = coord(1/2)
    
  5. Ortega, J.L.; Aguillo, I.F.: Visualization of the Nordic academic web : link analysis using social network tools (2008) 1.87
    1.8745221 = sum of:
      1.8745221 = product of:
        3.7490442 = sum of:
          3.7490442 = weight(author_txt:ortega in 3114) [ClassicSimilarity], result of:
            3.7490442 = score(doc=3114,freq=1.0), product of:
              0.83975697 = queryWeight, product of:
                1.2436321 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.07562489 = queryNorm
              4.4644394 = fieldWeight in 3114, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.5 = fieldNorm(doc=3114)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Huang, T.; Mehrotra, S.; Ramchandran, K.: Multimedia Access and Retrieval System (MARS) project (1997) 0.25
    0.2530208 = sum of:
      0.2530208 = product of:
        2.1085067 = sum of:
          0.020471146 = weight(abstract_txt:information in 758) [ClassicSimilarity], result of:
            0.020471146 = score(doc=758,freq=2.0), product of:
              0.054713096 = queryWeight, product of:
                1.1719325 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.019300602 = queryNorm
              0.37415442 = fieldWeight in 758, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.109375 = fieldNorm(doc=758)
          0.08103155 = weight(abstract_txt:retrieval in 758) [ClassicSimilarity], result of:
            0.08103155 = score(doc=758,freq=2.0), product of:
              0.15068807 = queryWeight, product of:
                2.2457724 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.019300602 = queryNorm
              0.5377437 = fieldWeight in 758, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.109375 = fieldNorm(doc=758)
          2.007004 = weight(title_txt:mars in 758) [ClassicSimilarity], result of:
            2.007004 = score(doc=758,freq=1.0), product of:
              0.56311804 = queryWeight, product of:
                3.0698068 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.019300602 = queryNorm
              3.5640912 = fieldWeight in 758, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.375 = fieldNorm(doc=758)
        0.12 = coord(3/25)
    
  2. Kowalski, G.J.; Maybury, M.T.: Information storage and retrieval systems : theory and implemetation (2000) 0.14
    0.13788252 = sum of:
      0.13788252 = product of:
        0.49243754 = sum of:
          0.08649904 = weight(abstract_txt:audio in 727) [ClassicSimilarity], result of:
            0.08649904 = score(doc=727,freq=1.0), product of:
              0.13844314 = queryWeight, product of:
                1.0762968 = boost
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.019300602 = queryNorm
              0.6247983 = fieldWeight in 727, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.09375 = fieldNorm(doc=727)
          0.10027288 = weight(abstract_txt:graphics in 727) [ClassicSimilarity], result of:
            0.10027288 = score(doc=727,freq=1.0), product of:
              0.15277523 = queryWeight, product of:
                1.130636 = boost
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.019300602 = queryNorm
              0.6563425 = fieldWeight in 727, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.000987 = idf(docFreq=109, maxDocs=44421)
                0.09375 = fieldNorm(doc=727)
          0.017546698 = weight(abstract_txt:information in 727) [ClassicSimilarity], result of:
            0.017546698 = score(doc=727,freq=2.0), product of:
              0.054713096 = queryWeight, product of:
                1.1719325 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.019300602 = queryNorm
              0.3207038 = fieldWeight in 727, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.09375 = fieldNorm(doc=727)
          0.030296529 = weight(abstract_txt:internet in 727) [ClassicSimilarity], result of:
            0.030296529 = score(doc=727,freq=1.0), product of:
              0.08667008 = queryWeight, product of:
                1.2043312 = boost
                3.7286568 = idf(docFreq=2900, maxDocs=44421)
                0.019300602 = queryNorm
              0.34956157 = fieldWeight in 727, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.7286568 = idf(docFreq=2900, maxDocs=44421)
                0.09375 = fieldNorm(doc=727)
          0.052028045 = weight(abstract_txt:types in 727) [ClassicSimilarity], result of:
            0.052028045 = score(doc=727,freq=1.0), product of:
              0.12428888 = queryWeight, product of:
                1.4422065 = boost
                4.4651284 = idf(docFreq=1388, maxDocs=44421)
                0.019300602 = queryNorm
              0.4186058 = fieldWeight in 727, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4651284 = idf(docFreq=1388, maxDocs=44421)
                0.09375 = fieldNorm(doc=727)
          0.069455616 = weight(abstract_txt:retrieval in 727) [ClassicSimilarity], result of:
            0.069455616 = score(doc=727,freq=2.0), product of:
              0.15068807 = queryWeight, product of:
                2.2457724 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.019300602 = queryNorm
              0.46092314 = fieldWeight in 727, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.09375 = fieldNorm(doc=727)
          0.13633871 = weight(abstract_txt:text in 727) [ClassicSimilarity], result of:
            0.13633871 = score(doc=727,freq=2.0), product of:
              0.2544818 = queryWeight, product of:
                3.2629445 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.019300602 = queryNorm
              0.5357503 = fieldWeight in 727, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.09375 = fieldNorm(doc=727)
        0.28 = coord(7/25)
    
  3. Next generation search engines : advanced models for information retrieval (2012) 0.12
    0.11654314 = sum of:
      0.11654314 = product of:
        0.3641973 = sum of:
          0.036041263 = weight(abstract_txt:audio in 1357) [ClassicSimilarity], result of:
            0.036041263 = score(doc=1357,freq=1.0), product of:
              0.13844314 = queryWeight, product of:
                1.0762968 = boost
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.019300602 = queryNorm
              0.2603326 = fieldWeight in 1357, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.664515 = idf(docFreq=153, maxDocs=44421)
                0.0390625 = fieldNorm(doc=1357)
          0.04227738 = weight(abstract_txt:goes in 1357) [ClassicSimilarity], result of:
            0.04227738 = score(doc=1357,freq=1.0), product of:
              0.15398443 = queryWeight, product of:
                1.1351016 = boost
                7.028639 = idf(docFreq=106, maxDocs=44421)
                0.019300602 = queryNorm
              0.27455622 = fieldWeight in 1357, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.028639 = idf(docFreq=106, maxDocs=44421)
                0.0390625 = fieldNorm(doc=1357)
          0.0225344 = weight(abstract_txt:information in 1357) [ClassicSimilarity], result of:
            0.0225344 = score(doc=1357,freq=19.0), product of:
              0.054713096 = queryWeight, product of:
                1.1719325 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.019300602 = queryNorm
              0.41186482 = fieldWeight in 1357, product of:
                4.358899 = tf(freq=19.0), with freq of:
                  19.0 = termFreq=19.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0390625 = fieldNorm(doc=1357)
          0.017852401 = weight(abstract_txt:internet in 1357) [ClassicSimilarity], result of:
            0.017852401 = score(doc=1357,freq=2.0), product of:
              0.08667008 = queryWeight, product of:
                1.2043312 = boost
                3.7286568 = idf(docFreq=2900, maxDocs=44421)
                0.019300602 = queryNorm
              0.20598114 = fieldWeight in 1357, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.7286568 = idf(docFreq=2900, maxDocs=44421)
                0.0390625 = fieldNorm(doc=1357)
          0.02167835 = weight(abstract_txt:types in 1357) [ClassicSimilarity], result of:
            0.02167835 = score(doc=1357,freq=1.0), product of:
              0.12428888 = queryWeight, product of:
                1.4422065 = boost
                4.4651284 = idf(docFreq=1388, maxDocs=44421)
                0.019300602 = queryNorm
              0.17441908 = fieldWeight in 1357, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.4651284 = idf(docFreq=1388, maxDocs=44421)
                0.0390625 = fieldNorm(doc=1357)
          0.04575791 = weight(abstract_txt:retrieval in 1357) [ClassicSimilarity], result of:
            0.04575791 = score(doc=1357,freq=5.0), product of:
              0.15068807 = queryWeight, product of:
                2.2457724 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.019300602 = queryNorm
              0.3036598 = fieldWeight in 1357, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0390625 = fieldNorm(doc=1357)
          0.097717255 = weight(abstract_txt:beyond in 1357) [ClassicSimilarity], result of:
            0.097717255 = score(doc=1357,freq=2.0), product of:
              0.3081409 = queryWeight, product of:
                2.781196 = boost
                5.7404623 = idf(docFreq=387, maxDocs=44421)
                0.019300602 = queryNorm
              0.31711873 = fieldWeight in 1357, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.7404623 = idf(docFreq=387, maxDocs=44421)
                0.0390625 = fieldNorm(doc=1357)
          0.08033835 = weight(abstract_txt:text in 1357) [ClassicSimilarity], result of:
            0.08033835 = score(doc=1357,freq=4.0), product of:
              0.2544818 = queryWeight, product of:
                3.2629445 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.019300602 = queryNorm
              0.3156939 = fieldWeight in 1357, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0390625 = fieldNorm(doc=1357)
        0.32 = coord(8/25)
    
  4. Hearst, M.A.: Search user interfaces (2009) 0.12
    0.116109304 = sum of:
      0.116109304 = product of:
        0.4837888 = sum of:
          0.078574575 = weight(abstract_txt:behind in 29) [ClassicSimilarity], result of:
            0.078574575 = score(doc=29,freq=1.0), product of:
              0.129853 = queryWeight, product of:
                1.0423709 = boost
                6.4544435 = idf(docFreq=189, maxDocs=44421)
                0.019300602 = queryNorm
              0.6051041 = fieldWeight in 29, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4544435 = idf(docFreq=189, maxDocs=44421)
                0.09375 = fieldNorm(doc=29)
          0.0797523 = weight(abstract_txt:considerations in 29) [ClassicSimilarity], result of:
            0.0797523 = score(doc=29,freq=1.0), product of:
              0.13114733 = queryWeight, product of:
                1.0475531 = boost
                6.4865317 = idf(docFreq=183, maxDocs=44421)
                0.019300602 = queryNorm
              0.60811234 = fieldWeight in 29, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.4865317 = idf(docFreq=183, maxDocs=44421)
                0.09375 = fieldNorm(doc=29)
          0.021490227 = weight(abstract_txt:information in 29) [ClassicSimilarity], result of:
            0.021490227 = score(doc=29,freq=3.0), product of:
              0.054713096 = queryWeight, product of:
                1.1719325 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.019300602 = queryNorm
              0.3927803 = fieldWeight in 29, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.09375 = fieldNorm(doc=29)
          0.13811 = weight(abstract_txt:reformulation in 29) [ClassicSimilarity], result of:
            0.13811 = score(doc=29,freq=1.0), product of:
              0.1891243 = queryWeight, product of:
                1.2579691 = boost
                7.7894444 = idf(docFreq=49, maxDocs=44421)
                0.019300602 = queryNorm
              0.73026043 = fieldWeight in 29, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.7894444 = idf(docFreq=49, maxDocs=44421)
                0.09375 = fieldNorm(doc=29)
          0.069455616 = weight(abstract_txt:retrieval in 29) [ClassicSimilarity], result of:
            0.069455616 = score(doc=29,freq=2.0), product of:
              0.15068807 = queryWeight, product of:
                2.2457724 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.019300602 = queryNorm
              0.46092314 = fieldWeight in 29, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.09375 = fieldNorm(doc=29)
          0.09640603 = weight(abstract_txt:text in 29) [ClassicSimilarity], result of:
            0.09640603 = score(doc=29,freq=1.0), product of:
              0.2544818 = queryWeight, product of:
                3.2629445 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.019300602 = queryNorm
              0.3788327 = fieldWeight in 29, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.09375 = fieldNorm(doc=29)
        0.24 = coord(6/25)
    
  5. Barrio, P.; Gravano, L.: Sampling strategies for information extraction over the deep web (2017) 0.09
    0.092579 = sum of:
      0.092579 = product of:
        0.46289498 = sum of:
          0.12140965 = weight(abstract_txt:extraction in 4412) [ClassicSimilarity], result of:
            0.12140965 = score(doc=4412,freq=9.0), product of:
              0.11951086 = queryWeight, product of:
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.019300602 = queryNorm
              1.015888 = fieldWeight in 4412, product of:
                3.0 = tf(freq=9.0), with freq of:
                  9.0 = termFreq=9.0
                6.192079 = idf(docFreq=246, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4412)
          0.020471146 = weight(abstract_txt:information in 4412) [ClassicSimilarity], result of:
            0.020471146 = score(doc=4412,freq=8.0), product of:
              0.054713096 = queryWeight, product of:
                1.1719325 = boost
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.019300602 = queryNorm
              0.37415442 = fieldWeight in 4412, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                2.4188995 = idf(docFreq=10748, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4412)
          0.13170971 = weight(abstract_txt:execution in 4412) [ClassicSimilarity], result of:
            0.13170971 = score(doc=4412,freq=2.0), product of:
              0.20831536 = queryWeight, product of:
                1.3202524 = boost
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.019300602 = queryNorm
              0.63226116 = fieldWeight in 4412, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4412)
          0.040515777 = weight(abstract_txt:retrieval in 4412) [ClassicSimilarity], result of:
            0.040515777 = score(doc=4412,freq=2.0), product of:
              0.15068807 = queryWeight, product of:
                2.2457724 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.019300602 = queryNorm
              0.26887184 = fieldWeight in 4412, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4412)
          0.1487887 = weight(abstract_txt:text in 4412) [ClassicSimilarity], result of:
            0.1487887 = score(doc=4412,freq=7.0), product of:
              0.2544818 = queryWeight, product of:
                3.2629445 = boost
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.019300602 = queryNorm
              0.5846733 = fieldWeight in 4412, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                4.040882 = idf(docFreq=2122, maxDocs=44421)
                0.0546875 = fieldNorm(doc=4412)
        0.2 = coord(5/25)