Document (#30048)

Author
Crestani, F.
Du, H.
Title
Written versus spoken queries : a qualitative and quantitative comparative analysis
Source
Journal of the American Society for Information Science and Technology. 57(2006) no.7, S.881-890
Year
2006
Abstract
The authors report on an experimental study on the differences between spoken and written queries. A set of written and spontaneous spoken queries are generated by users from written topics. These two sets of queries are compared in qualitative terms and in terms of their retrieval effectiveness. Written and spoken queries are compared in terms of length, duration, and part of speech. In addition, assuming perfect transcription of the spoken queries, written and spoken queries are compared in terms of their aptitude to describe relevant documents. The retrieval effectiveness of spoken and written queries is compared using three different information retrieval models. The results show that using speech to formulate one's information need provides a way to express it more naturally and encourages the formulation of longer queries. Despite that, longer spoken queries do not seem to significantly improve retrieval effectiveness compared with written queries.
Theme
Suchtaktik

Similar documents (author)

  1. Crestani, F.: Combination of similarity measures for effective spoken document retrieval (2003) 5.44
    5.4410844 = sum of:
      5.4410844 = weight(author_txt:crestani in 5690) [ClassicSimilarity], result of:
        5.4410844 = fieldWeight in 5690, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.705735 = idf(docFreq=19, maxDocs=44421)
          0.625 = fieldNorm(doc=5690)
    
  2. Crestani, F.; Lee, P.L.: Searching the web by constraining spreading activities (2000) 4.35
    4.3528676 = sum of:
      4.3528676 = weight(author_txt:crestani in 1394) [ClassicSimilarity], result of:
        4.3528676 = fieldWeight in 1394, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.705735 = idf(docFreq=19, maxDocs=44421)
          0.5 = fieldNorm(doc=1394)
    
  3. Tombros, T.; Crestani, F.: Users' perception of relevance of spoken documents (2000) 4.35
    4.3528676 = sum of:
      4.3528676 = weight(author_txt:crestani in 5996) [ClassicSimilarity], result of:
        4.3528676 = fieldWeight in 5996, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.705735 = idf(docFreq=19, maxDocs=44421)
          0.5 = fieldNorm(doc=5996)
    
  4. Crestani, F.; Wu, S.: Testing the cluster hypothesis in distributed information retrieval (2006) 4.35
    4.3528676 = sum of:
      4.3528676 = weight(author_txt:crestani in 1984) [ClassicSimilarity], result of:
        4.3528676 = fieldWeight in 1984, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.705735 = idf(docFreq=19, maxDocs=44421)
          0.5 = fieldNorm(doc=1984)
    
  5. Crestani, F.; Rijsbergen, C.J. van: Information retrieval by logical imaging (1995) 3.81
    3.8087592 = sum of:
      3.8087592 = weight(author_txt:crestani in 1827) [ClassicSimilarity], result of:
        3.8087592 = fieldWeight in 1827, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.705735 = idf(docFreq=19, maxDocs=44421)
          0.4375 = fieldNorm(doc=1827)
    

Similar documents (content)

  1. Sparck Jones, K.; Jones, G.J.F.; Foote, J.T.; Young, S.J.: Experiments in spoken document retrieval (1996) 0.19
    0.19048576 = sum of:
      0.19048576 = product of:
        1.190536 = sum of:
          0.09104977 = weight(abstract_txt:transcription in 2951) [ClassicSimilarity], result of:
            0.09104977 = score(doc=2951,freq=1.0), product of:
              0.109513946 = queryWeight, product of:
                1.375095 = boost
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.0089804595 = queryNorm
              0.83139884 = fieldWeight in 2951, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.868255 = idf(docFreq=16, maxDocs=44421)
                0.09375 = fieldNorm(doc=2951)
          0.1198885 = weight(abstract_txt:speech in 2951) [ClassicSimilarity], result of:
            0.1198885 = score(doc=2951,freq=2.0), product of:
              0.13156344 = queryWeight, product of:
                2.1314769 = boost
                6.8731537 = idf(docFreq=124, maxDocs=44421)
                0.0089804595 = queryNorm
              0.91126 = fieldWeight in 2951, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                6.8731537 = idf(docFreq=124, maxDocs=44421)
                0.09375 = fieldNorm(doc=2951)
          0.04906097 = weight(abstract_txt:retrieval in 2951) [ClassicSimilarity], result of:
            0.04906097 = score(doc=2951,freq=5.0), product of:
              0.06731899 = queryWeight, product of:
                2.1562386 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0089804595 = queryNorm
              0.7287835 = fieldWeight in 2951, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.09375 = fieldNorm(doc=2951)
          0.93053675 = weight(abstract_txt:spoken in 2951) [ClassicSimilarity], result of:
            0.93053675 = score(doc=2951,freq=3.0), product of:
              0.71520215 = queryWeight, product of:
                9.939337 = boost
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.0089804595 = queryNorm
              1.3010821 = fieldWeight in 2951, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.09375 = fieldNorm(doc=2951)
        0.16 = coord(4/25)
    
  2. Bacchin, M.; Ferro, N.; Melucci, M.: ¬A probabilistic model for stemmer generation (2005) 0.17
    0.16873878 = sum of:
      0.16873878 = product of:
        0.8436939 = sum of:
          0.0258574 = weight(abstract_txt:retrieval in 2001) [ClassicSimilarity], result of:
            0.0258574 = score(doc=2001,freq=2.0), product of:
              0.06731899 = queryWeight, product of:
                2.1562386 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0089804595 = queryNorm
              0.3841026 = fieldWeight in 2001, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.078125 = fieldNorm(doc=2001)
          0.043195248 = weight(abstract_txt:effectiveness in 2001) [ClassicSimilarity], result of:
            0.043195248 = score(doc=2001,freq=1.0), product of:
              0.1084931 = queryWeight, product of:
                2.3706076 = boost
                5.0961695 = idf(docFreq=738, maxDocs=44421)
                0.0089804595 = queryNorm
              0.39813823 = fieldWeight in 2001, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.0961695 = idf(docFreq=738, maxDocs=44421)
                0.078125 = fieldNorm(doc=2001)
          0.16804734 = weight(abstract_txt:written in 2001) [ClassicSimilarity], result of:
            0.16804734 = score(doc=2001,freq=1.0), product of:
              0.37215352 = queryWeight, product of:
                7.1697507 = boost
                5.779889 = idf(docFreq=372, maxDocs=44421)
                0.0089804595 = queryNorm
              0.45155382 = fieldWeight in 2001, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.779889 = idf(docFreq=372, maxDocs=44421)
                0.078125 = fieldNorm(doc=2001)
          0.15888919 = weight(abstract_txt:queries in 2001) [ClassicSimilarity], result of:
            0.15888919 = score(doc=2001,freq=1.0), product of:
              0.39865586 = queryWeight, product of:
                8.701486 = boost
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.0089804595 = queryNorm
              0.39856228 = fieldWeight in 2001, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.078125 = fieldNorm(doc=2001)
          0.44770473 = weight(abstract_txt:spoken in 2001) [ClassicSimilarity], result of:
            0.44770473 = score(doc=2001,freq=1.0), product of:
              0.71520215 = queryWeight, product of:
                9.939337 = boost
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.0089804595 = queryNorm
              0.6259835 = fieldWeight in 2001, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.078125 = fieldNorm(doc=2001)
        0.2 = coord(5/25)
    
  3. SARA (SGML Aware Retrieval Application) Workshop, 19th June 1994 (1994) 0.13
    0.13376684 = sum of:
      0.13376684 = product of:
        0.8360428 = sum of:
          0.008987989 = weight(abstract_txt:using in 824) [ClassicSimilarity], result of:
            0.008987989 = score(doc=824,freq=1.0), product of:
              0.033280466 = queryWeight, product of:
                1.0720319 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.0089804595 = queryNorm
              0.27006802 = fieldWeight in 824, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.078125 = fieldNorm(doc=824)
          0.0258574 = weight(abstract_txt:retrieval in 824) [ClassicSimilarity], result of:
            0.0258574 = score(doc=824,freq=2.0), product of:
              0.06731899 = queryWeight, product of:
                2.1562386 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0089804595 = queryNorm
              0.3841026 = fieldWeight in 824, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.078125 = fieldNorm(doc=824)
          0.16804734 = weight(abstract_txt:written in 824) [ClassicSimilarity], result of:
            0.16804734 = score(doc=824,freq=1.0), product of:
              0.37215352 = queryWeight, product of:
                7.1697507 = boost
                5.779889 = idf(docFreq=372, maxDocs=44421)
                0.0089804595 = queryNorm
              0.45155382 = fieldWeight in 824, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.779889 = idf(docFreq=372, maxDocs=44421)
                0.078125 = fieldNorm(doc=824)
          0.6331501 = weight(abstract_txt:spoken in 824) [ClassicSimilarity], result of:
            0.6331501 = score(doc=824,freq=2.0), product of:
              0.71520215 = queryWeight, product of:
                9.939337 = boost
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.0089804595 = queryNorm
              0.88527435 = fieldWeight in 824, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.078125 = fieldNorm(doc=824)
        0.16 = coord(4/25)
    
  4. Pilch, H.: Empirical linguistics (1976) 0.13
    0.12547448 = sum of:
      0.12547448 = product of:
        0.78421557 = sum of:
          0.010785587 = weight(abstract_txt:using in 7859) [ClassicSimilarity], result of:
            0.010785587 = score(doc=7859,freq=1.0), product of:
              0.033280466 = queryWeight, product of:
                1.0720319 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.0089804595 = queryNorm
              0.32408163 = fieldWeight in 7859, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.09375 = fieldNorm(doc=7859)
          0.03452749 = weight(abstract_txt:terms in 7859) [ClassicSimilarity], result of:
            0.03452749 = score(doc=7859,freq=1.0), product of:
              0.09107801 = queryWeight, product of:
                2.5080419 = boost
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.0089804595 = queryNorm
              0.379098 = fieldWeight in 7859, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.09375 = fieldNorm(doc=7859)
          0.20165683 = weight(abstract_txt:written in 7859) [ClassicSimilarity], result of:
            0.20165683 = score(doc=7859,freq=1.0), product of:
              0.37215352 = queryWeight, product of:
                7.1697507 = boost
                5.779889 = idf(docFreq=372, maxDocs=44421)
                0.0089804595 = queryNorm
              0.54186463 = fieldWeight in 7859, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.779889 = idf(docFreq=372, maxDocs=44421)
                0.09375 = fieldNorm(doc=7859)
          0.5372457 = weight(abstract_txt:spoken in 7859) [ClassicSimilarity], result of:
            0.5372457 = score(doc=7859,freq=1.0), product of:
              0.71520215 = queryWeight, product of:
                9.939337 = boost
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.0089804595 = queryNorm
              0.7511802 = fieldWeight in 7859, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.0125885 = idf(docFreq=39, maxDocs=44421)
                0.09375 = fieldNorm(doc=7859)
        0.16 = coord(4/25)
    
  5. Srinivasan, P.: Query expansion and MEDLINE (1996) 0.12
    0.12015305 = sum of:
      0.12015305 = product of:
        0.6007652 = sum of:
          0.021794718 = weight(abstract_txt:using in 67) [ClassicSimilarity], result of:
            0.021794718 = score(doc=67,freq=3.0), product of:
              0.033280466 = queryWeight, product of:
                1.0720319 = boost
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.0089804595 = queryNorm
              0.65488017 = fieldWeight in 67, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.4568708 = idf(docFreq=3806, maxDocs=44421)
                0.109375 = fieldNorm(doc=67)
          0.05119504 = weight(abstract_txt:retrieval in 67) [ClassicSimilarity], result of:
            0.05119504 = score(doc=67,freq=4.0), product of:
              0.06731899 = queryWeight, product of:
                2.1562386 = boost
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.0089804595 = queryNorm
              0.7604844 = fieldWeight in 67, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.4765 = idf(docFreq=3732, maxDocs=44421)
                0.109375 = fieldNorm(doc=67)
          0.08552223 = weight(abstract_txt:effectiveness in 67) [ClassicSimilarity], result of:
            0.08552223 = score(doc=67,freq=2.0), product of:
              0.1084931 = queryWeight, product of:
                2.3706076 = boost
                5.0961695 = idf(docFreq=738, maxDocs=44421)
                0.0089804595 = queryNorm
              0.78827345 = fieldWeight in 67, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                5.0961695 = idf(docFreq=738, maxDocs=44421)
                0.109375 = fieldNorm(doc=67)
          0.056967452 = weight(abstract_txt:terms in 67) [ClassicSimilarity], result of:
            0.056967452 = score(doc=67,freq=2.0), product of:
              0.09107801 = queryWeight, product of:
                2.5080419 = boost
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.0089804595 = queryNorm
              0.62547976 = fieldWeight in 67, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.043712 = idf(docFreq=2116, maxDocs=44421)
                0.109375 = fieldNorm(doc=67)
          0.3852858 = weight(abstract_txt:queries in 67) [ClassicSimilarity], result of:
            0.3852858 = score(doc=67,freq=3.0), product of:
              0.39865586 = queryWeight, product of:
                8.701486 = boost
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.0089804595 = queryNorm
              0.96646214 = fieldWeight in 67, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.1015973 = idf(docFreq=734, maxDocs=44421)
                0.109375 = fieldNorm(doc=67)
        0.2 = coord(5/25)