Document (#33032)

Author
Zhang, X.
Han, H.
Title
¬An empirical testing of user stereotypes of information retrieval systems
Source
Information processing and management. 41(2005) no.3, S.651-664
Year
2005
Abstract
Stereotyping is a technique used in many information systems to represent user groups and/or to generate initial individual user models. However, there has been a lack of evidence on the accuracy of their use in representing users. We propose a formal evaluation method to test the accuracy or homogeneity of the stereotypes that are based on users' explicit characteristics. Using the method, the results of an empirical testing on 11 common user stereotypes of information retrieval (IR) systems are reported. The participants' memberships in the stereotypes were predicted using discriminant analysis, based on their IR knowledge. The actual membership and the predicted membership of each stereotype were compared. The data show that "librarians/IR professionals" is an accurate stereotype in representing its members, while some others, such as "undergraduate students" and "social sciences/humanities" users, are not accurate stereotypes. The data also demonstrate that based on the user's IR knowledge a stereotype can be made more accurate or homogeneous. The results show the promise that our method can help better detect the differences among stereotype members, and help with better stereotype design and user modeling. We assume that accurate stereotypes have better performance in user modeling and thus the system performance. Limitations and future directions of the study are discussed.

Similar documents (author)

  1. Zhang, M.; Zhang, Y.: Professional organizations in Twittersphere : an empirical study of U.S. library and information science professional organizations-related Tweets (2020) 4.53
    4.5277104 = sum of:
      4.5277104 = weight(author_txt:zhang in 775) [ClassicSimilarity], result of:
        4.5277104 = score(doc=775,freq=2.0), product of:
          0.99999994 = queryWeight, product of:
            6.40315 = idf(docFreq=199, maxDocs=44421)
            0.15617312 = queryNorm
          4.527711 = fieldWeight in 775, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            6.40315 = idf(docFreq=199, maxDocs=44421)
            0.5 = fieldNorm(doc=775)
    
  2. Zhang, Y.; Zhang, C.: Enhancing keyphrase extraction from microblogs using human reading time (2021) 4.53
    4.5277104 = sum of:
      4.5277104 = weight(author_txt:zhang in 1238) [ClassicSimilarity], result of:
        4.5277104 = score(doc=1238,freq=2.0), product of:
          0.99999994 = queryWeight, product of:
            6.40315 = idf(docFreq=199, maxDocs=44421)
            0.15617312 = queryNorm
          4.527711 = fieldWeight in 1238, product of:
            1.4142135 = tf(freq=2.0), with freq of:
              2.0 = termFreq=2.0
            6.40315 = idf(docFreq=199, maxDocs=44421)
            0.5 = fieldNorm(doc=1238)
    
  3. Zhang, J.: TOFIR: A tool of facilitating information retrieval : introduce a visual retrieval model (2001) 4.00
    4.0019684 = sum of:
      4.0019684 = weight(author_txt:zhang in 7710) [ClassicSimilarity], result of:
        4.0019684 = score(doc=7710,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            6.40315 = idf(docFreq=199, maxDocs=44421)
            0.15617312 = queryNorm
          4.001969 = fieldWeight in 7710, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            6.40315 = idf(docFreq=199, maxDocs=44421)
            0.625 = fieldNorm(doc=7710)
    
  4. Zhang, A.: Multimedia file formats on the Internet : a beginner's guide for PC users (1995) 4.00
    4.0019684 = sum of:
      4.0019684 = weight(author_txt:zhang in 3280) [ClassicSimilarity], result of:
        4.0019684 = score(doc=3280,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            6.40315 = idf(docFreq=199, maxDocs=44421)
            0.15617312 = queryNorm
          4.001969 = fieldWeight in 3280, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            6.40315 = idf(docFreq=199, maxDocs=44421)
            0.625 = fieldNorm(doc=3280)
    
  5. Zhang, J.: ¬A representational analysis of relational information displays (1996) 4.00
    4.0019684 = sum of:
      4.0019684 = weight(author_txt:zhang in 6471) [ClassicSimilarity], result of:
        4.0019684 = score(doc=6471,freq=1.0), product of:
          0.99999994 = queryWeight, product of:
            6.40315 = idf(docFreq=199, maxDocs=44421)
            0.15617312 = queryNorm
          4.001969 = fieldWeight in 6471, product of:
            1.0 = tf(freq=1.0), with freq of:
              1.0 = termFreq=1.0
            6.40315 = idf(docFreq=199, maxDocs=44421)
            0.625 = fieldNorm(doc=6471)
    

Similar documents (content)

  1. Shapira, B.; Shoval, P.; Hanani, U.: Stereotypes in information filtering systems (1997) 0.57
    0.57260805 = sum of:
      0.57260805 = product of:
        2.0450287 = sum of:
          0.011147719 = weight(abstract_txt:based in 1157) [ClassicSimilarity], result of:
            0.011147719 = score(doc=1157,freq=1.0), product of:
              0.03735664 = queryWeight, product of:
                1.1763102 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.009976979 = queryNorm
              0.2984133 = fieldWeight in 1157, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.09375 = fieldNorm(doc=1157)
          0.027440203 = weight(abstract_txt:systems in 1157) [ClassicSimilarity], result of:
            0.027440203 = score(doc=1157,freq=4.0), product of:
              0.042902444 = queryWeight, product of:
                1.2606049 = boost
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.009976979 = queryNorm
              0.6395953 = fieldWeight in 1157, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.09375 = fieldNorm(doc=1157)
          0.031382807 = weight(abstract_txt:users in 1157) [ClassicSimilarity], result of:
            0.031382807 = score(doc=1157,freq=4.0), product of:
              0.046919316 = queryWeight, product of:
                1.3182986 = boost
                3.5672934 = idf(docFreq=3408, maxDocs=44421)
                0.009976979 = queryNorm
              0.6688675 = fieldWeight in 1157, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.5672934 = idf(docFreq=3408, maxDocs=44421)
                0.09375 = fieldNorm(doc=1157)
          0.010776229 = weight(abstract_txt:that in 1157) [ClassicSimilarity], result of:
            0.010776229 = score(doc=1157,freq=2.0), product of:
              0.034368556 = queryWeight, product of:
                1.4566089 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.009976979 = queryNorm
              0.31354907 = fieldWeight in 1157, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.09375 = fieldNorm(doc=1157)
          0.03447674 = weight(abstract_txt:user in 1157) [ClassicSimilarity], result of:
            0.03447674 = score(doc=1157,freq=1.0), product of:
              0.09990899 = queryWeight, product of:
                2.7205408 = boost
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.009976979 = queryNorm
              0.34508142 = fieldWeight in 1157, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.09375 = fieldNorm(doc=1157)
          0.49459153 = weight(abstract_txt:stereotype in 1157) [ClassicSimilarity], result of:
            0.49459153 = score(doc=1157,freq=1.0), product of:
              0.5550829 = queryWeight, product of:
                5.853845 = boost
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.009976979 = queryNorm
              0.8910228 = fieldWeight in 1157, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.504243 = idf(docFreq=8, maxDocs=44421)
                0.09375 = fieldNorm(doc=1157)
          1.4352136 = weight(abstract_txt:stereotypes in 1157) [ClassicSimilarity], result of:
            1.4352136 = score(doc=1157,freq=5.0), product of:
              0.7017916 = queryWeight, product of:
                7.210361 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.009976979 = queryNorm
              2.045071 = fieldWeight in 1157, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.09375 = fieldNorm(doc=1157)
        0.28 = coord(7/25)
    
  2. Mooney, G.; John, R.: Intelligent information retrieval from the World Wide Web using fuzzy user modelling (1997) 0.19
    0.18588085 = sum of:
      0.18588085 = product of:
        0.7745036 = sum of:
          0.019642947 = weight(abstract_txt:show in 2175) [ClassicSimilarity], result of:
            0.019642947 = score(doc=2175,freq=1.0), product of:
              0.047608502 = queryWeight, product of:
                1.0842628 = boost
                4.400995 = idf(docFreq=1480, maxDocs=44421)
                0.009976979 = queryNorm
              0.41259325 = fieldWeight in 2175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.400995 = idf(docFreq=1480, maxDocs=44421)
                0.09375 = fieldNorm(doc=2175)
          0.022720197 = weight(abstract_txt:performance in 2175) [ClassicSimilarity], result of:
            0.022720197 = score(doc=2175,freq=1.0), product of:
              0.052459177 = queryWeight, product of:
                1.1381593 = boost
                4.619759 = idf(docFreq=1189, maxDocs=44421)
                0.009976979 = queryNorm
              0.43310243 = fieldWeight in 2175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.619759 = idf(docFreq=1189, maxDocs=44421)
                0.09375 = fieldNorm(doc=2175)
          0.013720102 = weight(abstract_txt:systems in 2175) [ClassicSimilarity], result of:
            0.013720102 = score(doc=2175,freq=1.0), product of:
              0.042902444 = queryWeight, product of:
                1.2606049 = boost
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.009976979 = queryNorm
              0.31979766 = fieldWeight in 2175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.09375 = fieldNorm(doc=2175)
          0.0076199444 = weight(abstract_txt:that in 2175) [ClassicSimilarity], result of:
            0.0076199444 = score(doc=2175,freq=1.0), product of:
              0.034368556 = queryWeight, product of:
                1.4566089 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.009976979 = queryNorm
              0.22171268 = fieldWeight in 2175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.09375 = fieldNorm(doc=2175)
          0.06895348 = weight(abstract_txt:user in 2175) [ClassicSimilarity], result of:
            0.06895348 = score(doc=2175,freq=4.0), product of:
              0.09990899 = queryWeight, product of:
                2.7205408 = boost
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.009976979 = queryNorm
              0.69016284 = fieldWeight in 2175, product of:
                2.0 = tf(freq=4.0), with freq of:
                  4.0 = termFreq=4.0
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.09375 = fieldNorm(doc=2175)
          0.64184695 = weight(abstract_txt:stereotypes in 2175) [ClassicSimilarity], result of:
            0.64184695 = score(doc=2175,freq=1.0), product of:
              0.7017916 = queryWeight, product of:
                7.210361 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.009976979 = queryNorm
              0.91458344 = fieldWeight in 2175, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.09375 = fieldNorm(doc=2175)
        0.24 = coord(6/25)
    
  3. Singh, V.K.; Chayko, M.; Inamdar, R.; Floegel, D.: Female librarians and male computer programmers? : gender bias in occupational images on digital media platforms (2020) 0.13
    0.12792927 = sum of:
      0.12792927 = product of:
        1.0660772 = sum of:
          0.009146734 = weight(abstract_txt:systems in 1007) [ClassicSimilarity], result of:
            0.009146734 = score(doc=1007,freq=1.0), product of:
              0.042902444 = queryWeight, product of:
                1.2606049 = boost
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.009976979 = queryNorm
              0.21319844 = fieldWeight in 1007, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.411175 = idf(docFreq=3984, maxDocs=44421)
                0.0625 = fieldNorm(doc=1007)
          0.008798753 = weight(abstract_txt:that in 1007) [ClassicSimilarity], result of:
            0.008798753 = score(doc=1007,freq=3.0), product of:
              0.034368556 = queryWeight, product of:
                1.4566089 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.009976979 = queryNorm
              0.25601172 = fieldWeight in 1007, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=1007)
          1.0481317 = weight(abstract_txt:stereotypes in 1007) [ClassicSimilarity], result of:
            1.0481317 = score(doc=1007,freq=6.0), product of:
              0.7017916 = queryWeight, product of:
                7.210361 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.009976979 = queryNorm
              1.4935086 = fieldWeight in 1007, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.0625 = fieldNorm(doc=1007)
        0.12 = coord(3/25)
    
  4. Crossan, G.; Burton, P.F.: Teleworking stereotypes : a case study (1993) 0.10
    0.10004886 = sum of:
      0.10004886 = product of:
        0.83374053 = sum of:
          0.07191338 = weight(abstract_txt:homogeneous in 6690) [ClassicSimilarity], result of:
            0.07191338 = score(doc=6690,freq=1.0), product of:
              0.0809926 = queryWeight, product of:
                8.117949 = idf(docFreq=35, maxDocs=44421)
                0.009976979 = queryNorm
              0.8879006 = fieldWeight in 6690, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.117949 = idf(docFreq=35, maxDocs=44421)
                0.109375 = fieldNorm(doc=6690)
          0.013005672 = weight(abstract_txt:based in 6690) [ClassicSimilarity], result of:
            0.013005672 = score(doc=6690,freq=1.0), product of:
              0.03735664 = queryWeight, product of:
                1.1763102 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.009976979 = queryNorm
              0.34814885 = fieldWeight in 6690, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.109375 = fieldNorm(doc=6690)
          0.7488215 = weight(abstract_txt:stereotypes in 6690) [ClassicSimilarity], result of:
            0.7488215 = score(doc=6690,freq=1.0), product of:
              0.7017916 = queryWeight, product of:
                7.210361 = boost
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.009976979 = queryNorm
              1.0670141 = fieldWeight in 6690, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.755557 = idf(docFreq=6, maxDocs=44421)
                0.109375 = fieldNorm(doc=6690)
        0.12 = coord(3/25)
    
  5. Hong, H.; Ye, Q.: Crowd characteristics and crowd wisdom : evidence from an online investment community (2020) 0.10
    0.09551034 = sum of:
      0.09551034 = product of:
        0.23877585 = sum of:
          0.013095298 = weight(abstract_txt:show in 763) [ClassicSimilarity], result of:
            0.013095298 = score(doc=763,freq=1.0), product of:
              0.047608502 = queryWeight, product of:
                1.0842628 = boost
                4.400995 = idf(docFreq=1480, maxDocs=44421)
                0.009976979 = queryNorm
              0.27506217 = fieldWeight in 763, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.400995 = idf(docFreq=1480, maxDocs=44421)
                0.0625 = fieldNorm(doc=763)
          0.037101924 = weight(abstract_txt:performance in 763) [ClassicSimilarity], result of:
            0.037101924 = score(doc=763,freq=6.0), product of:
              0.052459177 = queryWeight, product of:
                1.1381593 = boost
                4.619759 = idf(docFreq=1189, maxDocs=44421)
                0.009976979 = queryNorm
              0.7072533 = fieldWeight in 763, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                4.619759 = idf(docFreq=1189, maxDocs=44421)
                0.0625 = fieldNorm(doc=763)
          0.007431812 = weight(abstract_txt:based in 763) [ClassicSimilarity], result of:
            0.007431812 = score(doc=763,freq=1.0), product of:
              0.03735664 = queryWeight, product of:
                1.1763102 = boost
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.009976979 = queryNorm
              0.1989422 = fieldWeight in 763, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.1830752 = idf(docFreq=5005, maxDocs=44421)
                0.0625 = fieldNorm(doc=763)
          0.024221277 = weight(abstract_txt:help in 763) [ClassicSimilarity], result of:
            0.024221277 = score(doc=763,freq=2.0), product of:
              0.056937136 = queryWeight, product of:
                1.1857418 = boost
                4.8128953 = idf(docFreq=980, maxDocs=44421)
                0.009976979 = queryNorm
              0.42540386 = fieldWeight in 763, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.8128953 = idf(docFreq=980, maxDocs=44421)
                0.0625 = fieldNorm(doc=763)
          0.021621898 = weight(abstract_txt:empirical in 763) [ClassicSimilarity], result of:
            0.021621898 = score(doc=763,freq=1.0), product of:
              0.06650742 = queryWeight, product of:
                1.2815259 = boost
                5.2016807 = idf(docFreq=664, maxDocs=44421)
                0.009976979 = queryNorm
              0.32510504 = fieldWeight in 763, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.2016807 = idf(docFreq=664, maxDocs=44421)
                0.0625 = fieldNorm(doc=763)
          0.0071841525 = weight(abstract_txt:that in 763) [ClassicSimilarity], result of:
            0.0071841525 = score(doc=763,freq=2.0), product of:
              0.034368556 = queryWeight, product of:
                1.4566089 = boost
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.009976979 = queryNorm
              0.20903271 = fieldWeight in 763, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                2.3649352 = idf(docFreq=11344, maxDocs=44421)
                0.0625 = fieldNorm(doc=763)
          0.03244636 = weight(abstract_txt:accuracy in 763) [ClassicSimilarity], result of:
            0.03244636 = score(doc=763,freq=1.0), product of:
              0.08717358 = queryWeight, product of:
                1.4671847 = boost
                5.9552646 = idf(docFreq=312, maxDocs=44421)
                0.009976979 = queryNorm
              0.37220404 = fieldWeight in 763, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.9552646 = idf(docFreq=312, maxDocs=44421)
                0.0625 = fieldNorm(doc=763)
          0.037513908 = weight(abstract_txt:testing in 763) [ClassicSimilarity], result of:
            0.037513908 = score(doc=763,freq=1.0), product of:
              0.096029006 = queryWeight, product of:
                1.5399036 = boost
                6.250429 = idf(docFreq=232, maxDocs=44421)
                0.009976979 = queryNorm
              0.39065182 = fieldWeight in 763, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.250429 = idf(docFreq=232, maxDocs=44421)
                0.0625 = fieldNorm(doc=763)
          0.035174724 = weight(abstract_txt:better in 763) [ClassicSimilarity], result of:
            0.035174724 = score(doc=763,freq=2.0), product of:
              0.08358245 = queryWeight, product of:
                1.7595252 = boost
                4.7612453 = idf(docFreq=1032, maxDocs=44421)
                0.009976979 = queryNorm
              0.4208386 = fieldWeight in 763, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                4.7612453 = idf(docFreq=1032, maxDocs=44421)
                0.0625 = fieldNorm(doc=763)
          0.022984492 = weight(abstract_txt:user in 763) [ClassicSimilarity], result of:
            0.022984492 = score(doc=763,freq=1.0), product of:
              0.09990899 = queryWeight, product of:
                2.7205408 = boost
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.009976979 = queryNorm
              0.23005427 = fieldWeight in 763, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.0625 = fieldNorm(doc=763)
        0.4 = coord(10/25)