Document (#42672)

Author
Menkov, V.
Ginsparg, P.
Kantor, P.B.
Title
Recommendations and privacy in the arXiv system : a simulation experiment using historical data
Source
Journal of the Association for Information Science and Technology. 71(2020) no.3, S.300-313
Year
2020
Abstract
Recommender systems may accelerate knowledge discovery in many fields. However, their users may be competitors guarding their ideas before publication or for other reasons. We describe a simulation experiment to assess user privacy against targeted attacks, modeling recommendations based on co-access data. The analysis uses an unusually long (14?years) set of anonymized historical data on user-item accesses. We introduce the notions of "visibility" and "discoverability." We find, based on historical data, that the majority of the actions of arXiv users would be potentially "visible" under targeted attack. However, "discoverability," which incorporates the difficulty of actually seeing a "visible" effect, is very much lower for nearly all users. We consider the effect of changes to the settings of the recommender algorithm on the visibility and discoverability of user actions and propose mitigation strategies that reduce both measures of risk.
Content
Vgl.: https://asistdl.onlinelibrary.wiley.com/doi/10.1002/asi.24236.
Object
arXiv

Similar documents (author)

  1. Ginsparg, P.: Winners and losers in the global research village (1998) 2.56
    2.5601478 = sum of:
      2.5601478 = product of:
        5.1202955 = sum of:
          5.1202955 = weight(author_txt:ginsparg in 2146) [ClassicSimilarity], result of:
            5.1202955 = score(doc=2146,freq=1.0), product of:
              0.8267119 = queryWeight, product of:
                1.2121809 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.06882178 = queryNorm
              6.1935673 = fieldWeight in 2146, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.625 = fieldNorm(doc=2146)
        0.5 = coord(1/2)
    
  2. Haque, A.-u.; Ginsparg, P.: Positional effects on citation and readership in arXiv (2009) 1.79
    1.7921036 = sum of:
      1.7921036 = product of:
        3.5842073 = sum of:
          3.5842073 = weight(author_txt:ginsparg in 147) [ClassicSimilarity], result of:
            3.5842073 = score(doc=147,freq=1.0), product of:
              0.8267119 = queryWeight, product of:
                1.2121809 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.06882178 = queryNorm
              4.3354974 = fieldWeight in 147, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.4375 = fieldNorm(doc=147)
        0.5 = coord(1/2)
    
  3. Haque, A.-ul; Ginsparg, P.: Last but not least : additional positional effects on citation and readership in arXiv (2010) 1.79
    1.7921036 = sum of:
      1.7921036 = product of:
        3.5842073 = sum of:
          3.5842073 = weight(author_txt:ginsparg in 110) [ClassicSimilarity], result of:
            3.5842073 = score(doc=110,freq=1.0), product of:
              0.8267119 = queryWeight, product of:
                1.2121809 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.06882178 = queryNorm
              4.3354974 = fieldWeight in 110, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.4375 = fieldNorm(doc=110)
        0.5 = coord(1/2)
    
  4. Collins, H.M.; Reyes-Galindo, L.; Ginsparg, P.: ¬A note concerning primary source knowledge (2017) 1.54
    1.5360888 = sum of:
      1.5360888 = product of:
        3.0721776 = sum of:
          3.0721776 = weight(author_txt:ginsparg in 4592) [ClassicSimilarity], result of:
            3.0721776 = score(doc=4592,freq=1.0), product of:
              0.8267119 = queryWeight, product of:
                1.2121809 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.06882178 = queryNorm
              3.7161405 = fieldWeight in 4592, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.375 = fieldNorm(doc=4592)
        0.5 = coord(1/2)
    
  5. Kantor, P.B.: ¬The Adaptive Network Library Interface : a historical overview and interim report (1993) 1.44
    1.4373509 = sum of:
      1.4373509 = product of:
        2.8747017 = sum of:
          2.8747017 = weight(author_txt:kantor in 6975) [ClassicSimilarity], result of:
            2.8747017 = score(doc=6975,freq=1.0), product of:
              0.5626254 = queryWeight, product of:
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.06882178 = queryNorm
              5.1094418 = fieldWeight in 6975, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.175107 = idf(docFreq=33, maxDocs=44421)
                0.625 = fieldNorm(doc=6975)
        0.5 = coord(1/2)
    

Similar documents (content)

  1. Can, O.; Yilmazer, D.: ¬A privacy-aware semantic model for provenance management (2014) 0.13
    0.13346884 = sum of:
      0.13346884 = product of:
        0.55612016 = sum of:
          0.12468204 = weight(abstract_txt:accesses in 2580) [ClassicSimilarity], result of:
            0.12468204 = score(doc=2580,freq=1.0), product of:
              0.17315896 = queryWeight, product of:
                1.0964746 = boost
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.017134737 = queryNorm
              0.72004384 = fieldWeight in 2580, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.216561 = idf(docFreq=11, maxDocs=44421)
                0.078125 = fieldNorm(doc=2580)
          0.023826996 = weight(abstract_txt:user in 2580) [ClassicSimilarity], result of:
            0.023826996 = score(doc=2580,freq=1.0), product of:
              0.08285695 = queryWeight, product of:
                1.3137152 = boost
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.017134737 = queryNorm
              0.28756785 = fieldWeight in 2580, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.078125 = fieldNorm(doc=2580)
          0.09189282 = weight(abstract_txt:actions in 2580) [ClassicSimilarity], result of:
            0.09189282 = score(doc=2580,freq=1.0), product of:
              0.17800823 = queryWeight, product of:
                1.5722121 = boost
                6.6077175 = idf(docFreq=162, maxDocs=44421)
                0.017134737 = queryNorm
              0.51622796 = fieldWeight in 2580, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                6.6077175 = idf(docFreq=162, maxDocs=44421)
                0.078125 = fieldNorm(doc=2580)
          0.062248237 = weight(abstract_txt:data in 2580) [ClassicSimilarity], result of:
            0.062248237 = score(doc=2580,freq=7.0), product of:
              0.09043039 = queryWeight, product of:
                1.5847592 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.017134737 = queryNorm
              0.6883553 = fieldWeight in 2580, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.078125 = fieldNorm(doc=2580)
          0.17040962 = weight(abstract_txt:privacy in 2580) [ClassicSimilarity], result of:
            0.17040962 = score(doc=2580,freq=3.0), product of:
              0.18629792 = queryWeight, product of:
                1.6084039 = boost
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.017134737 = queryNorm
              0.91471565 = fieldWeight in 2580, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.078125 = fieldNorm(doc=2580)
          0.08306043 = weight(abstract_txt:historical in 2580) [ClassicSimilarity], result of:
            0.08306043 = score(doc=2580,freq=1.0), product of:
              0.19049294 = queryWeight, product of:
                1.9919398 = boost
                5.58117 = idf(docFreq=454, maxDocs=44421)
                0.017134737 = queryNorm
              0.4360289 = fieldWeight in 2580, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.58117 = idf(docFreq=454, maxDocs=44421)
                0.078125 = fieldNorm(doc=2580)
        0.24 = coord(6/25)
    
  2. Ghosh, I.; Singh, V.: "Not all my friends are friends" : audience-group-based nudges for managing location privacy (2022) 0.09
    0.090705656 = sum of:
      0.090705656 = product of:
        0.37794024 = sum of:
          0.038798053 = weight(abstract_txt:users in 1562) [ClassicSimilarity], result of:
            0.038798053 = score(doc=1562,freq=5.0), product of:
              0.077822655 = queryWeight, product of:
                1.2731799 = boost
                3.5672934 = idf(docFreq=3408, maxDocs=44421)
                0.017134737 = queryNorm
              0.49854442 = fieldWeight in 1562, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                3.5672934 = idf(docFreq=3408, maxDocs=44421)
                0.0625 = fieldNorm(doc=1562)
          0.040552992 = weight(abstract_txt:effect in 1562) [ClassicSimilarity], result of:
            0.040552992 = score(doc=1562,freq=1.0), product of:
              0.119731285 = queryWeight, product of:
                1.2894216 = boost
                5.419201 = idf(docFreq=534, maxDocs=44421)
                0.017134737 = queryNorm
              0.33870006 = fieldWeight in 1562, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.419201 = idf(docFreq=534, maxDocs=44421)
                0.0625 = fieldNorm(doc=1562)
          0.04826623 = weight(abstract_txt:recommendations in 1562) [ClassicSimilarity], result of:
            0.04826623 = score(doc=1562,freq=1.0), product of:
              0.13446872 = queryWeight, product of:
                1.3664751 = boost
                5.743043 = idf(docFreq=386, maxDocs=44421)
                0.017134737 = queryNorm
              0.35894018 = fieldWeight in 1562, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.743043 = idf(docFreq=386, maxDocs=44421)
                0.0625 = fieldNorm(doc=1562)
          0.018822098 = weight(abstract_txt:data in 1562) [ClassicSimilarity], result of:
            0.018822098 = score(doc=1562,freq=1.0), product of:
              0.09043039 = queryWeight, product of:
                1.5847592 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.017134737 = queryNorm
              0.20813909 = fieldWeight in 1562, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=1562)
          0.1363277 = weight(abstract_txt:privacy in 1562) [ClassicSimilarity], result of:
            0.1363277 = score(doc=1562,freq=3.0), product of:
              0.18629792 = queryWeight, product of:
                1.6084039 = boost
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.017134737 = queryNorm
              0.73177254 = fieldWeight in 1562, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.0625 = fieldNorm(doc=1562)
          0.09517318 = weight(abstract_txt:visible in 1562) [ClassicSimilarity], result of:
            0.09517318 = score(doc=1562,freq=1.0), product of:
              0.21144727 = queryWeight, product of:
                1.7135317 = boost
                7.201658 = idf(docFreq=89, maxDocs=44421)
                0.017134737 = queryNorm
              0.4501036 = fieldWeight in 1562, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.201658 = idf(docFreq=89, maxDocs=44421)
                0.0625 = fieldNorm(doc=1562)
        0.24 = coord(6/25)
    
  3. Smets, A.; Vannieuwenhuyze, J.; Ballon, P.: Serendipity in the city : user evaluations of urban recommender systems (2022) 0.08
    0.07812424 = sum of:
      0.07812424 = product of:
        0.3906212 = sum of:
          0.018928407 = weight(abstract_txt:however in 1459) [ClassicSimilarity], result of:
            0.018928407 = score(doc=1459,freq=1.0), product of:
              0.07204465 = queryWeight, product of:
                1.0002118 = boost
                4.203706 = idf(docFreq=1803, maxDocs=44421)
                0.017134737 = queryNorm
              0.2627316 = fieldWeight in 1459, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.203706 = idf(docFreq=1803, maxDocs=44421)
                0.0625 = fieldNorm(doc=1459)
          0.017351015 = weight(abstract_txt:users in 1459) [ClassicSimilarity], result of:
            0.017351015 = score(doc=1459,freq=1.0), product of:
              0.077822655 = queryWeight, product of:
                1.2731799 = boost
                3.5672934 = idf(docFreq=3408, maxDocs=44421)
                0.017134737 = queryNorm
              0.22295584 = fieldWeight in 1459, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5672934 = idf(docFreq=3408, maxDocs=44421)
                0.0625 = fieldNorm(doc=1459)
          0.026957167 = weight(abstract_txt:user in 1459) [ClassicSimilarity], result of:
            0.026957167 = score(doc=1459,freq=2.0), product of:
              0.08285695 = queryWeight, product of:
                1.3137152 = boost
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.017134737 = queryNorm
              0.32534587 = fieldWeight in 1459, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.0625 = fieldNorm(doc=1459)
          0.08359955 = weight(abstract_txt:recommendations in 1459) [ClassicSimilarity], result of:
            0.08359955 = score(doc=1459,freq=3.0), product of:
              0.13446872 = queryWeight, product of:
                1.3664751 = boost
                5.743043 = idf(docFreq=386, maxDocs=44421)
                0.017134737 = queryNorm
              0.6217026 = fieldWeight in 1459, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                5.743043 = idf(docFreq=386, maxDocs=44421)
                0.0625 = fieldNorm(doc=1459)
          0.24378507 = weight(abstract_txt:recommender in 1459) [ClassicSimilarity], result of:
            0.24378507 = score(doc=1459,freq=3.0), product of:
              0.27446693 = queryWeight, product of:
                1.9522531 = boost
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.017134737 = queryNorm
              0.8882129 = fieldWeight in 1459, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.0625 = fieldNorm(doc=1459)
        0.2 = coord(5/25)
    
  4. Huang, Z.; Chung, Z.W.; Chen, H.: ¬A graph model for e-commerce recommender systems (2004) 0.08
    0.076748304 = sum of:
      0.076748304 = product of:
        0.3837415 = sum of:
          0.023660507 = weight(abstract_txt:however in 1501) [ClassicSimilarity], result of:
            0.023660507 = score(doc=1501,freq=1.0), product of:
              0.07204465 = queryWeight, product of:
                1.0002118 = boost
                4.203706 = idf(docFreq=1803, maxDocs=44421)
                0.017134737 = queryNorm
              0.3284145 = fieldWeight in 1501, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.203706 = idf(docFreq=1803, maxDocs=44421)
                0.078125 = fieldNorm(doc=1501)
          0.060332783 = weight(abstract_txt:recommendations in 1501) [ClassicSimilarity], result of:
            0.060332783 = score(doc=1501,freq=1.0), product of:
              0.13446872 = queryWeight, product of:
                1.3664751 = boost
                5.743043 = idf(docFreq=386, maxDocs=44421)
                0.017134737 = queryNorm
              0.44867522 = fieldWeight in 1501, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.743043 = idf(docFreq=386, maxDocs=44421)
                0.078125 = fieldNorm(doc=1501)
          0.04075104 = weight(abstract_txt:data in 1501) [ClassicSimilarity], result of:
            0.04075104 = score(doc=1501,freq=3.0), product of:
              0.09043039 = queryWeight, product of:
                1.5847592 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.017134737 = queryNorm
              0.45063436 = fieldWeight in 1501, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.078125 = fieldNorm(doc=1501)
          0.17593673 = weight(abstract_txt:recommender in 1501) [ClassicSimilarity], result of:
            0.17593673 = score(doc=1501,freq=1.0), product of:
              0.27446693 = queryWeight, product of:
                1.9522531 = boost
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.017134737 = queryNorm
              0.6410125 = fieldWeight in 1501, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.20496 = idf(docFreq=32, maxDocs=44421)
                0.078125 = fieldNorm(doc=1501)
          0.08306043 = weight(abstract_txt:historical in 1501) [ClassicSimilarity], result of:
            0.08306043 = score(doc=1501,freq=1.0), product of:
              0.19049294 = queryWeight, product of:
                1.9919398 = boost
                5.58117 = idf(docFreq=454, maxDocs=44421)
                0.017134737 = queryNorm
              0.4360289 = fieldWeight in 1501, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                5.58117 = idf(docFreq=454, maxDocs=44421)
                0.078125 = fieldNorm(doc=1501)
        0.2 = coord(5/25)
    
  5. Vishwanath, A.; Xu, W.; Ngoh, Z.: How people protect their privacy on facebook : a cost-benefit view (2018) 0.08
    0.07531053 = sum of:
      0.07531053 = product of:
        0.37655264 = sum of:
          0.018928407 = weight(abstract_txt:however in 223) [ClassicSimilarity], result of:
            0.018928407 = score(doc=223,freq=1.0), product of:
              0.07204465 = queryWeight, product of:
                1.0002118 = boost
                4.203706 = idf(docFreq=1803, maxDocs=44421)
                0.017134737 = queryNorm
              0.2627316 = fieldWeight in 223, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.203706 = idf(docFreq=1803, maxDocs=44421)
                0.0625 = fieldNorm(doc=223)
          0.09069387 = weight(abstract_txt:attacks in 223) [ClassicSimilarity], result of:
            0.09069387 = score(doc=223,freq=1.0), product of:
              0.16251782 = queryWeight, product of:
                1.0622497 = boost
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.017134737 = queryNorm
              0.5580549 = fieldWeight in 223, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                8.928879 = idf(docFreq=15, maxDocs=44421)
                0.0625 = fieldNorm(doc=223)
          0.017351015 = weight(abstract_txt:users in 223) [ClassicSimilarity], result of:
            0.017351015 = score(doc=223,freq=1.0), product of:
              0.077822655 = queryWeight, product of:
                1.2731799 = boost
                3.5672934 = idf(docFreq=3408, maxDocs=44421)
                0.017134737 = queryNorm
              0.22295584 = fieldWeight in 223, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.5672934 = idf(docFreq=3408, maxDocs=44421)
                0.0625 = fieldNorm(doc=223)
          0.026957167 = weight(abstract_txt:user in 223) [ClassicSimilarity], result of:
            0.026957167 = score(doc=223,freq=2.0), product of:
              0.08285695 = queryWeight, product of:
                1.3137152 = boost
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.017134737 = queryNorm
              0.32534587 = fieldWeight in 223, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.6808684 = idf(docFreq=3042, maxDocs=44421)
                0.0625 = fieldNorm(doc=223)
          0.2226222 = weight(abstract_txt:privacy in 223) [ClassicSimilarity], result of:
            0.2226222 = score(doc=223,freq=8.0), product of:
              0.18629792 = queryWeight, product of:
                1.6084039 = boost
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.017134737 = queryNorm
              1.1949795 = fieldWeight in 223, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.0625 = fieldNorm(doc=223)
        0.2 = coord(5/25)