Document (#39494)

Author
Sánchez, D.
Batet, M.
Title
C-sanitized : a privacy model for document redaction and sanitization
Source
Journal of the Association for Information Science and Technology. 67(2016) no.1, S.148-163
Year
2016
Abstract
Vast amounts of information are daily exchanged and/or released. The sensitive nature of much of this information creates a serious privacy threat when documents are uncontrollably made available to untrusted third parties. In such cases, appropriate data protection measures should be undertaken by the responsible organization, especially under the umbrella of current legislation on data privacy. To do so, human experts are usually requested to redact or sanitize document contents. To relieve this burdensome task, this paper presents a privacy model for document redaction/sanitization, which offers several advantages over other models available in the literature. Based on the well-established foundations of data semantics and information theory, our model provides a framework to develop and implement automated and inherently semantic redaction/sanitization tools. Moreover, contrary to ad-hoc redaction methods, our proposal provides a priori privacy guarantees which can be intuitively defined according to current legislations on data privacy. Empirical tests performed within the context of several use cases illustrate the applicability of our model and its ability to mimic the reasoning of human sanitizers.
Content
Vgl.: http://onlinelibrary.wiley.com/doi/10.1002/asi.23363/abstract.

Similar documents (author)

  1. Sánchez, M.F.: Semantically enhanced Information Retrieval : an ontology-based approach (2006) 5.07
    5.073718 = sum of:
      5.073718 = weight(author_txt:sánchez in 327) [ClassicSimilarity], result of:
        5.073718 = fieldWeight in 327, product of:
          1.0 = tf(freq=1.0), with freq of:
            1.0 = termFreq=1.0
          8.117949 = idf(docFreq=35, maxDocs=44421)
          0.625 = fieldNorm(doc=327)
    
  2. Sánchez, R. Rodriguez- -> Rodriguez-Sánchez, R.: 4.31
    4.305192 = sum of:
      4.305192 = weight(author_txt:sánchez in 4567) [ClassicSimilarity], result of:
        4.305192 = fieldWeight in 4567, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.117949 = idf(docFreq=35, maxDocs=44421)
          0.375 = fieldNorm(doc=4567)
    
  3. Sánchez, R. Rodríguez -> Rodríguez-Sánchez, R.: 4.31
    4.305192 = sum of:
      4.305192 = weight(author_txt:sánchez in 1501) [ClassicSimilarity], result of:
        4.305192 = fieldWeight in 1501, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.117949 = idf(docFreq=35, maxDocs=44421)
          0.375 = fieldNorm(doc=1501)
    
  4. Casabón, A.I. Sánchez- => Sánchez-Casabón, A.I.: 4.31
    4.305192 = sum of:
      4.305192 = weight(author_txt:sánchez in 787) [ClassicSimilarity], result of:
        4.305192 = fieldWeight in 787, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.117949 = idf(docFreq=35, maxDocs=44421)
          0.375 = fieldNorm(doc=787)
    
  5. Sánchez, J.A. Pastor => Pastor Sánchez, J.A.: 4.31
    4.305192 = sum of:
      4.305192 = weight(author_txt:sánchez in 791) [ClassicSimilarity], result of:
        4.305192 = fieldWeight in 791, product of:
          1.4142135 = tf(freq=2.0), with freq of:
            2.0 = termFreq=2.0
          8.117949 = idf(docFreq=35, maxDocs=44421)
          0.375 = fieldNorm(doc=791)
    

Similar documents (content)

  1. Wu, Z.; Xie, J.; Pan, J.; Su, X.: ¬An effective approach for the protection of user privacy in a digital library (2019) 0.18
    0.18317962 = sum of:
      0.18317962 = product of:
        1.1448727 = sum of:
          0.02299261 = weight(abstract_txt:provides in 782) [ClassicSimilarity], result of:
            0.02299261 = score(doc=782,freq=1.0), product of:
              0.08735178 = queryWeight, product of:
                1.1527265 = boost
                4.211497 = idf(docFreq=1789, maxDocs=44421)
                0.017993225 = queryNorm
              0.26321855 = fieldWeight in 782, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                4.211497 = idf(docFreq=1789, maxDocs=44421)
                0.0625 = fieldNorm(doc=782)
          0.25941318 = weight(abstract_txt:untrusted in 782) [ClassicSimilarity], result of:
            0.25941318 = score(doc=782,freq=3.0), product of:
              0.24181907 = queryWeight, product of:
                1.3561904 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.017993225 = queryNorm
              1.0727574 = fieldWeight in 782, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.0625 = fieldNorm(doc=782)
          0.055693615 = weight(abstract_txt:data in 782) [ClassicSimilarity], result of:
            0.055693615 = score(doc=782,freq=6.0), product of:
              0.109238595 = queryWeight, product of:
                1.8230284 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.017993225 = queryNorm
              0.5098346 = fieldWeight in 782, product of:
                2.4494898 = tf(freq=6.0), with freq of:
                  6.0 = termFreq=6.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.0625 = fieldNorm(doc=782)
          0.80677325 = weight(abstract_txt:privacy in 782) [ClassicSimilarity], result of:
            0.80677325 = score(doc=782,freq=8.0), product of:
              0.6751356 = queryWeight, product of:
                5.5506845 = boost
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.017993225 = queryNorm
              1.1949795 = fieldWeight in 782, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.0625 = fieldNorm(doc=782)
        0.16 = coord(4/25)
    
  2. Yao, M.Z.; Rice, R.E.; Wallis, K.: Predicting user concerns about online privacy (2007) 0.16
    0.16314153 = sum of:
      0.16314153 = product of:
        1.3595128 = sum of:
          0.0401934 = weight(abstract_txt:data in 1205) [ClassicSimilarity], result of:
            0.0401934 = score(doc=1205,freq=2.0), product of:
              0.109238595 = queryWeight, product of:
                1.8230284 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.017993225 = queryNorm
              0.3679414 = fieldWeight in 1205, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.078125 = fieldNorm(doc=1205)
          0.08420523 = weight(abstract_txt:model in 1205) [ClassicSimilarity], result of:
            0.08420523 = score(doc=1205,freq=3.0), product of:
              0.15624347 = queryWeight, product of:
                2.1802502 = boost
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.017993225 = queryNorm
              0.538936 = fieldWeight in 1205, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.078125 = fieldNorm(doc=1205)
          1.2351142 = weight(abstract_txt:privacy in 1205) [ClassicSimilarity], result of:
            1.2351142 = score(doc=1205,freq=12.0), product of:
              0.6751356 = queryWeight, product of:
                5.5506845 = boost
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.017993225 = queryNorm
              1.8294313 = fieldWeight in 1205, product of:
                3.4641016 = tf(freq=12.0), with freq of:
                  12.0 = termFreq=12.0
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.078125 = fieldNorm(doc=1205)
        0.12 = coord(3/25)
    
  3. Wu, Z.; Li, R.; Zhou, Z.; Guo, J.; Jiang, J.; Su, X.: ¬A user sensitive subject protection approach for book search service (2020) 0.15
    0.14840002 = sum of:
      0.14840002 = product of:
        0.9275002 = sum of:
          0.084915 = weight(abstract_txt:sensitive in 617) [ClassicSimilarity], result of:
            0.084915 = score(doc=617,freq=2.0), product of:
              0.13147682 = queryWeight, product of:
                7.3070183 = idf(docFreq=80, maxDocs=44421)
                0.017993225 = queryNorm
              0.64585525 = fieldWeight in 617, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                7.3070183 = idf(docFreq=80, maxDocs=44421)
                0.0625 = fieldNorm(doc=617)
          0.14977227 = weight(abstract_txt:untrusted in 617) [ClassicSimilarity], result of:
            0.14977227 = score(doc=617,freq=1.0), product of:
              0.24181907 = queryWeight, product of:
                1.3561904 = boost
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.017993225 = queryNorm
              0.61935675 = fieldWeight in 617, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                9.909708 = idf(docFreq=5, maxDocs=44421)
                0.0625 = fieldNorm(doc=617)
          0.05500263 = weight(abstract_txt:model in 617) [ClassicSimilarity], result of:
            0.05500263 = score(doc=617,freq=2.0), product of:
              0.15624347 = queryWeight, product of:
                2.1802502 = boost
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.017993225 = queryNorm
              0.35203153 = fieldWeight in 617, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.0625 = fieldNorm(doc=617)
          0.6378103 = weight(abstract_txt:privacy in 617) [ClassicSimilarity], result of:
            0.6378103 = score(doc=617,freq=5.0), product of:
              0.6751356 = queryWeight, product of:
                5.5506845 = boost
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.017993225 = queryNorm
              0.9447143 = fieldWeight in 617, product of:
                2.236068 = tf(freq=5.0), with freq of:
                  5.0 = termFreq=5.0
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.0625 = fieldNorm(doc=617)
        0.16 = coord(4/25)
    
  4. Can, O.; Yilmazer, D.: ¬A privacy-aware semantic model for provenance management (2014) 0.13
    0.13384965 = sum of:
      0.13384965 = product of:
        0.8365603 = sum of:
          0.075054966 = weight(abstract_txt:sensitive in 2580) [ClassicSimilarity], result of:
            0.075054966 = score(doc=2580,freq=1.0), product of:
              0.13147682 = queryWeight, product of:
                7.3070183 = idf(docFreq=80, maxDocs=44421)
                0.017993225 = queryNorm
              0.5708608 = fieldWeight in 2580, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                7.3070183 = idf(docFreq=80, maxDocs=44421)
                0.078125 = fieldNorm(doc=2580)
          0.07519497 = weight(abstract_txt:data in 2580) [ClassicSimilarity], result of:
            0.07519497 = score(doc=2580,freq=7.0), product of:
              0.109238595 = queryWeight, product of:
                1.8230284 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.017993225 = queryNorm
              0.6883553 = fieldWeight in 2580, product of:
                2.6457512 = tf(freq=7.0), with freq of:
                  7.0 = termFreq=7.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.078125 = fieldNorm(doc=2580)
          0.06875329 = weight(abstract_txt:model in 2580) [ClassicSimilarity], result of:
            0.06875329 = score(doc=2580,freq=2.0), product of:
              0.15624347 = queryWeight, product of:
                2.1802502 = boost
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.017993225 = queryNorm
              0.4400394 = fieldWeight in 2580, product of:
                1.4142135 = tf(freq=2.0), with freq of:
                  2.0 = termFreq=2.0
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.078125 = fieldNorm(doc=2580)
          0.6175571 = weight(abstract_txt:privacy in 2580) [ClassicSimilarity], result of:
            0.6175571 = score(doc=2580,freq=3.0), product of:
              0.6751356 = queryWeight, product of:
                5.5506845 = boost
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.017993225 = queryNorm
              0.91471565 = fieldWeight in 2580, product of:
                1.7320508 = tf(freq=3.0), with freq of:
                  3.0 = termFreq=3.0
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.078125 = fieldNorm(doc=2580)
        0.16 = coord(4/25)
    
  5. Chen, H.; Beaudoin, C.E.; Hong, H.: Teen online information disclosure : empirical testing of a protection motivation and social capital model (2016) 0.13
    0.13026042 = sum of:
      0.13026042 = product of:
        1.0855036 = sum of:
          0.028421026 = weight(abstract_txt:data in 4203) [ClassicSimilarity], result of:
            0.028421026 = score(doc=4203,freq=1.0), product of:
              0.109238595 = queryWeight, product of:
                1.8230284 = boost
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.017993225 = queryNorm
              0.26017386 = fieldWeight in 4203, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.3302255 = idf(docFreq=4320, maxDocs=44421)
                0.078125 = fieldNorm(doc=4203)
          0.048615914 = weight(abstract_txt:model in 4203) [ClassicSimilarity], result of:
            0.048615914 = score(doc=4203,freq=1.0), product of:
              0.15624347 = queryWeight, product of:
                2.1802502 = boost
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.017993225 = queryNorm
              0.31115484 = fieldWeight in 4203, product of:
                1.0 = tf(freq=1.0), with freq of:
                  1.0 = termFreq=1.0
                3.9827821 = idf(docFreq=2249, maxDocs=44421)
                0.078125 = fieldNorm(doc=4203)
          1.0084666 = weight(abstract_txt:privacy in 4203) [ClassicSimilarity], result of:
            1.0084666 = score(doc=4203,freq=8.0), product of:
              0.6751356 = queryWeight, product of:
                5.5506845 = boost
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.017993225 = queryNorm
              1.4937245 = fieldWeight in 4203, product of:
                2.828427 = tf(freq=8.0), with freq of:
                  8.0 = termFreq=8.0
                6.759825 = idf(docFreq=139, maxDocs=44421)
                0.078125 = fieldNorm(doc=4203)
        0.12 = coord(3/25)