Abstract
Social media platforms, such as Facebook and TikTok, have triggered debates on privacy. The recent transformation of social media into an increasingly centralized service, exemplified by TikTok, only exacerbates the matter. While aggregation has been deemed an effective way to combat privacy infringement, a high degree of centralization can make aggregation ineffective. We present a randomized article-push algorithm and a message-passing reconstruction algorithm that enable social media platforms to infer user preferences from only the publicly available aggregate data of article-reads, without storing any individual users’ actions. Its O(n) complexity allows the reconstruction algorithm to scale to a large population, as is typical of social media platforms. Moreover, the feasibility of the privacy attack depends on the algorithm using as few articles as possible. We determine the minimum number of articles needed for high probability inference. Given the proportion of users, 0<ϵ<1, who prefer a given topic, we design a push algorithm and a reconstruction algorithm that achieve an article-to-user ratio β=ϵ(1−ϵ), at which phase transition occurs. The analysis of the algorithm departs from the classic density evolution due to the lack of monotonicity in the per-iteration error probability, which makes it surprising that phase transition takes place. By formulating the inference problem as a compressed sensing problem, we show that our phase transition threshold ϵ(1−ϵ) is extremely close to that of compressed sensing, even when the latter algorithm is of a worst-case O(n3) complexity and uses a dense Gaussian measurement matrix.
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.