Cohort-Based Identity in Digital Advertising
Advertisers shift to cohort-based identity to protect privacy while personalizing ads.
― 4 min read
Table of Contents
In the world of online Advertising, reaching the right audience is crucial. As digital advertising continues to grow, companies must balance user Privacy with the need for personalized ads. To address this challenge, advertisers are moving from tracking individual users to grouping users into Cohorts. This approach allows advertisers to target groups of users with shared interests without revealing the identities of those users.
What Are User Cohorts?
A cohort is a group of users who share similar traits or behaviors. For example, a cohort may include users interested in technology or outdoor activities. Advertising campaigns can then target these cohorts rather than individual users. This method helps protect user privacy while still allowing advertisers to deliver relevant messages.
The Shift to Cohort-Based Identity
Recent changes in privacy regulations have prompted companies to rethink how they track and target users. For instance, Apple introduced new features requiring app developers to obtain consent before tracking users across apps. Similarly, Google is implementing privacy measures that restrict how advertisers access user data.
These developments have led to a growing interest in cohort-based identity. By grouping users into cohorts, advertisers can still engage with their audiences without compromising individual privacy. To create effective cohorts, advertisers need an efficient way to group users based on their characteristics.
The Challenge of Building Cohorts
Building user cohorts comes with its own set of challenges. Two primary goals must be met: maintaining user privacy and ensuring the cohorts are large enough to protect individual identities. The concept of K-anonymity is essential here. A cohort is considered K-anonymous if it contains at least K users, making it impossible to identify any single user within the group.
When creating cohorts, advertisers face the challenge of using traditional clustering methods, which may not be suitable for this purpose. Common clustering algorithms can be inefficient and may not ensure that cohorts are large enough to meet K-anonymity. For this reason, there is a need for new methods to build cohorts effectively.
Introducing a New Cohort-Building Method
To meet these challenges, a new method called consecutive consistent weighted sampling (CCWS) has been proposed. This method combines clustering techniques and hashing strategies to create cohorts that meet the K-anonymity requirement while also being efficient and scalable.
How CCWS Works
CCWS uses a process that begins by placing all users into a single initial cohort. As the algorithm runs, it continuously splits the cohort based on user similarities. This hierarchical approach allows for better control over cohort sizes and ensures that each group remains K-anonymous.
The method also employs hashing strategies to group users based on their characteristics. These hash values allow for quick comparisons and clustering, making the entire process faster and more efficient.
Evaluating the Effectiveness of CCWS
The effectiveness of the CCWS method has been tested using a large dataset from a leading professional networking platform. The results showed significant improvements in identifying relevant user cohorts compared to traditional methods. For instance, CCWS was able to increase the accuracy of matching users to advertising campaigns, meaning more people received ads relevant to their interests.
Comparison with Other Methods
Several methods were compared to evaluate CCWS, including random grouping and other hashing algorithms. While random grouping simply assigns users to cohorts without any consideration for similarities, the other hashing techniques also struggled to maintain user privacy and K-anonymity.
CCWS outperformed these methods, demonstrating a marked increase in the accuracy of ad targeting. This success highlights the potential of CCWS as a valuable tool for advertisers looking to meet both user privacy requirements and personalized advertising goals.
The Benefits of Using CCWS
Adopting CCWS for cohort building provides several benefits:
User Privacy: By grouping users into cohorts, CCWS helps safeguard individual user identities while still allowing advertisers to understand users' interests.
Increased Accuracy: The method leads to better ad targeting, meaning users receive more relevant advertisements.
Scalability: CCWS is designed to handle large datasets, making it suitable for companies with millions of users.
Flexibility: The method can be tailored to specific advertising needs, allowing for the consideration of important user features.
Conclusion
The shift towards cohort-based identity is transforming how digital advertising operates. By using methods like CCWS, advertisers can balance the necessity for personalized ads with the imperative of protecting user privacy. This approach not only meets current demands but also sets the stage for future advancements in advertising technology.
As the landscape of digital advertising continues to evolve, methods that prioritize user privacy while ensuring effective targeting will become increasingly critical. CCWS represents a significant step forward in this regard, providing a practical solution for advertisers navigating the complexities of privacy regulations.
Title: Building K-Anonymous User Cohorts with\\ Consecutive Consistent Weighted Sampling (CCWS)
Abstract: To retrieve personalized campaigns and creatives while protecting user privacy, digital advertising is shifting from member-based identity to cohort-based identity. Under such identity regime, an accurate and efficient cohort building algorithm is desired to group users with similar characteristics. In this paper, we propose a scalable $K$-anonymous cohort building algorithm called {\em consecutive consistent weighted sampling} (CCWS). The proposed method combines the spirit of the ($p$-powered) consistent weighted sampling and hierarchical clustering, so that the $K$-anonymity is ensured by enforcing a lower bound on the size of cohorts. Evaluations on a LinkedIn dataset consisting of $>70$M users and ads campaigns demonstrate that CCWS achieves substantial improvements over several hashing-based methods including sign random projections (SignRP), minwise hashing (MinHash), as well as the vanilla CWS.
Authors: Xinyi Zheng, Weijie Zhao, Xiaoyun Li, Ping Li
Last Update: 2023-04-26 00:00:00
Language: English
Source URL: https://arxiv.org/abs/2304.13677
Source PDF: https://arxiv.org/pdf/2304.13677
Licence: https://creativecommons.org/licenses/by/4.0/
Changes: This summary was created with assistance from AI and may have inaccuracies. For accurate information, please refer to the original source documents linked here.
Thank you to arxiv for use of its open access interoperability.