How science is finding different ways to improve targeting

The application of customer data science (analysing customers’ purchase behaviour) is the backbone of modern retailing. Knowing more about your customers’ needs, wants and shopping habits helps deliver a better shopping experience, leading to an increase in loyalty, and growth in sales, through mechanics such as the right range in stores, meaningful and relevant communications, optimised pricing on product lines and promotional offers.

There is much debate about what is the ‘right type of science’ to identify the right customers. Which are the best methods for determining who will buy your product, who will respond to your promotional offer? Marketing budgets are being squeezed for retailers and for brands, so it’s more important than ever to get it right with your targeting, and at dunnhumby we’re constantly looking for new ways to help retailers and brands delight shoppers and improve their performance by putting the customer at the heart of decision making.

Current targeting methods rely heavily on a rich purchase history. Take a typical promotional coupon campaign: one way to select your audience would be to identify customers who have frequently made purchases relating to the category or product on offer. While patterns of past behaviour can be a good indicator of future likelihood to purchase, they do not account for potential new shoppers who might never have purchased that brand or category before but might be motivated to trial it.

As part of my PhD, I set out to investigate if network science could be successfully used as an alternative approach to targeting that might yield better results. Network science focuses on the interaction and relationship between things. Twitter is a good example of network analysis in practice, seeing for example who are the key users in spreading a viral news story.

Applying network modelling in a retail environment allows analysis of connections between customers and products where there was a purchase. Densely connected clusters or ‘communities’ that are detected algorithmically in these networks reveal customers with similar preferences and the products that they buy the most. ‘Missing links’ within these communities then form the basis of new product recommendations. This approach differs from supervised machine learning in that there is no requirement to define any features about customers, for example you don’t have to look at what they spend in various categories. It picks up the patterns in the data in a more hands-off way, without having to define the inputs into the model.

An example of a network showing how customers and products relate to each other in ‘communities’.

Networks use transaction data directly to infer loose connections between customers and products, and these connections are the key to a new approach to targeting.

To test how this network modelling method compares to traditional targeting, we applied the approach to a promotional campaign in the yoghurt category. The control was an existing coupon campaign which used a targeting approach based on a combination of previous engagement with the product on promotion and possible engagement with competitor products. Customers were then prioritised according to how much they spend in the yoghurts category overall. We wanted to see if using a networks’ approach can better pinpoint those customers who were most likely to redeem the coupon.

The original campaign involved a promotional offer mailed to 100,000 customers. We used this data to compare two ways to rank customers in terms of how likely they are to participate in the promotion. If higher-ranked customers redeem coupons more often than lower-ranked customers, then the ranking has predictive power.

The results showed that the ranking based on the network model was more useful than category spend at picking out relevant customers to target, as it yielded a higher redemption rate. The gap between the two rankings was especially pronounced for ‘lapsed’ customers, whose purchasing of the promotional product declined in the time period leading up to the campaign. More testing is required, but the initial results are encouraging.

While there are pros and cons to using network science (one of the downsides is the amount of computing time/power required to find meaningful communities in networks of millions of customers and tens of thousands of products), developments in technology platforms are enabling data scientists to work with bigger and more varied data sets, so this may be less of an issue in the future.

In the race to understand customers’ needs, wants and purchase behaviours to create a better shopping experience, the scope for use of network science in retail presents the industry with some exciting possibilities.

Roxana Pamfil is a former PhD student at the University of Oxford, working in collaboration with dunnhumby as part of the EPSRC Centre for Doctoral Training in Industrially Focused Mathematical Modelling. In her research, she developed network algorithms for analysing consumer behaviour. She is currently a Knowledge Transfer Ambassador at dunnhumby, funded jointly by the University of Oxford and dunnhumby, implementing research findings within the business.

customer first data science analytics & machine learning services

Ready to get started?

Speak to a member of our team for more information

Cookie	Description
cli_user_preference	The cookie is set by the GDPR Cookie Consent plugin and is used to store the yes/no selection the consent given for cookie usage. It does not store any personal data.
cookielawinfo-checkbox-advertisement	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Analytics" category .
cookielawinfo-checkbox-necessary	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
CookieLawInfoConsent	The cookie is set by the GDPR Cookie Consent plugin and is used to store the summary of the consent given for cookie usage. It does not store any personal data.
viewed_cookie_policy	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.
wsaffinity	Set by the dunnhumby website, that allows all subsequent traffic and requests from an initial client session to be passed to the same server in the pool. Session affinity is also referred to as session persistence, server affinity, server persistence, or server sticky.

Cookie	Description
passster	Set by Passster to remember that a visitor has entered a correct password, so they don’t have to re-enter it across protected pages.
wordpress_test_cookie	WordPress cookie to read if cookies can be placed, and lasts for the session.
wp_lang	This cookie is used to remember the language chosen by the user while browsing.

Cookie	Description
fs_cid	Set by FullStory to correlate sessions for diagnostics and session consistency; not always set.
fs_lua	Set by FullStory to record the time of the user’s last activity, helping manage session timeouts.
fs_session	Set by FullStory to manage session flow and recording. Not always visible or applicable across all implementations.
fs_uid	Set by FullStory to uniquely identify a user’s browser. Used for session replay and user analytics. Does not contain personal data directly.
VISITOR_INFO1_LIVE	Set by YouTube to estimate user bandwidth and improve video quality by adjusting playback speed.
VISITOR_PRIVACY_METADATA	Set by YouTube to store privacy preferences and metadata related to user consent and settings.
vuid	Vimeo installs this cookie to collect tracking information by setting a unique ID to embed videos to the website.
YSC	Set by YouTube to track user sessions and maintain video playback state during a browser session.
_ga	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognise unique visitors.
_ga_*	Set by Google Analytics to persist session state.
_gid	Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously.
_lfa	This cookie is set by the provider Leadfeeder to identify the IP address of devices visiting the website, in order to retarget multiple users routing from the same IP address.
__Secure-ROLLOUT_TOKEN	YouTube sets this cookie via embedded videos to manage feature rollouts.

Cookie	Description
aam_uuid	Set by LinkedIn, for ID sync for Adobe Audience Manager.
AEC	Set by Google, ‘AEC’ cookies ensure that requests within a browsing session are made by the user, and not by other sites. These cookies prevent malicious sites from acting on behalf of a user without that user’s knowledge.
AMCVS_14215E3D5995C57C0A495C55%40AdobeOrg	Set by LinkedIn, indicates the start of a session for Adobe Experience Cloud.
AMCV_14215E3D5995C57C0A495C55%40AdobeOrg	Set by LinkedIn, Unique Identifier for Adobe Experience Cloud.
AnalyticsSyncHistory	Set by LinkedIn, used to store information about the time a sync with the lms_analytics cookie took place for users in the Designated Countries (which LinkedIn determines as European Union (EU), European Economic Area (EEA), and Switzerland).
bcookie	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognise browser ID.
bscookie	LinkedIn sets this cookie to store performed actions on the website.
DV	Set by Google, used for the purpose of targeted advertising, to collect information about how visitors use our site.
gpv_pn	Set by LinkedIn, used to retain and fetch previous page visited in Adobe Analytics.
lang	Session-based cookie, set by LinkedIn, used to set default locale/language.
lidc	Set by LinkedIn, used for routing from Share buttons and ad tags.
lidc	LinkedIn sets the lidc cookie to facilitate data center selection.
li_gc	Set by LinkedIn to store consent of guests regarding the use of cookies for non-essential purposes.
li_sugr	Set by LinkedIn, used to make a probabilistic match of a user's identity outside the Designated Countries (which LinkedIn determines as European Union (EU), European Economic Area (EEA), and Switzerland).
lms_analytics	Set by LinkedIn to identify LinkedIn Members in the Designated Countries (which LinkedIn determines as European Union (EU), European Economic Area (EEA), and Switzerland) for analytics.
lpv[AccountID]	This cookie is set by Salesforce Marketing Cloud Account Engagement. Prevents counting multiple page views within a short window to avoid duplicate tracking.
NID	Set by Google, registers a unique ID that identifies a returning user’s device. The ID is used for targeted ads.
OGP / OGPC	Set by Google, cookie enables the functionality of Google Maps.
OTZ	Set by Google, used to support Google’s advertising services. This cookie is used by Google Analytics to provide an analysis of website visitors in aggregate.
s_cc	Set by LinkedIn, used to determine if cookies are enabled for Adobe Analytics.
s_ips	Set by LinkedIn, tracks percent of page viewed.
s_plt	Set by LinkedIn, this cookie tracks the time that the previous page took to load.
s_pltp	Set by LinkedIn, this cookie provides page name value (URL) for use by Adobe Analytics.
s_ppv	Set by LinkedIn, used by Adobe Analytics to retain and fetch what percentage of a page was viewed.
s_sq	Set by LinkedIn, used to store information about the previous link that was clicked on by the user by Adobe Analytics.
s_tp	Set by LinkedIn, this cookie measures a visitor’s scroll activity to see how much of a page they view before moving on to another page.
s_tslv	Set by LinkedIn, used to retain and fetch time since last visit in Adobe Analytics.
test_cookie	Set by doubleclick.net (part of Google), the purpose of the cookie is to determine if the users' browser supports cookies.
U	Set by LinkedIn, Browser Identifier for users outside the Designated Countries (which LinkedIn determines as European Union (EU), European Economic Area (EEA), and Switzerland).
UserMatchHistory	LinkedIn sets this cookie for LinkedIn Ads ID syncing.
UserMatchHistory	This cookie is used by LinkedIn Ads to help dunnhumby measure advertising performance. More information can be found in their cookie policy.
visitor_id[AccountID]	This cookie is set by Salesforce Marketing Cloud Account Engagement. Unique visitor identifier used to recognize returning visitors and track their behavior.
visitor_id[AccountID]-hash	This cookie is set by Salesforce Marketing Cloud Account Engagement. Secure hash of the visitor ID to validate the visitor and prevent tampering.
yt-remote-connected-devices	YouTube sets this cookie to store the video preferences of the user using embedded YouTube video.
_gcl_au	Set by Google Tag Manager to store and track conversion events. It is typically associated with Google Ads, but may be set even if no active ad campaigns are running, especially when GTM is configured with default settings. The cookie helps measure the effectiveness of ad clicks in relation to site actions.

How science is finding different ways to improve targeting

An example of a network showing how customers and products relate to each other in ‘communities’.

Get in touch

The latest insights from our experts around the world