As discussed on this blog before, one of the wonderful things about data science is that it never stands still. No matter how much progress we might make, there’s always something new coming down the line that can help us do more for customers.
With that in mind, and with a new year upon us, we asked dunnhumby’s Chief Data Science Officer Sandra Stanley for her thoughts on what 2023 could hold from a data science perspective. Here’s what she had to say.
“Every day, all around the world, data science is changing the way that we do things for the better. From risk evaluation in financial services to self-driving cars in automotive, personalised care to price optimisation, data science is being used across industries to deliver better outcomes, and create smarter, more customer-centric products and services.
“While the data science industry isn’t subject to the same kind of moment-to-moment shifts that something like media and advertising is, we do still see clear overarching trends when it comes to progress and innovation. And, right now, there are three themes in particular that I believe will continue to change the face of data science in 2023 and beyond.
Artificial Intelligence (AI) is a term we use to classify machines that mimic human intelligence. Amongst other things, AI can be used to predict, automate, and optimise tasks that have historically been carried out by humans, things like decision making, speech and facial recognition, and translation. Augmented Intelligence is based on the principle of using AI to enhance human thinking.
Decision making is of particular relevance here. Over the past decade, we’ve gone from providing “descriptive” insights that can support decisions, to “predictive” insights that actually influence those choices. The next stage in that evolution is “prescriptive” insights that automate the decision making process, with data science working hard to calculate an optimal solution to a problem.
Naturally, there are challenges here. Without due care, data science can actually amplify any bias that exists in a dataset; one academic computer vision algorithm recently determined that all “cooking” images must feature women, for instance. And anyone with an interest in driverless cars will already know about the ethical issue of whether to put driver or pedestrian safety first.
All of this is driving us to think long and hard about how we can get value out of Augmented Intelligence without those negative consequences. Transparency, accountability, and fairness are all key here – ensuring that we understand how data is being used, what the potential results may be, and whether anyone will be disadvantaged as a result.
The events of 2022 have made us all stop and think. In the face of extreme weather, forest fires, and a global pandemic, how can we help our planet? The results of our Shopper Thoughts panel from September confirms how important those issues now are to consumers, with 43% saying that they’re conscious about the environment, and 28% wanting to buy sustainable products but believing they’re too expensive.
Data science has a sizeable role to play here. Whether by combining data from different sources in order to understand some of the psychological aspects of consumer decision making, or codifying sustainability by creating a single source of truth for product information, data science innovations give us the ability to better support customers with their sustainable choices.
There are any number of possibilities for what that might look like in practice. From sustainability-based customer segmentations through to algorithms that nudge shoppers towards environmentally-friendly or meat-free substitutions, there is a huge amount of potential here for quick wins that benefit both customers and planet in equal measure.
From terabytes to petabytes and now exabytes: the amount of data that we produce on a daily basis continues to grow. Volume isn’t the only factor at play here, though; so too are things like variety and velocity, with data now being generated in a greater number of formats, and near-real time analytical capabilities beginning to emerge.
While some might look at this influx of information as a problem, the advance of technology actually turns it into an opportunity. An innovation like Deep Learning for instance – a practice that trains computers to learn by example like a human brain – has profound implications for personalisation, because it can take vast datasets and find patterns that would previously have been much harder to identify.
Other developments in this space hold similar promise. Foundation models, which are incredibly adept at working with images and text, could revolutionise product substitutions, taking us beyond the use of EPOS data to make relevant recommendations. From a UI perspective, these models might also make it possible for us query data vocally rather than having to manually select different parameters.
One emerging technology that I’m particularly excited about is Quantum Computing. In collaboration with Durham University, we’re exploring how this emerging technology could be applied to retail issues like optimisation. Optimisation has implications for everything from delivery routes to promotional strategies, and Quantum Computing could prove to be genuinely game changing here.
No matter how far we go in terms of technology, however, data will continue to be the foundation of everything that we do. While much of the industry discussion over the last few years has focused on the need for more sophisticated models to deliver greater value from data, that dialogue is quickly shifting towards the need for higher quality data to realise the gains we all want.
So long as we can maintain the right balance here – bigger data can also lead to increased costs, slower results, and privacy concerns, after all – the potential is abundantly clear.”
A look at dunnhumby’s unique Customer Data Science, which is at the core of everything we do.
Data Science solutionsCookie | Description |
---|---|
cli_user_preference | The cookie is set by the GDPR Cookie Consent plugin and is used to store the yes/no selection the consent given for cookie usage. It does not store any personal data. |
cookielawinfo-checkbox-advertisement | Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category . |
cookielawinfo-checkbox-analytics | Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Analytics" category . |
cookielawinfo-checkbox-necessary | This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary". |
CookieLawInfoConsent | The cookie is set by the GDPR Cookie Consent plugin and is used to store the summary of the consent given for cookie usage. It does not store any personal data. |
viewed_cookie_policy | The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data. |
wsaffinity | Set by the dunnhumby website, that allows all subsequent traffic and requests from an initial client session to be passed to the same server in the pool. Session affinity is also referred to as session persistence, server affinity, server persistence, or server sticky. |
Cookie | Description |
---|---|
wordpress_test_cookie | WordPress cookie to read if cookies can be placed, and lasts for the session. |
wp_lang | This cookie is used to remember the language chosen by the user while browsing. |
Cookie | Description |
---|---|
CONSENT | YouTube sets this cookie via embedded youtube-videos and registers anonymous statistical data. |
vuid | Vimeo installs this cookie to collect tracking information by setting a unique ID to embed videos to the website. |
_ga | The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognise unique visitors. |
_gat_gtag_UA_* | This cookie is installed by Google Analytics to store the website's unique user ID. |
_ga_* | Set by Google Analytics to persist session state. |
_gid | Installed by Google Analytics, _gid cookie stores information on how visitors use a website, while also creating an analytics report of the website's performance. Some of the data that are collected include the number of visitors, their source, and the pages they visit anonymously. |
_hjSessionUser_{site_id} | This cookie is set by the provider Hotjar to store a unique user ID for session tracking and analytics purposes. |
_hjSession_{site_id} | This cookie is set by the provider Hotjar to store a unique session ID, enabling session recording and behavior analysis. |
_hp2_id_* | This cookie is set by the provider Hotjar to store a unique visitor identifier for tracking user behavior and session information. |
_hp2_props.* | This cookie is set by the provider Hotjar to store user properties and session information for behavior analysis and insights. |
_hp2_ses_props.* | This cookie is set by the provider Hotjar to store session-specific properties and data for tracking user behavior during a session. |
_lfa | This cookie is set by the provider Leadfeeder to identify the IP address of devices visiting the website, in order to retarget multiple users routing from the same IP address. |
Cookie | Description |
---|---|
aam_uuid | Set by LinkedIn, for ID sync for Adobe Audience Manager. |
AEC | Set by Google, ‘AEC’ cookies ensure that requests within a browsing session are made by the user, and not by other sites. These cookies prevent malicious sites from acting on behalf of a user without that user’s knowledge. |
AMCVS_14215E3D5995C57C0A495C55%40AdobeOrg | Set by LinkedIn, indicates the start of a session for Adobe Experience Cloud. |
AMCV_14215E3D5995C57C0A495C55%40AdobeOrg | Set by LinkedIn, Unique Identifier for Adobe Experience Cloud. |
AnalyticsSyncHistory | Set by LinkedIn, used to store information about the time a sync with the lms_analytics cookie took place for users in the Designated Countries (which LinkedIn determines as European Union (EU), European Economic Area (EEA), and Switzerland). |
bcookie | LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognise browser ID. |
bscookie | LinkedIn sets this cookie to store performed actions on the website. |
DV | Set by Google, used for the purpose of targeted advertising, to collect information about how visitors use our site. |
ELOQUA | This cookie is set by Eloqua Marketing Automation Tool. It contains a unique identifier to recognise returning visitors and track their visit data across multiple visits and multiple OpenText Websites. This data is logged in pseudonymised form, unless a visitor provides us with their personal data through creating a profile, such as when signing up for events or for downloading information that is not available to the public. |
gpv_pn | Set by LinkedIn, used to retain and fetch previous page visited in Adobe Analytics. |
lang | Session-based cookie, set by LinkedIn, used to set default locale/language. |
lidc | LinkedIn sets the lidc cookie to facilitate data center selection. |
lidc | Set by LinkedIn, used for routing from Share buttons and ad tags. |
li_gc | Set by LinkedIn to store consent of guests regarding the use of cookies for non-essential purposes. |
li_sugr | Set by LinkedIn, used to make a probabilistic match of a user's identity outside the Designated Countries (which LinkedIn determines as European Union (EU), European Economic Area (EEA), and Switzerland). |
lms_analytics | Set by LinkedIn to identify LinkedIn Members in the Designated Countries (which LinkedIn determines as European Union (EU), European Economic Area (EEA), and Switzerland) for analytics. |
NID | Set by Google, registers a unique ID that identifies a returning user’s device. The ID is used for targeted ads. |
OGP / OGPC | Set by Google, cookie enables the functionality of Google Maps. |
OTZ | Set by Google, used to support Google’s advertising services. This cookie is used by Google Analytics to provide an analysis of website visitors in aggregate. |
s_cc | Set by LinkedIn, used to determine if cookies are enabled for Adobe Analytics. |
s_ips | Set by LinkedIn, tracks percent of page viewed. |
s_plt | Set by LinkedIn, this cookie tracks the time that the previous page took to load. |
s_pltp | Set by LinkedIn, this cookie provides page name value (URL) for use by Adobe Analytics. |
s_ppv | Set by LinkedIn, used by Adobe Analytics to retain and fetch what percentage of a page was viewed. |
s_sq | Set by LinkedIn, used to store information about the previous link that was clicked on by the user by Adobe Analytics. |
s_tp | Set by LinkedIn, this cookie measures a visitor’s scroll activity to see how much of a page they view before moving on to another page. |
s_tslv | Set by LinkedIn, used to retain and fetch time since last visit in Adobe Analytics. |
test_cookie | Set by doubleclick.net (part of Google), the purpose of the cookie is to determine if the users' browser supports cookies. |
U | Set by LinkedIn, Browser Identifier for users outside the Designated Countries (which LinkedIn determines as European Union (EU), European Economic Area (EEA), and Switzerland). |
UserMatchHistory | LinkedIn sets this cookie for LinkedIn Ads ID syncing. |
UserMatchHistory | This cookie is used by LinkedIn Ads to help dunnhumby measure advertising performance. More information can be found in their cookie policy. |
VISITOR_INFO1_LIVE | A cookie set by YouTube to measure bandwidth that determines whether the user gets the new or old player interface. |
YSC | YSC cookie is set by YouTube and is used to track the views of embedded videos on YouTube pages. |
yt-remote-connected-devices | YouTube sets this cookie to store the video preferences of the user using embedded YouTube video. |
yt-remote-device-id | YouTube sets this cookie to store the video preferences of the user using embedded YouTube video. |
yt.innertube::nextId | This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen. |
yt.innertube::requests | This cookie, set by YouTube, registers a unique ID to store data on what videos from YouTube the user has seen. |
_gcl_au | Set by Google Analytics, to take information in advert clicks and store it in a 1st party cookie so that conversions can be attributed outside of the landing page. |