PyData Amsterdam 2023

Revealing the True Motives of News Readers
09-14, 15:10–15:40 (Europe/Amsterdam), Qux

Every news consumer has needs and in order to build a true bond with your customer it is vital to meet these, sometimes, diverse needs. To achieve this, first of all, it is important to identify the overarching needs of users; the reason why they read news. The BBC conducted research to determine these needs and identified six distinct categories: Update me, Keep me on trend, Give me perspective, Educate me, Divert me, and Inspire me. Their research showed that an equal distribution of content across these user needs will lead to higher customer engagement and loyalty. To apply this concept within DPG Media, we started building our own user needs model. Through various iterations of text labelling, text preparation, model building, fine-tuning and evaluation, we have arrived at a BERT model that is capable of determining the associated user needs based solely on the article text.


We would like to take the audience through all the steps that we have taken to get to the point where we are right now. During this process we had to find solutions to many obstacles and we are happy to share these lessons with the audience. Furthermore, we want to discuss all the tools and techniques that we used in order to arrive at the current phase.

The focus of the talk is on preparing the datasets and building the models, so a background in data science, engineering and/or machine learning is usefull.

The time breakdown will be the following:
Minutes 0-5: introducing the topic and explaining why it is important
Minutes 5-10: discussing the tools that we used and prior decisions we made
Minutes 10-20: going through the labelling process and different models we build
Minutes 20-25: sharing results and lessons learnt
Minutes 25-30: giving insights into next steps and future applications


Prior Knowledge Expected

Previous knowledge expected

Data Scientist with 15+ years experience in getting value out of data for various companies in different branches. Love to apply the right ML/ AI techniques to answer business questions and actually make a difference. Aside from hands on consultant i'm also trainer in various ML techniques. Last few years strong focus on textual data / NLP and transformer models / LLMs.

Hi, my name is Vincent Smeets. I am one of the data scientists within the Data And Customer Analytics department at DPG Media. I am responsible for generating insights from structured and semi-structured data to support decision making within the B2C Marketing organisation. In my freetime I love skateboarding, tennis and running.