PyData Amsterdam 2023

Innovation in the Age of Regulation: Federated Learning with Flower
09-15, 10:00–10:30 (Europe/Amsterdam), Qux

With the rise of data privacy concerns around AI in the EU, how can we innovate using AI capabilities despite regulations around consumer data? What tools and features are available to help us build AI in regulated industries? This talk will discuss how we can leverage diverse datasets to build better AI models without ever having to touch the datasets by using a Python library called Flower.


In this talk, we’ll review the importance of data privacy concerns, particularly in the EU, and address how we can build AI using sensitive data. We'll discuss a few machine learning techniques (classical, distributed and federated learning), and show how federated learning can help us train AI models without ever touching the sensitive data.

Then, we'll evaluate a few main open source Python packages that help engineers get started with federated learning and why Flower is a valuable option to consider for your next project. We'll review the core features of Flower; most notably, it's ease of use.

After that, we’ll jump into a demo and show how, with minimal code, a Python engineer can orchestrate a training job with multiple data sources using federated learning. We’ll walk through different parameters that give engineers the power to control and fine tune the server without the hassle of knowing infrastructure or cloud architecture.

By the end of this talk, you’ll be able to:

  • Understand the role of federated learning in a landscape with increasing regulation around AI, particularly in the EU with the proposed Artificial Intelligence Act
  • Differentiate between federated learning and classical machine learning
  • Design your project so that it is in compliance with current and future legislation passed on how to use personal data
  • Build and fine tune a server that hosts the model weights for a model trained without seeing personal data
  • Understand options available to increase the privacy around the data that is used to train the model

There will be a link to a Github repo at the end of the talk that contains all the code used in the demo in order to help you get started with your first federated learning project.


Prior Knowledge Expected

No previous knowledge expected

Krishi Sharma is a software developer at KUNGFU.AI where she builds software applications that power machine learning models and deliver data for a broad range of services. As a former data scientist and machine learning engineer, she is passionate about building tools that ease the infrastructure dependencies and reduce potential technical debt around handling data. She helped build and maintains an internal Python tool, Potluck, which allows machine learning engineers the ability to bootstrap a containerized, production ready application with data pipelining templates so that her team can focus on the data and metrics without squandering too much time finagling with deployment and software