PyData Amsterdam 2023

Survival Analysis: a deep dive
09-15, 14:20–14:50 (Europe/Amsterdam), Qux

Survival analysis was initially introduced to handle the data analysis required in use cases revolving death and treatment in health care. Due to its merit, this method has spread to many other domains for analyzing and modeling the data where the outcome is the time until an event of interest occurs. Domains such as finance, economy, sociology and engineering.

This talk aims at unraveling the potential of survival analysis with examples from different domains. A taxonomy of the existing descriptive and predictive analytics algorithms in survival analysis are demonstrated. The concept of some candidate algorithms from each group are explained in detail, along with an example and implementation guideline using the right open source framework.


This talk aims at introducing the tools and techniques within the survival analysis domain for analyzing the time until an event of interest occurs. Examples of such event are rehospitalization after being discharged from hospital (healthcare), device needing maintenance after (re)commissioning (manufacturing), finding a job after unemployment (economy), an asset being sold after listing for sale (real-estate/finance), getting rearrested after being released from prison (criminology/sociology), and many other examples.

The potential of survival analysis tools, in both descriptive and predictive analytics, are hidden to the data science community. As a result of this, such problems are often formulated as classification or regression, where this also comes with its own caveats and pitfalls.

The aim of the talk is to simplify methods and algorithms in survival analysis with some shallow mathematical focus and starts by raising awareness about survival analysis and its potential and applications for the general audience. The descriptive and predictive algorithms within survival analysis address the data scientists with basic statistics and machine learning background, as the main audience of the talk.

  • Introduction to Survival Analysis
  • Applications in different domains
  • Formulating Survival Analysis Problem
  • Taxonomy of Descriptive & Predictive Methods with python packages
  • Overview of Descriptive Methods
    - Kaplan-Meier [3 slide]
    - Nelson-Aalen & Weibull [half slide]
  • Overview of Predictive Methods
    - Cox Proportional Hazard [3 slide]
    - Survival Tree & Forrest [1 slide]
    - Deep Survival Analysis [1 slide]
  • Conclusion

At the end of the talk, the audience becomes aware of what survival analysis can do and which algorithms, with their corresponding python package, are the low hanging fruit in a data scientist toolbox. In addition, the audience will gain a structured overview on the topic so that any need for further knowledge acquisition could be independently followed in the future.


Prior Knowledge Expected

No previous knowledge expected

I am a data scientist with a background in applied mathematics (systems & control). In my career as data scientist, I have experienced different sectors, i.e. manufacturing, cybersecurity, healthcare, and finance. Currently, I am contributing to data-driven solutions that improve our clients’ experience and satisfaction within ABN AMRO.