JupyterCon 2023

MLOps made easy and reproducible with Jupyter and containers
05-10, 12:30–13:00 (Europe/Paris), Louis Armand 2

While training machine learning (ML) models is the subject of countless MOOCs and web tutorials, productionizing and operating ML models is usually left to the big commercial players or expert users. By leveraging the Jupyter environment and ecosystem, we describe a method to democratize the productionization of ML models while making the process transparent for the casual user.

ML models have become an essential tool for organizations across almost all industries, providing valuable insights and predictions based on data. However, deploying and maintaining ML models can be challenging due to the complex and often dynamic nature of the compute environments they require.

This talk will discuss the benefits of using containerization to manage the compute environments of ML models in JupyterLab and MLFlow, and how it can help organizations make their ML operations more democratic, efficient, and reliable over the long term.

Containerization offers a powerful solution for managing the compute environments of ML models and, combined with tools such as Cookiecutters and MLFlow, making it easier for organizations to deploy and maintain their ML operations over time. By adopting containerization both during model development, training and deployment and integrating open-source tools and services, organizations can better manage and trust the ML models used in their business.

My name is Richard, I come from Slovakia and currently live in Denmark. I work for a company called Adamatics as a Senior ML Engineer.

With mathematical background, my career started off 8 years ago as a Data Scientist but very soon ventured into Data and Software Engineering, which finally combined into Machine Learning Engineering. I've worked in multiple different industries across multiple EU countries, gathering knowledge, tips, and tricks for many different aspects of engineering.

In the last 2 years, I've been mainly focusing on developing and maintaining applications in Kubernetes from data science and data engineering perspectives.

I am a Partner & Principal Data Scientist at ADAMATICS. With a background in computational chemistry, I now work as a consultant in the field of data science. I have worked with clients from various industries, including pharmaceutical companies and banks, where we have helped implement machine-learning models in real-world applications.

At ADAMATICS I use my data analysis and machine learning expertise to help clients extract valuable insights from their data and lower the barrier for citizen data scientists to contribute their domain expertise.

Engineer at heart. Consultant by profession.

Working with data analytics, machine learning and E2E solution delivery since 2010. I have a background in computer science and a masters degree in computer security. Love seeing young professionals grow, enthusiastic about being a catalyst in their growth.