JupyterCon 2023

Richard Nemeth

My name is Richard, I come from Slovakia and currently live in Denmark. I work for a company called Adamatics as a Senior ML Engineer.

With mathematical background, my career started off 8 years ago as a Data Scientist but very soon ventured into Data and Software Engineering, which finally combined into Machine Learning Engineering. I've worked in multiple different industries across multiple EU countries, gathering knowledge, tips, and tricks for many different aspects of engineering.

In the last 2 years, I've been mainly focusing on developing and maintaining applications in Kubernetes from data science and data engineering perspectives.

The speaker's profile picture

Sessions

05-10
12:30
30min
MLOps made easy and reproducible with Jupyter and containers
Richard Nemeth, Sune Askjaer, Subramaniam

While training machine learning (ML) models is the subject of countless MOOCs and web tutorials, productionizing and operating ML models is usually left to the big commercial players or expert users. By leveraging the Jupyter environment and ecosystem, we describe a method to democratize the productionization of ML models while making the process transparent for the casual user.

ML models have become an essential tool for organizations across almost all industries, providing valuable insights and predictions based on data. However, deploying and maintaining ML models can be challenging due to the complex and often dynamic nature of the compute environments they require.

This talk will discuss the benefits of using containerization to manage the compute environments of ML models in JupyterLab and MLFlow, and how it can help organizations make their ML operations more democratic, efficient, and reliable over the long term.

Containerization offers a powerful solution for managing the compute environments of ML models and, combined with tools such as Cookiecutters and MLFlow, making it easier for organizations to deploy and maintain their ML operations over time. By adopting containerization both during model development, training and deployment and integrating open-source tools and services, organizations can better manage and trust the ML models used in their business.

Data Science
Louis Armand 2