Kubeflow is an open-source platform for machine learning and MLOps on Kubernetes, introduced by Google. It helps people build, train, and use machine learning models more easily by organizing the different steps of machine learning into separate tools.
Kubeflow includes tools for writing code, training models, and using models to make predictions. Some of these tools are Kubeflow Notebooks for writing and running code, Kubeflow Pipelines for organizing the steps of training a model, and KServe for using trained models to make decisions.
One special feature of Kubeflow is that you donβt have to use all of its tools at once. Each part can be used on its own, so you can pick just the tools that fit your project. This makes Kubeflow very flexible for different kinds of machine learning work.
History
The Kubeflow project was first announced in 2017 at KubeCon + CloudNativeCon North America by Google engineers. It started as a way for Google to share how they used TensorFlow internally.
The first version of Kubeflow was released in 2018, and by March 2020, many of its parts were ready for production use. In 2022, Google applied to join the Cloud Native Computing Foundation with Kubeflow, and in 2023, the foundation accepted Kubeflow as an incubating project.
Components
Kubeflow has several tools to help build and use machine learning models. Kubeflow Notebooks lets users develop models using web-based environments like Jupyter Notebook, Visual Studio Code, and RStudio.
Kubeflow Pipelines helps train models by creating workflows using Docker containers. KServe is used to deploy trained models for different frameworks such as TensorFlow and PyTorch. Katib automates parts of machine learning, like testing many settings to find the best one.
Release timeline
Kubeflow has gone through several updates since its beginning, adding new tools to help with different parts of working with machine learning. Each update brings improvements and new features to make using machine learning on Kubernetes easier and more powerful. The timeline shows when these updates happened and what new capabilities were added.