The MLOps is the process of continuous integration and continuous delivery of Machine Learning artifacts as a software product, keeping it inside a loop of Design, Model Development and Operations.

Last update: Nov 27, 2022

Related tags

Machine Learning mlops_project

Overview

MLOps

The MLOps is the process of continuous integration and continuous delivery of Machine Learning artifacts as a software product, keeping it inside a loop of Design, Model Development and Operations.

In this paradigm, teams can easily collaborate in models, with clear tracking of the data throughout the process of cleaning, processing, and feature creation. Automating every repetitive process avoids human error and reduces the delivery time, ensuring the team keeps focusing on the Business Problem.

Some benefits:

Versioning data and code, making models to be auditable and reproducible.
Automated tests and building ensuring quality functioning of artifacts and availability for the delivery pipelines.
Makes it easier and faster the deployment of new models by using an automated cycle.

The MLOps Project

The MLOps project is a path to learning how to implement a study case aiming to be testable and reproducible within the CI/CD methodology, using the best programming practices.

The scope of this project is delimited as you can see in the image below.

We will select the best tool to implement every step, integrate them, and build a Machine Learning Orchestrator. That said, in the end, new ML experiments will be easily made, and delivered as simples as typing a terminal command or clicking on a button!

Prerequisites

For mlops_project to work correctly, first, you should install the prerequisites

Contributing

Have an idea of how to improve this project but don't know how to start, try to contribute

You can understand the project organization here

How to use?

If you are interested just in using this package, follow the steps below.

Clone the repository

Open a terminal (if you are using Windows, make sure of using the git bash) navigate to the desired destination folder and clone the repository,
```
git clone https://github.com/Schots/mlops_project.git
```
The Makefile on the root folder defines a set of functions needed to automate repetitive processes in this project. Type "make" in the terminal and see the available functions.

Create an environment & Install requirements

Create a Python virtual environment for the MLOps project on your local machine. Use any tool you desire. Activate the environment and install the requirements using make:
```
make requirements
```
Download data

To download the raw dataset, use the get_data
```
make get_data
```
type the dataset name when prompted. The zip file with data will be downloaded and unzipped under the data/raw folder

Project based on the cookiecutter data science project template. #cookiecutterdatascience

The MLOps is the process of continuous integration and continuous delivery of Machine Learning artifacts as a software product, keeping it inside a loop of Design, Model Development and Operations.

Related tags

Overview

MLOps

The MLOps Project

Prerequisites

Contributing

How to use?

Owner

Maykon Schots

#30DaysOfStreamlit is a 30-day social challenge for you to build and deploy Streamlit apps.

SmartSim makes it easier to use common Machine Learning (ML) libraries like PyTorch and TensorFlow

This project used bitcoin, S&P500, and gold to construct an investment portfolio that aimed to minimize risk by minimizing variance.

ML-powered Loan-Marketer Customer Filtering Engine

LightGBM + Optuna: no brainer

Laporan Proyek Machine Learning - Azhar Rizki Zulma

Short PhD seminar on Machine Learning Security (Adversarial Machine Learning)

Machine Learning Algorithms ( Desion Tree, XG Boost, Random Forest )

InfiniteBoost: building infinite ensembles with gradient descent

Project to deploy a machine learning model based on Titanic dataset from Kaggle

customer churn prediction prevention in telecom industry using machine learning and survival analysis

TensorFlow Decision Forests (TF-DF) is a collection of state-of-the-art algorithms for the training, serving and interpretation of Decision Forest models.

Uses WiFi signals :signal_strength: and machine learning to predict where you are

🤖 ⚡ scikit-learn tips

Tools for mathematical optimization region

Auto updating website that tracks closed & open issues/PRs on scikit-learn/scikit-learn.

Required for a machine learning pipeline data preprocessing and variable engineering script needs to be prepared

Transpile trained scikit-learn estimators to C, Java, JavaScript and others.

Adaptive: parallel active learning of mathematical functions

Kubeflow is a machine learning (ML) toolkit that is dedicated to making deployments of ML workflows on Kubernetes simple, portable, and scalable.