A workshop with several modules to help learn Feast, an open-source feature store

Last update: Jan 05, 2023

Related tags

Text Data & NLP feast-workshop

Overview

Workshop: Learning Feast

This workshop aims to teach users about Feast, an open-source feature store.

We explain concepts & best practices by example, and also showcase how to address common use cases.

What is Feast?

Feast is an operational system for managing and serving machine learning features to models in production. It can serve features from a low-latency online store (for real-time prediction) or from an offline store (for batch scoring).

Why Feast?

Feast solves several common challenges teams face:

Lack of feature reuse across teams
Complex point-in-time-correct data joins for generating training data
Difficulty operationalizing features for online inference while minimizing training / serving skew

Pre-requisites

This workshop assumes you have the following installed:

A local development environment that supports running Jupyter notebooks (e.g. VSCode with Jupyter plugin)
Python 3.7+
Java 11 (for Spark, e.g. brew install java11)
pip
Docker & Docker Compose (e.g. brew install docker docker-compose)
Terraform (docs)
AWS CLI
An AWS account setup with credentials via aws configure (e.g see AWS credentials quickstart)

Since we'll be learning how to leverage Feast in CI/CD, you'll also need to fork this workshop repository.

Caveats

M1 Macbook development is untested with this flow. See also How to run / develop for Feast on M1 Macs.
Windows development has only been tested with WSL. You will need to follow this guide to have Docker play nicely.

Modules

These are meant mostly to be done in order, with examples building on previous concepts.

Time (min)	Description	Module
30-45	Setting up Feast projects & CI/CD + powering batch predictions	Module 0
15-20	Streaming ingestion & online feature retrieval with Kafka, Spark, Redis	Module 1
10-15	Real-time feature engineering with on demand transformations	Module 2
TBD	Feature server deployment (embed, as a service, AWS Lambda)	TBD
TBD	Versioning features / models in Feast	TBD
TBD	Data quality monitoring in Feast	TBD
TBD	Batch transformations	TBD
TBD	Stream transformations	TBD

A workshop with several modules to help learn Feast, an open-source feature store

Related tags

Overview

Workshop: Learning Feast

What is Feast?

Why Feast?

Pre-requisites

Modules

Owner

Feast

Curso práctico: NLP de cero a cien 🤗

小布助手对话短文本语义匹配的一个baseline

Transformers and related deep network architectures are summarized and implemented here.

TalkNet: Audio-visual active speaker detection Model

Blender addon - Scrub timeline from viewport with a shortcut

AMUSE - financial summarization

Analyse japanese ebooks using MeCab to determine the difficulty level for japanese learners

This repository contains the code, models and datasets discussed in our paper "Few-Shot Question Answering by Pretraining Span Selection"

Finally, some decent sample sentences

Negative sampling for solving the unlabeled entity problem in NER. ICLR-2021 paper: Empirical Analysis of Unlabeled Entity Problem in Named Entity Recognition.

Switch spaces for knowledge graph embeddings

MRC approach for Aspect-based Sentiment Analysis (ABSA)

Visual Automata is a Python 3 library built as a wrapper for Caleb Evans' Automata library to add more visualization features.

🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time

Utilities for preprocessing text for deep learning with Keras

Labelling platform for text using distant supervision

NLP techniques such as named entity recognition, sentiment analysis, topic modeling, text classification with Python to predict sentiment and rating of drug from user reviews.

NeoDays-based tileset for the roguelike CDDA (Cataclysm Dark Days Ahead)

voice2json is a collection of command-line tools for offline speech/intent recognition on Linux

Training code of Spatial Time Memory Network. Semi-supervised video object segmentation.