How to Design and Build a Recommendation System Pipeline in Python

by Jill Cates

data science machine learning open source

Want to know how Spotify, Amazon, and Netflix generate recommendations for their users? This talk walks through the steps involved in building a recommendation pipeline, from data cleaning, hyperparameter tuning, model training and evaluation.

Personalized recommendation systems play an integral role in e-commerce platforms, with the goal of driving user engagement. While there is extensive literature on the theory behind recommendation systems, there is limited material that describes the underlying infrastructure of a recommendation system pipeline. In this talk, we will walk through the process of designing and building a recommendation system pipeline. We will specifically discuss techniques for data cleaning and normalization, hyperparameter tuning, model training and fitting, as well as quantitative and qualitative model evaluation. By the end of this talk, you will learn how to design your own recommendation system pipeline from scratch.

About the Author

Jill is a data scientist at BioSymetrics, where she applies machine learning techniques to biomedical data. Outside of work, Jill is working on an open-source toolkit for implicit feedback recommendation systems. She is a member of PyLadies and Women Who Code.

Author website: