Full-stack ETL pipeline and API for scraping, processing, and serving professional hockey statistics.
The Hockey-ETL API is a robust system designed to scrape, clean, and serve professional hockey data. It consists of a nightly Python ETL pipeline that extracts raw data, cleans it using Pandas, and loads it into a central SQLite3 database. The data is then exposed via a high-performance ExpressJS API with endpoints for standings, matchups, players, and teams. The platform also includes a service for a user to subscribe to and uses Resend to deliver game scores to every morning.