Alec James McCarthy


Web Scraping, Machine Learning, and Web-App in Python

Pitchers are clustered based on over 50 pitch metrics. Hitter's statistics are compared vs each cluster of pitchers. Bayesian data analysis provides insights for how better or worse a player is vs each cluster.

Extract

Scrape RotoWire for each hitter vs pitcher matchup statistics. Loop through all active players on team, write the scraped data to json files.

View Script

Fit and Evaluate

Fit the KMeans Clustering model on training data. Evaluate its accuracy

Data Manipulation

Organize and aggregate data in the python for easier interpretation

Load

Deploy the Streamlit app

Footer © 2023 GitHub, Inc. Footer navigation Terms Privacy Security Status Docs Contact GitHub Pricing API Training Blog About