Use case on the Moneyball dataset

Dataset Moneyball

Machine Learning Task: Regression

This is the Moneyball database. This dataset contains some of the information that was available to Billy Beane and Paul DePodesta, who worked for the Oakland Athletics in the early 2000s and changed the game of baseball. It can be used to understand their statistical methods better. The database contains such information as team, league, year, runs scored, wins.

Available at OpenML: https://openml.org/d/41021

Category: Sport

# Rows: 1,232 # Columns: 14

Target: RS

Features

Numeric: Year, RA, W, OBP, SLG, BA, OOBP, OSLG

Nominal: Team, League, Playoffs, RankSeason, RankPlayoffs, G

Machine Learning Use Case Sport

Root Mean Square Error (RMSE)

Moneyball Rmse

Mean Absolute Error (MAE)

Moneyball Mae

Coefficient of Determination (R2)

Moneyball R2

« Back to Machine Learning Algorithms Comparison