Sunday, February 14, 2016

Anybody Want Some Data?

As we all know, Mack has been the success story of the league.  His teams have dominated the league almost since the beginning.

One of the things that he's done is use data to identify players that will be high-performers in the league.  I've always felt that the best way to try to bridge the gap between Mack and the rest of the league is through education and giving other players the ability to try to replicate what he's done and develop their own insights between data and player performance.

To that extent, I've created a test league and generated data for 86 years.  I'll be adding additional years to the data set as time goes on, but, for now, we have 86 years, running from 2014-2099.  I'll be running analyses on the data as time goes on but, for now, I just wanted to make the data available to everyone.

I created a page where you can download the data.  I'll add this page to the Links section.  As time goes on, I will add additional raw data and aggregations to this page.  Analyses of the data will be posted here on the blog.

For now, you can download the raw data dumps for the league for every year from 2014 to 2099 (grouped together by decades).  These are in csv format and can easily be imported into Excel or Google docs.

In addition, I've upload an aggregation with the lifetime stats of all players (pitchers and hitters on separate tabs), the major talent (potential) ratings of those players in their draft years, their draft position and the lifetime major league stats of those players.  This data is in Excel format.  If you don't have Excel, you can always upload it to Google docs and play with the data there.  If you'd like it in some other format, please let me know and, if possible, I'll upload it for you.

Please bear in mind that, for now, I just aggregated the data.  I have not (yet) run any analyses on it.  I'll do that in time and, in the mean time, if you're so inclined, you can do so as well.  I encourage everyone to share whatever discoveries they make.

If there are any aggregations or data relationships that you'd like me to explore, please feel free to ask.

As always, if you have any questions, please feel free to ask.

Zev

2 comments:

Note: Only a member of this blog may post a comment.