This is the promised update to my table of pretty good averages for
everyday baseball players at the major league level. As before, I consider an everyday ball player to be one who was eligible for a league batting title in a given year, which from 1996-2010 means that he made at least 502 plate appearances in a given year.
The raw data for this study was taken from the mySQL database thoughtfully provided by the folks at Baseball-DataBank.org. As outlined in yesterday's post, I used mySQL to pull out the appropriate data, this time using the command:
select b.yearID as Year, m.nameLast as Last, m.nameFirst as First, b.teamID as TEAM, b.G, b.AB+b.BB+b.HBP+b.SH+b.SF as PA, b.AB, b.R, b.H, b.2B, b.3B, b.HR, b.RBI, b.SB, b.CS, b.BB, b.SO, b.IBB, b.HBP, b.SH, b.SF, b.GIDP from Batting b inner join Master m where b.playerID=m.playerID and b.yearID>1995 and b.AB+b.BB+b.HBP+b.SH+b.SF > 502 order by b.yearID ASC, m.nameLast, m.nameFirst;
to get a list of all players from 1996 on who were eligible for a batting title. I then used mysql-query-browser to export all of the data into a spreadsheet, and there computed all of the averages and standard deviations. All the calculations are the same as in my original post, except:
- I added the batting data for the 2009 and 2010 seasons.
- My original post fraked up David Smyth's Base Runs statistic. I used the right formula (the second one on the page), but miscalculated total bases by forgetting that doubles, triples, and home runs are already counted as hits. So the numbers found in my earlier study are too high.
- I dropped 1995 from the study this time because only 144 games were scheduled for each team, so the batting eligibility criterion was 144 × 3.1 = 447 plate appearances. Just lazy.
Here's how it works. For, say, Batting Average in 1996, I computed the batting average for each of the 137 players eligible for batting awards that year. I then computed the average of that average, which is the entry in the BA column for 1996. I also computed the standard deviation of those averages for the year, and put that number in the σ column. I did this for each year and each statistic listed. Finally, I found the average of all the batting averages over the 2168 player-years, and put that under All Years, along with its standard deviation. So, to use another example, we can see that from 1996-2010 the average everyday major league batter hit someplace between 11 and 31 home runs in a year. Obviously there were some notable (and ignoble) players who hit more than that, and many who hit less. But in general, if you can hit 20 home runs in a year off big-league pitching, this table tells you that you've probably got a job. I hear it pays pretty well.
|Games||Batting Average||Home Runs||Runs Batted In|
The next table has the
modern statistics: On Base Percentage (OBP), Slugging
Average (SLG), and what's now a standard statistic for overall batting prowess, On Base Plus Slugging (OPS = OBP + SLG). Looks like if you can keep your OPS above about 0.750 won't have to worry about the kid's college tuition:
|On Base %||Slugging||OPB + SLG|
And finally, this table has two other batting ability figures of merit. Bill Jame's Runs Created, using the
technical version and David Smyth's Base Runs, using the version that includes stolen base and grounded into double play data. For each of these I also included a per Game category, where a batter is considered to have played a game every time he makes twenty-four (24) outs. Outs are defined as
Outs = At Bats - Hits + Caught Stealing + Sacrifice Hits + Sacrifice Flies + Times Grounded into Double Plays
even though Smyth's formula does not include Sacrifice data.
|Runs Created||Runs Created/Game||Base Runs||Base Runs/Game|
So there you have it. From now on, if someone says that a player has an OBP/SLG/OPS of 0.350/0.425/0.775, or some other string of statistics, you can use these tables to see how that fits in with the average (i.e. really, really good, compared to you or me) every-day Major League batter.