K Files

« Scrappy Win | Main | FREED! »

Before I get started, Neifi Perez sucks. Let's get that out of the way.

In case you hadn't noticed, in Mark Mulder's last three outings he's thrown 22 innings, giving up only two runs. He also struck out only seven batters while also walking seven. At this point we're thinking about the post-season -- that's the only way I can maintain my composure when Hector Luna is batting 6th or when the Cardinals are getting beaten by Henry "176 / 202 / 282" Blanco -- so the question is whether the low strikeout/low run games will turn from gold into lead come October. On one side you have the party line that Mulder's a solid #3 starter and on the other side you have the Bill James ideas expressed well by L Boros, which says you gotta have some strikeouts. Like L Boros, I'm also not comfortable with the concept of Mark Mulder pitching a big game, so I ran some numbers to check on my stat-head prejudices.

Anyway, I first tried regressing post-season ERA on regular season ERA and K-rates. I did that by season and by career for individual pitchers, and in both cases all I got was noise. As it turns out, there have been 299 pitcher-seasons (e.g., Matt Morris-2002 and Matt Morris-2004) among post-season starters, so I decided to group post-season starters over the last ten years into three K-rate groups of 100, 99 and 100. Here's what I got:

Class    RegERA   RegK          PostERA  PostStarts PostIP
Low K     4.24   13.60% (5.3)    4.42       205      1181
Mid K     3.79   17.02% (6.4)    4.32       208      1234
HighK     3.36   23.41% (8.7)    3.51       247      1608 
TOTAL     3.77   18.26% (6.9)    4.03       660      4023

Note that these are overall post-season numbers for pitchers who started at least one post-season game. This means, for example, that I included Pedro's strange little relief stint against the Yankees in last year's ALCS in his 2004 post-season stats.

My K-rate groupings are based on strikeouts per batters faced ("RegK") during the regular season, as opposed to the traditional K per nine innings. However, I've included each group's regular season K-rates per-nine in parentheses for familiarity's sake. Mulder's K-rate stands at 13.3% in 2005, so he fits comfortably in that Low-K group. Each group experienced a decline in the post-season, which is what you'd expect since there aren't a lot of bad offenses in the post-season. High-K pitchers experience a bit less decline (note that Rick Ankiel-2000 is in that group) than the Low-K group, but not enough to get excited about. Actually it's the in-between guys who take the biggest hit.

Whether or not this is statistically significant, it does answer my question on Mulder. For now I'll bag my theory that low-K pitchers have wider bad team/good team splits than high-K starters.

Originally I had planned on just showing the Low and High K pitchers and leaving it that, but it was strange to me that the Mid Ks lost more value than anyone. So being the inquisitive guy that I am, I decided to see what happens when you sort by walks:

Class    RegERA   RegBB         PostERA  PostStarts PostIP 
Low BB    3.51    5.09% (1.9)    3.49       231      1458
Mid BB    3.91    7.62% (2.9)    4.42       229      1386
HighBB    3.91   10.07% (3.9)    4.23       200      1179  
TOTAL     3.77    7.48% (2.8)    4.03       660      4023

Well, that's interesting. Low BB starters don't lose lot a value in the post-season, or at least they haven't in the past ten years. Once again the middle group took a bigger hit than the "bad" group. One last table, this one by K/BB:

Class    RegERA   RegK/BB       PostERA  PostStarts PostIP
Low K/BB  4.22     1.59          4.46       206      1197
Mid K/BB  3.77     2.31          4.57       211      1230
HighK/BB  3.38     3.90          3.28       243      1596 
TOTAL     3.77     2.44          4.03       660      4023

Again the guys in the middle have been dropping the most, to the point that they've been worse than the low K/BBers in the post-season. There is probably some sort of selection bias at work here or maybe I need to control for the opponent. Whatever the case, there isn't anything in the aggregate data to suggest Mulder is particularly vulnerable to collapse during the post-season. My gut still feels otherwise, but the gut feelings of stat-heads aren't especially useful.

[UPDATE: Now included post-season IP. And Mulder looked terrible last night.]

Posted by Rob at July 25, 2005 09:25 PM
TrackBacks (Trackback URL: http://www.thebirdwatch.com/mt/bw-tb.pl/932)

cool study ---- like you i'm surprised that the low-k guys seem to hold their value. more surprising to me is the fact that all three groups had higher era's postseason than reg season ---- aren't postseason games supposed to be lower-scoring than reg. season ones? aren't they supposed to prove the old adage about good pitching beating good hitting?

if that's not the case then i may have even larger assumptions to question . . . . .

Posted by: lboros at July 26, 2005 06:59 AM

Doesn't Mulder do much better at night and/or when it's cool?

Isn't October cooler, and the games played at night?

I'm no longer worried.

Posted by: Vincent at July 26, 2005 09:55 AM

Doesn't Mulder do much better at night and/or when it's cool?

Isn't October cooler, and the games played at night?

I'm no longer worried.

Posted by: Vincent at July 26, 2005 09:55 AM

yeah, good idea, interesting findings.

I guess I'm confused as to what the low/mid/high ranges are for the K's and BB's and also where Mulder would fall in those....

Could it be that low K pitchers must be really good at getting batters out in other ways (and they know it) but mid K pitchers are just confused junk throwers or imagine themselves to be overpowering K guys and then they make mistakes?

I'm still trying to qualify those findings...

Oh, and could you show your sample sizes for each category?

Posted by: Ryan at July 26, 2005 10:00 AM

L Boros, I'm excluding the Jason Simontacchis of the world, so that's why the "before/after" inflates post-season run scoring.

Ryan, I'm not sure the samples are that helpful since we're talking about 300 pitcher seasons, probably around 180 IP per pitcher in the regular season and about 13 IP per pitcher in the post-season. I'll post them when I get home though.

I suspect what's going on here is that good teams succeed against "typical" pitchers. There's not much point to building a team that's great against Tim Wakefield or BK Kim, but not so great against a vanilla pitcher. That's a WAG.

Vincent, it's true Mulder has a wide day/night split in 2005, but that's a recent phenomenon. ESPN's splits page says from 2002 to 2004 Mulder's daytime ERA was 3.28 versus 4.01 at night. I suspect that's noise.

Posted by: Rob at July 26, 2005 12:51 PM

But: has he played in the hot/humid St. Louis summer before?

I just know his lone great day start this year (10 inning shutout) was when it was still cool out.

Posted by: Vincent at July 26, 2005 01:33 PM