Ah, spring training! The spin? The Yanks are enraptured with Tanaka’s poise and polish, CC’s transformation, and the much-ballyhooed collective 2.80 ERA of their top eight starting pitcher candidates. Jeter and Tex are back, free from injuries, McCann has taken charge, and Beltran is an RBI machine. And Ellsbury’s calf woes will be gone by Opening Day. Ah, shades of 2009! A retooled Yankee team, and hopes springs eternal for Championship #28.
All perhaps true enough, but if you are going to view spring training stats as credible, you have to take equal note of the collective .231 batting average of the nine starting hitters, all of whom are either new or returning from serious injuries (with the solitary exception of Brett Gardner). Not exactly proving the case for optimism.
Better to ignore most of spring training and take a look at what the facts indicate...or at least what my model indicates.
For those of you new to this, back in 1992 I developed a regression model to predict Yankee wins, based on such statistics on "OPS" (On Base Plus Slugging Percentage) that are all the rage now but were only known to a few stat freaks that were ahead of their time. Trouble is, as visionary as the model was, it’s, um, not that terrific.
For the past three years I have put the model up against what I thought was a reasonable test. I asked a bunch of my baseball friends to forecast wins for all 30 major league teams off the top of their heads, pitted against my regression model, in a competition I called "Man Versus Machine." They have made a collective 54 guesses in that time, and have been off by an average of 7.5 wins per team. The Machine’s record? Off 6.9 wins per team, or 8% better. That is to say, the Machine, for all its fancy statistics, is not much better than a good "guess-timate" by a baseball fan.
And I’m not sure I can make the model much better. Over the 22 years doing the Yanks, my average miss has been 6.4 wins. In the last 13 years it’s been better, at 5.7. But that’s simply not good enough to push some big chips to the middle of the table in Vegas. Not yet.
2013 YANKEES (in review)
I predicted the Yanks would win 90 games last year and they won 85. That sounds pretty good, considering the injuries that decimated the squad, which are the bane of any such prediction method.
But truth be told, the Yanks actually achieved an OPS of .693 and an ERA of 3.94, and thatshould have translated to only 74 wins. In other words, if I had known IN ADVANCE – perfect information -- that the Yankees were going to have a .693/3.94, I would have predicted they would win 74 games. No other team was off by 11 games. When I put in their actual final stats, the 29 teams were only off by only 3 wins on average. The Yankees were a true outlier.
Why? Well, the basic fact is that the Yankees tended to win games by small margins (and thus have relatively poor stats when they won relative to other teams) and lose by larger margins (and thus have even worse stats than a losing team might). Some theories on this? Excellent bullpen work (which they had) probably resulted in winning more close games than other teams. And perhaps other factors, such as great managing, clutch hitting and just plain luck came into play. For whatever reason, they certainly played above their level, and ended up being in the hunt until virtually the end of the season.
Going into the season, I had thought Teixeira, Jeter and Granderson would be out for only a month, and A. Rod to be back by midseason. I could not know that Cervelli would go down quickly, as would ARod’s replacement, Kevin Youkilis. And that Tex, Jeter and Granderson would all be hurt again. The Yankee starting lineup (using Youkilis at third) ended up with only 45% of the total team plate appearances; the American League average for starters was about 70%.
And…the substitutes were simply terrible. The third basemen, collectively, including ARod and Yuke (and David Adams, Chris Nelson, et al), had an OPS of .633, with 52 RBI. The shortstops (do I hear Reid Brignac? Luis Cruz?) were even worse, .598 with 46 RBI. But even more amazing was the performance of the DH’s, who exist only to hit. They had a combined OPS of .583 – even worse than the shortstops! Ben Francisco was the DH on Opening Day, an omen if there ever was one.
My highlight was the pitching. I forecast the Yanks would have a 3.86 ERA and they came in at 3.94. But the hitting (.683 versus .757) more than did me in.
But I actually did better than I deserved in being off only 5 wins.
The Yanks will have a good year this year:
What, only 89 wins? For a team that won 85 with a bunch of no-names in the lineup? After signing all those great players and getting several more back from injuries?
Yes, only 89.
Remember, the 85 wins last year was a fluke. They should have won only 74. And whatever explained that aberration – great relief pitching (goodbye, Mariano!), great managing, clutch hitting, luck…what are the odds of all that happening again?
Last year I asked the question, is 2013 going to be 1965 all over again, the year the aging Yanks fell apart and finished sixth? Turns out my answer – "no" – was basically right. It was a tough year, but they found a way to prevent an utter collapse.
This year I ask the question, is 2014 going to be 2009 all over again, when the Yanks "retooled" after missing the playoffs for the first time since 1993 (’94 was a strike year), signing Teixiera, Sabathia, Burnett and trading for Swisher? And my answer is, again, ‘No."
The 2009 team was an incredible hitting juggernaut: the lowest OPS among the starting 9 hitters was Johnny Damon’s .854. Not one Yankee starter this year achieved that level in 2013, and only one, Carlos Beltran did it even once in the past three year. This is simply not the same team. A better team than 2012, for sure, but not a great hitting team at all.
I’m calling Beltran at .830; Tex, McCann, Ellsbury and Soriano all at .800; Gardner at .770; with Jeter (.725), Kelly Johnson (.715) and Brian Roberts (.700) filling out the infield.
The starting pitching could be a real plus. I have CC coming back to a 4.00 ERA, Pineda in at 4.00 also, and Tanaka, Kuroda and Nova all at 3.50. The bullpen is a bit suspect, particularly until Robertson establishes himself as the closer and an 8th inning guy emerges.
Everyone assumes that the key to that 2009 team was the four new guys, and they sure played a huge role. But what people forget is that each of Posada, Cano, Jeter and Matsui had relative off-years in 2008 and rebounded resoundingly in 2009. When 40% of your offense improves its OPS by .120 points, that translates to 10 more wins. The Yanks improved by 14 wins that year and the team ERA did not change. So give a few wins to Texand Swisher as offensive upgrades, but most of the improvement related to returns to form by those four guys.
The 2014 Yanks will need more of the same. Tex, Jeter and CC are three huge question marks, and this team will go only as far as the comebacks of those three take them, even if McCann, Ellsbury, Beltran and Tanaka deliver as advertised.
One other thing about the 2014 Yankees…there is no big left-handed bat off the bench. Every Yankee era has featured such a stick, from Johnny Mize to Johnny Blanchard to Jim Spencer to Daryl Strawberry to David Justice to Raul Ibanez (except for the Ruth/Gehrig years and 2009 team, neither of which needed one). This team needs one and does not have one.
So…the Yanks will win 89, which will put them in the thick of the AL East title as well as the wild card. There is no dominant AL East team this year, but all of them are good (though the Jays are not up with the rest.) Here are my individual player predictions, which sum, via the wonder of weighted averages, up to the total team.
As I said, the model has two variables, one for hitting, one for pitching. The hitting variable is OPS, possibly the one true, best measure of overall batting prowess. The pitching variable is more straightforward: ERA. Basically, I come up with a prediction for each team’s overall OPS and ERA and plug those numbers into the regression equation I developed (using many years of historical data) and voila, a forecast for Team Wins.
The difficult part is to actually come up with the forecast for OPS and ERA for each team. Here it gets a bit "granular": I make a prediction for OPS (or ERA) for each player on the team roster, and then also predict their number of plate appearances (or innings pitched). Then I multiply the OPS (or ERA) by that player’s percentage of the team’s total plate appearances (or innings pitched), and then add up all the players to get to the total team. Ah, the wonders of weighted averages!
So Brett Gardner had an OPS of .759 in 2013. It is reasonable to conclude he will do about the same this year. And I expect him to have about 625 plate appearances this year, which is exactly 10.0% of the Yankees 6,250 expected team plate appearances. I multiply the .759 times 10.0% to get .076, and then do the same thing for the other Yankees hitter, and add them all up to get the team OPS. That process typically yields a team OPS number between .630 (say, for Miami) and .780 (say, for the Tigers). A brute force method, for sure.
And I do the same thing to predict team ERA….CC could have a modest rebound and achieve a 4.00 ERA in 200 innings, and I do the same math for pitchers. I end up with a Team OPS and a Team ERA which I plug into my equation and out pops Team Predicted Wins.
This year I made a refinement, to try to improve the impact of injuries and replacements. This involved adjusting all plate appearance so that starters achieved roughly 70% of the plate appearances in the American League and 60% in the National League, and similar adjustments for starting pitchers and opening day relievers. Hopefully this will help!
If you like this, check out www.borntorunthenumbers.com