The Future of Pitching

Once I completed my study on predictive stats for hitters, I executed the same research for Starting Pitchers.

The stats we want to predict: future Strikeouts, Wins, Losses, ERA and WHIP. The goal of this exercise is to complete this table.

  2012 2012 2012 2012 2012   2011
R-Squared K W L ERA WHIP   ?
K 1.00 0.78 0.53 0.01 0.00    
W   1.00 0.44 0.01 0.00    
L     1.00 0.01 0.04    
ERA       1.00 0.69    
WHIP         1.00    

Again, the goal of this study is to determine what stat, or set of stats, is/are the best suited to fill in the blank above. R-Squared tells you how related each stat is, where 1.00 is perfectly related, and 0.00 is totally unrelated. Also, I used five years of performance data, not just 2011-12.

Here are the most predictive stats, with some commentary.

WAR, or Wins Above Replacement. Before this study, I always thought that the pitcher’s main duty was to prevent runs. This tells me that it is not. A pitcher’s main duty is to get outs. What I mean is that a pitcher will give up a run to get an out (or two outs) in certain situations, without a thought.

xFIP. Similar to Runs v Average for hitters, this is a counting stat v rate stat issue. If a pitcher is called up from Triple-A to pitch the final two months of the season, he would not be able to accumulate much WAR, but his xFIP could still be very low.

BB. Walks are positively correlated to future performance; the more walks a pitcher issues, the better his future performance will be. This should cause you to stop and think. It takes a lot of Innings to accumulate a high Walk total, which means that pitcher has his manager’s confidence, despite struggling in at least one statistic. This is probably more predictive of playing time than anything else. Remember, too, that this is a tie-breaker of sorts after WAR and xFIP. If a pitcher’s WAR and xFIP are poor, then you would not be considering them anyway.

Do what you will with that, but here are the Top Ten SPs in Predictive Stats to this point in 2013.

Rank Name Team WAR BB xFIP
1 Felix Hernandez Mariners 2.3 12 2.56
2 Max Scherzer Tigers 2.3 13 2.57
3 Yu Darvish Rangers 1.8 22 2.59
4 Anibal Sanchez Tigers 2.4 14 2.40
5 Adam Wainwright Cardinals 2.8 6 2.41
6 Matt Harvey Mets 2.1 14 2.91
7 Alex Cobb Rays 0.8 14 3.02
8 A.J. Burnett Pirates 1.7 23 2.68
9 Clay Buchholz Red Sox 2.4 24 3.10
10 Clayton Kershaw Dodgers 2.1 18 3.10

Since none of these SPs are remotely available, let me give you one more name – the highest rated SP by this study that is possibly available in your league.

Rank Name Team WAR BB xFIP
28 Rick Porcello Tigers 0.4 8 3.42

It is scary to ‘recommend’ a pitcher with a 6.28 ERA. But if these metrics can be trusted, he could look better going forward. Your league probably is not going to be in a hurry to acquire Porcello, so maybe just keep an eye on him over his next few starts instead of adding him to your roster.

The Future

As fantasy players, all we are really trying to do is to predict the future. No big deal, right?

The question we want answered is this: which player will score more Runs, hit more Home Runs and RBIs, get more Stolen Bases, and hit for a higher Average tomorrow? or next week? or next year? depending on the design of your league.

So what we really need to know is ‘Which stats are the most forward-looking?’ What stats point (as well as can be expected) to better future performance?

I did some research that went something like this.

  2012 2012 2012 2012 2012   2011
R-Squared R HR RBI SB AVG   ?
R 1.00 0.94 0.82 0.29 0.40    
HR   1.00 0.92 0.19 0.41    
RBI     1.00 0.09 0.36    
SB       1.00 0.08    
AVG         1.00    

In case you are unfamiliar, R-Squared tells you how related each stat is, where 1.00 is perfectly related, and 0.00 is totally unrelated. Also, I used five years of performance data, not just 2011-12.

The goal is to replace the ‘?’ with a single statistic, or a set of stats. I hit up Fangraphs.com‘s stat library, matched up one year’s production to the next’s, and ran the numbers. Here are the most predictive stats, with some commentary.

Runs. It makes sense that these stats are somewhat self-predictive, but why are Runs more predictive than RBI? I think because a player cannot fake Runs. He cannot come up to the plate and score four times. A player can come up to the plate and knock in four RBI. Additionally, scoring Runs requires several skillsets. Hitting Home Runs requires a big bat, Stolen Bases requires speed. Players who score a lot of Runs can probably do more than one of those things well. Runs is far and away the best predictive stat.

Average. One of the biggest problems with Runs is that to be at the top of the MLB leaderboards, you have to play all season. Rate stats do not require that. If next year’s must-own is not getting playing time now, he will not score many Runs. But he might hit for a good Average. St. Louis’ Matt Adams can hit. He should be on your radar for 2014. In 2013, he is a member of the Cardinals’ bench, and not playing much. He has scored 5 Runs so far this season, and he is hitting for a .400 Average in 40 AB. His Runs total says he probably is not going to get playing time in the immediate future, but his Average says if he did, he may score a helpful amount of Runs.

wFA/C (pfx). You may not know what this one is. Pitch F/X is a video system installed in each Major League stadium that tracks the type, velocity, movement and location of every pitch thrown. wFA/C answers the question, ‘For every 100 Fastballs a player hit, how much WAR did he generate?’ or ‘How well did this player his Fastballs?’ which is a necessary skill for any player to have if he wants to stick around.

Pinch Hits. This stat is negatively correlated to future performance, meaning, the more Pinch hits a guy has, the worse you can expect his future performance to be. I had to think about this one a bit, but I concluded that this is all about playing time. Every time a player Pinch Hits, it means he was healthy and on the 25-Man Roster, but did not start the game. If a player sits, that hurts his fantasy production.

These are the four most predictive stats for position players. I was careful to not select any redundant stats. For instance, Total Bases are extremely predictive of future fantasy performance, but are too closely correlated to Runs to be useful. It would almost be like counting Runs twice. With these four stats, there is as little overlap as possible.

Here is a Top Ten Leaderboard of 2013′s Predictive stats. Maybe see if Daniel Nava is available in your league and try him out for a week or two.

Rank Name Team R AVG wFA/C (pfx) PH
1 Joey Votto Reds 35 0.353 2.41 0
2 Starling Marte Pirates 33 0.309 2.96 0
3 Paul Goldschmidt Diamondbacks 31 0.323 2.17 1
4 Troy Tulowitzki Rockies 25 0.338 4.89 4
5 Shin-Soo Choo Reds 36 0.303 3.46 0
6 Manny Machado Orioles 32 0.318 4.42 0
7 Ryan Braun Brewers 25 0.322 2.70 0
8 Carlos Gonzalez Rockies 37 0.298 1.66 1
9 Daniel Nava Red Sox 24 0.289 2.26 2
10 Ian Kinsler Rangers 24 0.302 2.46 0

Pitching Leaderboards – Last 30 Days

Here is the Pitching Leaderboard, updated with the just the last 30 days’ stats.

This file is downloadable to Excel, and includes Fangraph.com‘s ‘playerid’ field, in case you want to tack on your own research.

And here are the last 30 days’ SP and RP leaders. Both of the performance stats, wArsenal+ and CK:B+, are set with a mean of 1.00. So, it would be correct to say that Fernandez’s wArsenal and CK:B are both 42% above the mean.

Rank Name Team GS IP/S wArsenal+ CK:B+
5 Jose Fernandez Marlins 6 5.5 1.42 1.42
7 Shelby Miller Cardinals 6 6.5 1.41 1.38
13 Josh Stinson Orioles 1 5.7 1.07 1.94
16 Phil Hughes Yankees 6 5.7 1.27 1.37
23 Clay Buchholz Red Sox 5 7.3 1.16 1.49
24 David Price Rays 6 6.3 1.31 1.23
  Burch Smith Padres 1 1.0 1.21 1.33
30 Matt Harvey Mets 5 6.9 1.55 1.12
33 Max Scherzer Tigers 6 7.2 1.15 1.40
35 Chris Sale White Sox 5 7.5 1.20 1.25
36 Hector Santiago White Sox 3 6.0 1.18 1.26

 

Rank Name Team RA wArsenal+ CK:B+
  Alex Burnett Orioles 1 1.03 8.32
  Alex Torres Rays 1 1.74 2.43
1 Sean Doolittle Athletics 11 1.70 1.62
  Will Harris Diamondbacks 3 1.41 1.85
2 Joba Chamberlain Yankees 5 1.44 1.69
3 Jonathan Papelbon Phillies 11 1.42 1.59
4 David Robertson Yankees 13 1.45 1.54
  Luke Putkonen Tigers 2 1.36 1.70
  Josh Lueke Rays 3 1.34 1.76
6 Jason Grilli Pirates 14 1.44 1.39
  Vinnie Pestano Indians 4 1.20 1.91
8 Jim Johnson Orioles 13 1.32 1.53
9 LaTroy Hawkins Mets 11 1.26 1.67
10 Addison Reed White Sox 11 1.37 1.36
11 Yoervis Medina Mariners 9 1.63 1.26

 

Thanks for reading. Follow me on Twitter at @OutliarBaseball.

Travel Schedules

I had been setting one of my fantasy rosters on a theory this season. That theory went something like this: if a team has a brutal travel schedule during a week, I should bench those players, since the excessive travel may wear them out.

For instance, the White Sox were in Chicago on Sunday, opened a three-game series in Minnesota on Monday, and then are off to Los Angeles to start a four-game series on Thursday. That is a lot of travel with zero off-days.

While the theory makes sense, I had to check the numbers on it. I looked at 2012 travel schedules by time zone. For example: how do Eastern Time Zone teams do when travelling to Central Time or Pacific Time, and how does that compare to road games within the Eastern Time Zone. I did the same for Central Time Zone teams and Pacific Time Zone teams. The Rockies are the only team in the Mountain Time Zone (during Daylight Savings), and with the added Coors Field effect, I left them out of this.

Here is the breakdown:

Eastern Time Zone (14 teams)

Travel To G BA OBP SLG
Other Eastern 5137 0.246 0.307 0.392
Central 1971 0.261 0.327 0.420
Pacific 1695 0.239 0.298 0.371
Home Games 8273 0.255 0.320 0.415

 

Central Time Zone (7 teams)

Travel To G BA OBP SLG
Eastern 283 0.252 0.305 0.388
Other Central 202 0.255 0.317 0.406
Pacific 150 0.225 0.285 0.347
Home Games 648 0.268 0.330 0.431

 

Pacific Time Zone (8 teams)

Travel To G BA OBP SLG
Eastern 181 0.256 0.313 0.405
Central 98 0.246 0.309 0.401
Other Pacific 241 0.250 0.311 0.391
Home Games 565 0.252 0.316 0.387
Japan 4 0.181 0.208 0.326

You may remember that the A’s and Mariners opened the season in Tokyo last year. You can see that the cross-hemispheric travel was probably tough on the offenses.

To summarize, travelling to the Pacific Time Zone is tough. This week when the Nationals visit Chavez Ravine and San Diego, they will be playing games until 1am Eastern Time, their local time. But to their opponents, the game will only run until 10pm, who did not have to travel for the game.

It does not work in reverse, though, because when the Dodgers travel to Atlanta this week, the games will only run until 7pm Pacific Time.

I suspect this would have a bigger effect on older players. I never had any trouble staying up until 1 or 2am when I was in my early twenties. Now that I am older, that is not the case.

The takeaway is to avoid (older) players who are visiting the West Coast, or to expect a bit of an offensive drop. Be sure to look at it the other way for pitchers.

Fanduel.com Guest Post – Trying to Figure Out David Price

It may only be six starts into the season, but David Price is looking like anything but the fantasy baseball stud and American League Cy Young Award winner he was a season ago. With a 1-4 record and a 6.25 ERA, neither him or his Tampa Bay Rays are having the type of season they envisioned. The question is, will Price bounce back?

 

Six starts seems like a small sample size, and for the most part, it is. However, there seems to be a major issue with Price right now when it comes to locating within the strike zone. He is actually on pace to allow the fewest amount of walks per nine innings in his career, so location isn’t that far off, but too many of his offspeed pitching are being left up and over the plate.

 

Two of his outings this year have been downright dreadful, and that includes giving eight earned runs and 10 hits in five innings to Cleveland, and nine earned runs and 11 hits in 6.2 innings to Colorado. Guys were not just getting lucky against him either. He is giving up hard hits, and his fastball is not there yet where he can rely on it to just blow hitters away.

 

Perhaps the biggest concern right now is that he is showing signs of this slow start getting to him. Against Chicago, he had a spat with umpire Tom Hallion. In his last start, he complained a few different times about his inability to get a good grip on the ball. Price has always been a laid back, stay out of the spotlight kind of pitcher. Now, he seems to be letting distractions get the better of him.

Before his Cy Young season, he was 12-13 with a 3.49 ERA in 2011. Wins and losses are a bad way to judge a pitcher, but it was obvious that he was merely a slightly above average pitcher then. Might 2012 prove to just be one magical season? No, he’s better than that. However, as far as fantasy baseball goes, he is not one of the elite daily fantasy sports pitchers in the game just yet. He is simply on the fringe until he can be overall more consistent.

Pitching Leaderboards

Here are the pitching leaderboards, updated with Games Started, Innings Pitched per Start, and Relief Appearances.

In case you don’t already know, Weighted Arsenal is a summary stat of a pitcher’s velocity and movement. Called Strike-to-Ball Ratio answers the question: how often does a pitcher hit the strike zone when the batter does not swing?

You can download this leaderboard directly to Microsoft’s Excel.

Also, here are the Top Ten SPs & RPs.

Name Team GS IP/GS RA wArsenal+ CK:B+
Jose Fernandez Marlins 6 5.2 0 1.42 1.30
Shelby Miller Cardinals 6 6.1 0 1.37 1.30
David Price Rays 7 6.4 0 1.33 1.27
Corey Kluber Indians 2 6.3 2 1.21 1.48
Justin Verlander Tigers 7 6.6 0 1.45 1.15
Phil Hughes Yankees 6 5.8 0 1.25 1.20
Alex Cobb Rays 6 7.0 0 1.12 1.44
Hector Santiago White Sox 2 6.2 7 1.18 1.25
Matt Harvey Mets 7 7.0 0 1.60 1.05
Clay Buchholz Red Sox 7 7.2 0 1.13 1.37

 

Name Team GS IP/GS RA wArsenal+ CK:B+
Aroldis Chapman Reds 0   16 2.02 1.46
David Robertson Yankees 0   12 1.43 2.02
Bobby Parnell Mets 0   14 1.59 1.47
Craig Kimbrel Braves 0   14 1.57 1.41
Mike Dunn Marlins 0   16 1.42 1.48
Jonathan Papelbon Phillies 0   13 1.29 1.70
Addison Reed White Sox 0   15 1.40 1.31
Tony Watson Pirates 0   15 1.32 1.35
Sean Doolittle Athletics 0   14 1.59 1.24
Jason Grilli Pirates 0   15 1.47 1.27

.

 

Pitching Matchups

Here is an updated pitching matchup table, with recommendations of who to target, and who to avoid. These stats are from the last 30 days. The teams are sorted by general matchupness.

Team H/9 K/H K/9 K/BB BB/9
Astros 9.0 1.16 10.4 3.83 2.7
White Sox 7.9 1.12 8.9 3.67 2.4
Blue Jays 7.9 1.09 8.5 2.78 3.1
Braves 8.6 1.15 9.9 2.71 3.6
Nationals 8.0 1.08 8.6 2.78 3.1
Pirates 8.7 1.03 9.0 3.02 3.0
Phillies 8.5 0.96 8.2 2.93 2.8
Cubs 8.5 0.90 7.6 3.10 2.5
Marlins 7.8 0.91 7.1 2.69 2.6
Mariners 8.7 0.97 8.4 2.67 3.2
Angels 9.3 0.90 8.4 2.98 2.8
Mets 8.3 1.03 8.6 2.34 3.7
Padres 8.9 0.96 8.5 2.56 3.3
Brewers 9.6 0.86 8.3 2.80 3.0
Rays 8.8 0.89 7.9 2.38 3.3
Twins 8.6 0.95 8.1 2.16 3.8
Diamondbacks 9.0 0.86 7.8 2.31 3.4
Cardinals 9.3 0.78 7.2 2.51 2.9
Red Sox 9.9 0.94 9.3 2.25 4.1
Reds 8.6 0.95 8.2 1.94 4.2
Indians 10.3 0.84 8.6 2.40 3.6
Royals 10.0 0.72 7.2 2.54 2.8
Orioles 9.5 0.77 7.3 2.26 3.2
Yankees 9.5 0.76 7.2 2.16 3.3
Giants 9.6 0.72 7.0 2.19 3.2
Dodgers 9.3 0.81 7.5 1.88 4.0
Athletics 9.1 0.86 7.9 1.63 4.8
Rockies 10.8 0.73 7.9 2.06 3.9
Rangers 9.8 0.67 6.6 1.91 3.4
Tigers 10.7 0.66 7.0 1.84 3.8

Actively target:

  • Astros
  • White Sox
  • Braves
  • Blue Jays
  • Nationals
  • Cubs
  • Marlins

Actively avoid:

  • Athletics
  • Tigers
  • Rockies
  • Rangers
  • Dodgers
  • Reds
  • Giants

Here are the average H/9, K/9, BB/9 of each group

  • Target: 8.2 / 8.7 / 2.9
  • Avoid: 9.7 / 7.4 / 3.9

 

The Fantasy Impact of Infield Hits

We know that getting Infield Hits (IFH) is a skill that peaks early in a career (around age 23-24) and goes away. This is due to the speed required to beat out a ground ball.

For your education, here is how I calculate Infield Average (IFA):

[ Infield Hits + Bunt Hits ] / [ At-bats - Strikeouts ]

How often has a player collected an infield hit on contact?

But how impactful is the IFA skill to fantasy baseball 5×5 stats? I have done some legwork to answer that question. Here is a table in which all counting stats have been converted to a 150-game-pace.

IFA   R HR RBI SB AVG
Elite   85 6 46 35 0.287
Top Third   86 14 67 21 0.280
Average   84 20 81 10 0.279
Low Third   80 24 88 5 0.278

You can see the speed aspect show through in the Stolen Bases.

Other than that, this skill takes away (in HR & RBI) more than it adds (in R & AVG). Players who are Elite IFHers are putting the ball on the ground often, and essentially stealing first base.

When I first started looking into this, I assumed that an elite IFHer would be able to inflate his AVG, but (as a group) they are not able to add more than a few points.

One important result of investigating IFA is that it forced me to also look at Outfield Average, or OFA. How often does a player get an Outfield Hit (OFH) on contact?

[ Hits - (Infield Hits + Bunt Hits) ] / [ At-bats - Strikeouts ]

The fantasy 5×5 impact of this stat is dramatic.

OFA   R HR RBI SB AVG
Elite   100 36 110 6 0.318
Top Third   93 28 97 9 0.298
Average   82 20 80 10 0.279
Low Third   75 12 61 16 0.261
Non-Elite   69 5 42 24 0.246

Far more important than barely inflating AVG with IFH is driving the ball to the outfield. When a player can do that well, they are a championship caliber fantasy player.

The result of this study, I would say, is to understand that we often combine two events (IFH & OFH) when looking at AVG. They are two separate events caused by two different skills, speed and hitting. And hitting is much more impactful and important.

Infield Hits & Aging

This is a continuation of the conversation on Infield Hits, and the effect they have on offensive performance.

My first assumption is that a player’s ability to collect IFH is heavily tied to how fast he can run. And speed is a skill players lose with age.

I worked up an aging curve for IFH.

Aging IFH

The blue line is a trendline meant to smooth out the actual data.

You can see that getting IFH is a skill that only decreases with time. In fact, this particular data set says that after the Age 23 season, the best a player can hope to be is below average.

The IFH Avg+ (set to a mean of 1.00) looks like this:

  • Through Age 23, 1.10
  • After Age 23, 0.88

This is a skill that goes away, and does so early in a career.

Even so, this age-related decline has a very small impact on Batting Average. The below-23s added about .011 points of AVG because of this phenomenon.

That said, elite IFHers can add around .050 AVG points over players who do not possess this skill.

Infield Hits

A very common sabermetric line of thinking is ‘Ground balls are good for pitchers.’

If that is true, then ground balls are bad for hitters. Following my recent posts on non-zero hitters, I began thinking about near-zero events, and have begun investigating ground balls for hitters.

Ground balls can be productive. They can move baserunners up, produce RBI, even generate hits. That last part calls for some investigation.

If a player got enough infield hits, his batted ground balls (which should not bring much production) may add a significant amount of production to his yearly totals.

If you buy this line of thinking, and I will have to do some more research to determine if I do, then Outfield Hits are driven, productive hits, and Infield Hits are rolling, relatively unproductive hits.

Since the start of 2012, the MLB leader in IFH is Nori Aoki, with 52. Over that time period, he has hit .285 and slugged .426. Without those 52 hits he averaged .199 and slugged .340. Now, .199 / .340 is not reflective of any skill level Aoki possesses, but I do wonder if collecting infield hits is a repeatable or regressive skill. More to come on that later.

For now, here is a list of hitters who have heavily relied on IFH for the production, listed with their Outfield Average and Slugging. Again, I still need to figure if this is significant; just consider it descriptive.

  • Nori Aoki – .199 / .340
  • Ben Revere – .206 / .251
  • Erick Aybar – .213 / .336
  • Emilio Bonifacio – .160 / .235
  • Jean Segura – .213 / .300

On the flip side, there is a player who has just one IFH in the last year-plus.

  • Alex Avila – .218 / .380

You can see that Avila’s OF AVG/SLG is better than everyone’s above. In fact, both numbers are league average. Avila has hit just .231 / .335 / .366 since the start of 2012. He has done that with a league-average ability to drive the ball, and a pathetic inability to sneak out more than a single infield hit.

What does that mean? I will let you know.