## Fielding Independent Offense: Part 1

IT’S SO *** **** HARD TO THINK

WITH ALL THESE DUCKS EVERYWHERE!

In August of 2011, I introduced **Should Hit** (in three iterations: ShH, SHAP!, and Complete SHAP!). Should Hit is essentially a simple regression of walk rates, strikeout rates, home run rates, and BABIP on weighted runs created plus (wRC+). In both its calculation and its simplicity, it is very similar to FIP — but its uses and impact are quite unlike FIP.

Like FIP with groundball pitchers, the formula has some biases — known, accepted (by me, at least) biases. For instance, because it ignores doubles and triples completely, Should Hit naturally undervalues players who excel at the extra bags and overvalues to the sluggers stuck at first. It presumes a certain number of doubles and triples for every player based on their home run rate and other peripherals — all poor proxies for something that is a verifiable skill or weakness in many players.

Ultimately, though, the tools (ShH and its brethren) work rather well. For the curious thinker, ShH can admirably predict what a player might hit with a normal/career BABIP or if their BB% or K% or HR% changes. However, at the time of its uncovering, I was wrongly under the impression that the current FanGraphs iteration of the wRC+ formula **did not include stolen bases**. It mattered little to me at the time — the only reason I thought the uncovering was so interesting to begin with was that only four peripherals could explain almost 93% of the variation within wRC+ (and that is still amazing to me!)

But today, we are going to add in SBs and stand back with a decanter of thought and ask ourselves: “What the hell did we just make here?”

Studying economics in graduate school, I learned the one, core truth about the art of research. Allow me to relay the basic gist of my conversation with the professor:

“So [this economist] came up witd* [this idea], but afterwards, when nations and international bodies tried to implement his idea and it didn’t work, what did the economist do?” asked Turkish Professor.

“Write a book about it?” I said.

“Yeah, he went out and wrote a bunch of papers explaining why it didn’t work and why everybody didn’t understand it. And that’s how the field works: You come witd and idea; you publish some papers on it; and then you spend the rest of your career saying it’s right.”

**He was Turkish. That’s how said “with” — he also said “turd” instead of “third.” And that’s why we loved him.*

So, I’m going to break this — shall we say — Turkish rule of American economics by admitting my first impressions were off. As noted above, I initially thought wRC+ (here at FanGraphs, not as a whole) did not include steals, so by disregarding steals I was only doing myself a favor.

I tried to look for a suitable proxy for doubles and triples — a proxy specifically because I felt including *actual* doubles and triples would sully the defensive independent ideals of the tool. Unfortunately, as I observed here, the most logical proxies — steals and speed scores — were terrible predictors of doubles and triples.

So, I (and I still think rightly, given those circumstances) disregarded steals as a proxy for extra base hits. But let us pause a moment to consider some facts about the very nature of defensive independent statistics — more importantly, why? Why should it have been so important to me to keep the metric defensive independent?

- •Defensive independent stats became important following the earth-sundering study by Voros McCraken in ancient America, some 100 sabermetric years ago, wherein he discovered and concluded:
**Pitchers cannot control where the ball goes.**This led eventually to Tom Tango’s FIP, which to this day remains as one of the absolute best pitching evaluation metrics available. - •
**Hitters — unlike pitchers — actually**Not only do players normalize in a wider range of BABIPs, they also can have differentiating skill levels with regards to hitting doubles and triples (which is essentially a product of gap power and foot speed) as well as infield singles and RBOE.*do*have some control over a pitch’s in-play result. - •
**Why, then, would you need a defensive independent hitting metric?**Defense plays into a hitter’s stats*much*less than it does a pitcher — hitters are interacting with (as in hitting against) a different defense at least every four games, while pitchers are interacting with (as in playing in front of) virtually the same defense throughout the season. - •Defensive independent hitting, therefore, is not going to give us any clearer information than wOBA or wRC+ if it comes to us as a
**static metric**.

You, dearest reader, will notice I have carefully avoided referring to ShH and the like as metrics or statistics — because, in truth, they are not. ShH, SHAP!, and Complete SHAP! are *tools* at best. They can give a nice static number, but they are best served in helping us identify fluctuations and their causes.

But let’s save that bit for Part 2. ;)

As promised, I have once again run my Regression Machine, with the dial set to “Include Steals This Time.” More specifically, I regressed BB%, K%, HR%, BABIP, and SB-rate (as in: SB divided by PA — I’ll explain! Just wait a second!) on wRC+. Which gives us approximately this, what I call **Fielding Independent Offense** (FIO, pronounced *fye-oh*):

**FIO** = -51.57 + 275.21(**BB%**) – 180.52(**K%**) +

1184.34(**HR%**) + 151.75(**SB/PA**) + 422.14(**BABIP**)

I call this Fielding Independent Offense instead of Batting or something similar because it includes

Here is what it looks like compared to wRC+:

And here is what ShH looks like compared to the wRC+ of that same group:

So we gain a sliver of R-squared, which is nice, but more importantly, you will notice the plotting is much, shall we say?, tighter in the first graph — fewer dots scattered away from the line like a salt spilled from a diner salt shaker. Also, the standard deviation of the difference from wRC+ is considerably lower, and the median and average both equal exactly 0 — unlike ShH, where they equal 1 and slightly more than 0, respectively.

**Important to note here:** My Regression Machine only included — brace yourself — three years of data. Yeah, I’m crazy, right?

Here’s the deal, and I looked at this a LOT with ShH; the simple fact is that the MLB run environment is *really* rapidly changing. Using data from 2000 to present (or even larger) would essentially be like writing an article about the life on top of Mount Everest whilst I’m actively scaling down its side. We do not know quite where, or if, the league will bottom out (or if it will go back up). Run production has gone down two years in a row and trying to use more than three years of data — as I did with my very first incarnations of ShH — just creates an upward, steroid-fueled bias.

So instead, I am using 2009 to 2011 combined seasons of “qualified” hitters, while just happily saying, “When this whole run environment thing calms down, I’ll go ahead and recalculate everything, so shut up.”

Anyway, we’ll get to the testing of this puppy in a little bit, but for now, let us end by taking one more look at the doubles and triples problem. To do this, I tried a couple of different method until I arrived at this: I took the doubles and triples for each player, divided them by their PAs (working with this 2009 to 2011 dataset again). Using these rates, I multiplied them by their *The Book* run expectancy, and then added them together, effectively created a wRC for doubles and triples per plate appearance. Comparing that to FIO minus wRC+, we get this:

Yeah, the bias is still very much there, but at least it only explains about 14% of the deviations away from wRC+. In an interesting, if not frustrating, twist, the R-squared here is larger than in the same study using ShH — meaning the doubles and triples bias explains more in FIO than in ShH. But when we consider that FIO uses SB and is thus explaining more overall — and that the coefficient is less steep in the FIO bias study (-127 versus -141 with ShH) — then it pretty much comes out more of a wash.

Ideally, the R-squared would be near zero, but **FIO is not meant to replace wRC+**, so this problem is less of a THING, more of a thing.

If we look at the specific places where FIO and wRC+ deviate from each other, we can actually notice a really interesting and important bias:

Tell me if you can spot some common traits in the 11 players FIO overvalued the most:

*Player, difference between wRC+ and FIO*

Ian Stewart, +13

Willie Bloomquist, +10

Todd Helton, +10

Ryan Spilborghs, +9

Elvis Andrus, +9

David Murphy, +9

Alexei Ramirez, +8

Scott Podsednik, +8

Troy Tulowitzki, +8

Pedro Feliz, +8

Carlos Gonzalez, +7

That’s right, the majority of them play for — or have played for — the Colorado Rockies, the Chicago White Sox, or Texas Rangers. We see a similar, though not as dramatic, bias on the other side of the spectrum:

Josh Willingham, -8

Kevin Kouzmanoff, -8

Shane Victorino, -8

Ryan Braun, -8

Will Venable, -8

Rickie Weeks, -8

David Eckstein, -9

Carlos Quentin, -9

Chase Utley, -10

Jose Reyes, -10

Andres Torres, -11

A number of speedy guys who play in pitcher’s parks. It makes sense. FanGraph’s wRC+ accounts for parks, FIO does not (and frankly does not need to).

So FIO has some bias — as did wRC+ — but overall, it comes *even closer* to predicting a player’s offensive output than ShH does. Does it completely ignore fielding? No. Heavens no. BABIP has a LOT to do with fielding (which is one of the reason FIO makes a better tool than a stat) as well as SBs. In fact, but ignoring caught stealing — which merely helps to keep the calculations simple — we are assuming each of these base-thieves is a steal master and that the defenses they play against are all equal, neither of which is true.

Let’s get to Part 2 already, and use this freakin’ thing!

For digestive and giggling purposes, here is the same order of players as the most recent graphs, except set to the tune of ShH minus wRC+ — which helps illustrate how the average change from switching from FIO to ShH does not fully represent the real change, on a player-by-player basis, it is making:

(ALSO: Note the difference in scale used here and the preceding graph.)

*Go here for Part 2.*

Print This Post

Bradley you make the best graphs.

Thanks Jon!

Why are you graphing FIO (a stat that’s clearly not park-adjusted, as it includes HR% and BABIP) against a park-adjusted stat like RC+? Why not just use wOBA (the not park adjusted version of RC+)? That way, when looking at the biggest differences, we’d be seeing where FIO is “wrong” instead of just seeing a bunch of Rockies and Padres players.

Then again, I’m not sure what use FIO really has in the first place.

Great stuff as usual.

Will FIO be available on google doc’s as well?

Yeah, I may try to throw one into Part 2 if I have enough space.

Keep this up, Bradley, and we’re going to have certain baseball sites using FIO to calculate WAR. I mean, we’ve already got one using FIP for………well, you know.

Why not include the R-squared from the original regression? Also, the fact that alpha is not 0 and beta is not one suggest that this measure is biased. Why would it be biased?

Sorry, this looks to me like just a great deal of work for little or no value.

I will wholeheartedly agree its value does not compare to FIP, but it certainly has it’s uses — which I will offer in Part the Second.

How well does the accuracy of FIO compare with FIP? Please tell me this will go on the player’s pages!

Well this looks like a colossal waste of time for both you and me. For the record, upon what degree did your parents waste their money?

No degree. I “wasted” a LOT of Future Brad’s money on an economics degree.

Thanks for your input.

I hate it when people say “how much of your parents money did you waste”. Some of us actually pay our own way through school and aren’t drunk, adderall takin’ pieces of shit who are only going to school so that daddy can justify hiring you.

Yea, that’s a topic that I get fired up about. It’s because virtually every kid I know who’s parents pay his/her way is a brat and I wish their seat in class would be given to a more driven human being.

You do have to admit that Wet Blanket’s name was perfect for his comment.

When you complete this, I think you should write a very concise explanation of when to use FIO versus any other metric.

Good idea.

First of all, this is amazing work as always Bradley. I find your mix of wit, graphs, and clear language very enjoyable.

Second of all, why not use wOBA instead of wRC+? Since you’re not trying to normalize for park and league factors, which is the main difference between wOBA and wRC+, then it seems like you might as well just use the former, right?

THIRD of all, I commend you for responding to almost every comment, even the trolls, with thoughtfulness and without snark.

Thanks Matt. I’m waiting for a bus, so I’ve got the time. :)

Well, one of the main reasons for using wRC+ was the + helped with the changing run environments. It doesn’t cancel it out entirely (as noted above), but since the league wOBA has gone from .332 to .316 in just five years –while wRC+ remains at 96 — wRC+ invites less problem on that front.

http://www.fangraphs.com/leaders.aspx?pos=all&stats=bat&lg=all&qual=0&type=8&season=2011&month=0&season1=2001&ind=0&team=0,ss&rost=0&players=0

But your question strikes a chord in my research ligaments, so I’ll take a gander back to my charts and give it another look.

Could you personally just create wRCBRAD+ in which you standardize wOBAs without regard to park? I mean, I’d bet wOBAs are pretty normally distributed, so you could just give everyone Z-scores and WOOHOO!

Right?

Could you bring Carl Crawford, Jason Werth and Adam Dunn’s FIO results to the front? It would seem that FIO disagrees with the + in wRC+

Interesting suggestions! I already had plans to show Werth and Dunn. I’ll throw in Crawford as well.

REGRESS ALL THE BABIPS PLZ

Hey! HEY! Fangraphs Staff! You posted this NotGraphs article in the wrong place! I SAAAAID… YOU

If you divide your coefficients for SO, BB, HR by 91, you get:

-2

3

13

Which are, of course, the FIP coefficients.

I’m not sure why you have BABIP in there, other than to show its impact. Obviously, in a FIO equation, you have to take that out.

So, you are advocating justing turning this into FIP? You know if you use the FIP equation for batters, Ian Kinsler’s FIO (FIB? FIH?) would be .748.

I had this discussion somewhere, somehow. You use wOBA, and in the place of the in-park batted balls, you set them to “.300″. And the rest, you use the wOBA equation.

So…..

0.0 SO

0.3 everything else

0.7 BB, HB

2.0 HR

Yeah, I feel to see how this is “defense independent” whatsoever when it has BABIP in it.

It just seems like a sloppy way to approximate wOBA. The wRC+ thing confuses me too, because “FIO” isn’t park-adjusted.

Hey! HEY! Fangraphs Staff! You posted this NotGraphs article in the wrong place! Get this thing out of here!

“In fact, but ignoring caught stealing — which merely helps to keep the calculations simple — we are assuming each of these base-thieves is a steal master and that the defenses they play against are all equal, neither of which is true.”

I’m I was shaking my head in disapproval every once in a while, and then I read this and punched my f***** monitor.

Dude, put down the tequila bottle.

This is largely unreadable.

strong agree.

Wouldn’t it make more sense to make a FIH stat over TYPES of balls in play? Wouldn’t it make more sense find some general rules for BIP and go from there? For example, batters are likely to be able to control their BA on ground balls. But what about LDs and FBs in play? Esp if you were to adjust for park. If so, what would happen if you took IFFBs out of the equation? Or just as SIERA says that pitchers that strike out more batters produce lower BABIPS and HR/FB, what if batters who strike out more/less have higher/lower BABIPs? A SIERA type FIH would be interesting.

I don’t know, I’m no expert. But it just seems to me like doing FIH based off the knowledge of “what batted balls- as a rule do- what” would make more sense. Esp since we know batters have more control over batted ball type than pitchers.

Bradley,

just out of curiosity, would you save bad eyes some effort, and maybe mark the switch hitters?

Great article, waiting eagarly on Part duo.

I enjoyed this thoroughly and look forward to part duex, but was that a coke reference?

What if you took a similar equation as the tERA and applied it to hitters, then had some sort of “speed” coefficient. For example, a league average speed guy gets the same worth out of a line drive, a groundball, a flyball, etc. A slow guy probably gets a few less bases on a GB, FB, LD, but a fast guy probably gets more bases.

You don’t need to take fielding out, just adjust for how hard the ball is hit and in which direction and then tweak it based on how fast the guy is.

So…FIP can be recalculated very easily to adjust for different run environments. Do you think you can get FIO to the point where you can say…

The basic formula for FIO is X and if you think the run environment is different than the default setting Y, then you can easily adjust the formula by doing _______.

O Hit f/x, where art thou?

Wether this is useful or not, I whole heatedly appreciate the humble attitude.