FanGraphs Logo

THT Projections: A (Quick) Closer Look

Earlier this week the much anticipated Hardball Times 2007 Season Preview was released, and with it a brand new projection system. I recently took a look at Bill James, CHONE, ZiPS, and the Marcel projection systems to see how they differed. Let’s throw THT into the mix and see where it has its major differences.

First off, let’s see how THT fares against the other projection systems in OPS and ERA as a whole when compared to the Marcel projection system (the simplest of the five).

System        ERA-R^2    OPS-R^2
ZiPS             .725       .908
Bill James       .714       .875
CHONE            .699       .865
THT              .681       .837

And in English, when comparing the other projection systems to the Marcel projection system, THT’s system is the least similar. (When look at batters with 300+ at-bats and pitchers with 100+ innings.)

So which batters does THT disagree on the most in terms of OPS?

Name            Bill James    CHONE   Marcel     THT    ZiPS
Frank Thomas          .939     .853     .874    .982    .892
Hanley Ramirez        .801     .791     .843    .714    .777
Robinson Cano         .860     .842     .852    .766    .836
Chris Duncan          .862     .776     .891    .753    .803
Melky Cabrera         .766     .796     .787    .715    .800

Except for Frank Thomas, who THT projects is going to have a phenomenal season, they’re the low point for the other four players. It’s interesting to note that those four are also first or second year major league players. There’s generally a lot of disagreement about Chris Duncan and Hanley Ramirez, but the THT projections for Robinson Cano and Melky Cabrera appear to be the sole point of difference. Let’s look at the pitchers:

Name            Bill James    CHONE   Marcel     THT    ZiPS
Tony Armas Jr.        4.85     4.64     4.96    5.81    4.88
Carlos Zambrano       3.40     3.47     3.48    2.77    3.46
Cliff Lee             4.43     4.20     4.48    5.04    4.55
James Shields         4.03     4.29     4.72    5.03    4.70
Brandon Webb          3.53     3.60     3.65    3.07    3.85
Randy Johnson         4.31     3.77     4.33    3.43    3.63

THT clearly hates Tony Armas Jr. (more) with his ERA about a point higher than the others, while they love Carlos Zambrano who they have at about a .75 lower ERA than the other systems. I threw in Randy Johnson since he was next on the list. It looks like the projections are pretty well divided for him between the 4.30-ish ERA, and the 3.50-ish ERA.

Anyway, the THT projections are certainly similar to the others, but there are clearly a number of key differences which are definitely worth a look. There’s also a lot more to projections than ERA and OPS, so I’m sure you’ll find many other unique aspects to THT’s projection system. Like with any projection system, we’ll have to wait and see which one happens to be the most accurate for 2007.


Print This Post Print This Post
David Appelman is the creator of FanGraphs.

5 Responses to “THT Projections: A (Quick) Closer Look”

You can follow any responses to this entry through the RSS 2.0 feed.
Click here to view comments in a non-threaded output.
  1. tangotiger says:

    I would drop the threshhold for pitchers to 60 IP, or even 50 IP. Otherwise, you are essentially only looking at starters.

    Or, you can not have a threshhold at all, and simply do:
    sum(PA * abs(Marcel minus x) ) / sum(PA)
    where PA is the PA or IP in system X, and “x” is OPS or ERA of system X

    This will neatly bypass the issue of selective sampling.

    You should further, if you want to be fair, do this:
    The OPS of Jeter under Marcel would be: his OPS minus the league OPS that Marcel expects. Repeat for each of the systems. And to figure the league OPS, you would take the forecasted OPS and PA of each system in question.

    Vote -1 Vote +1

  2. I don’t understand what sum(PA * abs(Marcel OPS – x OPS) / sum(PA) is doing, and what am I’m supposed to apply it to. I took a look at doing it for each player and that doesn’t look right to me.

    Just for kicks I looked at every player that the 5 projection systems all had projections, and then I ranked the players from 1 to n by ERA/OPS. I thought this might be another way to solve the projection context issue.

    System        ERA-R^2    OPS-R^2     ERA-Rank     OPS-Rank
    ZiPS             .584       .803         .569         .758
    Bill James       .547       .795         .540         .746
    CHONE            .523       .793         .502         .710
    THT              .496       .742         .478         .661

    The correlation for pitchers really takes a dive. I guess it might be better to look at starters and relievers in two separate groups?

    Vote -1 Vote +1

  3. tangotiger says:

    Suppose you have:
    player, Marcel OPS, THT OPS, PA
    Jeter, .900, .800, 600
    ARod, .950, .900, 500
    Matsui, .700, .800, 400

    First you take the absolute difference:
    Jeter, .100
    ARod, .050
    Matsui, .100

    You multiply each of those by the PA:
    .100 x 600
    + .050 x 500
    + .100 x 400
    = 125

    The total weights is: 600+500+400 = 1500

    The average difference is: 125/1500 = .083

    ***

    At the end of the season, you would simply use the actual PA. Before the season, I would use the THT PA when comparing THT to Marcel, and ZIPS PA when compared ZIPS to Marcel, etc.

    However, that may not make for an easy comparison. You could instead use Marcel PA for each comparison.

    Vote -1 Vote +1

  4. bob says:

    Question: I already bought the THT 2007 Preview book, does anyone know if the THT projections are available in a nice Excel file like Marcel/CHONE/ZiPS/PECOTA/James?

    Vote -1 Vote +1

  5. David says:

    Yep, they’re available in a password protected directory that you can find halfway down the second column on page 11 of the book.

    Vote -1 Vote +1

Leave a Reply


Player Linker - Contact Us - Terms of Service - Privacy Policy