​
​
Sign In
  • Support FanGraphs
    FanGraphs Membership
    FanGraphs Shirts
    FanGraphs Mugs
    Gift a Membership
    Donate to FanGraphs
  • Fantasy
    Fantasy Tools
    Fantasy Player Rater
    Auction Calculator
    Ottoneu Fantasy Baseball
    Signup, FAQ, Blog Posts
  • Blogs
    Blog Roll

    FanGraphs
    • Andrew Abbott Merits More Attention (And He’s Getting It Here)
    • The Brewers Are What We Expected, but Also Better
    • 2025 Trade Value: Nos. 21-30
    • Catching up With the ZiPS Top 100 Prospects, 2025
    Podcasts: Effectively Wild

    FanGraphs Prospects

    RotoGraphs
    • Paul Sporer's Baseball Chat - July 23rd, 2025
    • Position Player Playing Time Changes: July 23, 2025
    Podcasts: The Sleeper and The Bust | Field of Streams | Beat the Shift

    Community Research
    • Effectively Wild's Preseason Predictions Game Update: Ben Clemens

    Archived Blogs: The Hardball Times | NotGraphs | TechGraphs | FanGraphs+
    Archived THT: THT Live | Dispatch | Fantasy | ShysterBall
    Archived Podcasts: FanGraphs Audio | Chin Music | UMP: The Untitled McDongenhagen Project | Stealing Home | Doing It For Bartolo | OttoGraphs |
  • Projections
    2025 Pre-Season Projections
    ZiPS, ZiPS DC
    Steamer
    Depth Charts
    ATC
    THE BAT, THE BAT X
    OOPSY
    2025 600 PA / 200 IP Projections
    Steamer600, Steamer600 (Update)
    2025 Updated In-Season Projections
    ZiPS (RoS), ZiPS (Update), ZiPS DC (RoS)
    Steamer (RoS), Steamer (Update)
    Depth Charts (RoS)
    ATC DC (RoS)
    THE BAT (RoS), THE BAT X (RoS)
    OOPSY DC (RoS)
    3-Year Projections
    ZiPS 2026, ZiPS 2027
    On-Pace Leaders
    Every Game Played, Games Played %
    Cy Young Award Projections

    Auction Calculator
  • Scores
    Today
    Live Scoreboard, Probable Pitchers
    Live Daily Leaderboards
    Win Probability & Box Scores
    2025, 2024, 2023, 2022, 2021, 2020, 2019
    AL Games
    MIL (10) @ SEA (2)Final
    BAL (2) @ CLE (3)Final
    NYY (4) @ TOR (8)Final
    CHW (11) @ TBR (9)Final
    ATH (1) @ TEX (2)Final
    NL Games
    CIN (5) @ WSN (0)Final
    SDP (2) @ MIA (3)Final
    SFG (9) @ ATL (3)Final
    DET (1) @ PIT (6)Final
    LAA (3) @ NYM (6)Final
    KCR (8) @ CHC (4)Final
    STL (0) @ COL (6)Final
    HOU (4) @ ARI (3)Final
    MIN (3) @ LAD (4)Final
    BOS (9) @ PHI (8)Final/11
  • Standings
    2025 Projected Standings
    2025 Playoff Odds, Playoff Odds Graphs
    2024 ZiPS Postseason Game-By-Game Odds
    AL East
    Blue Jays60420.0
    Yankees56464.0
    Red Sox55496.0
    Rays53507.5
    Orioles445715.5
    AL Central
    Tigers60430.0
    Guardians51508.0
    Royals505310.0
    Twins495310.5
    White Sox376623.0
    AL West
    Astros60420.0
    Mariners54486.0
    Rangers53507.5
    Angels495311.0
    Athletics426219.0
    NL East
    Mets59440.0
    Phillies58440.5
    Marlins485310.0
    Braves445714.0
    Nationals416117.5
    NL Central
    Brewers61410.0
    Cubs60421.0
    Reds53508.5
    Cardinals52519.5
    Pirates426119.5
    NL West
    Dodgers60430.0
    Padres55474.5
    Giants54496.0
    D-backs505310.0
    Rockies267633.5
  • Leaders
    Major League Leaders
    Batting: 2025, 2024, 2023, 2022, 2021, Career
    Pitching: 2025, 2024, 2023, 2022, 2021, Career
    Fielding: 2025, 2024, 2023, 2022, 2021, Career
    Major League Leaders - Rank
    Batting: Ranking Grid, Compare Players, Compare Stats
    Pitching: Ranking Grid, Compare Players, Compare Stats
    Splits Leaderboards
    Pitch-Type Splits Leaderboards
    Season Stat Grid

    Postseason Leaders
    Batting: 2024, (WS), (LCS), (LDS), (WCS), Career
    Pitching: 2024, (WS), (LCS), (LDS), (WCS), Career

    Spring Training Leaders
    Batting: 2025, 2024, 2023
    Pitching: 2025, 2024, 2023

    KBO Leaders
    Batting, Pitching
    NPB Leaders
    Batting, Pitching

    Minor League Leaders
    AAA: International League, Pacific Coast League
    AA: Eastern League, Southern League, Texas League
    A+: Midwest League, South Atlantic League, Northwest League
    A: California League, Carolina League, Florida State League
    CPX: Arizona, Florida
    R: Dominican Summer League
    College Leaders
    Batting, Pitching

    WAR Tools
    Combined WAR Leaderboards
    WAR Graphs
    WPA Tools
    WPA Inquirer
    Rookie Leaders
    Batters 2025, Pitchers 2025
    Splits Leaders
    Batters: vs L, vs R, Home, Away
    Pitchers: vs L, vs R, Home, Away
  • Teams
    Team Batting Stats
    2025, 2024, 2023, 2022, 2021, 2020
    Team Pitching Stats
    2025, 2024, 2023, 2022, 2021, 2020
    Team WAR Totals (RoS)
    AL East
    Blue Jays  |  DC
    Orioles  |  DC
    Rays  |  DC
    Red Sox  |  DC
    Yankees  |  DC
    AL Central
    Guardians  |  DC
    Royals  |  DC
    Tigers  |  DC
    Twins  |  DC
    White Sox  |  DC
    AL West
    Angels  |  DC
    Astros  |  DC
    Athletics  |  DC
    Mariners  |  DC
    Rangers  |  DC
    NL East
    Braves  |  DC
    Marlins  |  DC
    Mets  |  DC
    Nationals  |  DC
    Phillies  |  DC
    NL Central
    Brewers  |  DC
    Cardinals  |  DC
    Cubs  |  DC
    Pirates  |  DC
    Reds  |  DC
    NL West
    D-backs  |  DC
    Dodgers  |  DC
    Giants  |  DC
    Padres  |  DC
    Rockies  |  DC
    Positional Depth Charts
    Batters: C, 1B, 2B, SS, 3B, LF, CF, RF, DH
    Pitchers: SP, RP
  • RosterResource
    Current Depth Charts
    AL East
    Blue Jays
    Orioles
    Rays
    Red Sox
    Yankees
    AL Central
    Guardians
    Royals
    Tigers
    Twins
    White Sox
    AL West
    Angels
    Astros
    Athletics
    Mariners
    Rangers
    NL East
    Braves
    Marlins
    Mets
    Nationals
    Phillies
    NL Central
    Brewers
    Cardinals
    Cubs
    Pirates
    Reds
    NL West
    D-backs
    Dodgers
    Giants
    Padres
    Rockies
    In-Season Tools
    2025 Closer Depth Chart
    2025 Injury Report
    2025 Payroll Pages
    2025 Transaction Tracker
    2025 Schedule Grid
    2025 Probables Grid
    2025 Lineup Tracker
    2025 Minor League Power Rankings
    Offseason Tools
    2025 Free Agent Tracker
    2025 Offseason Tracker
    2025 Opening Day Tracker
  • Prospects
    Prospects Home
    The Board
    The Board: Scouting + Stats!
    How To Use The Board: A Tutorial
    Farm System Rankings

    Top Prospects List
    20252024
    AL
    BALCHWATH
    BOSCLEHOU
    NYYDETLAA
    TBRKCRSEA
    TORMINTEX
    NL
    ATLCHCARI
    MIACINCOL
    NYMMILLAD
    PHIPITSDP
    WSNSTLSFG
    2025 Preseason Top 100
  • Glossary
    Library
    Batting Stats
    wOBA, wRC+, ISO, K% & BB%, more...
    Pitching Stats
    FIP, xFIP, BABIP, K/9 & BB/9, more...
    Defensive Stats
    UZR Primer, DRS, FSR, TZ & TZL, more...
    More
    WAR, UBR Primer, WPA, LI, Clutch
    Guts!
    Seasonal Constants
    Park Factors
    Park Factors by Handedness
  • Sign In
Crowdsourced Trade Value: Pick between player matchups to create your own trade value list!

Nate Silver and Imperfect Modeling

by Dave Cameron
November 7, 2012

If you’re reading FanGraphs, you’re probably familiar with Nate Silver. He’s known nationally now for his political projections at Five Thirty Eight, but of course he made his name on the internet writing about baseball, creating the PECOTA projections, and penning some of the best articles about the economics of baseball written over the last decade.

Even if you’re not a political junkie, it was hard to get away from discussions about Nate Silver over the last few weeks. The final few weeks of the election saw a Nate vs Pundits fight that looked like something straight out of Moneyball. Last night, my Twitter feed probably had more references to Nate Silver than either Barack Obama or Mitt Romney. Needless to say, the performance of his model was a major storyline during last night’s election, especially if you were following the election through the eyes of people who write about baseball for a living.

If you haven’t already heard, Nate’s model did pretty darn well. As in, he got every projection right, going 49 for 49 in states that have projected winners and nailing the fact that Florida was basically a coin flip. But, I’m not writing this post to talk about how Nate Silver is a witch or to bleed political discussion over into yet another area of your life, but instead, I think that there’s an important takeaway from this that applies to baseball and what we do here at FanGraphs: the fact that imperfect models with questionable inputs can still be quite useful.

Nate’s model was similar in structure to many other polling aggregators, including one from Princeton that was even more aggressive with its conclusions. In general, the argument against these models is that the inputs they were using — the polls themselves — were of questionable value and that they were essentially guessing at things like voter turnout based on assumptions that might not hold true anymore.

Even Nate acknowledge the truth in some of these criticisms, as polling data can problematic, and the people collecting the data can have biases that skew the results one way or another. No one thinks polling data is perfect, nor should we think Nate’s model perfectly corrects for these biases because of the results of the voting last night. The “it’s a projection, not a prediction” line cuts both ways – we can’t note that the model could have still been right had the results been different last night while believing that the results prove that the model was clearly right to begin with. The critiques of the model that were true a few days ago are still true today. Criticisms of Nate’s methodology are still valid, as a perfect result in one election does not prove that the model is without flaw.

But, hopefully, we can note that a model does not have to be perfect to be useful, and perhaps we can move away from the idea that imperfect — and even biased — data should be discarded until it can be perfected. In baseball, we deal with a lot of biased data and imperfect models. Colorado is a perfect example. The raw numbers from games a mile high can’t be taken at face value because of the atmosphere, and changes to the environment — such as the introduction of the humidor — make applying park factors to that data a bit of a guessing game. We’ve seen offensive levels in Denver shift back and forth over the years, and we certainly don’t have a perfect way of explaining or accounting for those shifts. If we were to project the 2013 run environment in Coors Field, we’d have to deal with a lot of moving parts, many of which require assumptions that we can’t test, and there’s a decent amount of uncertainty that would surround that projection. But that doesn’t mean we shouldn’t try.

Whether it’s FIP, UZR, ZIPS, the Fan’s Scouting Report, or especially WAR, pretty much every statistical model that we host here on FanGraphs contains some inputs that can be legitimately questioned and requires some assumptions that don’t always hold. These models are imperfect, and the data that goes into them can be biased. But, that doesn’t mean that the alternative of discarding them and just accepting any conclusion as equally valid is an improvement.

That’s essentially where the pundits went wrong with Nate’s model. They didn’t like the conclusions, and some of them raised valid concerns about polling data and whether Nate’s adjustments added or subtracted from simpler, more transparent techniques. But to discard the model entirely was silly, and to pretend like the race was a toss-up was simply wrong. Throwing out the imperfect model with biased data was worse than taking it at face value.

In reality, we shouldn’t do either. The models showed their usefulness last night, but they’re still not perfect, and we shouldn’t just blindly accept every conclusion they spit out in the future. But, we don’t need to discard these models simply because we’ve figured out where their weak points are either. It’s not an either/or situation. We can be informed by imperfect models without being slaves to them.

WAR can inform our opinion of Mike Trout’s value relative to Miguel Cabrera without us turning the MVP Award into the Whoever Has The Highest WAR Award. We can acknowledge the shortcomings of defensive metrics and park factors while also applying the lessons they can teach us in an intelligent way. We can note that FIP doesn’t work very well for Jim Palmer without using that as a reason to keep evaluating pitchers by ERA instead.

Last night was undoubtedly a win for data-based analysis, but let’s be honest, the results don’t always turn out that well. Just as we shouldn’t have discarded Nate’s model had the results been different, we shouldn’t believe his model is perfect because the results did line up with what he projected. His model is still imperfect, but it’s also still useful.

Let’s not let the perfect be the enemy of the good. If we want a takeaway from the Nate Silver vs Pundits argument, let’s note that the pundits went wrong when they discarded his insights because they didn’t like the results and because they assumed the data was too biased to be useful. If a model doesn’t occasionally challenge our preconceived notions of what’s true, it’s not helpful to begin with, and even a model with problematic datasets can still provide useful information that can help inform our decisions.

The takeaway from last night shouldn’t be “always trust Nate Silver” or “always trust the data”. The takeaway should be that even mediocre data is often better than no data, and when you put mediocre data in the hands of smart people who understand its limitations and adjust accordingly, it can become quite useful indeed.





Daily Notes: Three Notable Minor-League FAs of Note
 
FanGraphs Chat – 11/7/12

Dave is the Managing Editor of FanGraphs.

84 Comments
Oldest
Newest Most Voted
Inline Feedbacks
View all comments
Steve 1
12 years ago

Too bad the mainstream ‘baseball pundits’ won’t read a word of this.

4
Jack Weiland
12 years ago
Reply to  Steve 1

The correct term is “lamestream.”

29
Gary York
12 years ago
Reply to  Steve 1

Also it’s too bad that the mainstream “political pundits” won’t read a word of this.

5
jason B
12 years ago
Reply to  Gary York

I think the mainstream punidtry wasn’t questioning the model or the results – they were pretty firmly entrenched in the Obama camp (this isn’t meant to be controversial or inflammatory, most members of the media looked favorably on Obama). But you’re correct in that the right-leaning punditry who should read this and glean some lessons from this whole episode likely won’t (read, or learn).

-1
David
12 years ago
Reply to  Gary York

Of ALL the asinine lazy language that’s made it into political discourse over the last decade (and that’s obviously a tremendously large pool in which to fish), “mainstream media” has got to be the stupidest.
In the last week alone, I have read columns by people who are paid to write about politics by the five largest circulation newspapers in the country, as well as columnists/opinion writers/foisters of drivel from papers in three other of the 10 largest cities in the nation assert in various pieces that the very topic they are addressing will not be addressed in the “mainstream media.”
Newsflash… the fact that your sentence was published in a major newspaper of record by definition means you are wrong. And whiny.
[/offtopicrant]

-1
channelclemente
12 years ago
Reply to  Steve 1

Their Least Coast media’s motto, beware of geeks bearing gifts.

1
the fluMember since 2016
12 years ago
Reply to  channelclemente

Given that this Fangraphs, shouldn’t that be “bearing GIFs.”

12
  • Alex Chamberlain
    Post Count: 13
  • Ben Clemens
    Post Count: 1392
  • Ben Lindbergh
    Post Count: 2328
  • Dan Szymborski
    Post Count: 1131
  • David Appelman
    Post Count: 907
  • David Laurila
    Post Count: 2081
  • Davy Andrews
    Post Count: 285
  • Eric Longenhagen
    Post Count: 952
  • Eric Longenhagen and James Fegan
    Post Count: 9
  • Eric Longenhagen and Travis Ice
    Post Count: 15
  • Esteban Rivera
    Post Count: 119
  • FanGraphs Staff
    Post Count: 29
  • Jake Mailhot
    Post Count: 324
  • Jason Martinez
    Post Count: 109
  • Jay Jaffe
    Post Count: 1709
  • Jon Becker
    Post Count: 80
  • Kiri Oler
    Post Count: 44
  • Kyle Kishimoto
    Post Count: 69
  • Leo Morgenstern
    Post Count: 145
  • Matt Martell
    Post Count: 21
  • Meg Rowley
    Post Count: 685
  • Michael Baumann
    Post Count: 508
  • Michael Rosen
    Post Count: 48
  • Paul Sporer
    Post Count: 13
  • Sean Dolinar
    Post Count: 112
  • Tess Taruskin
    Post Count: 47
  • Travis Ice
    Post Count: 5
  • 2024 Postseason
  • 2024 Trade Deadline
  • 2024 Trade Value
  • 2025 BBWAA Ballot
  • 2025 Classic Baseball Ballot
  • 2025 MLB Draft
  • 2025 Positional Power Rankings
  • 2025 Replacement-Level Killers
  • 2025 Trade Deadline
  • 2025 ZiPS Projections
  • Best of 2024
  • Chat
  • College
  • Contract Crowdsourcing 2024-25
  • Crowdsourcing
  • Effectively Wild
  • Effectively Wild 2025 Season Preview
  • Extension
  • Featured
  • Featured Prospects
  • Five Things
  • Free Agent Signing
  • Hall of Fame
  • Idle Thoughts
  • JAWS
  • Job Postings
  • Mailbag
  • Matrix Reloaded
  • MLB Draft Week 2025
  • Old Scouting Reports Revisited
  • Power Rankings
  • Prospect List
  • Prospect Week 2025
  • Prospects
  • Prospects Report 2025
  • Q&As
  • Research
  • Site News
  • Talks Hitting
  • Angels
  • Astros
  • Athletics
  • Blue Jays
  • Braves
  • Brewers
  • Cardinals
  • Cubs
  • Diamondbacks
  • Dodgers
  • Giants
  • Guardians
  • Mariners
  • Marlins
  • Mets
  • Nationals
  • Orioles
  • Padres
  • Phillies
  • Pirates
  • Rangers
  • Rays
  • Red Sox
  • Reds
  • Rockies
  • Royals
  • Tigers
  • Twins
  • White Sox
  • Yankees
  • Top of the Order
  • Trade
  • We Tried
You are going to send email to

Move Comment

Updated: Tuesday, July 22, 2025 11:02 AM ETUpdated: 7/22/2025 11:02 AM ET
@fangraphs - Contact Us - Advertise - Terms of Service - Privacy Policy
sis_logo
All major league baseball data including pitch type, velocity, batted ball location, and play-by-play data provided by Sports Info Solutions.
mlb logo
Major League and Minor League Baseball data provided by Major League Baseball.
Mitchel Lichtman
All UZR (ultimate zone rating) calculations are provided courtesy of Mitchel Lichtman.
TangoTiger.com
All Win Expectancy, Leverage Index, Run Expectancy, and Fans Scouting Report data licenced from TangoTiger.com
Retrosheet.org
Play-by-play data prior to 2002 was obtained free of charge from and is copyrighted by Retrosheet.

Support FanGraphs
Become a Member

Please consider becoming a FanGraphs Member. All the great work that you've come to rely on is made possible by Member support, including analysis, stats, projections, RosterResource, prospect coverage, and podcasts.

Membership starts at $.16 a day.

Already a Member: Log In

Sign Me Up