## Alphabetism in Baseball

You may already be aware of this, dear reader, but alphabet discrimination exists.Â  People with surnames near the beginning of the alphabet own a slight but noticeable advantage over their late-alphabet colleagues.Â  They appear earlier in directories, leading to more phone calls.Â  They receive more applause at awards ceremonies and graduations, because people tend to get tired of clapping by the time the Tâ€™s roll around.Â  They even are more likely to receive tenure and Nobel Prizes, according to a study by Liran Einav and Leeat Yariv, because authors of collaborated work in certain fields tend to be recognized in alphabetical order.

The alphabet is important in baseball, too.Â  David Aardsma, despite the success heâ€™s found in an eight-year career, is still best known for supplanting Hank Aaron as the first player listed in the alphabetical list of players.Â  This fact is the second sentence in his Wikipedia article. People are still upset by this.

But is there alphabet discrimination in baseball?Â  I collected the performances of every hitter in baseball history (this is an activity which sounds far more impressive than it actually is), organized them by surname, and averaged them by their hitting ability, as represented by FanGraphâ€™s own fRC+.Â  The stunning and aesthetically pleasing result:

(Note: Each player’s career wRC+ is counted once, no matter how many seasons they played. Â Since a superior player is more likely to last multiple seasons than an inferior player, the graph doesn’t average out at 100 even if the average player does.)

From this beautiful and concise graph we can draw several conclusions:

• The next player whose last name starts with X will be the greatest player whose last name starts with Xâ€¦ of all time.
• Having a last name beginning with a Q is the kiss of death.Â  In fact, the letter Q owes its recent success to the performance of Carlos Quentin; without him, the average wRC+ would be 76.
• Other than that, not much.

But why stop there?Â  Why not examine hitting ability based on something even more arbitrary, such as the length of a playerâ€™s last name?

Bringing up the rear there is Americaâ€™s favorite Saltalamacchia, proud owner of a career .699 OPS.Â  But whatâ€™s surprising is the statistical significance of the data.Â  For you kids at home with the graphing calculators, the data sports a r-squared of .69, and it jumps to .78 if we boot out a certain busted catching prospect.

The causes of this, if any, lie in obscurity. Â Perhaps players lose confidence when the PA announcer botches their name at home games; perhaps scouts are more likely to remember short names when scanning for talent. Â Who can say? Â The world is full of biases, swirling and eddying around us all.

Print This Post

Patrick Dubuque is a wastrel and a general layabout. Many of the sites he has written for are now dead. Follow him on Twitter @euqubud.

Guest
Theo
5 years 2 months ago

As you can see from the first graph, children, the average player is, statistically, below average.

Member
Friedman
5 years 2 months ago

I definitely thought the same thing.

The only explanation that I can think of is that he didn’t weight wRC+ by PA. Players with high wRC+ will stay around much longer and for every Albert Pujols, there will be multiple replacement level players with subpar wRC+s. If they’re given equal weight, this might result in the below-average error.

Guest
5 years 2 months ago

Damn! Going by this I would be an effing terrible ML Player… 8 Letters beginning with R…

Member
AustinRHL
5 years 2 months ago

Not really. 8 letters and R are both slightly below average, but not too far. By contrast, I have a hyphenated last name that comes out to twelve letters (thirteen total characters with the hyphen), which is decidedly below average. It does at least start with H, which is an average-ish letter.

Guest
Andrew
5 years 2 months ago

4 letters beginning with J, second best possible outcome! (Or maybe Y is higher? Can’t tell)

Member
TheGrandslamwich
5 years 2 months ago

Whoa whoa whoa! Graphs in Notgraphs? Not cool.

Also, interesting stuff.

Member
AustinRHL
5 years 2 months ago

The general trend for wRC+ versus length of name shouldn’t be surprising. There are more people with shorter last names, which means that they comprise a larger pool of people from whom the best MLB players end up being selected. It’s sort of like how it’s harder to find left-handed pitchers who throw hard. I’m too lazy to plot the frequency of letters beginning a last name versus wRC+, but there should be a positive correlation there, too.

Guest
Rob
5 years 2 months ago

I’d imagine it’s more of a bell curve, with 5-6 letter last names being the most common. Your theory doesn’t explain why 3 letter names are the most successful with fewer players.

And given that wRC+ is a weighted average, a higher number of players to comprise a “pool” would not imply a higher average… It would only increase the likelihood that that categories’ average is closer to the league-wide average. Smaller groups are more likely to produce outlier results, such as Salty ( the only player in history with 14 letters) being the one and only player to yield a much lower wRC+ than the league average. He could have just as easily been that much higher.

It appears that a 93 is the average player’s wRC+ (without correcting for PA). If you look at the two graphs, the more common groups hover around 93, whereas those that are significantly higher or lower tend to be less common (last names that begin with S, T, and M are at least very common last names in my cell phone’s address book, whereas I know no one with the last name beginning with a Y and very few with an E).

Member
Kyle
5 years 2 months ago

What we really need here are error bars!

Then again this is /not/ graphs.

Guest
5 years 2 months ago

“Your theory doesnâ€™t explain why 3 letter names are the most successful with fewer players.”

Mel Ott may have something to do with it.

Guest
John R.
5 years 2 months ago

I had the same thought. Just how many players with three-letter names have there been? Is it few enough that Mel Ott could single-handedly pull up an otherwise-average group?

Member
Kyle
5 years 2 months ago

The Q’s must pitch better than they hit. Quisenberry, Qualls, Quantrill?

Guest
Chris
5 years 2 months ago

I dunno, last year Qualls would probably have been a better hitter than pitcher had he been given the opportunity.