<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd"
xmlns:rawvoice="http://www.rawvoice.com/rawvoiceRssModule/"
	>
<channel>
	<title>Comments on: My Small Sample Size Plea</title>
	<atom:link href="http://www.fangraphs.com/blogs/index.php/my-small-sample-size-plea/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.fangraphs.com/blogs/index.php/my-small-sample-size-plea/</link>
	<description>Daily baseball statistical analysis and commentary</description>
	<lastBuildDate>Sun, 12 Feb 2012 23:34:21 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<item>
		<title>By: John</title>
		<link>http://www.fangraphs.com/blogs/index.php/my-small-sample-size-plea/#comment-71068</link>
		<dc:creator>John</dc:creator>
		<pubDate>Wed, 08 Apr 2009 23:43:20 +0000</pubDate>
		<guid isPermaLink="false">http://www.fangraphs.com/blogs/?p=4063#comment-71068</guid>
		<description>Jayhawk Bill -

If Cash lived in a vacuum ur point wuld be a good but if you have a league with 270 starters and a bunch of bench players, 5 or 10 of them will be hitting 150+ pts over their head in April in BA, so Cash&#039;s start is just probably an anomoly caused by having hundreds of guys playing April. If it was the end of May, its certainly a good sign for Cash at that point cuz it goes from like 1% to like .1%.</description>
		<content:encoded><![CDATA[<p>Jayhawk Bill -</p>
<p>If Cash lived in a vacuum ur point wuld be a good but if you have a league with 270 starters and a bunch of bench players, 5 or 10 of them will be hitting 150+ pts over their head in April in BA, so Cash&#8217;s start is just probably an anomoly caused by having hundreds of guys playing April. If it was the end of May, its certainly a good sign for Cash at that point cuz it goes from like 1% to like .1%.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Rally</title>
		<link>http://www.fangraphs.com/blogs/index.php/my-small-sample-size-plea/#comment-70984</link>
		<dc:creator>Rally</dc:creator>
		<pubDate>Wed, 08 Apr 2009 13:21:52 +0000</pubDate>
		<guid isPermaLink="false">http://www.fangraphs.com/blogs/?p=4063#comment-70984</guid>
		<description>10 team league, 23 players per team, so only the top 3rd of players are worthy of roster spots in the league.  At the end of 2007, Lee was not one of them.</description>
		<content:encoded><![CDATA[<p>10 team league, 23 players per team, so only the top 3rd of players are worthy of roster spots in the league.  At the end of 2007, Lee was not one of them.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Kincaid</title>
		<link>http://www.fangraphs.com/blogs/index.php/my-small-sample-size-plea/#comment-70869</link>
		<dc:creator>Kincaid</dc:creator>
		<pubDate>Tue, 07 Apr 2009 21:36:42 +0000</pubDate>
		<guid isPermaLink="false">http://www.fangraphs.com/blogs/?p=4063#comment-70869</guid>
		<description>The problem with that is that there are enough players and seasons in MLB that aberrations will happen.  Things that have 1% probabilities will happen, as we&#039;d expect them to with hundreds or thousands of chances.  So when you wait for them to happen and then pick out the aberrations, you can&#039;t really assign much statistical meaning there because of the sample bias.  If you randomly selected a sample, or if you pick a player beforehand and say you will look at his next 40 ABs, you could analyze the meaning with binomial theory, but by specifically selecting an anomaly out of the thousands of possible samples you could have picked after the fact, you are introducing a bias that makes the statistical implications meaningless.  99% doesn&#039;t mean all that much when your selection process heavily favours the 1%.</description>
		<content:encoded><![CDATA[<p>The problem with that is that there are enough players and seasons in MLB that aberrations will happen.  Things that have 1% probabilities will happen, as we&#8217;d expect them to with hundreds or thousands of chances.  So when you wait for them to happen and then pick out the aberrations, you can&#8217;t really assign much statistical meaning there because of the sample bias.  If you randomly selected a sample, or if you pick a player beforehand and say you will look at his next 40 ABs, you could analyze the meaning with binomial theory, but by specifically selecting an anomaly out of the thousands of possible samples you could have picked after the fact, you are introducing a bias that makes the statistical implications meaningless.  99% doesn&#8217;t mean all that much when your selection process heavily favours the 1%.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Eric Seidman</title>
		<link>http://www.fangraphs.com/blogs/index.php/my-small-sample-size-plea/#comment-70842</link>
		<dc:creator>Eric Seidman</dc:creator>
		<pubDate>Tue, 07 Apr 2009 19:36:00 +0000</pubDate>
		<guid isPermaLink="false">http://www.fangraphs.com/blogs/?p=4063#comment-70842</guid>
		<description>We&#039;re somewhat talking about different things here, Jayhawk, but let me be clear that you hit the nail on the head.  It&#039;s just that the nail belongs to another topic.  What you&#039;re discussing is how a small sample can change a player&#039;s true talent level.  This is definitely true, and I wrote an article last year discussing Cliff Lee and CC Sabathia, using The Hardball Times&#039; marcel in-season estimator to show the effects.  Basically, you couldn&#039;t have had a better first month than Lee or a worst first month than Sabathia, but even with those opposite extremes, their in-season adjusted projections were not drastically different.

What I&#039;m talking about here is that people should not go nuts over someone getting off to a 6-10 start.  Adam Lind could very well be hitting .400/.500/.900 with 12 rbis through 4 games... my point is - who cares!?  It&#039;s 4 games.</description>
		<content:encoded><![CDATA[<p>We&#8217;re somewhat talking about different things here, Jayhawk, but let me be clear that you hit the nail on the head.  It&#8217;s just that the nail belongs to another topic.  What you&#8217;re discussing is how a small sample can change a player&#8217;s true talent level.  This is definitely true, and I wrote an article last year discussing Cliff Lee and CC Sabathia, using The Hardball Times&#8217; marcel in-season estimator to show the effects.  Basically, you couldn&#8217;t have had a better first month than Lee or a worst first month than Sabathia, but even with those opposite extremes, their in-season adjusted projections were not drastically different.</p>
<p>What I&#8217;m talking about here is that people should not go nuts over someone getting off to a 6-10 start.  Adam Lind could very well be hitting .400/.500/.900 with 12 rbis through 4 games&#8230; my point is &#8211; who cares!?  It&#8217;s 4 games.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Chris</title>
		<link>http://www.fangraphs.com/blogs/index.php/my-small-sample-size-plea/#comment-70827</link>
		<dc:creator>Chris</dc:creator>
		<pubDate>Tue, 07 Apr 2009 18:32:29 +0000</pubDate>
		<guid isPermaLink="false">http://www.fangraphs.com/blogs/?p=4063#comment-70827</guid>
		<description>I agree Jayhawk.  I think the point is don&#039;t do anything too hasty based on  one hot streak from an out of nowhere free agent.

Your point about Cash is well taken, but by the same token anybody who went out and picked him up from the waiver wire didn&#039;t exactly reap dividends post-May 11th.

Most of us are pretty savvy drafters and owners.  Our presence on this very site indicates we do our research. If you&#039;ve already researched your drafted players to death already, I think it&#039;s a mistake to scrap countless hours of analysis based on one sexy box score.</description>
		<content:encoded><![CDATA[<p>I agree Jayhawk.  I think the point is don&#8217;t do anything too hasty based on  one hot streak from an out of nowhere free agent.</p>
<p>Your point about Cash is well taken, but by the same token anybody who went out and picked him up from the waiver wire didn&#8217;t exactly reap dividends post-May 11th.</p>
<p>Most of us are pretty savvy drafters and owners.  Our presence on this very site indicates we do our research. If you&#8217;ve already researched your drafted players to death already, I think it&#8217;s a mistake to scrap countless hours of analysis based on one sexy box score.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: JayhawkBill</title>
		<link>http://www.fangraphs.com/blogs/index.php/my-small-sample-size-plea/#comment-70808</link>
		<dc:creator>JayhawkBill</dc:creator>
		<pubDate>Tue, 07 Apr 2009 17:48:49 +0000</pubDate>
		<guid isPermaLink="false">http://www.fangraphs.com/blogs/?p=4063#comment-70808</guid>
		<description>Small sample sizes do have meaning, though.

I like to apply binomial theorem to players&#039; small sample sizes to see how significant they are.  For instance, last May 11 Kevin Cash was batting .375.  Obviously he wasn&#039;t a true .375 hitter, but a quick check of probabilities indicated that it was 99% likely that he was a true .207 hitter or better.  Entering 2008, Kevin Cash looked more like a true .180 hitter, so that .375 batting average was still good news, even when discounted appropriately for sample size.

Nay sayers will comment that he batted only .167 the rest of the way.  I&#039;d counter that catchers always tire later in the season, and that .167 was higher than his previous career split for June-September...the outstanding hitting in April and May was an accurate signal that he&#039;d improved as a hitter.

Other small sample sizes can have similar meaning, if they are dramatically far from the player&#039;s previous norm and if they&#039;re appropriately discounted for their size.</description>
		<content:encoded><![CDATA[<p>Small sample sizes do have meaning, though.</p>
<p>I like to apply binomial theorem to players&#8217; small sample sizes to see how significant they are.  For instance, last May 11 Kevin Cash was batting .375.  Obviously he wasn&#8217;t a true .375 hitter, but a quick check of probabilities indicated that it was 99% likely that he was a true .207 hitter or better.  Entering 2008, Kevin Cash looked more like a true .180 hitter, so that .375 batting average was still good news, even when discounted appropriately for sample size.</p>
<p>Nay sayers will comment that he batted only .167 the rest of the way.  I&#8217;d counter that catchers always tire later in the season, and that .167 was higher than his previous career split for June-September&#8230;the outstanding hitting in April and May was an accurate signal that he&#8217;d improved as a hitter.</p>
<p>Other small sample sizes can have similar meaning, if they are dramatically far from the player&#8217;s previous norm and if they&#8217;re appropriately discounted for their size.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Chris</title>
		<link>http://www.fangraphs.com/blogs/index.php/my-small-sample-size-plea/#comment-70802</link>
		<dc:creator>Chris</dc:creator>
		<pubDate>Tue, 07 Apr 2009 17:44:43 +0000</pubDate>
		<guid isPermaLink="false">http://www.fangraphs.com/blogs/?p=4063#comment-70802</guid>
		<description>Personally, I&#039;d prefer a cool 324 out of Felipe Lopez.</description>
		<content:encoded><![CDATA[<p>Personally, I&#8217;d prefer a cool 324 out of Felipe Lopez.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Chris</title>
		<link>http://www.fangraphs.com/blogs/index.php/my-small-sample-size-plea/#comment-70800</link>
		<dc:creator>Chris</dc:creator>
		<pubDate>Tue, 07 Apr 2009 17:43:35 +0000</pubDate>
		<guid isPermaLink="false">http://www.fangraphs.com/blogs/?p=4063#comment-70800</guid>
		<description>I have no problem with it, and I&#039;m a Yankee fan.  These guys are trying to squeeze a story out of each and every game of the year -- this is the angle they chose for an Opening Day Yankee loss.

However banal and unimaginative it may be, it is valid.</description>
		<content:encoded><![CDATA[<p>I have no problem with it, and I&#8217;m a Yankee fan.  These guys are trying to squeeze a story out of each and every game of the year &#8212; this is the angle they chose for an Opening Day Yankee loss.</p>
<p>However banal and unimaginative it may be, it is valid.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: David Appelman</title>
		<link>http://www.fangraphs.com/blogs/index.php/my-small-sample-size-plea/#comment-70798</link>
		<dc:creator>David Appelman</dc:creator>
		<pubDate>Tue, 07 Apr 2009 17:17:42 +0000</pubDate>
		<guid isPermaLink="false">http://www.fangraphs.com/blogs/?p=4063#comment-70798</guid>
		<description>Weird.  That&#039;s exactly what they said on the Seattle postgame show!  (they might have been serious)</description>
		<content:encoded><![CDATA[<p>Weird.  That&#8217;s exactly what they said on the Seattle postgame show!  (they might have been serious)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: rwildernessr</title>
		<link>http://www.fangraphs.com/blogs/index.php/my-small-sample-size-plea/#comment-70796</link>
		<dc:creator>rwildernessr</dc:creator>
		<pubDate>Tue, 07 Apr 2009 17:16:29 +0000</pubDate>
		<guid isPermaLink="false">http://www.fangraphs.com/blogs/?p=4063#comment-70796</guid>
		<description>Small sample size, schmample size. I think it is obvious to everyone that Ken Griffey Jr. is clearly going to hit 162 home runs this year.</description>
		<content:encoded><![CDATA[<p>Small sample size, schmample size. I think it is obvious to everyone that Ken Griffey Jr. is clearly going to hit 162 home runs this year.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

