<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd"
	>
<channel>
	<title>Comments on: Small Sample Usefulness</title>
	<atom:link href="http://www.fangraphs.com/blogs/index.php/small-sample-usefulness/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.fangraphs.com/blogs/index.php/small-sample-usefulness/</link>
	<description>Daily baseball statistical analysis and commentary</description>
	<lastBuildDate>Mon, 22 Mar 2010 09:08:44 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: kris</title>
		<link>http://www.fangraphs.com/blogs/index.php/small-sample-usefulness/#comment-74255</link>
		<dc:creator>kris</dc:creator>
		<pubDate>Sat, 02 May 2009 06:06:49 +0000</pubDate>
		<guid isPermaLink="false">http://www.fangraphs.com/blogs/?p=4613#comment-74255</guid>
		<description>I enjoyed reading your posts earlier, and I&#039;m glad you mathematized my point about std-devs far beyond the 4 words I wrote.

There is a disjoint between the two types of baseball stat geeks.  I refer to what I practice as Sabr-magics for a damn good reason:  I&#039;m fairly well versed in most statistical analysis, but when it comes to realistically predicting the immediate future (that&#039;s what fantasy baseball is,) you can often throw math out the window.

I hate to bring this back to cliches, but 1) steroids 2) injuries 3) off-season work, are all things you cannot predict.  If you had information, you could factor them in but unfortunately you don&#039;t have any *trustworthy* information.

e.g. Maybe Ankiel Worked Hard, Ludwick took HGH, and Pujols had a minor injury.  There&#039;s nothing you can do to predict the outcome of the season, all you have is incomplete information! Regardless of how poorly Pujols performs, you&#039;re viewing each AB as an independent event.  

I&#039;m not sure what Dave claims to be, but there&#039;s a huge difference between those that want accuracy, and those that want to be ahead of the curve in their fantasy baseball league.

I still maintain these early stats are the most important in pointing you in the right direction.

Someone brought up Andruw Jones, he was predicted by most models to hit 35-40 HR last year ( i believe)

Did you keep him on your squad for 200 PA?  I doubt it.</description>
		<content:encoded><![CDATA[<p>I enjoyed reading your posts earlier, and I&#8217;m glad you mathematized my point about std-devs far beyond the 4 words I wrote.</p>
<p>There is a disjoint between the two types of baseball stat geeks.  I refer to what I practice as Sabr-magics for a damn good reason:  I&#8217;m fairly well versed in most statistical analysis, but when it comes to realistically predicting the immediate future (that&#8217;s what fantasy baseball is,) you can often throw math out the window.</p>
<p>I hate to bring this back to cliches, but 1) steroids 2) injuries 3) off-season work, are all things you cannot predict.  If you had information, you could factor them in but unfortunately you don&#8217;t have any *trustworthy* information.</p>
<p>e.g. Maybe Ankiel Worked Hard, Ludwick took HGH, and Pujols had a minor injury.  There&#8217;s nothing you can do to predict the outcome of the season, all you have is incomplete information! Regardless of how poorly Pujols performs, you&#8217;re viewing each AB as an independent event.  </p>
<p>I&#8217;m not sure what Dave claims to be, but there&#8217;s a huge difference between those that want accuracy, and those that want to be ahead of the curve in their fantasy baseball league.</p>
<p>I still maintain these early stats are the most important in pointing you in the right direction.</p>
<p>Someone brought up Andruw Jones, he was predicted by most models to hit 35-40 HR last year ( i believe)</p>
<p>Did you keep him on your squad for 200 PA?  I doubt it.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Wally</title>
		<link>http://www.fangraphs.com/blogs/index.php/small-sample-usefulness/#comment-73997</link>
		<dc:creator>Wally</dc:creator>
		<pubDate>Thu, 30 Apr 2009 21:21:09 +0000</pubDate>
		<guid isPermaLink="false">http://www.fangraphs.com/blogs/?p=4613#comment-73997</guid>
		<description>David, thanks again.

I&#039;d like to further emphasize this sample size issue, but leave baseball for a second.  It is often the exact right thing to do to just flat out ignore some data when doing statistical analysis if there are sufficient problems with that data.  This problem may not appear in baseball very often, given that those problems usually come from design or sampling issues.  But that is the basic reason we don&#039;t trust pre-WWII statistics very much.  

As for something more baseball related, one of the major problems is going to be a skewed sample or nesting.  So, say you are researching Lake Trout, but only sample 3 lakes.  Even if you achieve statistical significance in your study in those 3 lakes, you can only makes conclusions about those 3 lakes (and may even only have achieved significance thanks to pseudoreplication, but I digress).  Obviously, 3 lakes isn&#039;t going to be representative of all Lake Trout.  A similar problem is faced in baseball and is hinted on with Andruw Jones.  He&#039;s faced a higher portion of lefties than we would expect if he played everyday.  Victor Martinez, and all other players have a slightly different issue.  They have only faced a small handful of pitchers or teams.  Currently he&#039;s had 6 PAs against Sidney Ponson, 4 against Kevin Slowey and doesn&#039;t have any more than 3 against any other pitcher, with 46 total pitchers faced (last year he faced 114 pitchers, against 18 he had 4 or more PAs).  He&#039;s had 27 PAs against the Royals in general, with a 1.222 in OPS against them.  And has only faced 6 teams, getting just 9 PAs against the Red Sox.  So 90% of Victor&#039;s sample is nested in 5 teams.  2 of them have the highest team ERA in the game outside of the team Martinez plays for.  Minnasota is 24th, Toronto 11th, KC 3rd (who Martinez has hit the best).  Does that sound like a representative sample of the whole league?  

Now, I&#039;m not saying ignore these April numbers all together (nice straw man), but you have to recognize they have problems.  These are small samples, even 100 PAs is pretty small.  And because of the scheduling and nature of baseball, those small samples are nested within just a few teams or pitchers.

As for being an ass, how&#039;s this: You and Dave Cameron talk about statistics like you read about it on wikipedia, and didn&#039;t get it.  Someone who wants to have a serious conversation about statistics and the misuse of statistical principles would not casually throw around the word significant, nor think its up for interpretation.  They then would certainly not resort to such asinine comments, as Dave has, after being faced with criticism.

Being able to react to and answer criticism rationally and logically, even if you don&#039;t know the answer or made a mistake, is absolutely necessary to maintain credibility.  Obviously, I&#039;m not terribly concerned with this, I&#039;m an anonymous commenter.  But Dave has a great deal of interest in maintaining his credibility....</description>
		<content:encoded><![CDATA[<p>David, thanks again.</p>
<p>I&#8217;d like to further emphasize this sample size issue, but leave baseball for a second.  It is often the exact right thing to do to just flat out ignore some data when doing statistical analysis if there are sufficient problems with that data.  This problem may not appear in baseball very often, given that those problems usually come from design or sampling issues.  But that is the basic reason we don&#8217;t trust pre-WWII statistics very much.  </p>
<p>As for something more baseball related, one of the major problems is going to be a skewed sample or nesting.  So, say you are researching Lake Trout, but only sample 3 lakes.  Even if you achieve statistical significance in your study in those 3 lakes, you can only makes conclusions about those 3 lakes (and may even only have achieved significance thanks to pseudoreplication, but I digress).  Obviously, 3 lakes isn&#8217;t going to be representative of all Lake Trout.  A similar problem is faced in baseball and is hinted on with Andruw Jones.  He&#8217;s faced a higher portion of lefties than we would expect if he played everyday.  Victor Martinez, and all other players have a slightly different issue.  They have only faced a small handful of pitchers or teams.  Currently he&#8217;s had 6 PAs against Sidney Ponson, 4 against Kevin Slowey and doesn&#8217;t have any more than 3 against any other pitcher, with 46 total pitchers faced (last year he faced 114 pitchers, against 18 he had 4 or more PAs).  He&#8217;s had 27 PAs against the Royals in general, with a 1.222 in OPS against them.  And has only faced 6 teams, getting just 9 PAs against the Red Sox.  So 90% of Victor&#8217;s sample is nested in 5 teams.  2 of them have the highest team ERA in the game outside of the team Martinez plays for.  Minnasota is 24th, Toronto 11th, KC 3rd (who Martinez has hit the best).  Does that sound like a representative sample of the whole league?  </p>
<p>Now, I&#8217;m not saying ignore these April numbers all together (nice straw man), but you have to recognize they have problems.  These are small samples, even 100 PAs is pretty small.  And because of the scheduling and nature of baseball, those small samples are nested within just a few teams or pitchers.</p>
<p>As for being an ass, how&#8217;s this: You and Dave Cameron talk about statistics like you read about it on wikipedia, and didn&#8217;t get it.  Someone who wants to have a serious conversation about statistics and the misuse of statistical principles would not casually throw around the word significant, nor think its up for interpretation.  They then would certainly not resort to such asinine comments, as Dave has, after being faced with criticism.</p>
<p>Being able to react to and answer criticism rationally and logically, even if you don&#8217;t know the answer or made a mistake, is absolutely necessary to maintain credibility.  Obviously, I&#8217;m not terribly concerned with this, I&#8217;m an anonymous commenter.  But Dave has a great deal of interest in maintaining his credibility&#8230;.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Davidceisen</title>
		<link>http://www.fangraphs.com/blogs/index.php/small-sample-usefulness/#comment-73974</link>
		<dc:creator>Davidceisen</dc:creator>
		<pubDate>Thu, 30 Apr 2009 18:16:53 +0000</pubDate>
		<guid isPermaLink="false">http://www.fangraphs.com/blogs/?p=4613#comment-73974</guid>
		<description>The definition of significant is not subjective.  Sabermetrics labels itself as a scientific analysis of baseball, so those who claim to be practicing advanced analysis of baseball should use the well-defined and accepted definitions of words.  A significant change is something that falls out side either one or two standard deviations (it depends on the data sample and the claims put forth by the study).  Without a complete statistical analysis it is impossible to make the claims that Dave is making.  Basically he is making empirical claims, not statistical one--which is, I believe, the opposite of what he thinks he is doing.  I think that people that are reading this site are interested in seeing real statistical analysis, not some kind of pseudoscience that Dave is doing.  

&quot;Second, you’re asking Dave to take out the noise for every player in a response to your post. It can’t be done.&quot;

I won&#039;t speak for Nacho, I don&#039;t know him at all and don&#039;t know what he believes, but I disagree with your assessment.  I&#039;m certainly not asking for the &#039;noise&#039; to be taken out.  I&#039;m asking that the standard deviation for these data sets to be compared.  I don&#039;t have them and I don&#039;t claim to be a baseball blogger (not even close!), so I&#039;m not going to do this.  My guess is that the standard deviations of the two projections cross, which means simply that there is no significant change.  This has nothing to do with getting rid of noise, but has everything to do with doing serious, real analysis.  This is not ignoring data, but instead considering all data available and conducting an actual comparison.  

No one is being an ass but Dave and his blind supporters.  He refuses to answer legitimate concerns with his writing, instead he prefers to insult people and look down.

In sum:  Significance has a well-defined definition.  Use it, or else you aren&#039;t doing statistical analysis.</description>
		<content:encoded><![CDATA[<p>The definition of significant is not subjective.  Sabermetrics labels itself as a scientific analysis of baseball, so those who claim to be practicing advanced analysis of baseball should use the well-defined and accepted definitions of words.  A significant change is something that falls out side either one or two standard deviations (it depends on the data sample and the claims put forth by the study).  Without a complete statistical analysis it is impossible to make the claims that Dave is making.  Basically he is making empirical claims, not statistical one&#8211;which is, I believe, the opposite of what he thinks he is doing.  I think that people that are reading this site are interested in seeing real statistical analysis, not some kind of pseudoscience that Dave is doing.  </p>
<p>&#8220;Second, you’re asking Dave to take out the noise for every player in a response to your post. It can’t be done.&#8221;</p>
<p>I won&#8217;t speak for Nacho, I don&#8217;t know him at all and don&#8217;t know what he believes, but I disagree with your assessment.  I&#8217;m certainly not asking for the &#8216;noise&#8217; to be taken out.  I&#8217;m asking that the standard deviation for these data sets to be compared.  I don&#8217;t have them and I don&#8217;t claim to be a baseball blogger (not even close!), so I&#8217;m not going to do this.  My guess is that the standard deviations of the two projections cross, which means simply that there is no significant change.  This has nothing to do with getting rid of noise, but has everything to do with doing serious, real analysis.  This is not ignoring data, but instead considering all data available and conducting an actual comparison.  </p>
<p>No one is being an ass but Dave and his blind supporters.  He refuses to answer legitimate concerns with his writing, instead he prefers to insult people and look down.</p>
<p>In sum:  Significance has a well-defined definition.  Use it, or else you aren&#8217;t doing statistical analysis.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: CaR</title>
		<link>http://www.fangraphs.com/blogs/index.php/small-sample-usefulness/#comment-73969</link>
		<dc:creator>CaR</dc:creator>
		<pubDate>Thu, 30 Apr 2009 18:03:04 +0000</pubDate>
		<guid isPermaLink="false">http://www.fangraphs.com/blogs/?p=4613#comment-73969</guid>
		<description>Excellent point... looks to have shot down an argument.  Also, the ability to forecast season totals higher based on a hot first month seems to be filed under &quot;duh.&quot;  

The observations made about Andrew Jones (seems to be tying in) if true, are provable from an observational standpoint, and not likely a statistical one.  The Rangers appear to have protected him some to this point, and if he now plays every day for a month, we will see if his numbers have seen a remarkable jump.</description>
		<content:encoded><![CDATA[<p>Excellent point&#8230; looks to have shot down an argument.  Also, the ability to forecast season totals higher based on a hot first month seems to be filed under &#8220;duh.&#8221;  </p>
<p>The observations made about Andrew Jones (seems to be tying in) if true, are provable from an observational standpoint, and not likely a statistical one.  The Rangers appear to have protected him some to this point, and if he now plays every day for a month, we will see if his numbers have seen a remarkable jump.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Scott</title>
		<link>http://www.fangraphs.com/blogs/index.php/small-sample-usefulness/#comment-73967</link>
		<dc:creator>Scott</dc:creator>
		<pubDate>Thu, 30 Apr 2009 17:54:21 +0000</pubDate>
		<guid isPermaLink="false">http://www.fangraphs.com/blogs/?p=4613#comment-73967</guid>
		<description>Seriously Nacho...what are you arguing here?  Are you the semantic police?  First, Dave thinks that a 34 point increase in OPS is significant, and obviously you do not.  I happen to agree with Dave that it is significant, but the larger point to the article is that you have to use all the information available, even if it is a smaller sample size, and not stick you&#039;re head in the sand like you&#039;re doing, and just demand an answer to the exact projection of every MLB player.  Baseball is a game of neverending changes and adjustments.  The players are always getting better, aging and getting worse, all at different rates.  Any projection should take any additional information you can get.    

Second, you&#039;re asking Dave to take out the noise for every player in a response to your post.  It can&#039;t be done.  But Dave gave Victor Martinez of an example where small sample performance can have an overall effect on projections.  Could it be noise or error?  Sure, but based upon his success so far, a projection system certainly has to take those numbers into account.  Andruw&#039;s success has come in significantly less AB&#039;s and mostly against leftys.  Should it have as much effect?  No, it&#039;s a smaller sample and in what should be favorable matchups for him.  Modify your projections upward, but not nearly as significantly.  It&#039;s a relative calculation.  Regardless, to ignore any information you can lay your hands on is ignorant.  

Meanwhile, stop being such an ass.</description>
		<content:encoded><![CDATA[<p>Seriously Nacho&#8230;what are you arguing here?  Are you the semantic police?  First, Dave thinks that a 34 point increase in OPS is significant, and obviously you do not.  I happen to agree with Dave that it is significant, but the larger point to the article is that you have to use all the information available, even if it is a smaller sample size, and not stick you&#8217;re head in the sand like you&#8217;re doing, and just demand an answer to the exact projection of every MLB player.  Baseball is a game of neverending changes and adjustments.  The players are always getting better, aging and getting worse, all at different rates.  Any projection should take any additional information you can get.    </p>
<p>Second, you&#8217;re asking Dave to take out the noise for every player in a response to your post.  It can&#8217;t be done.  But Dave gave Victor Martinez of an example where small sample performance can have an overall effect on projections.  Could it be noise or error?  Sure, but based upon his success so far, a projection system certainly has to take those numbers into account.  Andruw&#8217;s success has come in significantly less AB&#8217;s and mostly against leftys.  Should it have as much effect?  No, it&#8217;s a smaller sample and in what should be favorable matchups for him.  Modify your projections upward, but not nearly as significantly.  It&#8217;s a relative calculation.  Regardless, to ignore any information you can lay your hands on is ignorant.  </p>
<p>Meanwhile, stop being such an ass.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Fresh Hops</title>
		<link>http://www.fangraphs.com/blogs/index.php/small-sample-usefulness/#comment-73962</link>
		<dc:creator>Fresh Hops</dc:creator>
		<pubDate>Thu, 30 Apr 2009 16:55:17 +0000</pubDate>
		<guid isPermaLink="false">http://www.fangraphs.com/blogs/?p=4613#comment-73962</guid>
		<description>For those who are curious, &quot;relatively stable&quot; here means the number quoted (150PA or whatever) has an r-squared value of .5 or greater for a random sample of performance that size during the season with the rest of the season performance. Note that the early season is not a random sample, so we cannot assume r-squared .5 for that. So we won&#039;t have a correlation as strong as that, but it&#039;s shouldn&#039;t be horribly far off.</description>
		<content:encoded><![CDATA[<p>For those who are curious, &#8220;relatively stable&#8221; here means the number quoted (150PA or whatever) has an r-squared value of .5 or greater for a random sample of performance that size during the season with the rest of the season performance. Note that the early season is not a random sample, so we cannot assume r-squared .5 for that. So we won&#8217;t have a correlation as strong as that, but it&#8217;s shouldn&#8217;t be horribly far off.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Wally</title>
		<link>http://www.fangraphs.com/blogs/index.php/small-sample-usefulness/#comment-73958</link>
		<dc:creator>Wally</dc:creator>
		<pubDate>Thu, 30 Apr 2009 15:51:19 +0000</pubDate>
		<guid isPermaLink="false">http://www.fangraphs.com/blogs/?p=4613#comment-73958</guid>
		<description>Well said David.</description>
		<content:encoded><![CDATA[<p>Well said David.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: DavidCEisen</title>
		<link>http://www.fangraphs.com/blogs/index.php/small-sample-usefulness/#comment-73956</link>
		<dc:creator>DavidCEisen</dc:creator>
		<pubDate>Thu, 30 Apr 2009 15:38:31 +0000</pubDate>
		<guid isPermaLink="false">http://www.fangraphs.com/blogs/?p=4613#comment-73956</guid>
		<description>I actually wonder if Dave agree with this post.</description>
		<content:encoded><![CDATA[<p>I actually wonder if Dave agree with this post.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: DavidCEisen</title>
		<link>http://www.fangraphs.com/blogs/index.php/small-sample-usefulness/#comment-73952</link>
		<dc:creator>DavidCEisen</dc:creator>
		<pubDate>Thu, 30 Apr 2009 15:03:57 +0000</pubDate>
		<guid isPermaLink="false">http://www.fangraphs.com/blogs/?p=4613#comment-73952</guid>
		<description>I was in a hurry to leave my apartment this morning and didn&#039;t quite make the exact point that I wanted.

When I said there wasn&#039;t a huge difference in performance, I meant on field performance, not statistical performance.

But the main point I was trying to make, and never really got around to stating, is that what would be really interesting is when projections are made in the beginning of the season that information on what one and two Standard Deviations are for each projection.  Each player should have a different deviation based on sample size of performance, and knowing the size of the bell curve would make these discussions on sample size much more scientifically sound.  With the updated ZIPS it would be extremely informative to know whether or not the updated projection falls outside of one or two standard deviations, at which point you can make claims of statistical significance, if the update doesn&#039;t then there is no significance in the data.  Error bars are part of science, so if you want to make claims of scientific analysis, do science. 

Further it would be absolutely amazing to know the size of the standard deviation for 100 AB, 200 AB, 300 AB, 400 AB, 500 AB, and 600 AB.   This way we could look at a 100 AB sample and know whether or not it is a statistically significant (from the actual mathematical definition) deviation from expectations.

I think this is the same point being made by others and their issue with your use of the term significant.  No one is arguing that Martinez is a bad player or that the first 100 AB of a season are meaningless, but without doing real statistical analysis your post isn&#039;t telling anyone anything too groundbreaking.</description>
		<content:encoded><![CDATA[<p>I was in a hurry to leave my apartment this morning and didn&#8217;t quite make the exact point that I wanted.</p>
<p>When I said there wasn&#8217;t a huge difference in performance, I meant on field performance, not statistical performance.</p>
<p>But the main point I was trying to make, and never really got around to stating, is that what would be really interesting is when projections are made in the beginning of the season that information on what one and two Standard Deviations are for each projection.  Each player should have a different deviation based on sample size of performance, and knowing the size of the bell curve would make these discussions on sample size much more scientifically sound.  With the updated ZIPS it would be extremely informative to know whether or not the updated projection falls outside of one or two standard deviations, at which point you can make claims of statistical significance, if the update doesn&#8217;t then there is no significance in the data.  Error bars are part of science, so if you want to make claims of scientific analysis, do science. </p>
<p>Further it would be absolutely amazing to know the size of the standard deviation for 100 AB, 200 AB, 300 AB, 400 AB, 500 AB, and 600 AB.   This way we could look at a 100 AB sample and know whether or not it is a statistically significant (from the actual mathematical definition) deviation from expectations.</p>
<p>I think this is the same point being made by others and their issue with your use of the term significant.  No one is arguing that Martinez is a bad player or that the first 100 AB of a season are meaningless, but without doing real statistical analysis your post isn&#8217;t telling anyone anything too groundbreaking.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: NadavT</title>
		<link>http://www.fangraphs.com/blogs/index.php/small-sample-usefulness/#comment-73947</link>
		<dc:creator>NadavT</dc:creator>
		<pubDate>Thu, 30 Apr 2009 13:50:23 +0000</pubDate>
		<guid isPermaLink="false">http://www.fangraphs.com/blogs/?p=4613#comment-73947</guid>
		<description>This is really useful information -- thanks for posting it!</description>
		<content:encoded><![CDATA[<p>This is really useful information &#8212; thanks for posting it!</p>
]]></content:encoded>
	</item>
</channel>
</rss>
