<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd"
	>
<channel>
	<title>Comments on: When Samples Become Reliable</title>
	<atom:link href="http://www.fangraphs.com/blogs/index.php/when-samples-become-reliable/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.fangraphs.com/blogs/index.php/when-samples-become-reliable/</link>
	<description>Daily baseball statistical analysis and commentary</description>
	<lastBuildDate>Sun, 21 Mar 2010 07:16:16 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: intricatenick</title>
		<link>http://www.fangraphs.com/blogs/index.php/when-samples-become-reliable/#comment-86837</link>
		<dc:creator>intricatenick</dc:creator>
		<pubDate>Tue, 21 Jul 2009 23:02:34 +0000</pubDate>
		<guid isPermaLink="false">http://www.fangraphs.com/blogs/?p=5035#comment-86837</guid>
		<description>Since you are using terminology such as &quot;stabilize&quot; wouldn&#039;t it be prudent to display a scatter of the correlation vs PAs for all the metrics? Some may not &quot;stabilize&quot; at the number by which i mean the slope of the scatter at the point at which it hits .7 may be important. Of course, this is my attack on everything - &quot;Yes, that may be the case, but what is the derivative doing at that point?&quot; 

I&#039;m guessing the marginal benefit of extra plate appearances to the stability would be equal to it&#039;s fraction of the number needed to hit 0.7.

I won&#039;t give too much crap about the 0.7. I do understand that sometimes you have to pick an alpha and stick with it, if only for consistencies sake. For lab standards we&#039;ve gotta go with 0.99</description>
		<content:encoded><![CDATA[<p>Since you are using terminology such as &#8220;stabilize&#8221; wouldn&#8217;t it be prudent to display a scatter of the correlation vs PAs for all the metrics? Some may not &#8220;stabilize&#8221; at the number by which i mean the slope of the scatter at the point at which it hits .7 may be important. Of course, this is my attack on everything &#8211; &#8220;Yes, that may be the case, but what is the derivative doing at that point?&#8221; </p>
<p>I&#8217;m guessing the marginal benefit of extra plate appearances to the stability would be equal to it&#8217;s fraction of the number needed to hit 0.7.</p>
<p>I won&#8217;t give too much crap about the 0.7. I do understand that sometimes you have to pick an alpha and stick with it, if only for consistencies sake. For lab standards we&#8217;ve gotta go with 0.99</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Bob</title>
		<link>http://www.fangraphs.com/blogs/index.php/when-samples-become-reliable/#comment-82068</link>
		<dc:creator>Bob</dc:creator>
		<pubDate>Wed, 24 Jun 2009 23:48:29 +0000</pubDate>
		<guid isPermaLink="false">http://www.fangraphs.com/blogs/?p=5035#comment-82068</guid>
		<description>oops

Those SLGs can&#039;t be true. I will recalculate when I have time.</description>
		<content:encoded><![CDATA[<p>oops</p>
<p>Those SLGs can&#8217;t be true. I will recalculate when I have time.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Bob</title>
		<link>http://www.fangraphs.com/blogs/index.php/when-samples-become-reliable/#comment-82063</link>
		<dc:creator>Bob</dc:creator>
		<pubDate>Wed, 24 Jun 2009 23:20:17 +0000</pubDate>
		<guid isPermaLink="false">http://www.fangraphs.com/blogs/?p=5035#comment-82063</guid>
		<description>I wonder if a similar method can be applied to evaluate a hitter&#039;s PA numbers necessary for &quot;vs a specific pitcher&quot; stats to stabilize. The reason why I am bringing this up is, we often hear announcers and commentators say &quot;this hitter hits well against this pitcher through his career&quot;, and if you look at the actual stats, they are often talking about the sample sizes of 30 or so.  That may be laughable to some of us, but maybe not. The way how I see it, THE biggest variable in batting stats for a hitter is the opponent pitcher. If you eliminate that variable, maybe the sample size required for his stats to stabilize become dramatically reduced. 

Let me take an example of Eric Chavez against Jamie Moyer. Despite being known to be completely hapless against most other lefties, Chavez somehow managed to hit Moyer to the tune of 323/397/646 in 72 PA. I then did odd-even year split, and found them to be 366/387/766 (31 PA) in the odd years and 285/405/662 (42 PA) in the even years. Except for BAs, which is not surprising, other numbers look pretty good. So, are the OBG and SLG vs a specific pitcher meaningful after 72 PA? 

By increasing the number of individual cases (and maybe using Pizza Cutter&#039;s method of splitting into halves), one may be able to determine the sample size needed for &quot;vs a specific pitcher&quot; stats to stabilize. Although some caveats exist (such as these career stats covering many years), this sort of general information could be quite useful.</description>
		<content:encoded><![CDATA[<p>I wonder if a similar method can be applied to evaluate a hitter&#8217;s PA numbers necessary for &#8220;vs a specific pitcher&#8221; stats to stabilize. The reason why I am bringing this up is, we often hear announcers and commentators say &#8220;this hitter hits well against this pitcher through his career&#8221;, and if you look at the actual stats, they are often talking about the sample sizes of 30 or so.  That may be laughable to some of us, but maybe not. The way how I see it, THE biggest variable in batting stats for a hitter is the opponent pitcher. If you eliminate that variable, maybe the sample size required for his stats to stabilize become dramatically reduced. </p>
<p>Let me take an example of Eric Chavez against Jamie Moyer. Despite being known to be completely hapless against most other lefties, Chavez somehow managed to hit Moyer to the tune of 323/397/646 in 72 PA. I then did odd-even year split, and found them to be 366/387/766 (31 PA) in the odd years and 285/405/662 (42 PA) in the even years. Except for BAs, which is not surprising, other numbers look pretty good. So, are the OBG and SLG vs a specific pitcher meaningful after 72 PA? </p>
<p>By increasing the number of individual cases (and maybe using Pizza Cutter&#8217;s method of splitting into halves), one may be able to determine the sample size needed for &#8220;vs a specific pitcher&#8221; stats to stabilize. Although some caveats exist (such as these career stats covering many years), this sort of general information could be quite useful.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: radiosurgery</title>
		<link>http://www.fangraphs.com/blogs/index.php/when-samples-become-reliable/#comment-80894</link>
		<dc:creator>radiosurgery</dc:creator>
		<pubDate>Tue, 16 Jun 2009 16:13:49 +0000</pubDate>
		<guid isPermaLink="false">http://www.fangraphs.com/blogs/?p=5035#comment-80894</guid>
		<description>Is this not in the glossary? I think this is super useful information, and maybe you guys could either put a link to this article, or just the data in the glossary?</description>
		<content:encoded><![CDATA[<p>Is this not in the glossary? I think this is super useful information, and maybe you guys could either put a link to this article, or just the data in the glossary?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: CJS</title>
		<link>http://www.fangraphs.com/blogs/index.php/when-samples-become-reliable/#comment-76988</link>
		<dc:creator>CJS</dc:creator>
		<pubDate>Sat, 23 May 2009 09:20:56 +0000</pubDate>
		<guid isPermaLink="false">http://www.fangraphs.com/blogs/?p=5035#comment-76988</guid>
		<description>I received a brief intro to Bayesian statistics in a class once, and it really clicked for me when I put it into the context of baseball statistics.  Basically, I framed it in my mind the way you position what you&#039;ve written here.  I don&#039;t remember the specifics of the modeling, but the gist that I do remember is that a guy hitting .350 in April is not a .350 hitter if he hits .275 lifetime.  The correct answer is somewhere in between.  Pure common sense, but Bayesian statisticians make the effort to build a theoretical framework around the idea, with both subjective and objective (formulaic) underpinnings.  This likely isn&#039;t news to you.  But your point was interesting to me in that it reminded me of how how baseball brought the abstract theory to life at the time.</description>
		<content:encoded><![CDATA[<p>I received a brief intro to Bayesian statistics in a class once, and it really clicked for me when I put it into the context of baseball statistics.  Basically, I framed it in my mind the way you position what you&#8217;ve written here.  I don&#8217;t remember the specifics of the modeling, but the gist that I do remember is that a guy hitting .350 in April is not a .350 hitter if he hits .275 lifetime.  The correct answer is somewhere in between.  Pure common sense, but Bayesian statisticians make the effort to build a theoretical framework around the idea, with both subjective and objective (formulaic) underpinnings.  This likely isn&#8217;t news to you.  But your point was interesting to me in that it reminded me of how how baseball brought the abstract theory to life at the time.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Pizza Cutter</title>
		<link>http://www.fangraphs.com/blogs/index.php/when-samples-become-reliable/#comment-76947</link>
		<dc:creator>Pizza Cutter</dc:creator>
		<pubDate>Sat, 23 May 2009 04:08:55 +0000</pubDate>
		<guid isPermaLink="false">http://www.fangraphs.com/blogs/?p=5035#comment-76947</guid>
		<description>Confidence intervals are nice, and they are important.  The problem with confidence intervals is the problem of when they cross that line to being &quot;small enough.&quot;  There&#039;s not a really good way to tell.  Admittedly, my selection of .70 as my cutoff is an arbitrary line in the sand (although I think it has more of a reason than other lines that I might draw.)  But decisions are denominated in yes and no.  A confidence interval doesn&#039;t give you that.</description>
		<content:encoded><![CDATA[<p>Confidence intervals are nice, and they are important.  The problem with confidence intervals is the problem of when they cross that line to being &#8220;small enough.&#8221;  There&#8217;s not a really good way to tell.  Admittedly, my selection of .70 as my cutoff is an arbitrary line in the sand (although I think it has more of a reason than other lines that I might draw.)  But decisions are denominated in yes and no.  A confidence interval doesn&#8217;t give you that.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: fgelias</title>
		<link>http://www.fangraphs.com/blogs/index.php/when-samples-become-reliable/#comment-76943</link>
		<dc:creator>fgelias</dc:creator>
		<pubDate>Sat, 23 May 2009 03:42:19 +0000</pubDate>
		<guid isPermaLink="false">http://www.fangraphs.com/blogs/?p=5035#comment-76943</guid>
		<description>cool stuff.  i understand why this might be particularly useful for &quot;complicated&quot; statistics (specifically, those with hard to compute variances), but for the simpler rate statistics, what does this tell us that just computing a confidence interval can&#039;t?

for example dan uggla hit 32 HR in 521 AB (for this discussion i ignore the fact that PA!=AB) last season.  this is a rate of about 0.06 HR/AB.  using the simple formula for the variance of a binary random variable (p*(1-p))/N) i can compute a 95% confidence interval of his &quot;true&quot; rate as (0.040, 0.081).

if i were to use his rate for the current year (7HR in 141AB), the the confidence interval would be a much less informative (0.014, 0.086) because the sample size is so much smaller.  doesn&#039;t this provide a much more informative measure of how &quot;reliable&quot; the estimate is?

what additional benefit does a more formal reliability study provide?</description>
		<content:encoded><![CDATA[<p>cool stuff.  i understand why this might be particularly useful for &#8220;complicated&#8221; statistics (specifically, those with hard to compute variances), but for the simpler rate statistics, what does this tell us that just computing a confidence interval can&#8217;t?</p>
<p>for example dan uggla hit 32 HR in 521 AB (for this discussion i ignore the fact that PA!=AB) last season.  this is a rate of about 0.06 HR/AB.  using the simple formula for the variance of a binary random variable (p*(1-p))/N) i can compute a 95% confidence interval of his &#8220;true&#8221; rate as (0.040, 0.081).</p>
<p>if i were to use his rate for the current year (7HR in 141AB), the the confidence interval would be a much less informative (0.014, 0.086) because the sample size is so much smaller.  doesn&#8217;t this provide a much more informative measure of how &#8220;reliable&#8221; the estimate is?</p>
<p>what additional benefit does a more formal reliability study provide?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Scott</title>
		<link>http://www.fangraphs.com/blogs/index.php/when-samples-become-reliable/#comment-76868</link>
		<dc:creator>Scott</dc:creator>
		<pubDate>Fri, 22 May 2009 20:43:57 +0000</pubDate>
		<guid isPermaLink="false">http://www.fangraphs.com/blogs/?p=5035#comment-76868</guid>
		<description>Cutter,
Because you used PA instead of BIP, it has a quicker, more practical application than it would otherwise.  BIP is accessible, but junkie baseball fanatics know how many PA&#039;s batters have off the top of their head, or at least can estimate.  I for one am glad you used PA.</description>
		<content:encoded><![CDATA[<p>Cutter,<br />
Because you used PA instead of BIP, it has a quicker, more practical application than it would otherwise.  BIP is accessible, but junkie baseball fanatics know how many PA&#8217;s batters have off the top of their head, or at least can estimate.  I for one am glad you used PA.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Pizza Cutter</title>
		<link>http://www.fangraphs.com/blogs/index.php/when-samples-become-reliable/#comment-76862</link>
		<dc:creator>Pizza Cutter</dc:creator>
		<pubDate>Fri, 22 May 2009 20:20:18 +0000</pubDate>
		<guid isPermaLink="false">http://www.fangraphs.com/blogs/?p=5035#comment-76862</guid>
		<description>I&#039;ve actually done some similar work looking at those sorts of trend lines.  Worth a shot in this case.</description>
		<content:encoded><![CDATA[<p>I&#8217;ve actually done some similar work looking at those sorts of trend lines.  Worth a shot in this case.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Peter Jensen</title>
		<link>http://www.fangraphs.com/blogs/index.php/when-samples-become-reliable/#comment-76856</link>
		<dc:creator>Peter Jensen</dc:creator>
		<pubDate>Fri, 22 May 2009 19:52:18 +0000</pubDate>
		<guid isPermaLink="false">http://www.fangraphs.com/blogs/?p=5035#comment-76856</guid>
		<description>Pizza - Wouldn&#039;t time series analysis be one way to approach a problem like Coco&#039;s walk rate?  Split his career into 200 PA segments and measure his walk rate in each segment and the standard deviation between segments and then test for a relationship varying with time?</description>
		<content:encoded><![CDATA[<p>Pizza &#8211; Wouldn&#8217;t time series analysis be one way to approach a problem like Coco&#8217;s walk rate?  Split his career into 200 PA segments and measure his walk rate in each segment and the standard deviation between segments and then test for a relationship varying with time?</p>
]]></content:encoded>
	</item>
</channel>
</rss>
