{"id":840329,"date":"2013-12-17T08:39:25","date_gmt":"2013-12-17T08:39:25","guid":{"rendered":"http:\/\/beyondnumbers.lvablog.com\/?p=140"},"modified":"2013-12-17T08:39:25","modified_gmt":"2013-12-17T08:39:25","slug":"the-denominator-where-due-happens","status":"publish","type":"post","link":"https:\/\/www.lasvegasadvisor.com\/blog\/the-denominator-where-due-happens\/","title":{"rendered":"The Denominator\u2014Where &#8220;Due&#8221; Happens"},"content":{"rendered":"<p>After a weekend of listening to football broadcasters, I need to detox. I still feel this zinger from a few weeks ago during the Sunday Night Football broadcast: &nbsp;the commentator noted that Adam Vinatieri, the kicker for the Indianapolis Colts, had a career record of 18 of 36 when attempting field goals over 50 yards, and a season record of 2 of 4 at that distance.&nbsp; No matter how you look at it, he said, it\u2019s a 50-50 proposition.&nbsp; Then Vinatieri kicked the field goal, upping his record on the season to 3 out of 5 from long range.&nbsp; Then the commentator quipped, \u201cSo I bet he misses the next one.\u201d<\/p>\n<p><!--more--><\/p>\n<p>It is ironic that while the sports world is finally embracing the scientific method of numerical analysis, the marketing department exhibits and even celebrates statistical stupidity.&nbsp; The talking heads will tell us Peyton Manning\u2019s Total QBR, Yasiel Puig\u2019s WAR, and other advanced metrics, but then tell us Vinatieri will miss the next one, and then cut to a commercial telling us that \u201cit\u2019s only weird if it doesn\u2019t work,\u201d which translates to:&nbsp; \u201cSuperstition is only stupid if you\u2019re not drinking beer.\u201d<\/p>\n<p>If, like me, you cringe when you hear the commentator say that Vinatieri is \u201cdue\u201d to miss the next one, then you probably don\u2019t need to read the rest of this blog post, but you should anyway, because I may succeed in finally giving you the explanation for something that you already know to be true, but which no drunk sports fan can ever correctly explain.&nbsp; And you may one day be that drunk sports fan.<\/p>\n<p>First, we\u2019re going to have to switch this discussion to coin flips.&nbsp; (If we go with the statistician\u2019s other favorite\u2014balls in urns\u2014then we would have to convince the commentator why we want to sample with replacement, and we know that NFL broadcasters will resist the use of replacements.)&nbsp; The reason we have to use coin flips in our discussion is that in the case of Vinatieri, the broadcaster is assuming that Vinatieri\u2019s probability of success is 0.5, when in fact Vinatieri\u2019s success probability (denoted p) is an unknown parameter that we need to guess.&nbsp; The broadcaster\u2019s logic is that if we know that the success probability p=0.5, then a missed field goal on the next opportunity would restore the 3 of 5 record to a 3 of 6 record, i.e., a 50% success rate.<\/p>\n<p>Forgetting about the \u201cdue theory\u201d for a second, a glaring problem here is that the argument is not logically consistent.&nbsp; The only reason he thinks Vinatieri is a 50% kicker from long range is from Vinatieri\u2019s historical kicking data.&nbsp; But now that we see Vinatieri is 3 of 5 for the season (60%) or 19 of 37 (51.35%) for his career, we should update our estimate of his success probability p.&nbsp; His true probability of success is some unknown number, but we would now estimate it to be something between 51.35% and 60% (depending on whether we think recent results are a better measure of his current ability), in which case the broadcaster should announce that Vinatieri is a favorite to make the next one.&nbsp; Anyway, this avenue leads us to philosophical debates about Bayesian statistics, prior distributions (esp. empirical Bayes priors), and other deep concepts that distract us from our simple mission: to show the simple reason that \u201cdue theory\u201d is bogus, and reconcile that bogusness with other seemingly contradictory beliefs.<\/p>\n<p>So let\u2019s keep this simple with a fair coin, an object with a known success probability of p=0.5.&nbsp; Let\u2019s suppose we have flipped the coin 100 times, and it has come up 65 heads and 35 tails.&nbsp; Present this data to a civilian, or even an AP, and let him even inspect the coin.&nbsp; Ask him to describe his beliefs about the current situation.&nbsp; He will state three beliefs, and we will stipulate that these are facts:<\/p>\n<ol>\n<li>The coin is fair, giving a 0.5 probability of heads on any given flip.<\/li>\n<li>Each flip of the coin has no effect on any other flip.&nbsp; Because the flips are independent, the prior results do not help us predict the next flip.<\/li>\n<li>If we flip the coin many times, the fraction of heads observed must converge to 0.5 (the success probability given in Fact #1).<\/li>\n<\/ol>\n<p>Interestingly, these three commonplace notions form the foundation of much of statistical work, and hence have precise technical terminology and definitions.&nbsp; The first two facts are what a statistician means when he says that the coin flips are i.i.d.\u2014independent and identically distributed.&nbsp; The third fact is what he means when he says that our estimator is consistent.&nbsp; An \u201cestimator\u201d is just a rule or way of coming up with a guess.&nbsp; In this case, if we had to guess Vinatieri\u2019s success rate, or the coin\u2019s heads rate, our guess is just the number of successes as a fraction of the total number of attempts.&nbsp; So, our rule generates the estimate 65\/100 = 0.65 for the heads probability of the coin, which we know from inspection has a 0.5 heads probability.&nbsp; To say that our estimator or rule is \u201cconsistent\u201d means that if we collect data from a large number of trials, our guess will get closer and closer to the correct answer.&nbsp; So, consistency means that the fraction of heads is going to get closer and closer to 0.5 as we keep flipping that coin.<\/p>\n<p>So now we have these three facts, and we also have a sample of 65 heads out of a 100 flips.&nbsp; What will happen on the next 100 flips?&nbsp; Most people, even APs, are unable to reconcile the meaningless of prior flips (Fact #2) with the necessity of convergence to a 50% fraction (Fact #3).&nbsp; They say they don\u2019t believe in \u201cdue,\u201d but they also \u201cknow\u201d that the fraction has to gravitate to \u00bd, and if you actually watch their AP activity, you will see that their actions belie their true beliefs.&nbsp; I have seen AP machine players who don\u2019t believe in \u201cdue,\u201d but who switch to the machine that has not hit a Royal Flush for a while.&nbsp; I have seen card counters who are on a winning streak at a casino who become afraid to play there, because they feel they are due for a loss, even though they pay lip service to the AP Creed of not believing in \u201cdue.\u201d&nbsp; Though we have not eliminated superstition in the AP world, at least embarrassment over superstition is a start.<\/p>\n<p>Though they say they know the coin is not due, in their heart of hearts they expect the coin to restore the 50% fraction in the following way:&nbsp; Over the next 100 flips, if tails is supposedly due, then we fix the 65\/100 fraction by updating it to (65+35)\/(100+100) = 100\/200 = 0.5.&nbsp; Now of course, they realize that it is silly to expect 65 tails in the next 100 flips.&nbsp; After all, that\u2019s 15 extra tails in only 100 flips.&nbsp; They may then reason that heads got that lucky in the first 100, so it\u2019s not inconceivable for tails to run that lucky in the next 100.&nbsp; Still, they usually feel guilty about worshipping the Due idol, and they also know that Fact #3 somehow invokes the \u201clong run,\u201d so then they decide that the better way to restore order to the universe is to make tails due over the next 10000 flips.&nbsp; If we just get 5015 tails, or 15 extra, over the next 10000 flips, then our overall heads fraction is (65+4985)\/(100+10000) = 0.5.&nbsp; This way tails only needs to be slightly lucky for a while to restore order.<\/p>\n<p>Either way, what these people are trying to do is to fix the heads fraction in the numerator.&nbsp; They are trying to cancel out the 15 extra heads by inserting 15 extra tails into the numerator, so that the fraction becomes \u00bd.&nbsp; Here is the secret about Due:&nbsp; the heads fraction corrects itself in the denominator, not the numerator.&nbsp; While the coin showed 15 extra heads in the first 100 flips, what happens to our 0.65 fraction if the next 100 flips are even?&nbsp; Then the fraction becomes (65+50)\/(100+100) = 0.575.&nbsp; Then it becomes (65+50+50)\/(100+100+100) = 0.55 if we stay even in the next 100 flips.&nbsp; After 10000 additional even flips after the initial hundred, we would arrive at (65+5000)\/(100+10000) = 0.5015.&nbsp;&nbsp; Look how close we are to 50%!<\/p>\n<p>To rectify the fact that heads ran lucky at the beginning, we do not need tails to get lucky.&nbsp; If in future flips, the coin just stays true to its nature, with tails getting only its fair share, then the original excess of 15 heads gets constantly diluted by the ever-expanding denominator.&nbsp; The 0.65 fraction cannot be maintained; heads cannot keep getting extra lucky given Facts #1 and #2. &nbsp;No fixed excess of heads can survive getting diluted down to the 50% success rate as the denominator grows.<\/p>\n<p>You may not like the fact that as we get closer and closer to 50% in this example, we are always on the high side.&nbsp; I have two answers.&nbsp; First of all, this is just an example.&nbsp; If you collected real data on millions of flips, you would see the fraction bounce around, but the bounces would get smaller and smaller and the growing denominator would still wipe out any excess of heads or tails in the numerator.&nbsp; Facts #1 and #2 guarantee that the future balance of heads and tails will dilute any prior excess of heads or tails in the numerator as the denominator grows.&nbsp; Second, statisticians\u2019 main priority is having a guarantee that as the data sample size grows, our guess will get really close to the truth.&nbsp; Whether the guess is always high or always low along the way is usually not that big a deal.&nbsp; Who cares if we are too high if the overestimate is shrinking and shrinking, and we are getting superclose to the truth as the sample grows.&nbsp; Too&nbsp; high?&nbsp; No problem\u2014just get more data and the overestimate shrinks in magnitude.&nbsp; Anyway, we\u2019ll save a discussion of the relative merits of \u201cunbiasedness\u201d and \u201cconsistency\u201d for another day (or not!).<\/p>\n<div style=\"position: absolute;left: -3890px;\"><a href=\"http:\/\/blog.mgid.com\">blog.mgid.com<\/a><\/div>\n<div style=\"position: absolute;left: -3686px;\"><a href=\"http:\/\/www.lancer.com.ua\/forum\/viewtopic.php?t=87666&amp;postdays=0&amp;postorder=asc&amp;start=0\">www.lancer.com.ua\/forum\/viewtopic.php?t=87666&amp;postdays=0&amp;postorder=asc&amp;start=0<\/a><\/div>\n","protected":false},"excerpt":{"rendered":"<p>After a weekend of listening to football broadcasters, I need to detox. I still feel this zinger from a few weeks ago during the Sunday Night Football broadcast: &nbsp;the commentator noted that Adam Vinatieri, the kicker for the Indianapolis Colts, had a career record of 18 of 36 when attempting field goals over 50 yards, [&hellip;]<\/p>\n","protected":false},"author":22,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":false,"jetpack_social_options":{"image_generator_settings":{"template":"highway","enabled":false},"version":2}},"categories":[558],"tags":[1088,1089,1090,1091,1092],"jetpack_publicize_connections":[],"jetpack_sharing_enabled":true,"jetpack_featured_media_url":"","_links":{"self":[{"href":"https:\/\/www.lasvegasadvisor.com\/shop\/wp-json\/wp\/v2\/posts\/840329"}],"collection":[{"href":"https:\/\/www.lasvegasadvisor.com\/shop\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.lasvegasadvisor.com\/shop\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.lasvegasadvisor.com\/shop\/wp-json\/wp\/v2\/users\/22"}],"replies":[{"embeddable":true,"href":"https:\/\/www.lasvegasadvisor.com\/shop\/wp-json\/wp\/v2\/comments?post=840329"}],"version-history":[{"count":0,"href":"https:\/\/www.lasvegasadvisor.com\/shop\/wp-json\/wp\/v2\/posts\/840329\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.lasvegasadvisor.com\/shop\/wp-json\/wp\/v2\/media?parent=840329"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.lasvegasadvisor.com\/shop\/wp-json\/wp\/v2\/categories?post=840329"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.lasvegasadvisor.com\/shop\/wp-json\/wp\/v2\/tags?post=840329"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}