Day: May 31, 2017

Why I’m applauding Buster Posey’s decision to sit out brawlWhy I’m applauding Buster Posey’s decision to sit out brawl

The Internet jokes, memes and insults flowed, as these things are wont to do nowadays. No one likes a good overkill on Twitter of lame puns and pop-culture references more than I do. But I’m not talking right now about the late-night presidential slip of the fingers that has consumed us and entertained

Click here to read the full article

How difficult is it to follow up a low score with another low score?How difficult is it to follow up a low score with another low score?

You hear the traditional wisdom all the time – it’s difficult to follow a great round of golf with another great round. While this notion is intuitively appealing, it could also be a product of confirmation bias: whenever a golfer plays poorly following a low round, golf analysts and observers are quick to use that wisdom as an explanation, thus confirming the theory. However, when a player goes low on consecutive rounds, it’s not used as evidence to discount the theory. As usual, the truth reveals itself in the data.  Using round-level data on the PGA TOUR from 2000-2016, we first calculate each player’s baseline relative-to-field scoring average on a 3-year rolling basis (this will be the measure of each player’s typical performance at any given point in time). Then, for each round played, we calculate the difference between a player’s relative-to-field score that day, and their baseline relative-to-field score; let’s refer to this as “personal strokes-gained.â€� Example: If Phil Mickelson typically beats the field by 1 shot, and then plays a round where he beats the field score by 4 shots, his personal strokes-gained would be 3. We want to look at the relationship between a player’s performance in a given round to his performance in the round that follows. For our purposes, we classify rounds into different “bins”; for instance, one bin is defined as the set of rounds where a player had a personal strokes gained of 8 or more. The other bins are defined similarly — rounds with personal strokes gained of 6-8 shots, 4-6 shots and so on down to -8 or worse shots. We’ll then assess the distribution of personal strokes gained depending on which bin a player’s previous round belonged to. Sticking with the Mickelson example, where we have assumed Phil typically beats the field by 1 shot, suppose he beats the field by 5 shots following a round when he beat the field by 4 shots? In the first of these two rounds, Phil’s personal strokes gained was 3 shots, so he enters the “2-4 shots” bin, and the object of interest to us is then how Phil plays in the next round (in this case above, he had a personal strokes gained of 4). We do this for all players and rounds and are able to obtain “conditional distributions” of personal strokes gained, where we are conditioning on how a player played in his previous round (as defined by the bins). In lay terms, we are simply looking at the personal strokes-gained of the set of players who all fell into the same bin in their previous round, and looking for any differences between these sets of players (ex: do those who had 2-4 personal strokes gained in their previous round play better than those who had 0-2 personal strokes-gained in their previous round?). To start, the average personal strokes-gained for each bin is shown below: A very clean relationship emerges; if you’ve played well in the previous round, it is more likely, on average, you will play well in the following round. The analogous statement holds for poor play. This is not all that surprising; it simply means that form, good or bad, tends to last more than a single round. Although, there clearly is a tendency to regress to the mean at work here as well. The next figure gives a better sense of the entire distribution of personal strokes-gained conditional on playing at a certain level in the previous round: The horizontal bar in each box is the median of the data, the upper and lower bounds of each box is the 75th percentile and 25th percentile respectively, and the ends of the lines are the maximum and minimum values excluding outliers (where an outlier is defined as 1.5* the 75th percentile, and 1.5* the 25th percentile). Note that the most extreme rounds are contained in the middle bins; this is expected as these bins contain by far the largest number of rounds, and consequently there is the greatest potential for an outlier to emerge. In this final figure we report a transition matrix, reporting the probability of moving from one bin to the next in consecutive rounds. There is a lot of interesting information here, so take a long look: To ensure you are interpreting this correctly, the top left box states the following: given a player’s personal strokes gained was 8 or better in their previous round, there is a 1.35 percent chance that their personal strokes-gained is negative 8 or worse in the next round. It is surprising to us how well-defined the relationship is between a player’s performance in one round to the next. For example, consider the column for bin (2-4); as you move from the bottom row to the top row we are looking at the set of players who played increasingly better in their previous round, and the probability of entering the (2-4) bin is increasing monotonically, just as expected. Finally, the answer to the initial question: there is no evidence supporting the claim that it is more difficult to follow up a great round with a good round. A big reason this claim makes sense intuitively is that it is simply very unlikely to shoot a really good round (say, 6 or more strokes better than usual). Therefore, it’s unlikely that an exceptional round will be followed by another exceptional round — but this is always the case irrespective of a player’s performance in the previous round. Brothers Matt and Will Courchene grew up in a Canadian household full of golf fanatics. In 2016, they launched a DataGolf blog in hopes of contributing fresh and unbiased insights to the sport. Matt, a PhD student at the Vancouver School of Economics, focuses on applied econometrics and causal inference, while Will, who has a Masters of Economics from the University of Toronto, focuses on statistical modeling and data visualization.

Click here to read the full article