Background

Many basketball players and fans alike believe in the "hot hand" phenomemon: the idea that making several shots in a row increases a player's chance of making the next shot. Is there really a hot hand in basketball? More generally, in success/failure trials is there a tendency for trials following streaks of successes to be more likely to result in success? This app can be used to perform a statistical test for "hot hand" type behavior in sequences of success/failure trials, such as the shot attempts of a basketball player.

Data

The data consist of a fixed number of success/failure trials with the outcomes recorded in sequence. (The probability that a trial results in success is assumed to be the same for all trials.) The method to input the observed data is controlled by the radio buttions:

  1. Input results sequence: Input the observed sequence of trial outcomes itself, with 1 representing success and 0 failure, separated by commas. For example, an entry of 0,1,1,1,0 corresponds to failure on the first trial, success on the second, third, and fourth trials, and failure on the fifth trial. (Click on the References tab for information about some sample data that can be copied into the app.)
  2. Input summary statistics: Rather than entering the results of each individual trial, the user can enter three summary statistics.

After entering the results sequence or the summary statistics (number of trials and number of successes), as well as the streak length (see below), click the button to Accept inputs and initialization simulation. The initialization process might take a minute; when the "please wait" message disappers, the app is ready to perform repetitions of the simulation (see Permutation test below). Note that the Accept inputs button should be pushed whenever these inputs change.

A "streak statistic" that measures the hot hand?

While there is no consensus definition of what constitutes the "hot hand", the term generally refers to a tendency for trials following streaks of successes to be more likely to result in success. Several statistics are used in practice; those included in the app are described below.

For statistics 1 through 4 below, the user must define the streak length: How many successes must be observed in a row in order to consider it a hot streak? The minimum possible streak length is 1; the maximum allowed streak length in the app is 7. Note: in applications concerning the hot hand in basketball, 3 is commonly used for the streak length.

In the app, the user can choose a streak statistic from the following. 

  1. Proportion of S after streaks of S. The proportion of success on those trials that are immediately preceded by a streak of successes. For example, if the streak length is 3 and the sequence is 0,1,1,1,1,0,1,1,1,1, trials 5, 6, and 10 are preceded by a streak of 3 successes, and the proportion of success on these trials is 2/3 = 0.6667. 
  2. Difference in proportion of S (after streaks of S - other trials). The difference between the statistic in item 1 and the proportion of success on the remanining trials. In the previous example, 0,1,1,1,1,0,1,1,1,1, the value of the statistic is 2/3 - 6/7 = -0.1905.
  3. Difference in proportion of S (after streaks of S - after streaks of F). The difference between the statistic in item 1 and the proportion of success on those trials that are immediately preceded by a streak of failures. In the previous example, this statistic cannot be computed since there are no trials which follow a streak of 3 failures. In the sequence 1,0,0,0,0,1,1,1,1,1, with streak length 3, the value of the statistic is 2/2 - 1/2 = 0.5. 
  4. Frequency of S streaks. The proportion of trials that are immediately preceded by a streak of successes. In the example 0,1,1,1,1,0,1,1,1,1, the value of the statistic is 3/7 = 0.4286. Note, with a streak length of 3 the first 3 trials are not counted in determining the frequency of trials preceded by a streak of successes, and similarly for other values of streak length. 
  5. Longest run of S. The largest number of successes in a row in the observed sequence. In the example 0,1,1,1,1,0,1,1,1,1, the value of the statistic is 4.
  6. Total number of runs. The total number of runs, of any length, of both success and failure. In the example 0,1,1,1,1,0,1,1,1,1, the value of the statistic is 4. (This statistic is equivalent to the total number of "switches" or "alterations" between S and F if the first trial is counted as the first switch.)

Permutation test

The app can be used to perform a randomization-based hypothesis test of the hot hand phenomenon. The null hypothesis is that there is no hot hand; that is, that the trials are independent. Under the null hypothesis (together with the assumption of constant probability of success on all trials), given the number of successes any possible ordering of the successes and failures in the sequence is equally likely. Therefore, to simulate one hypothetical value of the streak statistic under the null hypothesis of no hot hand:

The null distribution of the streak statistic can be simulated by repeating this process many times, by pressing the Simulate shuffles button in the app and changing the desired number of shuffles to simulate. Checking the box for show most recent shuffle will illustrate the process. Note: the observed value of the streak statistic is not necessary to simulate the null distribution.

An approximate p-value can be computed by comparing the observed value of the streak statistic relative to its null distribution, by checking the box for Compute p-value. Since we are interested if there is evidence of presence of the hot hand, one-sided p-values are computed.

As illustrated by the example for streak statistic 3 above, for streak statistics 1 through 4 the value of the statistic cannot be computed for a permutation in which there are no streaks of the specified length. Thus the app distinguishes between the number of repetitions performed and the number of simulated values of the statistic. The latter count is the denominator of the simulated p-value.

Note: The clear plot button will simply clear the plot. To rerun the simulation the Accept inputs and initialize simulation button must be pressed first.

A note on analyzing the "cold hand" phenomenon

The hot hand refers to a tendency for successes to cluster together. The "cold" hand refers to a similar tendency for failures. The app can be used to analyze the cold hand by simply interchanging the roles of success and failure. For example, to assess evidence of the cold hand based on the observed sequence 0,0,0,1,0,0,1, enter the data as 1,1,1,0,1,1,0 and perform a "hot hand" analysis. KEVIN ROSS