Chapter 14 Prime Charts for Count Data with Very Large Subgroups

With count data involving very large subgroup sizes, SPC charts often produce very tight control limits with many data points outside the limits.

Figure 14.1 demonstrates this phenomenon, known as overdispersion, using data from from Mohammed A. Mohammed et al. (2013) (example data are tabulated at the end of this chapter).

P chart of percent emergency attendances seen within 4 hours

Figure 14.1: P chart of percent emergency attendances seen within 4 hours

This can suggest that the process is highly unstable, even when such a conclusion may not be warranted. Overdispersion occurs when the natural, common cause variation in the data exceeds what is expected under the assumed theoretical distribution – binomial for proportions and Poisson for rates.

To diagnose this issue, we can plot the data on an I chart, which bases its control limits on the natural variation between successive subgroups rather than on theoretical distributional assumptions. Figure 14.2 is an I chart of the data from Figure 14.1 suggesting only common cause variation.

I chart of percent emergency attendances seen within 4 hours

Figure 14.2: I chart of percent emergency attendances seen within 4 hours

As a general rule of thumb, when the control limits from an I chart differ significantly from those of the P or U chart, this suggests that the observed variation is inconsistent with the assumed binomial or Poisson models underlying the P and U charts, respectively.

Several solution to the problem with overdispersion and tight control limits have been proposed. The obvious pragmatic solution is to simply use I charts for count data as well as for (individual) measurements. However, I charts have straight control limits that do not account for varying subgroup sizes. If the subgroup sizes only vary little, this may not be a problem. But sometimes – not the least in healthcare – varying subgroup sizes matter.

Laney proposed a solution that adjust the within subgroup variation in count data (\(\sigma_{pi}\)) by a factor based on the moving ranges of successive standardised data points (\(\sigma_z\)) (Laney 2002).

14.1 Laney’s prime chart

As shown earlier in Table 6.1, the control limits for a traditional P chart are calculated as:

\[\bar{p}\pm3\sigma_{pi}\]

where \(\bar{p}\) is the average proportion and \(\sigma_{pi}\) is the within-subgroup standard deviation for the i-th subgroup:

\[\sigma_{pi}=\sqrt{\bar{p}(1-\bar{p})/{n_{i}}}\]

where \(n_i\) is the sample size of the i-th subgroup.

The P′ chart (pronounced P-prime chart) accounts for both within and between subgroup variation. The control limits are given by:

\[\bar{p}\pm3\sigma_{pi}\sigma_z\]

where \(\sigma_z\) represents the between-subgroup variation, calculated using the moving ranges of the standardised proportions defined as:

\[z_i=(p_i - \bar{p})/\sigma_{pi}\]

The moving ranges are:

\[MR_i=|z_i-z_{i-1}|\]

Then, the estimated between-subgroup variation \(\sigma_z\) is the average moving range devided by the constant 1.128:

\[\sigma_z=\overline{MR_z}/1.128\] Thus, the control limits for the P′ chart are:

\[\bar{p}\pm3\sigma_{pi}\sigma_z \]

P' chart of percent emergency attendances seen within 4 hours

Figure 14.3: P’ chart of percent emergency attendances seen within 4 hours

The control limits in Figure 14.3 are very close to the I chart limits from Figure 14.3, but vary slightly due to varying subgroup sizes.

Similarly, control limits for the U’ chart are:

\[\bar{u}\pm3\sigma_{ui}\sigma_z\]

where \(\sigma_{ui}=\sqrt{\bar{u}/n_i}\) and \(\sigma_{z}=\overline{MR_z}/1.128\); and the moving ranges are computed from the standardised rates defined as \(z_i=(u_i-\bar{u})/\sigma_{ui}\).

Note that in qicharts2, the moving ranges of the standardized values are, by default, screened for extreme values – following the same approach used for I charts, as described in Chapter 12.

14.2 When to use prime charts

Laney’s prime charts were developed to handle overdispersion (and underdispersion), that is, when data show more (or less) variation than expected under a binomial or Poisson distribution, which is often seen with very large subgroup sizes. But how do we know when to suspect overdispersion?

A red flag is when a traditional P or U chart shows unusually tight control limits that seem unnatural. If plotting the same data on an I chart produces significantly wider control limits, it is a strong indication that the assumptions behind the P or U chart may not hold. In such cases, switching to Laney’s P’ or U’ charts is typically the better choice.

Note that when the adjustment factor (\(\sigma_z\)) is close to one – indicating little to no overdispersion – the Laney P′ or U′ chart closely resembles the traditional P or U chart. For this reason, Laney recommends using prime charts as a generally safe and robust default.

In the next chapter, we will explore a modified I chart – the I’ (I prime) chart – which generates control limits for count data that closely align with Laney’s prime charts, while also accommodating measurement data with varying subgroup sizes.


Example data for P prime charts

Table 14.1: The number of attendances to major accident and emergency hospital departments in the NHS that were seen within 4 hours of arrival over twenty weeks. Source: (Mohammed A. Mohammed et al. 2013)
Week (i) Attendances seen within 4 hours (r) Number of attendances (n)
1 266,501 280,443
2 264,225 276,823
3 276,532 291,681
4 281,461 296,155
5 269,071 282,343
6 261,215 275,888
7 270,409 283,867
8 279,778 295,251
9 270,483 284,468
10 270,320 282,529
11 267,923 279,618
12 271,478 283,932
13 255,353 266,629
14 256,820 268,091
15 261,835 276,803
16 259,144 271,578
17 255,910 266,005
18 260,863 273,520
19 264,465 278,574
20 260,989 273,772

References

Laney, David B. 2002. “Improved Control Charts for Attributes.” Quality Engineering 14: 531–37.
Mohammed, Mohammed A, Jagdeep S Panesar, David B Laney, and Richard Wilson. 2013. “Statistical Process Control Charts for Attribute Data Involving Very Large Sample Sizes: A Review of Problems and Solutions.” BMJ Quality & Safety 22 (4): 362–68. https://doi.org/10.1136/bmjqs-2012-001373.