Chapter 14 Prime Charts for Count Data with Very Large Subgroups

With count data involving very large subgroup sizes, SPC charts often produce very tight control limits with many data points outside the limits. Figure 14.1 demonstrates this phenomenon, known as overdispersion, using data from from Mohammed A. Mohammed et al. (2013) included in qicharts2 (data are tabulated at the end of this chapter).

Figure 14.1: P chart of percent emergency attendances seen within 4 hours

This can suggest that the process is highly unstable, even when such a conclusion may not be warranted. Overdispersion occurs when the natural, common cause variation in the data exceeds what is expected under the assumed theoretical distribution – binomial for proportions and Poisson for rates.

To diagnose this issue, we can plot the data on an I chart, which bases its control limits on the natural variation between successive subgroups rather than on theoretical distributional assumptions. Figure 14.2 is an I chart of the data from Figure 14.1 suggesting only common cause variation.

Figure 14.2: I chart of percent emergency attendances seen within 4 hours

As a general rule of thumb, when the control limits from an I chart differ significantly from those of the P or U chart, this suggests that the observed variation is inconsistent with the assumed binomial or Poisson models underlying the P and U charts, respectively.

Several solution to the problem with overdispersion and tight control limits have been proposed. The obvious pragmatic solution is to simply use I charts for count data as well as for (individual) measurements. However, I charts have straight control limits that do not account for varying subgroup sizes. If the subgroup sizes only vary little, this may not be a problem. But sometimes – not the least in healthcare – varying subgroup sizes matter.

Laney proposed a solution that adjusts the assumed within subgroup variation in count data by a factor based on the moving ranges of successive standardised data points (\(\sigma_z\)) (Laney 2002).

14.1 Laney’s prime chart

As shown earlier in Table 6.1, the control limits for a traditional P chart are calculated as:

\[\bar{p}\pm3\sigma_{pi}\]

where \(\bar{p}\) is the average proportion and \(\sigma_{pi}\) is the within-subgroup standard deviation for the i-th subgroup:

\[\sigma_{pi}=\sqrt{\bar{p}(1-\bar{p})/{n_{i}}}\]

where \(n_i\) is the sample size of the i-th subgroup.

The P′ chart (pronounced P prime chart) accounts for both within and between subgroup variation. The control limits are given by:

\[\bar{p}\pm3\sigma_{pi}\sigma_z\]

where \(\sigma_z\) represents the between-subgroup variation, calculated using the moving ranges of the standardised proportions defined as:

\[z_i=(p_i - \bar{p})/\sigma_{pi}\]

The moving ranges are:

\[MR_i=|z_i-z_{i-1}|\]

Then, the estimated between-subgroup variation \(\sigma_z\) is the average moving range devided by the constant 1.128:

\[\sigma_z=\overline{MR_z}/1.128\] Thus, the control limits for the P′ chart are:

\[\bar{p}\pm3\sigma_{pi}\sigma_z \]

Figure 14.3: P’ chart of percent emergency attendances seen within 4 hours

The control limits in Figure 14.3 are very close to the I chart limits from Figure 14.3, but vary slightly due to varying subgroup sizes.

Similarly, control limits for the U’ chart are:

\[\bar{u}\pm3\sigma_{ui}\sigma_z\]

where \(\sigma_{ui}=\sqrt{\bar{u}/n_i}\) and \(\sigma_{z}=\overline{MR_z}/1.128\); and the moving ranges are computed from the standardised rates defined as \(z_i=(u_i-\bar{u})/\sigma_{ui}\).

Note that in qicharts2, the moving ranges of the standardized values are, by default, screened for extreme values – following the same approach used for I charts, as described in Chapter 12.

14.2 When to use prime charts

Laney’s prime charts were developed to handle overdispersion (and underdispersion), that is, when data show more (or less) variation than expected under a binomial or Poisson distribution, which is often seen with very large subgroup sizes. But how do we know when to suspect overdispersion?

A red flag is when a traditional P or U chart shows unusually tight control limits that seem unnatural. If plotting the same data on an I chart produces significantly wider control limits, it is a strong indication that the assumptions behind the P or U chart may not hold. In such cases, switching to Laney’s P’ or U’ charts is typically the better choice.

Note that when the adjustment factor (\(\sigma_z\)) is close to one – indicating little to no overdispersion – the Laney P′ or U′ chart closely resembles the traditional P or U chart. For this reason, Laney recommends using prime charts as a generally safe and robust default.

In the next chapter, we will explore a modified I chart – the I’ (I prime) chart – which generates control limits for count data that closely align with Laney’s prime charts, while also accommodating measurement data with varying subgroup sizes.

Example data for P prime charts

Table 14.1: The number of attendances to major accident and emergency hospital departments in the NHS that were seen within 4 hours of arrival over twenty weeks. Source: (Mohammed A. Mohammed et al. 2013)
Week (i)	Attendances seen within 4 hours (r)	Number of attendances (n)
1	266,501	280,443
2	264,225	276,823
3	276,532	291,681
4	281,461	296,155
5	269,071	282,343
6	261,215	275,888
7	270,409	283,867
8	279,778	295,251
9	270,483	284,468
10	270,320	282,529
11	267,923	279,618
12	271,478	283,932
13	255,353	266,629
14	256,820	268,091
15	261,835	276,803
16	259,144	271,578
17	255,910	266,005
18	260,863	273,520
19	264,465	278,574
20	260,989	273,772

References

Laney, David B. 2002. “Improved Control Charts for Attributes.” Quality Engineering 14: 531–37.

Mohammed, Mohammed A, Jagdeep S Panesar, David B Laney, and Richard Wilson. 2013. “Statistical Process Control Charts for Attribute Data Involving Very Large Sample Sizes: A Review of Problems and Solutions.” BMJ Quality & Safety 22 (4): 362–68. https://doi.org/10.1136/bmjqs-2012-001373.