A free online reference for statistical process control, process capability analysis, measurement systems analysis, control chart interpretation, and other quality metrics.
Don’t miss out! Book a demo of our specialized SPC software and unlock immediate improvements in your processes.
A u-chart is an attributes control chart used with data collected in subgroups of varying sizes. U-charts show how the process, measured by the number of nonconformities per item or group of items, changes over time. Nonconformities are defects or occurrences found in the sampled subgroup. They can be described as any characteristic that is present but should not be, or any characteristic that is not present but should be. For example, a scratch, dent, bubble, blemish, missing button, and a tear are all nonconformities. U-charts are used to determine if the process is stable and predictable, as well as to monitor the effects of process improvement theories. U-charts can be created using software programs like SQCpack.
The u-chart shows the proportion of nonconformities units in subgroups of varying sizes.
A histogram can be created using software such as SQCpack. How would you describe the shape of the histogram?
Bell-shaped: A bell-shaped picture, shown below, usually presents a normal distribution.
Bimodal: A bimodal shape, shown below, has two peaks. This shape may show that the data has come from two different systems. If this shape occurs, the two sources should be separated and analyzed separately.
Skewed right: Some histograms will show a skewed distribution to the right, as shown below. A distribution skewed to the right is said to be positively skewed. This kind of distribution has a large number of occurrences in the lower value cells (left side) and few in the upper value cells (right side). A skewed distribution can result when data is gathered from a system with has a boundary such as zero. In other words, all the collected data has values greater than zero.
Skewed left: Some histograms will show a skewed distribution to the left, as shown below. A distribution skewed to the left is said to be negatively skewed. This kind of distribution has a large number of occurrences in the upper value cells (right side) and few in the lower value cells (left side). A skewed distribution can result when data is gathered from a system with a boundary such as 100. In other words, all the collected data has values less than 100.
Uniform: A uniform distribution, as shown below, provides little information about the system. An example would be a state lottery, in which each class has about the same number of elements. It may describe a distribution which has several modes (peaks). If your histogram has this shape, check to see if several sources of variation have been combined. If so, analyze them separately. If multiple sources of variation do not seem to be the cause of this pattern, different groupings can be tried to see if a more useful pattern results. This could be as simple as changing the starting and ending points of the cells, or changing the number of cells. A uniform distribution often means that the number of classes is too small.
Random: A random distribution, as shown below, has no apparent pattern. Like the uniform distribution, it may describe a distribution that has several modes (peaks). If your histogram has this shape, check to see if several sources of variation have been combined. If so, analyze them separately. If multiple sources of variation do not seem to be the cause of this pattern, different groupings can be tried to see if a more useful pattern results. This could be as simple as changing the starting and ending points of the cells, or changing the number of cells. A random distribution often means there are too many classes.
Follow these steps to interpret histograms.
There are several statistics which are useful to describe and analyze a histogram. They are calculated to describe the area under the curve formed by its shape. These descriptive statistics can be calculated using software such as SQCpack.
Central location
The central location of a set of data points is where (on what value) the middle of the data set is located. Central location is commonly described by the mean, the median, and/or the mode. The mean is the average value of the data points. The median is the middle number in the data set when the data points are arranged from low to high. The mode is the value in the data set that occurs most often.
Spread
Both range and the standard deviation illustrate data spread. Range is calculated by subtracting the minimum data value from the maximum data value. The standard deviation is a measure that indicates how different the values are from each other and from the mean. There are two methods of calculating standard deviation using individual data points or using a samples average range. Both formulas are available here.
Skewness
Skewness is the measure of the asymmetry of a histogram (frequency distribution). A histogram with normal distribution is symmetrical. In other words, the same amount of data falls on both sides of the mean. A normal distribution will have a skewness of 0. The direction of skewness is “to the tail.” The larger the number, the longer the tail. If skewness is positive, the tail on the right side of the distribution will be longer. If skewness is negative, the tail on the left side will be longer. The formula for skewness is available here.
Kurtosis
Kurtosis is a measure of the combined weight of the tails in relation to the rest of the distribution. As the tails of a distribution become heavier, the kurtosis value will increase. As the tails become lighter the kurtosis value will decrease. A histogram with a normal distribution has a kurtosis of 0. If the distribution is peaked (tall and skinny), it will have a kurtosis greater than 0 and is said to be leptokurtic. If the distribution is flat, it will have a kurtosis value less than zero and is said to be platykurtic. The formula for kurtosis is available here.
Coefficient of variance
The coefficient of variance is a measure of how much variation exists in relation to the mean. It may also be described as a measure of the significance of the sigma in relation to the mean. The larger the coefficient of variance, the more significant the sigma, relative to the mean. For example, if the standard deviation is 10, what does it mean? If the process average (mean) is 1000, a sigma value of 10 is not very significant. However, if the average is 15, a standard deviation of 10 is VERY significant. The formula for coefficient of variance is available here.
Chi-square
In SPC, the chi-square statistic is used to determine how well the actual distribution fits the expected distribution. Chi-square compares the number of observations found in each cell in a histogram (actual) to the number of observations that would be found in an expected distribution. If the differences are small, the distribution fits the theoretical distribution. If the difference are large, the distribution probably does not fit the expected distribution.
Using Chi-square with the assumption of a normal distribution
Note: Theoretical percent outside of specifications may be misleading.
The formula for chi-square is available here along with the degrees of freedom table.
Is the shape of the histogram normal? The following characteristics of normal distributions will help in studying your histogram, which you can create using software like SQCpack.
An affinity diagram is the organized output from a brainstorming session. It is one of the seven management tools for planning. The diagram was created in the 1960s by Kawakita Jiro and is also known as the KJ method.
The purpose of an affinity diagram is to generate, organize, and consolidate information concerning a product, process, complex issue, or problem. Constructing an affinity diagram is a creative process that expresses ideas without quantifying them.
The affinity diagram helps a group to develop its own system of thought about a complex issue or problem. A group can use an affinity diagram at any stage where it needs to generate and organize a large amount of information. For example, members of a leadership team may use the diagram during strategic planning to organize their thoughts and ideas. Alternatively an improvement team can use the diagram to analyze the common causes of variation in its project. The diagram is flexible in its application and is easy to use.
A completed affinity diagram is shown below. In the example, a bakery has recently expanded its business and opened a chain of retail outlets. A number of problems have arisen and the management team, involved with the retail outlets, has met to discuss the problems. The issues are complex so they have decided to complete an affinity diagram.
Use an affinity diagram when you can answer “yes” to all of the following questions:
The above article is an excerpt from the “Operational Definition” chapter of Practical Tools for Continuous Improvement Volume 2 Statistical Tools.
Attributes data is data that can be classified and counted. There are two types of attributes data: counts of defects per item or group of items (nonconformities ) and counts of defective items (nonconforming).
Attributes data is analyzed in control charts that show how a system changes over time. There are two chart options for each type of attributes data. These attributes control charts, and more, can be created easily using software packages such as SQCpack.
Nonconforming data is a count of defective units. It is often described as go/no go, pass/fail, or yes/no, since there are only two possible outcomes to any given check. It is also referred to as a count of defective or rejected units. For example, a light bulb either works or it does not. Track either the number failing or the number passing.
Nonconforming data is analyzed in p-charts and np-charts. Chart selection is based on the consistency of the subgroup size:
Nonconformities data is a count of defects per unit or group of units. Nonconformities can refer to defects or occurrences that should not be present but are. It also refers to any characteristic that should be present but is not. Examples of nonconformities are dents, scratches, bubbles, cracks, and missing buttons.
Nonconformities data is analyzed in u-charts and c-charts. Chart selection is based on the consistency of the subgroup size:
A process that is in control is not necessarily producing an output that meets customer or engineering requirements. To find out if your process is capable of producing outputs that are in spec, you should perform capability analysis.
Capability analysis is a set of calculations used to assess whether a system is statistically able to meet a set of specifications or requirements. To complete the calculations, a set of data is required, usually generated by a control chart; however, data can be collected specifically for this purpose. Easily create control charts and perform capability analysis using software like SQCpack.
Specifications or requirements are the numerical values within which the system is expected to operate, that is, the minimum and maximum acceptable values. Occasionally there is only one limit, a maximum or minimum. Customers, engineers, or managers usually set specifications. Specifications are numerical requirements, goals, aims, or standards. It is important to remember that specifications are not the same as control limits. Control limits come from control charts and are based on the data. Specifications are the numerical requirements of the system.
All methods of capability analysis require that the data is statistically stable, with no special causes of variation present. To assess whether the data is statistically stable, a control chart should be completed. If special causes exist, data from the system will be changing. If capability analysis is performed, it will show approximately what happened in the past, but cannot be used to predict capability in the future. It will provide only a snapshot of the process at best. If, however, a system is stable, capability analysis shows not only the ability of the system in the past, but also, if the system remains stable, predicts the future performance of the system.
ttributes capability measures are taken directly from attributes control charts. No additional calculations are required.
The capability for a p-chart is the average proportion of nonconforming items (p-bar). The capability for an np-chart is the average number of nonconforming items generated by the system (np-bar). The capability for a c-chart is the average number of nonconformities per subgroup (c-bar). The capability for a u-chart is the average number of nonconformities per unit (u-bar).
A weakness in capability estimates for attributes data is that they do not suggest why a system is either capable or not. For instance, there is no way of knowing whether the system is incapable because it is not centered, it is too close to a specification limit, or it exhibits too much unit-to-unit variation. Further studies must be done to learn how to improve the system.
This is the quickest and easiest test for system stability. Look above the upper control limit and below the lower control limit to see whether any points fall in those regions of the chart. If you are looking at a chart pair (X-bar and R, X-bar and s, or X and MR), look at both charts.
Points falling outside the control limits may be the result of a special cause that was corrected quickly, either intentionally or unintentionally. It may also point to an intermittent problem. The chart below shows two points outside the control limits.
See also: >> Analyze for special causes of variation >> Any point lying outside the control limits >> 7 or more points in a row above or below the center line >> 7 or more points in one direction >> Any nonrandom pattern