Types of Distributions in Six Sigma

This article discusses the common probability types of distributions in Six Sigma Black Belt Projects. Many statistical approaches you’ll learn later are built on the assumption that the data is normally distributed. So, it’s important to gain an early understanding of the normal distribution. We’ll also cover some less common statistical distributions used in Six Sigma Black Belt Projects and show why they matter.

The Normal Distribution

The most common distribution used in Six Sigma is the normal distribution.

The Normal Distribution has these 3 unique characteristics:

Only Random Error is Present
There is no evidence of Assignable Cause
There are no drifts or shifts in the data as evidenced by the fact that the [Mean = Median = Mode].

The obvious conclusion from items 1-3 is that if the data is not normally distributed, then the following are likely true:

Probably more than random error is present
Probably there is evidence of assignable special cause

Note on the Normal Distribution

As you assess the distributions in your data set, understand that it’s difficult to determine what’s affecting a process if the data set is normally distributed. Special cause is harder to determine when your data set is a pretty little normal distribution. When it’s skewed in some way, then it’s much easier.

Most Used and Abused Distribution

While pretty and smooth, the normal distribution is the most used probability distribution – and because it’s so misunderstood, it’s also the most abused. And while it serves as the foundation of many statistical tools that we’ll learn later in the Measure Phase and in the Analyze Phase, encountering the normal distribution in real life is not common.

Characteristics of Normal Distribution

The Normal Distribution is a function of two parameters: The Mean and the Standard Deviation.
Each combination of the Mean and Standard Deviation can produce a unique normal curve. Below are pictures of normal distributions.

Notice how they are different?

If your distribution has the normal bell shape, but is uniquely different from the “Standard” Normal Curve, then we can transform our unique normal distribution to the “Standard” Normal Distribution.

Why be Normal?

Why do we want to do this? By doing this it allows us to use the Z Table, which helps us to be able to compare various normal distributions and lets us estimate tail area proportions. This act of converting our data to the Standard normal is called “Normalizing”.

By Normalizing our data, we convert the raw score into the standard Z Scores with a Mean = 0 and a Standard Deviation = 1. This allows us to use the Z Table to make estimates.

Area Under the Curve and Process Capability

I understand that this material is getting a little academic. I want you to keep the end in mind: this information will help us later on determine process capability, a key measure in Six Sigma. More on that later.

Proportion of Distribution

The area under the curve between any 2 points represents the proportion of distribution between those points. If we can estimate the area under the curve between 2 points, then we’ll be able to estimate process capability.

Extending what we now know about the area under the curve, we can predict more accurately our estimates of processes are performing. This curve will be familiar to you – it’s the foundation of Six Sigma.

The Empirical Rule

Based on this data and what we know about the area under the curve, we can make the following conclusions:

68.27 % of the data will fall within +/- 1 standard deviation
95.45 % of the data will fall within +/- 2 standard deviations
99.73 % of the data will fall within +/- 3 standard deviations
99.9937 % of the data will fall within +/- 4 standard deviations
99.999943 % of the data will fall within +/- 5 standard deviations
99.9999998 % of the data will fall within +/- 6 standard deviations

This means that, regardless of the shape of the distribution in our data set, when we get beyond 3 standard deviations from the mean, the probability of occurrence will be very low.

Below are several distributions you might encounter in your DMAIC project work:

Other Distributions Used in Black Belt DMAIC Projects

I’ve already indicated that the Normal Distribution will be the most commonly used distribution in your time running DMAIC projects. But, in the wild – in real life – encountering a data set that is approximated by the Normal Distribution will not be a common occurrence. So, to help you get accustomed to what you might see, below are a few distributions to keep in mind.

Binomial Distribution

What is it?

The Binomial Distribution is used to model discrete data and applies when the population is large (where N is greater than 50) and the sample size is small compared to the population.

When to Use it?

Use the Binomial Distribution when the proportion of defects is equal to or greater than 0.1 (this means it’s really small).

Poisson Distribution

What is it?

From experience, the Poisson Distribution is a common distribution in waiting lines, call centers, and in many transactional processes. Don’t be surprised if you see it in trying to approximate data from those areas.

When to Use it?

The Poisson Distribution can be used to model discrete data.

Chi Square Distribution

What is it?

The Chi Square Distribution is formed by summing the squares of the standard normal random variables. For example, if z is a standard normal random variable, then the sum of the squares will for the Chi Square.

When to Use it?

The Chi Square is common (from my experience) in healthcare processes and in areas where discrete data is found such as Go/No Go, Present/Not Present, etc.

Next Up

In our next module, we’ll treat different data distributions and learn various ways we can visualize the data using graphical methods.


Blog Article	Excel	PDF	PowerPoint	Video

Module	Description	Type
Overview	What is Six Sigma The various definitions of Six Sigma is explained in this 5:42 video. We specifically discuss 6 definitions of "Sigma", ending with the most relevant definition which is related to the DMAIC Method of Problem Solving.
Overview	The DMAIC Framework In this 4:17 video, we explain the DMAIC framework and give an introduction to each phase in DMAIC. We specifically show the storyboard for each phase in the DMAIC framework.
Overview	DMAIC versus PDCA Article describes how PDCA is used in Lean and the similarities and common history between PDCA and DMAIC.
Overview	History of Six Sigma and Lean In this video, we go through the various contributors of Six Sigma, their contribution, and why it's important in the practice of modern Six Sigma. We also go into the history of the Toyota Production System and how the term "lean" was coined. Video is 7:36 long.
Overview	Lean History and Timeline This article shows a comprehensive history and timeline of Lean and of continuous improvement beginning in the 1600's.
Overview	Black Belt Certification In this article, we provide various resources where you may take the Black Belt exam should you choose to do so. We also discuss the positive and negative of Black Belt certification.
DEFINE
Define	The Define Phase Storyboard We introduce the Define Phase and show the Define Storyboard, a high level map of what the phase is about and the expected outputs. Video length is 3:50.
Define	Business Needs Assessment In this video, we discuss how to identify business needs of an organization and how to take that knowledge and transform it into a formal DMAIC project that will get the backing and support from top management. Video length is 6:46.
Define	Project Charter In this 5:37 minute video, we explain the role of the project charter and its importance in Six Sigma DMAIC projects. Video length is 5:37.
Define	Project Selection Matrix In this short 2:51 minute video, we learn a simple and effective method for prioritizing between competing priorities. This method is important for the selection of an improvement project.
Define	Problem Statement Articulating the problem well gets you much closer to a solution. In this video, we show you how along with several real world examples of effective problem statements. Video length is 5:42.
Define	Stakeholder Analysis Identifying stakeholders and their needs is one of the most important steps in Define. This is especially crucial if there are any influential stakeholders that are resistant to your message. Video length is 2:47.
Define	Affinity Diagram Affinity Diagram is a tried and true method for brainstorming and coming up with ideas. Learn how to apply this technique in this video. Video length is 4:25.
Define	SIPOC Identifying the key spots where measurements can be taken in crucial. This video will show you how to do it. Video length is 3:01.
Define	Voice of the Customer and CTQ In this video spanning 5:11, we explain Voice of the Customer and how Six Sigma is rooted in the customer. We explain how to translate Voice of the Customer into Critical to Quality Metrics.
Define	Critical to Quality Tree Article explaining the critical to quality tree, with examples, and a template to download so you can create your own for your six sigma projects.
Define	Value Stream Map In this 4:42 video, you will learn understand the value stream map symbols and learn how to design your own value stream map. We provide a zip file of VSM Symbols for you to download.
Define	Kano Model We explain the Kano Model to identify service and product characteristics that should be "satisfiers" and the ones that be "good enough" and don't need to go any further.
MEASURE
Measure	The Measure Phase Storyboard We introduce the Measure Phase and show the Measure Storyboard, a high level map of what the phase is about and the expected outputs.
Measure	Data Types in Six Sigma In this article we explain the various types of data, how they're different, and what they tell us about process behavior. We will also learn how to collect data. Video length is 5:24.
Measure	Descriptive Statistics In this module we learn various data measures that tell us key characteristics of a data set. We also begin the foundation for our discussion on distributions in a later module.
Measure	Distributions This is a brief introduction to statistical distributions and what inferences we can draw from them.
Measure	Graphical Representation of Data Graphically representing data effectively is required to effectively communicate meaning. In this module we learn various graphical methods and how to do them.
Measure	7 Quality Tools We briefly introduce each of the 7 quality tools. We follow this video several videos where we focus on the detailed of each of the 7 quality tools. Video length is 4:46.
Measure	Check Sheets In this HD video, we explain the checksheet, what it is used for, see various examples of checksheets, how to create one, and be able to download a checksheet template from the Shmula content library. Video length is 3:53.
Measure	Pareto Chart In this 4:48 minute video, you will learn the history of the Pareto Principle, why it's important, and how to apply the Pareto Principle in your lean and six sigma efforts using excel.
Measure	Histogram This video on the Histogram explains what it is, when to use it, and how to use it. Video length is 3:01.
Measure	Scatter Plot In this 4:27 short video, we introduce the Scatterplot, what it is, why use it, and how it can be helpful in your six sigma projects.
Measure	Cause and Effect Diagram This 5:21 minute video explains the cause and effect diagram - what it is, when to use it, and how to create one.
Measure	Control Chart In this video, we introduce you to the control chart - what it is, where to use it, when to use it, and how it's used. Video length is 7:05.
Measure	Run Chart In Progress	In Progress
Measure	Process Cycle Efficiency Process Cycle Efficiency is a more modern tool that looks at processes from the perspective of value and waste. We show you how to do it and why it's important.	In Progress
Measure	FMEA Failure Mode Effects Analysis is a tried and true method and technique for quickly identifying ways where process problems can occur and how to quickly mitigate them. Video length is 4:45.
Measure	Basic Statistics In this article, we go in depth to explain basic data types, scales, and the language of six sigma.
Measure	Using Z Values We learn about Z Values or the Z Score with applications in Six Sigma projects.
Measure	Sample Size Calculations In this module we learn the underpinnings of sample size calculations and how they are used in six sigma. We provide a sample size calculator in the template section also.
Measure	Introduction to Variation This article introduces the learner to the concept of variation and how it impacts the customer experience.
Measure	Red Bead Experiment Part 1 Introduction to red bead experiment.
Measure	Red Bead Experiment Part 2 In part 2, we actually do a quick run through the experiment.
Measure	Red Bead Experiment Part 3 In this video, we explain and go through more runs of the experiment.
Measure	Red Bead Experiment Part 4 In this video we continue our experiment and go through some of Dr. Deming's most famous quotes.
Measure	Red Bead Experiment Part 5 Continuing the experiment, with a focus on how to best facilitate an event.
Measure	Red Bead Experiment Part 6 In this last video in the series, we go through the key lessons learned from Deming's famous experiment on variation.
Measure	Measurement System Analysis In this video we discuss variation and how it impacts our methods of measuring. Video length is 5:28 and we show examples along with tips on how to deal with bad metrology. Video length is 5:28.
Measure	Gauge R&R In this video we explain the Gauge R&R Test and provide various examples of where and how it may be applied in industry.	In Progress
ANALYZE
Analyze	The Analyze Phase Storyboard We introduce the Analyze Phase and show the Analyze Storyboard, a high level map of what the phase is about and the expected outputs.	In Progress
Analyze	Brainstorming We introduce various methods of brainstorming. Some conventional and some not very and more modern. Some of these methods are taken from Design Thinking and have been found to be very effective in identifying innovative and simple solutions to problems.	In Progress
Analyze	5 Whys and Fishbone Diagram In this video we explain the 5 Why exercise and show many examples. We extend the 5 Whys and show how it naturally leads to the Fishbone Diagram.	In Progress
Analyze	Verifying Root Causes We introduce hypothesis testing and various methods for doing so including the Regression, T Test, Chi Square, and ANOVA.	In Progress
Analyze	Hypothesis Testing In Progress	In Progress
Analyze	Regression In Progress	In Progress
Analyze	T Test In Progress	In Progress
Analyze	Chi Square In Progress	In Progress
Analyze	ANOVA In Progress	In Progress
IMPROVE
Improve	The Improve Phase Storyboard We introduce the Improve Phase and show the Improve Storyboard, a high level map of what the phase is about and the expected outputs.	In Progress
Improve	Change Management We introduce you to several change management models that have been found to effective in practice. We show what they are, how to do them.	In Progress
Improve	Solution Selection Matrix The Solution Selection Matrix is a simple tool that helps a team vote and decide on which solution makes the most sense to put resources behind in improvement projects.	In Progress
Improve	Process Capability We discuss process capability and how it's different from a process not in control. We discuss its importance.	In Progress
Improve	Cost / Benefit Analysis We introduce the concept of Cost and Benefit Analysis and provide several ways at showing cost savings from Six Sigma Projects.	In Progress
Improve	Poka Yoke As part of the Improve Phase, we introduce the concept of Poka Yoke, or error proofing, as a way to prevent defects before they even occur. We show may examples and teach the principles behind Poka Yoke.	In Progress
CONTROL
Control	The Control Phase Storyboard We introduce the Control Phase and show the Control Storyboard, a high level map of what the phase is about and the expected outputs.	N/AN/A
Control	Before / After Pareto We show ways to visually see before and after results of your project.	In Progress
Control	Standard Pig Game In this 4:55 minute video, we show you a simple and effective game that teaches the importance of Standard Work. This video should be watched prior to the video on Standard Work.
Control	Standard Work Standard Work is a foundation of Lean and Six Sigma. In this 5:36 minute video we explain Standard Work and show its role in continuous improvement.
Control	Control Charts We discuss the various control charts, why they're important, and how to create them given your process and given your data type.	In Progress