https://github.com/jchester/spc-kit

Skip to content
Toggle navigation
 
Sign in

  * Product
      +  
        Actions
        Automate any workflow
      +  
        Packages
        Host and manage packages
      +  
        Security
        Find and fix vulnerabilities
      +  
        Codespaces
        Instant dev environments
      +  
        Copilot
        Write better code with AI
      +  
        Code review
        Manage code changes
      +  
        Issues
        Plan and track work
      +  
        Discussions
        Collaborate outside of code
    Explore
      + All features
      + Documentation
      + GitHub Skills
      + Blog
  * Solutions
    For
      + Enterprise
      + Teams
      + Startups
      + Education
    By Solution
      + CI/CD & Automation
      + DevOps
      + DevSecOps
    Resources
      + Learning Pathways
      + White papers, Ebooks, Webinars
      + Customer Stories
      + Partners
  * Open Source
      +  
        GitHub Sponsors
        Fund open source developers
      +  
        The ReadME Project
        GitHub community articles
    Repositories
      + Topics
      + Trending
      + Collections
  * Pricing

Search or jump to...

Search code, repositories, users, issues, pull requests...

Search
[                    ]
Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

[                    ] [ ] Include my email address so I can be
contacted
Cancel Submit feedback

Saved searches

Use saved searches to filter your results more quickly

Name [                    ] 
Query [                    ]

To see all available qualifiers, see our documentation.

Cancel Create saved search
Sign in
Sign up
You signed in with another tab or window. Reload to refresh your
session. You signed out in another tab or window. Reload to refresh
your session. You switched accounts on another tab or window. Reload
to refresh your session. Dismiss alert
{{ message }}
jchester / spc-kit Public

  * Notifications
  * Fork 2
  * Star 41
  * 

A toolkit for statistical process control using SQL

License

Apache-2.0 license
41 stars 2 forks Branches Tags Activity
Star
Notifications

  * Code
  * Issues 1
  * Pull requests 0
  * Actions
  * Security
  * Insights

Additional navigation options

  * Code
  * Issues
  * Pull requests
  * Actions
  * Security
  * Insights

jchester/spc-kit

This commit does not belong to any branch on this repository, and may
belong to a fork outside of the repository.
 main
BranchesTags
  
Go to file
Code

Folders and files

   Name        Name    Last commit message      Last commit date
Latest commit

 

History

104 Commits
 
sql         sql                              

.gitignore  .gitignore                       

LICENSE     LICENSE                          

README.md   README.md                        

View all files

Repository files navigation

  * README
  * Apache-2.0 license

SPC Kit

 

Very much a work-in-progress, but here's the basic idea: perform
statistical process control calculations in SQL. Why?

 1. The database is closest to the data and will be the fastest place
    to manipulate it.
 2. SQL is a lingua franca that any language and framework can
    interoperate with easily.

But by all that's holy take note of the LICENSE, in which I disclaim
all warranties. If you use this for something involving real
consequences, that's on you.

What the heck is statistical process control?

 

The short version is:

  * A process shows two kinds of variability
     1. Common or ordinary variability, which can be seen all the
        time and is statistically predictable.
     2. Special or assignable variability, which is out of the
        ordinary.
  * You can use some simple rules to detect the special/assignable
    events, so you can investigate what is going wrong.
  * You can use some simple rules to compare common variability to
    your target performance, so you can figure out whether
    improvement is necessary (and afterwards whether you've managed
    to improve things).

Because statistical process control is based on simple data and
simple rules, it doesn't require a PhD to apply successfully. Folks
were doing this stuff by hand in the 50s without fuss. Turning it
into SQL makes it even easier to apply in a modern context.

See "References" for some more detailed reading.

What it can do

 

  * Report out-of-control samples on variables using:
      + xR (aka XbarR) limits. These detect out-of-control sample
        averages, based on the variability of ranges of samples.
        (See: Montgomery SS6.2.1, Eqn 6.4)
      + R (aka Rbar) limits. These detect out-of-control sample
        ranges. (See: Montgomery SS6.2.1, Eqn 6.5)
      + xs (aka XbarS) limits. These detect out-of-control sample
        averages, based on the variability of the standard deviation
        of samples. (See: Montgomery SS6.3, Eqn 6.28)
      + s (aka Sbar) limits. These detect out-of-control sample
        standard deviations. (See: Montgomery SS6.3, Eqns 6.25 & 6.27)
      + Limits for individual measurements (aka XmR). These are
        applied to samples with a single measurement and track
        measurement-to-measurement changes in means (X) and moving
        ranges (mR). Sensitive to departure from normality. (See:
        Montgomery SS6.4, Eqn 6.33; Wheeler & Chambers SS3.6)
      + Limits for Exponentially-Weighted Moving Averages (EWMA).
        These track shifts in the mean. Useful adjunct to the usual
        Shewhart charts. (See: Montgomery SS9.2, Eqns 9.25 & 9.26)
  * Report out-of-control samples on attributes using:
      + p limits, available in both conformant (aka yield chart) and
        non-conformant (aka fallout chart) flavors. (See: Montgomery
        SS7.2, Eqn 7.8)
      + np limits, available in both conformant and non-conformant
        flavors. (See: Montgomery SS7.2.1, Eqn 7.13)
      + c limits. (See: Montgomery SS7.3.1, Eqn 7.17)

Sample sizes are assumed to be equal throughout a window.

What it cannot do

 

Everything else. No variable sample sizes. No sensitizing rules. No u
charts. No Cusum. No Hotelling T2. Etc.

Installation

 

The SQL dialect used is unapologetically PostgreSQL, so you need that
running first.

Then apply the sql/postgresql files in alphanumeric order. They are
prefixed with numbers for your convenience.

You can optionally add sample data from the data directory. I mostly
used these to check my calculations and rule queries.

Usage

 

A lot of the details of what's what and how it works lives in
PostgreSQL comments. But as a summary:

 1. Add your data to tables in spc_data.

  * Create entries in observed_systems for each system you wish to
    observe.
  * Create instruments under each system.
  * Create samples under each instrument for each sampling period.
  * Create measurements under each sample. The number of measurements
    for each sample must be the same.

 2. Then establish your limits.

  * Identify which time periods are your limit establishment windows
    - the samples you will use to calculate limits for subsequent
    control. Add to data.
  * Add control windows that immediately follow a limit establishment
    window and finish before the next limit establishment window.

 3. Read back rules applied to samples from spc_reports.
 4. Ignore spc_intermediates, unless you want to understand the
    calculations from end to end.

References

 

Listed in suggested order of priority.

  * Stjernlof, C. "Statistical Process Control: A Practitioner's
    Guide", Two Wrongs.
  * Montgomery, Douglas C. Introduction to Statistical Quality
    Control, 8th EMEA Ed.
  * Wheeler, Donald J and Chambers, David S. Understanding
    Statistical Process Control, 3rd Ed.

About

A toolkit for statistical process control using SQL

Topics

spc statistical-process-control

Resources

Readme

License

Apache-2.0 license
Activity

Stars

41 stars

Watchers

1 watching

Forks

2 forks
Report repository

Releases

No releases published

Packages 0

No packages published

Languages

  * PLpgSQL 100.0%

Footer

 (c) 2024 GitHub, Inc.

Footer navigation

  * Terms
  * Privacy
  * Security
  * Status
  * Docs
  * Contact
  * Manage cookies
  * Do not share my personal information

You can't perform that action at this time.