E 456 - 96 Standard Terminology For Relating To Quality and Statistics1

Designation: E 456 96
AMERICAN SOCIETY FOR TESTING AND MATERIALS

100 Barr Harbor Dr., West Conshohocken, PA 19428
Reprinted from the Annual Book of ASTM Standards. Copyright ASTM
Standard Terminology for

Relating to Quality and Statistics1
This standard is issued under the fixed designation E 456; the number immediately following the designation indicates the year of
original adoption or, in the case of revision, the year of last revision. A number in parentheses indicates the year of last reapproval. A
superscript epsilon (e) indicates an editorial change since the last revision or reapproval.
1. Scope aliases, nin a fractional factorial design, two or more effects

1.1 This terminology includes those quality and statistical which are estimated by the same contrast and which,
terms in wide use in ASTM for which standard definitions therefore, cannot be estimated separately. E 1325
appear desirable. assignable cause, na factor that contributes to variation, and
which is feasible to detect and identify.
2. Referenced Documents
NOTE 2Many factors will contribute to variation but it may not be
2.1 ASTM Standards: feasible (economically or otherwise) to identify some of them.
E 177 Practice for the Use of the Terms Precision and Bias
attribute data, nobserved values or determinations which
in ASTM Test Methods2
indicate the presence or absence of specific characteristics.
E 1325 Terminology Relating to Design of Experiments2
E 1402 Terminology Relating to Sampling2 DISCUSSIONItems or units of material may be evaluated by counting
or measurement. Attributes are counted whereas variables are mea-
3. Significance and Use sured. Attribute distributions are discrete. See variables data.
3.1 This terminology is the general terminology standard for attributes, method of, nmeasurement of quality by the
terms defined by Committee E-11. method of attributes consists of noting the presence (or
3.2 Citation is made to other E-11 standards which contain absence) of some characteristic or attribute in each of the
more extensive information regarding the particular term and units in the group under consideration, and counting how
its usage. These references may be to other practices and many units do (or do not) possess the quality attribute, or
guides or to more specific terminology standards, such as how many such events occur in the unit, group, or area.
Terminology E 1325. average run length (ARL)(1) sample sense, nthe aver-
4. Terminology age number of times that a process will have been sampled
and evaluated before a shift in process level is signaled, and
acceptance (control chart or acceptance control chart (2) unit sense, nthe average number of units that will have
usage, n), na decision that the process is operating in a been produced before a shift in level is signaled.
satisfactory manner with respect to the statistical measures
DISCUSSIONA long ARL is desirable for a process located at its
being plotted: action limits: control limits. specified level (so as to minimize calling for unneeded investigation or
accepted reference value, na value that serves as an corrective action) and a short ARL is desirable for a process shifted to
agreed-upon reference for comparison, and which is derived some undesirable level (so that corrective action will be called for
as: (1) a theoretical or established value, based on scientific promptly). ARL curves are used to describe the relative quickness in
principles, (2) an assigned or certified value, based on detecting level shifts of various control chart systems.
experimental work of some national or international organi- balanced incomplete block design (BIB), nan incomplete
zation, or (3) a consensus or certified value, based on block design in which each block contains the same number
collaborative experimental work under the auspices of a k of different versions from the t versions of a single
scientific or engineering group. principal factor arranged so that every pair of versions
accuracy, nthe closeness of agreement between a test result occurs together in the same number, l, of blocks from the b
and an accepted reference value. blocks. E 1325
NOTE 1The term accuracy, when applied to a set of test results, batch, na definite quantity of some product or material
involves a combination of a random component and of a common produced under conditions that are considered uniform.
systematic error or bias component.
NOTE 3A batch is usually smaller than a lot.
1
This terminology is under the jurisdiction of ASTM Committee E-11 on Quality
bias, nthe difference between the expectation of the test
and Statistics and is the direct responsibility of Subcommittee E11.60 on Terminol- results and an accepted reference value.
ogy.
Current edition approved June 10, 1996. Published September 1996. Originally NOTE 4Bias is the total systematic error as contrasted to random
published as E 456 72. Last previous edition E 456 92. error. There may be one or more systematic error components contributing
2
Annual Book of ASTM Standards, Vol 14.02. to the bias. A larger systematic difference from the accepted reference
1
COPYRIGHT American Society for Testing and Materials
Licensed by Information Handling Services
E 456
value is reflected by a larger bias value. sis format usable for both simple and complex designs. When any set of
orthogonal contrasts is used, the procedure, as in the example, is
characteristic, na property of items in a sample or popula- straightforward. When terms are not orthogonal, the orthogonalization
tion which, when measured, counted or otherwise observed, process to adjust for the common element in nonorthogonal contrast is
helps to distinguish between the items. also systematic and can be programmed. E 1325
cluster sampling, nwhen the primary sampling unit com-
prises a bundle of elementary units or a group of subunits, control(evaluation), nan evaluation to check, test, or
the term cluster sampling may be applied. verify; (authority): the act of guiding, directing, or manag-
ing; (stability): a state of process in which the variability is
DISCUSSIONExamples of cluster sampling are: selection of city attributable to a constant system of chance causes.
blocks as primary sampling units; selection of a household as a cluster control chart factor, na factor, usually varying with sample
of people (of which only one may be interviewed); selection of bundles
of rods or pipe from a shipment; and selection, from a shipment, of
size, to convert specified statistics or parameters into a
cartons that contain boxes or packages within them. central line value or control limit appropriate to the control
chart.
completely randomized design, na design in which the control chart method, nthe method of using control charts
treatments are assigned at random to the full set of experi- to determine whether or not processes are in a stable state.
mental units. E 1325 control limits, nlimits on a control chart which are used as
completely randomized factorial design, na factorial ex- criteria for signaling the need for action, or for judging
periment (including all replications) run in a completely whether a set of data does or does not indicate a state of
randomized design. E 1325 statistical control.
component of variance, na part of a total variance identified conventional true value of a quantity, nvalue attributed to
with a specified source of variability. a particular quantity and accepted, sometimes by conven-
composite design, na design developed specifically for tion, as having an uncertainty appropriate for a given
fitting second order response surfaces to study curvature, purpose.
constructed by adding further selected treatments to those
obtained from a 2n factorial (or its fraction). E 1325 NOTE 988Conventional true value is sometimes called 88assigned
value, 88best value, 88conventional value, or 88reference value. 88Ref-
confounded factorial design, na factorial experiment in erence value, in this sense, should not be confused with 88reference
which only a fraction of the treatment combinations are run value in the sense of an influence quantity affecting a measuring
in each block and where the selection of the treatment instrument.
combinations assigned to each block is arranged so that one NOTE 10Frequently, a number of results of measurements of a
or more prescribed effects is(are) confounded with the block quantity is used to establish a conventional true value.
effect(s), while the other effects remain free from confound- DISCUSSIONWhen warning limits are used, the control limits are
ing. often called action limits. Action may be in the form of investigation
of the source(s) of an assignable cause, making a process adjustment,
NOTE 5All factor level combinations are included in the experiment. or terminating a process. Criteria other than control limits are also used
E 1325 frequently.
confounding, ncombining indistinguishably the main effect dependent variable, nSee response variable.
of a factor or a differential effect between factors (interac- design of experiments, nthe arrangement in which an
tions) with the effect of other factor(s), block factor(s) or experimental program is to be conducted, and the selection
interactions(s). of the levels (versions) of one or more factors or factor
combinations to be included in the experiment. Synonyms
NOTE 6Confounding is a useful technique that permits the effective
use of specified blocks in some experiment designs. This is accomplished
include experiment design and experimental design.
by deliberately preselecting certain effects or differential effects as being E 1325
of little interest, and arranging the design so that they are confounded with deviation, nthe difference between a measurement or quasi-
block effects or other preselected principal factor or differential effects, measurement and its stated value or intended level.
while keeping the other more important effects free from such complica-
tions. Sometimes, however, confounding results from inadvertent changes DISCUSSIONDeviation should be stated as a difference in terms of
to a design during the running of an experiment or from incomplete the appropriate data units. Sometimes these units will be original
planning of the design, and it serves to diminish, or even to invalidate, the measurement units; sometimes they will be quasi-measurements; that
is, a scaled rating of subjective judgments; sometimes they will be
effectiveness of an experiment. E 1325
designated values representing all continuous or discrete measurements
contrast, na linear function of the observations for which falling in defined cells or classes.
the sum of the coefficients is zero. error of result, nthe test result minus the accepted reference
NOTE 7With observations Y1, Y2,..., Yn, the linear function value (of the characteristic).
a1Y1 + a2Y2 + ... + an Yn is a contrast if, and only if (ai 5 0, where the ai
NOTE 11It is not possible to correct for random error.
values are called the contrast coefficients. E 1325
experimental design, nsee design of experiments. E 1325
contrast analysis, na technique for estimating the param- experiment space, nthe materials, equipment, environmen-
eters of a model and making hypothesis tests on preselected tal conditions and so forth that are available for conducting
linear combinations of the treatments (contrasts). an experiment. E 1325
NOTE 8Contrast analysis involves a systematic tabulation and analy- experimental unit, na portion of the experiment space to
2
E 456
which a treatment is applied or assigned in the experiment. ate precisions may be more characteristic of individual laboratories than of
the test method.
NOTE 12The unit may be a patient in a hospital, a group of animals,
a production batch, a section of a compartmented tray, etc. E 1325 intermediate precision conditions, nconditions under
which test results are obtained with the same test method
evolutionary operation (EVOP), na sequential form of using test units or test specimens (see Practice E 691,2 10.3)
experimentation conducted in production facilities during taken at random from a single quantity of material that is as
regular production. nearly homogeneous as possible, and with changing condi-
NOTE 13The principal theses of EVOP are that knowledge to improve tions such as operator, measuring equipment, location within
the process should be obtained along with a product, and that designed the laboratory, and time.
experiments using relatively small shifts in factor levels (within produc- item, n(1) an object or quantity of material on which a set of
tion tolerances) can yield this knowledge at minimum cost. The range of observations can be made: (2) an observed value or test
variation of the factors for any one EVOP experiment is usually quite result obtained from an object or quantity of material.
small in order to avoid making out of tolerance products, which may
require considerable replication, in order to be able to clearly detect the DISCUSSIONThe second usage in the definition is generally limited
effect of small changes. E 1325 to generic descriptions such as in the definition of population. Terms
such as observation, measurement, test result, unit, value or
factorial experiment (general), nin general, an experiment yield are more common in specific applications. A set as used here
in which all possible treatments formed from two or more may be one or more variables.
factors, each being studied at two or more levels (versions) level (of a factor), na given value, a specification of
are examined so that interactions (differential effects) as well procedure or a specific setting of a factor.
as main effects can be estimated. E 1325
2n factorial experiment, na factorial experiment in which n NOTE 1788Version is a general term applied both to quantitative and
factors are studied, each of them in two levels (versions). qualitative factors. The more restrictive term 88level is frequently used to
express more precisely the quantitative characteristic. For example, two
E 1325 versions of a catalyst may be presence and absence. Four levels of a heat
fractional factorial design, na factorial experiment in treatment may be 100C, 120C, 140C, and 160C. E 1325
which only an adequately chosen fraction of the treatments
required for the complete factorial experiment is selected to lota definite quantity of a product or material accumulated
be run. under conditions that are considered uniform for sampling
purposes.
NOTE 14This procedure is sometimes called fractional replication.
lower control limit (LCL), ncontrol limit for points below
frame, na list, compiled for sampling purposes, which the central line.
designates the items (units) of a population or universe to be lower tolerance limit (LTL) (lower specification limit), na
considered in a study. tolerance limit that defines the lower conformance boundary
for an individual unit of a manufacturing or service opera-
DISCUSSIONWhen a frame is available, sampling schemes can be
devised for selection of the units directly (one-stage), or in two or more
tion.
stages. In multi-stage sampling, a frame is needed for each stage. As an main effect, average effect, na term describing a measure
example, the cartons of a lot could be the first-stage units, packages for the comparison of the responses at each level (version) of
within the carton could be second-stage units, and items within the a factor averaged over all levels (versions) of other factors in
packages could be the third-stage units. the experiment.
fully nested experiment, na nested experiment in which the NOTE 18The term 88main effect may describe the parameter in an
second factor is nested within levels (versions) of the first assumed model or the estimate of this parameter. E 1325
factor and each succeeding factor is nested within versions
mixture design, na design in which two or more ingredients
of the previous factor. E 1325
or components shall be mixed and the response is a property
hierarchical experiment, nsee nested experiment.
of the resulting mixture that does not depend upon the
incomplete block design, na design in which the experi-
amount of the mixture.
ment space is subdivided into blocks in which there are
insufficient experimental units available to run a complete NOTE 19The proportions of each of the q components (Xi) in the
set of treatments or replicate of the experiment. E 1325 c
mixture shall satisfy the conditions O # Xi # 1 and ( Xi 5 1; and
intermediate precisions, nthe closeness of agreement be- i51
tween test results obtained under specified intermediate each experimental point is defined in terms of these proportions.
precision conditions. NOTE 20In some fields of application the experimental mixtures are
described by the terms 88formulation or 88blend. The use of mixture
NOTE 15The specific measure and the specific conditions must be designs is appropriate for experimenting with the formulations of manu-
specified for each intermediate measure of precision; thus, 88standard factured products, such as paints, gasoline, foods, rubber, and textiles.
deviation of test results among operators in a laboratory, or 88day-to-day NOTE 21In some applications, the proportions of the components of
standard deviation within a laboratory for the same operator. the mixture may vary between 0 and 100 % of the mixture (88complete
NOTE 16Because the training of operators, the agreement of different domain). In others, there may be operative restraints, so that at least one
pieces of equipment in the same laboratory and the variation of environ- component cannot attain 0 or 100 % (88reduced domain). E 1325
mental conditions with longer time intervals all depend on the degree of
within-laboratory control, the intermediate measures of precision are method of least squares, na technique of estimation of a
likely to vary appreciably from laboratory to laboratory. Thus, intermedi- parameter which minimizes (e2, where e is the difference
3
E 456
between the observed value and the predicted value derived NOTE 26It is not unusual to find that experiments consist of both
from the assumed model. E 1325 factorial and nested segments. See nested experiment. E 1325
natural process limits (NPL), nlimits which include a Plackett-Burman designs, na set of screening designs using
stated fraction of the individuals in a population. orthogonal arrays that permit evaluation of the linear effects
NOTE 22Natural process limits will not ordinarily be the dimensional of up to n 5 t 1 factors in a study of t, treatment
limits shown on an engineering drawing. They are mostly used to compare combinations. E 1325
the natural capability of the process to tolerance limits. population, nthe totality of items or units of material under
DISCUSSIONFor populations with a normal (Gaussian) distribution, consideration.
the natural process limits ordinarily will be at 63 s. If placed around
the standard level, these limits identify the boundaries which will DISCUSSIONThe word items may be interpreted in the sense of
include approximately 99.7 % of the individuals in a process that is measurements, or possible measurements, for a single characteristic, or
properly centered and in a state of statistical control. In many occasionally for multiple characteristics, on all items or units of
circumstances (several machines making the same product that serially material being considered. The word totality may refer to items not
feed into the process) it is recognized that in addition to the variability available for inclusion in samples as well as those which are available.
around a single level, an acceptable zone of standard levels (for the
different machines) is required. Then the NPL may be placed around precision, nthe closeness of agreement between independent
the Acceptable Process Levels (APL) that define this zone so that the test results obtained under stipulated conditions.
NPL identify the boundaries within which at least 99.7 % of the
NOTE 27Precision depends on random errors and does not relate to
individuals will be included in a process located at the APL, or inside
the true value or the specified value.
the zone. It should be noted that there is no assumption made that the
NOTE 28The measure of precision usually is expressed in terms of
process levels within the zone are random variables.
imprecision and computed as a standard deviation of the test results. Less
nested experiment, nan experiment to examine the effect of precision is reflected by a larger standard deviation.
two or more factors in which the same level (version) of a NOTE 2988Independent test results means results obtained in a
factor cannot be used with all levels (versions) of other manner not influenced by any previous result on the same or similar test
object. Quantitative measures of precision depend critically on the
factors. Synonym: hierarchical experiment. E 1325
stipulated conditions. Repeatability and reproducibility conditions are
observation, n(1) the process of obtaining information particular sets of extreme stipulated conditions.
regarding the presence or absence of an attribute of a test
specimen, or of making a reading on a characteristic or probability sample, na sample of which the sampling units
dimension of a test specimen, or (2) the attribute or mea- have been selected by a chance process such that, at each
surement information obtained from the process. (The term step of selection, a specified probability of selection can be
88observed value is preferred for this second usage.) attached to each sampling unit available for selection.
NOTE 23See Annex A1. NOTE 30These probabilities of selection need not be equal. If equal,
see simple random sample. See the general termsample. Also, see
observed value, nthe value obtained by carrying out the Practice E 1052 in this volume.
complete protocol of the test method once, being either a
single test determination or an average or other specified random error of result, na component of the error which,
combination of a specified number of test determinations. in the course of a number of test results for the same
characteristic, varies in an unpredictable way.
NOTE 24See Annex A1. randomization, nthe procedure used to allot treatments at
orthogonal array, na table of coefficients identifying the random to the experimental units so as to provide a high
levels, or some weight associated with the levels, for each degree of independence in the contributions of experimental
factor to be used in the analysis of specified effects, which error to estimates of treatment effects.
are arranged in such a manner that each effect will be NOTE 31An essential element in the design of experiments is to
independent of the other effects. E 1325 provide estimates of effects free from biases due to undetected assignable
orthogonal contrasts, ntwo contrasts are orthogonal if the causes within the experimental space. Randomization is a process to
contrast coefficients of the two sets satisfy the condition that, minimize this risk. The operational procedure for assignment 88at random
when multiplied in corresponding pairs, the sum of the involves the use of random numbers or some similar method for assuring
products is equal to zero. See contrast and contrast analy- that each unit has an equal chance of being selected for each treatment.
sis. E 1325 E 1325
partially balanced incomplete block design (PBIB), nan randomized block design, na design in which the experi-
incomplete block design in which each block contains the ment space is subdivided into blocks of experimental units,
same number k, of different versions from the t versions of the units within each block being more homogeneous than
the principal factor. units in different blocks.
NOTE 25The arrangement is such that not all pairs of versions occur NOTE 32In each block the treatments are allocated randomly to the
together in the same number of the blocks; some versions can therefore be experimental units within each block. Replication is obtained by the use of
compared with greater precision than others. E 1325 two or more blocks, depending on the precision desired, and a separate
randomization is made in each block. E 1325
partially nested experiment, na nested experiment in
which several factors may be crossed as in factorial experi- randomized block factorial design, na factorial experiment
ments and other factors nested within the crossed combina- run in a randomized block design in which each block
tions. includes a complete set of factorial combinations. E 1325
4
E 456
repeatability, nprecision under repeatability conditions. absolute difference between two test results obtained under
reproducibility conditions may be expected to occur with a
NOTE 33Repeatability is one of the concepts or categories of the
precision of a test method. probability of approximately 0.95 (95 %).
NOTE 34Measures of repeatability defined in this compilation are NOTE 40The reproducibility limit is 2.8 ('1.96 =2 ) times the
repeatability standard deviation and repeatability limit. reproducibility standard deviation. The multiplier is independent of the
repeatability conditions, nconditions where independent size of the interlaboratory study (that is, of the number of laboratories
participating), as explained in Practice E 177.2
test results are obtained with the same method on identical NOTE 41The approximation to 0.95 is reasonably good (say 0.90 to
test items in the same laboratory by the same operator using 0.98) when many laboratories (30 or more) are involved but is likely to be
the same equipment within short intervals of time. poor when fewer than eight laboratories are studied.
NOTE 35See precision Note 3. reproducibility standard deviation (SR), nthe standard
DISCUSSIONThe same operator, same equipment requirement deviation of test results obtained under reproducibility con-
means that for a particular step in the measurement process, the same
ditions.
combination of operator and equipment is used for every test result.
Thus, one operator may prepare the test specimens, a second measure NOTE 42Other measures of the dispersion of test results obtained
the dimensions and a third measure the mass in a test method for under reproducibility conditions are the 88reproducibility variance and
determining density. the 88reproducibility coefficient of variation.
DISCUSSIONBy in the shortest practical period of time is meant NOTE 43The reproducibility standard deviation includes, in addition
that the test results, at least for one material, are obtained in a time to between-laboratory variability, the repeatability standard deviation and
period not less than in normal testing and not so long as to permit a contribution from the interaction of laboratory factors (that is, differ-
significant change in test material, equipment or environment. ences between operators, equipment and environments) with material
factors (that is, the differences between properties of the materials other
repeatability limit (r), nthe value below which the absolute than that property of interest).
difference between two individual test results obtained under
repeatability conditions may be expected to occur with a residual error, nthe difference between the observed result
probability of approximately 0.95 (95 %). and the predicted value (estimated treatment response);
Observed Result minus Predicted Value. E 1325
NOTE 36The repeatability limit is 2.8 ('1.96 =2 ) times the
response surface, nthe pattern of predicted responses based
repeatability standard deviation. This multiplier is independent of the size
of the interlaboratory study, as explained in Practice E 177.2 on the empirical model derived from the experiment obser-
NOTE 37The approximation to 0.95 is reasonably good (say 0.90 to vations. E 1325
0.98) when many laboratories (30 or more) are involved, but is likely to response variable, nthe variable that shows the observed
be poor when fewer than eight laboratories are studied. results of an experimental treatment. Synonym dependent
repeatability standard deviation, nthe standard deviation variable. E 1325
of test results obtained under repeatability conditions. robustness, ninsensitivity of a statistical test to departures
from underlying assumptions.
NOTE 38It is a measure of the dispersion of the distribution of test
results under repeatability conditions. DISCUSSIONMany statistical test procedures depend on the form of
NOTE 39Similarly, 88repeatability variance and 88repeatability coef- the assumed distribution of the population sampled to obtain exact
ficient of variation could be defined and used as measures of the values for the probability statements. If departures from the assumed
dispersion of test results under repeatability conditions. distribution do not materially affect the decisions which would be based
DISCUSSIONIn an interlaboratory study, this is the pooled standard on the statistical tests involved, the test is considered robust. For
deviation of test results obtained under repeatability conditions. See example, tests based on an assumption of normality that compare
Practice E 691. averages generally are robust even though the underlying distribution
DISCUSSIONThe repeatability standard deviation, usually consid- of individual items in the population is not normal. On the other hand,
ered a property of the test method, will generally be smaller than the the F-statistic for comparing variances may be an indicator of lack of
within-laboratory standard deviation. (See within-laboratory standard normality rather than a simple variance comparison.
deviation.) ruggedness, ninsensitivity of a test method to departures
reproducibility, nprecision under reproducibility condi- from specified test or environmental conditions.
tions. DISCUSSIONAn evaluation of the ruggedness of a test method or
reproducibility conditions, nconditions where test results an empirical model derived from an experiment is useful in determining
are obtained with the same method on identical test items in whether the results or decisions will be relatively invariant over some
different laboratories with different operators using different range of environmental variability under which the test method or the
equipment. model is likely to be applied.
DISCUSSIONIdentical material means either the same test units or ruggedness test, na planned experiment in which environ-
test specimens are tested by all the laboratories as for a nondestructive mental factors or test conditions are deliberately varied in
test or test units or test specimens are taken at random from a single order to evaluate the effects of such variation.
quantity of material that is as nearly homogeneous as possible. (See
Practice E 691.) DISCUSSIONSince there usually are many environmental factors that
DISCUSSIONA different laboratory of necessity means a different might be considered in a ruggedness test, it is customary to use a
operator, different equipment, and different location and under different screening type of experiment design (see screening design) which
supervisory control. concentrates on examining many first order effects and generally
assume that second order effects such as interactions and curvature are
reproducibility limit, n(R) the value below which the relatively negligible. Often in evaluating the ruggedness of a test
5
E 456
method, if there is an indication that the results of a test method are specification limits, nsee tolerance limits.
highly dependent on the levels of the environmental factors, there is a staggered nested experiment, na nested experiment in
sufficient indication that certain levels of environmental factors must be which the nested factors are run within only a subset of the
included in the specifications for the test method, or even that the test
method itself will need further revision.
versions of the first or succeeding factors. E 1325
standard deviation, nthe most usual measure of the disper-
run, n(1) an uninterrupted sequence of occurrences of the sion of observed values or results expressed as the positive
same attribute or event in a series of observations, and (2) a square root of the variance.
consecutive set of successively increasing run-up or succes- statistic, na quantity calculated from a sample of observa-
sively decreasing run-down values in a series of variable tions, most often to form an estimate of some population
measurements. parameter.
DISCUSSIONIn control chart applications, some variable measure- statistical measure, nstatistic or mathematical function of a
ments are treated as attributes in determining runs. For example, a run statistic.
might be considered a series of a specified number of consecutive DISCUSSIONThe word statistical emphasizes that measures are
points above or below the central line.3 subject to inherent errors and that, in estimating a population parameter,
sample, na group of items, observations, test results, or they represent a sample, with inherent sampling variability.
portions of material, taken from a large collection of items, subgroup, n(1) object sense, na set of units or quantity of
observations, test results, or quantities of material, which material obtained by subdividing a larger group of units or
serves to provide information that may be used as a basis for quantity of material, and (2) measurement sense, na set
making a decision concerning the larger collection. of groups of observations obtained by subdividing a larger
DISCUSSIONThe sample may be the units of material themselves or group of observations. See rational subgroup.
the set of the observations collected from them. The decision may or systematic error of result, na component of the error,
may not involve taking action on the units of material, or on the which in the course of a number of test results for the same
process. It is necessary to describe whether the sample is to be selected characteristic, remains constant or varies in a predictable
on a simple random, a stratified random, or other specified basis. way.
Probability samples, that is, samples selected by chance using appro-
priate randomization, are required to make confidence interval state- NOTE 46Systematic errors and their causes may be known or un-
ments and similar statistical inferences about the parameters of the known.
sampled population.
systematic sampling, nsample selection procedure in which
sample size, nthe number of units in a sample or the number every kth element is selected from the universe or popula-
of observations in a sample. tion; for example, u, u + k, u + 2k, u + 3k, etc., where u is in
sampling fraction, f, nthe ratio f of the number of sampling the interval 1 to k.
units selected for the sample to the number of sampling units
DISCUSSIONIf k 5 20 and u 5 7 is the initial unit selected, then
available. sampling units 7, 27, 47, 67, ..., would comprise the sample. When N/k
NOTE 44For the simple random sample case, f 5 n/N where n is the is not an integer, there is a small bias due to the end effect. When u is
sample size and N is the number of sampling units available. When f > selected by a chance process and N/k is an integer, the systematic
0.10 estimation of the precision of an estimator should take account of this sample will provide unbiased estimates of the population average or
magnitude of f. See finite population correction. total. Situations for which N/k is not an integer usually ignore the small
or negligible bias in estimating the mean or total. Schemes have been
sampling with replacement, na procedure used with some developed for non-integer N/k to overcome sampling bias. See Jessen.4
probability sampling plans in which a selected unit is Estimation of the precision of an average computed from a
replaced after any step in selection so that this sampling unit systematic sample is a difficult problem that has no generally satisfac-
tory solution. Independent replicate systematic samples provide an
is available for selection again at the next step of selection, approach to variance estimation, but have been rejected by some
or at any other succeeding step of the sample selection writers. In some ASTM situations where replicate samples may be
procedure. obtained on a routine basis, the technique may be useful. See Cochran5
screening design, na balanced design, requiring relatively for an extended discussion of variance estimation for systematic
minimal amount of experimentation, to evaluate the lower sampling.
order effects of a relatively large number of factors in terms test determination, n(1) the process of deriving from one or
of contributions to variability or in terms of estimates of more test observations (observed values) the presence or
parameters for a model. absence of an attribute or the value of a characteristic or
NOTE 45In screening designs, the term lower order effects is some- dimension of a single test specimen, or (2) the attribute
times limited to first order terms such as linear components of main (presence or absence) or value derived from the process (see
effects, but often includes both first order terms and second order terms test specimen).
such as two factor interactions and quadratic curvature components of
main effects. E 1325 NOTE 47See Annex A1.
4
Jessen, R. J., Statistical Survey Techniques, John Wiley & Sons, Inc., New
3
Other examples may be found in references such as Nelson, L. S., Interpreting York, 1978, Sec. 12.2.
5
Shewhart X Control Charts, Journal of Quality Technology, Vol 17, No. 2, April Cochran, W. G., Sampling Techniques, John Wiley & Sons, Inc., New York,
1985. 1977, Chapter 8.
6
E 456
test observation, nsee observation. nents of error: (1) bias, and (2) the random error attributed to
test result, nthe value of a characteristic obtained by the imprecision of the measurement process.
carrying out a specified test method.
DISCUSSIONQuantitative measures of uncertainty generally require
NOTE 48The test method should specify that one or a number of descriptive statements of explanation because of differing traditions of
individual observations be made and their average or another appropriate usage and because of differing circumstances. For example: (1) the bias
function, such as the medium or the standard deviation, be reported as the and imprecision may both be negligible; (2) the bias may not be
test result. It also may require standard corrections to be applied, such as negligible while the imprecision is negligible; (3) neither the bias nor
correction of gas volumes to standard temperature and pressure. A test the imprecision may be negligible; (4) the bias may be negligible while
result, therefore, can be a result calculated from several observed values. the imprecision is not negligible.
In the simple case, the test result is the observed value itself.
unit, nan object on which a measurement or observation
test specimen, nthe portion of a test unit needed to obtain a may be made.
single test determination.
DISCUSSIONThe word unit is commonly used in the sense of a
NOTE 49When used for a physical test, this is sometimes called 88test unit of product (service, etc.)the entity of product inspected in order
piece. For a chemical test, it is sometimes called test portion or test to determine its classification or its measurements. This entity may be
sample. For optical and other tests, it is also sometimes called test sample. a single article, a set of like articles treated collectively, a subassembly,
In interlaboratory evaluation of test methods and other statistical proce- a stated quantity of material, etc. The unit of product or service need not
dures, it is best to reserve the word sample for the whole amount of be the same as the unit of purchase, supply, production, or shipment.
material involved and not the individual test specimens, pieces or portions
being tested. universe (population), nthe totality of the set of items,
NOTE 50See Annex A1. units, or measurements, etc., real or conceptual, that is under
consideration.
test unit, nthe total quantity of material (containing one or
more test specimens) needed to obtain a test result as NOTE 54This definition of universe is being revised to incorporate the
specified in the test method. See test result. concept of including one or more populations. Use with caution.
tolerance limits (specification limits), nlimits that define upper control limit (UCL), ncontrol limit for points above
the conformance boundaries for an individual unit of a the central line.
manufacturing or service operation. upper tolerance limit (UTL) (upper specification limit),
DISCUSSIONLimits may be established either with or without the use na tolerance limit applicable to the upper conformance
of probability considerations. Tolerance limits may be in the form of a boundary for an individual unit of a manufacturing or service
single (unilateral) limit (upper or lower) or double (bilateral) limits operation.
(upper and lower). Double, or two-sided limits occur more frequently. variables, method of, nmeasurement of quality by the
Double limits are often stated as a symmetrical deviation from a stated method of variables consists of measuring and recording the
value, but they need not be symmetrical. Frequently the term specifi-
numerical magnitude of a quality characteristic for each of
cation limits is used instead of tolerance limits. While tolerance limits
is generally preferred in terms of evaluating the manufacturing or the units in the group under consideration.
service requirements, specification limits may be more appropriate for NOTE 55This involves reference to a continuous scale of some kind.
categorizing material, product, or service in terms of their stated
requirements. variables data, nmeasurements which vary and may take
tolerance specification, nthe total allowable variation any of a specified set of numerical values.
around a level or state (upper limit minus lower limit), or the DISCUSSIONThe term random variable or variate is often used
maximum acceptable excursion of a characteristic. to indicate that each of the specified set of values is associated with a
specified relative frequency or probability, and that each is a random
DISCUSSIONThe determination of the amount of variation to be sample from a continuous or a discrete, or discontinuous, population
allowed involves the product or service requirements and consideration encompassing the specified values.
of process capability (see natural process limits), measurement
variability, and other appropriate elements or some compromise among variance, na measure of the squared dispersion of observed
these. values or measurements expressed as a function of the sum
treatment, na combination of the levels (versions) of each of the squared deviations from the population mean or
of the factors assigned to an experimental unit, synonym sample average.
treatment combination. NOTE 56The sample variance, or variance of a sample of n observed
treatment combination, nsee treatment. values, is computed as s2 5 [1/(n 1)][((yi y)2]. The sample standard
trueness, nthe closeness of agreement between the popula- deviation s is the positive square root of the sample variance. The
tion mean of the measurements or test results and the population variance s2 5 *R (y )2f (y)dy, where R is the region over
which the random variable y is defined, and where f (y) is the probability
accepted reference value.
density function and is the population mean of y. The population
NOTE 51The measure of trueness usually is expressed in terms of standard deviation (s) is the positive square root of the population
bias. Greater bias means less favorable trueness. variance.
NOTE 5288Population mean is, conceptually, the average value of an DISCUSSIONA listing of the sample variance s2 should always be
indefinitely large number of test results. accompanied by the degrees of freedom on which it is based. The
NOTE 53Trueness is the systematic component of accuracy. degrees of freedom for the sample variance described above are (n 1).
uncertainty, nan indication of the variability associated with within-laboratory standard deviation, nthe standard de-
a measured value that takes into account two major compo- viation of test results obtained within a laboratory for a
7
E 456
single material under conditions that may include such Youden square, na type of block design derived from
elements as different operators, equipment, and longer time certain Latin squares by deleting, or adding, rows (or
intervals. columns) so that one block factor remains complete blocks
NOTE 57Because the training of operators, the agreement of different and the second block factor constitutes balanced incomplete
pieces of equipment in the same laboratory and the variation of environ- blocks. E 1325
mental conditions with longer time intervals depend on the degree of
within-laboratory control, the within-laboratory standard deviation is
likely to vary appreciably from laboratory to laboratory.
ANNEX
(Mandatory Information)
A1. MEASUREMENT TERMINOLOGY
A1.1 A test method often has three distinct stages: (1) the test method specifies that only one test determination is to be
direct observation of dimensions or characteristics, (2) the made, then the test determination value is the test result of the
combining of the observed values to obtain a single test test method. Some test methods require that several determi-
determination, and (3) the combining of a number of test nations be made and the values obtained be averaged or
determinations to obtain the test result of the test method. The otherwise combined to obtain the test result of the test method.
term measurement may be applied to any one or more of these Averaging of several determinations is often used to reduce the
stages of the measurement process. effect of local variations of the property within the material.
A1.2 In the simplest of test methods a single direct
observation is also the test determination and the test result. A1.3 Precision statements for ASTM test methods are
For example, a test observation required by a test method may usually based on test results, not test determinations or obser-
be the mass of a test specimen prepared and weighed in a vations. If for some compelling reason an ASTM committee
specified way. The observation would also be the test determi- wished to address the issue of variation between test determi-
nation of the mass of the test specimen, and if only one nations (in addition to the variation among test results), the
specimen is to be weighed, the observed weight would also be committee can do so with a clear declaration (of what is being
the test result of the test method. Another test method may done) to avoid confusion. Sampling plans and product speci-
require the measurement of the area of the test specimen as fications should specify the sample size in terms of the number
well as the mass, and then direct that the mass be divided by of replicate test results. A test method should specify the
the area to obtain the mass per unit area of the test specimen. required observations to obtain a test determination and the
The whole process of measuring the mass and the area and number of test determinations to be averaged or otherwise
calculating the mass per unit area is a test determination. If the combined to obtain a single test result.
The American Society for Testing and Materials takes no position respecting the validity of any patent rights asserted in connection
with any item mentioned in this standard. Users of this standard are expressly advised that determination of the validity of any such
patent rights, and the risk of infringement of such rights, are entirely their own responsibility.
This standard is subject to revision at any time by the responsible technical committee and must be reviewed every five years and
if not revised, either reapproved or withdrawn. Your comments are invited either for revision of this standard or for additional standards
and should be addressed to ASTM Headquarters. Your comments will receive careful consideration at a meeting of the responsible
technical committee, which you may attend. If you feel that your comments have not received a fair hearing you should make your
views known to the ASTM Committee on Standards, 100 Barr Harbor Drive, West Conshohocken, PA 19428.
8

E 456 - 96 Standard Terminology For Relating To Quality and Statistics1

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

E 456 - 96 Standard Terminology For Relating To Quality and Statistics1

Uploaded by

Copyright:

Available Formats

Designation: E 456 96

AMERICAN SOCIETY FOR TESTING AND MATERIALS

Standard Terminology for

1. Scope aliases, nin a fractional factorial design, two or more effects

A1. MEASUREMENT TERMINOLOGY

You might also like