ASTM E456-02 R04.pdf

Designati Desi gnation: on: E 456 – 02

An American National Standard

e1

Standard Terminology

Relating to Quality and Statistics 1 This standard is issued under the fixed designation E 456; the number immediately following the designation indicates the year of original adoption or, in the case of revision, the year of last revision. A number in parentheses indicates the year of last reapproval. A superscript supers cript epsilon (e) indicates an editorial change since the last revision or reapproval.

e1 NOTE—Editorial changes were made throughout in January 2004.

1. Sco Scope pe 1.1 This terminology terminology includes includes those quality quality and stat statisti istical cal terms in wid terms widee use in AST ASTM M for which sta standa ndard rd defi definit nition ionss appear desirable. 2. Referenced Documents 2.1 ASTM Standards: 2 E 29 29 Prac Practice tice for Using Signi Significan ficantt Digit Digitss in Test Test Data to Determine Conformance with Specifications E 177 Pract Practice ice for the Use of the Terms Terms Precision Precision and Bias in ASTM Test Methods E 178 Pract Practice ice for Deali Dealing ng with Outlying Outlying Obser Observati vations ons E 1169 1169 Guide for Conducting Ruggedness Tests Tests E 1325 Te Terminology rminology Relating to Design of Experiments E 1402 Te Terminology rminology Relating to Sampling E 1488 Guide for Statistica Statisticall Procedures Procedures to Use in Devel Developoping and Applying Test Methods E 199 1994 4 Pra Practi ctice ce for Use of Pro Proces cesss Ori Orient ented ed AOQL and LTPD Sampling Plans E 2281 Pract Practice ice for Proc Process ess and Measu Measureme rement nt Capab Capabilit ility y Indices E 2282 Guide for Defining the Test Test Result of a Test Method E 2334 Practice for Setting Setting an Upper Confidence Confidence Bound for a Fraction or Number of Non-Conforming Items, or a Rate of Occurrence for Non-Conformities, Non-Conformities, Using Attribute Data, When there is a Zero Response in the Sample 3. Signi Significanc ficancee and Use 3.1 This terminology is is the general terminology terminology standard for terms defined by Committee E-11. 3.2 Cita Citation tion is made to other E-11 E-11 standards standards which contain more extensive information regarding the particular term and 1

This terminology is under the jurisdiction of ASTM Committee E 11 on Quality and Statistics and is the direct responsibility of Subcommittee E11.60 on Terminology. Currentt edition approved Curren approved Oct. 10, 2002 2002.. Publi Published shed November 2002. Originally published as E 456 – 72. Last previous edition E 456 – 96. 2 For referenced ASTM standards, visit the ASTM website, www.astm.org, or contact ASTM Customer Service at service@ [email protected] astm.org. g. For For Annual Annual Book of ASTM volumee inform information, ation, refer to the standard’s Document Document Summar Summary y page on Standardsvolum Standards the ASTM website website..

its usage. usage. The These se ref refere erence ncess may may be to oth other er pra practi ctices ces and guides gui des or to mor moree spe specifi cificc ter termi minol nology ogy sta standa ndards rds,, suc such h as Terminology E 1325. 4. Terminology acceptanc accept ancee (co (contr ntrol ol cha chart rt or acc accept eptanc ancee con contr trol ol cha chart rt usage, n), n—a decision that the process is operating in a satisfactory manner with respect to the statistical measures being plotted: action limits: control limits . valu luee th that at se serv rves es as an accepted acce pted refe referenc rencee value value,, n—a va agreed-upon reference for comparison, and which is derived as: (1) a theoretical or established value, based on scientific principl prin ciples, es, ( 2) an as assi sign gned ed or ce cert rtifi ified ed va valu lue, e, ba base sed d on experimental work of some national or international organizati za tion on,, or (3) a co cons nsen ensu suss or ce cert rtifi ified ed va valu lue, e, ba base sed d on collabora coll aborative tive experimental experimental work under the auspi auspices ces of a scientific or engineering group. accuracy, n —the closeness of agreement between a test result and an accepted reference value. NOTE 1—The 1—The ter term m acc accura uracy cy,, whe when n app applied lied to a set of tes testt res results ults,, involve invol vess a co comb mbina inatio tion n of a ra rando ndom m co comp mpon onen entt an and d of a co comm mmon on systematic error or bias component.

aliases, n—in a fractional factorial design, two or more effects whic wh ich h ar aree es esti tima mate ted d by th thee sa same me co cont ntra rast st an and d wh whic ich, h, E 1325 therefore, cannot be estimated separately. assignable cause, n —a factor that contributes to variation, and which is feasible to detect and identify. NOTE 2—Many factors will contribute to variation but it may not be feasible (economically or otherwise) to identify some of them.

attribute data, n—observed values or determinations which indicate the presence or absence of specific characteristics. DISCUSSION—Items or units of material may be evaluated by counting or measurement. measurement. Attributes are counted whereas variables are measured. Attribute Attribute distribu distributions tions are discrete. See variables See variables data. data .

attributes, attribute s, metho method d of, n—me —measu asurem rement ent of qua qualit lity y by the metho me thod d of att attrib ribute utess con consis sists ts of not noting ing the pre presen sence ce (or absence) of some characteristic or attribute in each of the units in the group under consideration, and counting how

Copyright © ASTM International, 100 Barr Harbor Drive, PO Box C700, West Conshohocken, PA 19428-2959, United States.

Copyright ASTM International Reproduced by IHS under license with ASTM

1Document provided by I HS Licensee=Instituto Mexicano Del Petroleo/3139900100, Petroleo/3139900100, 10/27/2004 13:11:48 MDT Questions or comments about this message: please call the Document Policy Group at 303-397-2295.

` , , ` , ` , , ` , , ` ` , ` , , ` ` ` , , ` ` , ` ` ` ` , , ` ` ` ` , , , ` -

E 456 – 02 many units do (or do not) possess the quality attribute, or how many such events occur in the unit, group, or area. E 2334 average outgoing quality (AOQ)—the average percent defective of outgoing product including all accepted lots or batches, after any defective units found in them are replaced by acceptable units, plus all lots or batches which are not accepted after such lots or batches have been effectively 100 % inspected and all defective units replaced by acceptE 1994 able units. average outgoing quality limit (AOQL)—the maximum of the AOQs for all possible incoming percentages defective for the process, for a given acceptance sampling plan. E 1994 average quality protection—a type of protection in which there is prescribed some chosen value of average percent defective in the product after inspection (average outgoing quality limit (AOQL), that shall not be exceeded in the long run no matter what may be the level of percent defective in E 1994 the product submitted to the inspector. average run length (ARL)—(1) sample sense, n—the average number of times that a process will have been sampled and evaluated before a shift in process level is signaled, and (2) unit sense, n—the average number of units that will have been produced before a shift in level is signaled. DISCUSSION—A long ARL is desirable for a process located at its specified level (so as to minimize calling for unneeded investigation or corrective action) and a short ARL is desirable for a process shifted to some undesirable level (so that corrective action will be called for promptly). ARL curves are used to describe the relative quickness in detecting level shifts of various control chart systems.

average standard deviation, s¯, n—arithmetic average of sample standard deviations. E 2281 balanced incomplete block design (BIB), n—an incomplete block design in which each block contains the same number k of different versions from the t versions of a single principal factor arranged so that every pair of versions occurs together in the same number, l , of blocks from the b blocks. E 1325 batch, n—a definite quantity of some product or material produced under conditions that are considered uniform. NOTE 3—A batch is usually smaller than a lot.

bias, n—the difference between the expectation of the test results and an accepted reference value. NOTE 4—Bias is the total systematic error as contrasted to random error. There may be one or more systematic error components contributing to the bias. A larger systematic difference from the accepted reference value is reflected by a larger bias value.

characteristic, n —a property of items in a sample or population which, when measured, counted or otherwise observed, helps to distinguish between the items. E 2282 cluster sampling, n—when the primary sampling unit comprises a bundle of elementary units or a group of subunits, the term cluster sampling may be applied. DISCUSSION—Examples of cluster sampling are: selection of city blocks as primary sampling units; selection of a household as a cluster of people (of which only one may be interviewed); selection of bundles of rods or pipe from a shipment; and selection, from a shipment, of --`,,,````,,````,``,,```,,`,-`-`,,`,,`,`,,`---


e1

cartons that contain boxes or packages within them.

completely randomized design, n—a design in which the treatments are assigned at random to the full set of experimental units. E 1325 completely randomized factorial design, n—a factorial experiment (including all replications) run in a completely randomized design. E 1325 component of variance, n—a part of a total variance identified with a specified source of variability. composite design, n—a design developed specifically for fitting second order response surfaces to study curvature, constructed by adding further selected treatments to those obtained from a 2 n factorial (or its fraction). E 1325 confidence bound, n—see confidence limit . E 2334 confidence coefficient, n—the value, C , of the probability associated with a confidence interval or statistical coverage interval. It is often expressed as a percentage. ISO 3534-1 E 2334 confidence level, n—see confidence coeffıcient . E 2334 confidence limit, n —each of the limits, T 1 and T 2, of the two sided confidence interval, or the limit T of the one sided confidence interval. E 2334 confounded factorial design, n—a factorial experiment in which only a fraction of the treatment combinations are run in each block and where the selection of the treatment combinations assigned to each block is arranged so that one or more prescribed effects is(are) confounded with the block effect(s), while the other effects remain free from confounding. NOTE 5—All factor level combinations are included in the experiment.

E 1325 confounding, n —combining indistinguishably the main effect of a factor or a differential effect between factors (interactions) with the effect of other factor(s), block factor(s) or interactions(s). NOTE 6—Confounding is a useful technique that permits the effective use of specified blocks in some experiment designs. This is accomplished by deliberately preselecting certain effects or differential effects as being of little interest, and arranging the design so that they are confounded with block effects or other preselected principal factor or differential effects, while keeping the other more important effects free from such complications. Sometimes, however, confounding results from inadvertent changes to a design during the running of an experiment or from incomplete planning of the design, and it serves to diminish, or even to invalidate, the effectiveness of an experiment.

E 1325

consumer’s risk—the probability that a lot whose percentage defective is equal to the LTPD will be accepted by the plan. E 1994 contrast, n—a linear function of the observations for which the sum of the coefficients is zero. NOTE 7—With observations Y 1, Y 2,..., Y n, the linear function a1Y 1 + a2Y 2 + ... + a n Y n is a contrast if, and only if ( ai = 0, where the a i values are called the contrast coefficients.

E 1325

contrast analysis, n—a technique for estimating the parameters of a model and making hypothesis tests on preselected linear combinations of the treatments (contrasts). 2Document provided by I HS Licensee=Instituto Mexicano Del Petroleo/3139900100, 10/27/2004 13:11:48 MDT Questions or comments about this message: please call the Document Policy Group at 303-397-2295.

` , , , ` ` ` ` , , ` ` ` ` , ` ` , , ` ` ` , , ` , ` ` , , ` , , ` , ` , , ` -

E 456 – 02

NOTE 8—Contrast analysis involves a systematic tabulation and analysis format usable for both simple and complex designs. When any set of orthogonal contrasts is used, the procedure, as in the example, is straightforward. When terms are not orthogonal, the orthogonalization process to adjust for the common element in nonorthogonal contrast is also systematic and can be programmed. E 1325

control—(evaluation), n—an evaluation to check, test, or verify; (authority): the act of guiding, directing, or managing; (stability): a state of process in which the variability is attributable to a constant system of chance causes. control chart factor, n —a factor, usually varying with sample size, to convert specified statistics or parameters into a central line value or control limit appropriate to the control chart. control chart method, n —the method of using control charts to determine whether or not processes are in a stable state. control limits, n —limits on a control chart which are used as criteria for signaling the need for action, or for judging whether a set of data does or does not indicate a state of statistical control. conventional true value of a quantity, n —value attributed to a particular quantity and accepted, sometimes by convention, as having an uncertainty appropriate for a given purpose. 88

88

NOTE 9— Conventional true value” is sometimes called assigned value”, best value”, conventional value”, or reference value”. Reference value”, in this sense, should not be confused with reference value” in the sense of an influence quantity affecting a measuring instrument. NOTE 10—Frequently, a number of results of measurements of a quantity is used to establish a conventional true value. DISCUSSION—When warning limits are used, the control limits are often called “action limits.” Action may be in the form of investigation of the source(s) of an “assignable cause”, making a process adjustment, or terminating a process. Criteria other than control limits are also used frequently. 88

88

88

88

88

dependent variable, n— See response variable. E 1325 design of experiments, n—the arrangement in which an experimental program is to be conducted, and the selection of the levels (versions) of one or more factors or factor combinations to be included in the experiment. Synonyms include experiment design and experimental design. E 1325 deviation, n —the difference between a measurement or quasimeasurement and its stated value or intended level. DISCUSSION— Deviation should be stated as a difference in terms of the appropriate data units. Sometimes these units will be original measurement units; sometimes they will be quasi-measurements; that is, a scaled rating of subjective judgments; sometimes they will be designated values representing all continuous or discrete measurements falling in defined cells or classes.

error of result, n —the test result minus the accepted reference value (of the characteristic). NOTE 11—It is not possible to correct for random error.

evolutionary operation (EVOP), n—a sequential form of experimentation conducted in production facilities during regular production. NOTE 12—The principal theses of EVOP are that knowledge to improve


e1

the process should be obtained along with a product, and that designed experiments using relatively small shifts in factor levels (within production tolerances) can yield this knowledge at minimum cost. The range of variation of the factors for any one EVOP experiment is usually quite small in order to avoid making out of tolerance products, which may require considerable replication, in order to be able to clearly detect the

E 1325

effect of small changes.

experimental design, n— see design of experiments. E 1325 experiment space, n —the materials, equipment, environmental conditions and so forth that are available for conducting E 1325 an experiment. experimental unit, n—a portion of the experiment space to which a treatment is applied or assigned in the experiment. NOTE 13—The unit may be a patient in a hospital, a group of animals, a production batch, a section of a compartmented tray, etc.

E 1325

factorial experiment (general), n —in general, an experiment in which all possible treatments formed from two or more factors, each being studied at two or more levels (versions) are examined so that interactions (differential effects) as well E 1325 as main effects can be estimated. n 2 factorial experiment, n —a factorial experiment in which n factors are studied, each of them in two levels (versions). E 1325 fractional factorial design, n—a factorial experiment in which only an adequately chosen fraction of the treatments required for the complete factorial experiment is selected to be run. NOTE 14—This procedure is sometimes called fractional replication.

E 1325 frame, n—a list, compiled for sampling purposes, which designates the items (units) of a population or universe to be considered in a study. DISCUSSION—When a frame is available, sampling schemes can be devised for selection of the units directly (one-stage), or in two or more stages. In multi-stage sampling, a frame is needed for each stage. As an example, the cartons of a lot could be the first-stage units, packages within the carton could be second-stage units, and items within the packages could be the third-stage units.

fully nested experiment, n —a nested experiment in which the second factor is nested within levels (versions) of the first factor and each succeeding factor is nested within versions E 1325 of the previous factor. hierarchical experiment, n— see nested experiment. E 1325 incomplete block design, n—a design in which the experiment space is subdivided into blocks in which there are insufficient experimental units available to run a complete E 1325 set of treatments or replicate of the experiment. intermediate precisions, n—the closeness of agreement between test results obtained under specified intermediate precision conditions. NOTE 15—The specific measure and the specific conditions must be specified for each intermediate measure of precision; thus, standard deviation of test results among operators in a laboratory,” or day-to-day standard deviation within a laboratory for the same operator.” 88

88

3Document provided by I HS Licensee=Instituto Mexicano Del Petroleo/3139900100, 10/27/2004 13:11:48 MDT Questions or comments about this message: please call the Document Policy Group at 303-397-2295.

E 456 – 02 NOTE 16—Because the training of operators, the agreement of different pieces of equipment in the same laboratory and the variation of environmental conditions with longer time intervals all depend on the degree of within-laboratory control, the intermediate measures of precision are likely to vary appreciably from laboratory to laboratory. Thus, intermediate precisions may be more characteristic of individual laboratories than of the test method.

intermediate precision conditions, n—conditions under which test results are obtained with the same test method using test units or test specimens (see Practice E 691, 2 10.3) taken at random from a single quantity of material that is as nearly homogeneous as possible, and with changing conditions such as operator, measuring equipment, location within the laboratory, and time. item, n—(1) an object or quantity of material on which a set of observations can be made: ( 2) an observed value or test result obtained from an object or quantity of material. DISCUSSION—The second usage in the definition is generally limited to generic descriptions such as in the definition of “population.” Terms such as “observation,” “measurement,” “test result,” “unit,” “value” or “yield” are more common in specific applications. A set as used here may be one or more variables.

level (of a factor), n—a given value, a specification of procedure or a specific setting of a factor. 88

NOTE 17— Version” is a general term applied both to quantitative and qualitative factors. The more restrictive term level” is frequently used to express more precisely the quantitative characteristic. For example, two versions of a catalyst may be presence and absence. Four levels of a heat 88

treatment may be 100°C, 120°C, 140°C, and 160°C.

E 1325

long term standard deviation, s LT , n—sample standard deviation of all individual (observed) values taken over a long period of time. DISCUSSION—A long period of time may be defined as shifts, weeks, or months, etc.

E 2281

lot—a definite quantity of a product or material accumulated under conditions that are considered uniform for sampling purposes. lot quality protection—a type of protection in which there is prescribed some chosen value of limiting percent defective in a lot (lot tolerance percent defective, (LTPD)) and also some chosen value for the probability (called the consumer’s risk) of accepting a submitted lot that has a percent defective E 1994 equal to the lot tolerance percent defective. lot tolerance percent defective (LTPD)—the percentage of defective units in a batch or lot for which, for purposes of acceptance sampling, the consumer wishes the probability of acceptance to be restricted to a specified low value, specifically 10 % for this practice. This is also referred to by the more general term limiting quality taken at 10 % consumer E 1994 risk. lower control limit (LCL), n —control limit for points below the central line. lower process capability index, C pkl, n—index describing process capability in relation to the lower specification limit. E 2281 lower process performance index, P pkl, n—index describing process performance in relation to the lower specification

e1

E 2281 limit. lower tolerance limit (LTL) (lower specification limit), n —a tolerance limit that defines the lower conformance boundary for an individual unit of a manufacturing or service operation. main effect, average effect, n—a term describing a measure for the comparison of the responses at each level (version) of a factor averaged over all levels (versions) of other factors in the experiment. 88

NOTE 18—The term main effect” may describe the parameter in an assumed model or the estimate of this parameter. E 1325

method of least squares, n—a technique of estimation of a parameter which minimizes (e2, where e is the difference between the observed value and the predicted value derived from the assumed model. E 1325 minimum process capability index, C pk , n—smaller of the upper process capability index and the lower process capaE 2281 bility index. minimum process performance index, P pk , n—smaller of the upper process performance index and the lower process E 2281 performance index. mixture design, n —a design in which two or more ingredients or components shall be mixed and the response is a property of the resulting mixture that does not depend upon the amount of the mixture. NOTE 19—The proportions of each of the q components ( X i) in the c mixture shall satisfy the conditions O # X i # 1 and ( X i = 1; and each i51 experimental point is defined in terms of these proportions. NOTE 20—In some fields of application the experimental mixtures are described by the terms formulation” or blend.” The use of mixture designs is appropriate for experimenting with the formulations of manufactured products, such as paints, gasoline, foods, rubber, and textiles. NOTE 21—In some applications, the proportions of the components of the mixture may vary between 0 and 100 % of the mixture ( complete domain”). In others, there may be operative restraints, so that at least one component cannot attain 0 or 100 % ( reduced domain”). E 1325 88

88

88

88

natural process limits (NPL), n—limits which include a stated fraction of the individuals in a population. NOTE 22— Natural process limits will not ordinarily be the dimensional limits shown on an engineering drawing. They are mostly used to compare the natural capability of the process to tolerance limits. DISCUSSION—For populations with a normal (Gaussian) distribution, the natural process limits ordinarily will be at 6 3 s . If placed around the standard level, these limits identify the boundaries which will include approximately 99.7 % of the individuals in a process that is properly centered and in a state of statistical control. In many circumstances (several machines making the same product that serially feed into the process) it is recognized that in addition to the variability around a single level, an acceptable zone of “standard” levels (for the different machines) is required. Then the NPL may be placed around the Acceptable Process Levels (APL) that define this zone so that the NPL identify the boundaries within which at least 99.7 % of the individuals will be included in a process located at the APL, or inside the zone. It should be noted that there is no assumption made that the process levels within the zone are random variables.

nested experiment, n—an experiment to examine the effect of two or more factors in which the same level (version) of a factor cannot be used with all levels (versions) of other

--`,,,````,,````,``,,```,,`,-`-`,,`,,`,`,,`---



E 456 – 02 E 1325 factors. Synonym: hierarchical experiment. non-conforming item, n—an item containing at least one non-conformity. DISCUSSION—The term “defective item” is also used in this context.

E 2334 observation, n—(1) the process of obtaining information regarding the presence or absence of an attribute of a test specimen, or of making a reading on a characteristic or dimension of a test specimen, or ( 2) the attribute or measurement information obtained from the process. (The term observed value” is preferred for this second usage.)

e1

Plackett-Burman designs, n—a set of screening designs using orthogonal arrays that permit evaluation of the linear effects of up to n = t − 1 factors in a study of t , treatment combinations. E 1325 population, n —the totality of items or units of material under consideration. DISCUSSION—The word “items” may be interpreted in the sense of measurements, or possible measurements, for a single characteristic, or occasionally for multiple characteristics, on all items or units of material being considered. The word “totality” may refer to items not available for inclusion in samples as well as those which are available.

E 2334

88

NOTE 23—See Annex A1.

E 2282

observation, n—the process of obtaining information regarding the presence or absence of an attribute of a test specimen, or of making a reading on a characteristic or dimension of a test specimen. NOTE 24—Observation is also associated with the attribute or measurement information obtained from the process. The term “observed value” is ` , , , ` ` ` ` , , ` ` ` ` , ` ` , , ` ` ` , , ` , ` ` , , ` , , ` , ` , , ` -

preferred for this second usage.

E 2281

observed value, n—the value obtained by carrying out the complete protocol of the test method once, being either a single test determination or an average or other specified combination of a specified number of test determinations. NOTE 25—See Annex A1.

E 2282

observed value, n—the value obtained by making an observation. E 2281 orthogonal array, n—a table of coefficients identifying the levels, or some weight associated with the levels, for each factor to be used in the analysis of specified effects, which are arranged in such a manner that each effect will be independent of the other effects. E 1325 orthogonal contrasts, n—two contrasts are orthogonal if the contrast coefficients of the two sets satisfy the condition that, when multiplied in corresponding pairs, the sum of the products is equal to zero. See contrast and contrast analysis. E 1325 outlier—see outlying observation. E 178 outlying observation, n —an observation that appears to deviate markedly in value from other members of the sample in which it appears. E 178 partially balanced incomplete block design (PBIB), n—an incomplete block design in which each block contains the same number k , of different versions from the t versions of the principal factor. NOTE 26—The arrangement is such that not all pairs of versions occur together in the same number of the blocks; some versions can therefore be compared with greater precision than others.

E 1325

partially nested experiment, n—a nested experiment in which several factors may be crossed as in factorial experiments and other factors nested within the crossed combinations. NOTE 27—It is not unusual to find that experiments consist of both factorial and nested segments. See nested experiment.


E 1325

precision, n—the closeness of agreement between independent test results obtained under stipulated conditions. NOTE 28—Precision depends on random errors and does not relate to the true value or the specified value. NOTE 29—The measure of precision usually is expressed in terms of imprecision and computed as a standard deviation of the test results. Less precision is reflected by a larger standard deviation. NOTE 30— Independent test results” means results obtained in a manner not influenced by any previous result on the same or similar test object. Quantitative measures of precision depend critically on the stipulated conditions. Repeatability and reproducibility conditions are particular sets of extreme stipulated conditions. 88

probability sample, n —a sample of which the sampling units have been selected by a chance process. At each step of selection, a specified probability of selection can be attached to each sampling unit available for selection. E 1402 probability sample, n —a sample of which the sampling units have been selected by a chance process such that, at each step of selection, a specified probability of selection can be attached to each sampling unit available for selection. NOTE 31—These probabilities of selection need not be equal. If equal, see simple random sample. See the general term—sample. Also, see Practice E 1052 in this volume.

process capability, PC, n —statistical estimate of the outcome of a characteristic from a process that has been demonstrated to be in a state of statistical control. E 2281 process capability index, C p, n —an index describing process capability in relation to specified tolerance. E 2281 process performance, PP, n—statistical measure of the outcome of a characteristic from a process that may not have been demonstrated to be in a state of statistical control. E 2281 process performance index, P p, n —index describing process performance in relation to specified tolerance. E 2281 random error of result, n—a component of the error which, in the course of a number of test results for the same characteristic, varies in an unpredictable way. randomization, n—the procedure used to allot treatments at random to the experimental units so as to provide a high degree of independence in the contributions of experimental error to estimates of treatment effects. NOTE 32—An essential element in the design of experiments is to provide estimates of effects free from biases due to undetected assignable causes within the experimental space. Randomization is a process to minimize this risk. The operational procedure for assignment at random” involves the use of random numbers or some similar method for assuring 88


E 456 – 02 that each unit has an equal chance of being selected for each treatment.

E 1325 randomized block design, n—a design in which the experiment space is subdivided into blocks of experimental units, the units within each block being more homogeneous than units in different blocks. ` , , , ` ` ` ` , , ` ` ` ` , ` ` , , ` ` ` , , ` , ` ` , , ` , , ` , ` , , ` -

NOTE 33—In each block the treatments are allocated randomly to the experimental units within each block. Replication is obtained by the use of two or more blocks, depending on the precision desired, and a separate

E 1325

randomization is made in each block.

randomized block factorial design, n—a factorial experiment run in a randomized block design in which each block includes a complete set of factorial combinations. E 1325 range, R, n—the largest observation minus the smallest E 2281 observation in a set of values or observations. repeatability, n—precision under repeatability conditions. NOTE 34—Repeatability is one of the concepts or categories of the precision of a test method. NOTE 35—Measures of repeatability defined in this compilation are repeatability standard deviation and repeatability limit.

repeatability conditions, n—conditions where independent test results are obtained with the same method on identical test items in the same laboratory by the same operator using the same equipment within short intervals of time. NOTE 36—See precision Note 3. DISCUSSION—The “same operator, same equipment” requirement means that for a particular step in the measurement process, the same combination of operator and equipment is used for every test result. Thus, one operator may prepare the test specimens, a second measure the dimensions and a third measure the mass in a test method for determining density. DISCUSSION—By “in the shortest practical period of time” is meant that the test results, at least for one material, are obtained in a time period not less than in normal testing and not so long as to permit significant change in test material, equipment or environment.

repeatability limit (r), n —the value below which the absolute difference between two individual test results obtained under repeatability conditions may be expected to occur with a probability of approximately 0.95 (95 %). NOTE 37—The repeatability limit is 2.8 ('1.96 =2 ) times the repeatability standard deviation. This multiplier is independent of the size of the interlaboratory study, as explained in Practice E 177.2 NOTE 38—The approximation to 0.95 is reasonably good (say 0.90 to 0.98) when many laboratories (30 or more) are involved, but is likely to be poor when fewer than eight laboratories are studied.

repeatability standard deviation, n—the standard deviation of test results obtained under repeatability conditions. NOTE 39—It is a measure of the dispersion of the distribution of test results under repeatability conditions. NOTE 40—Similarly, repeatability variance” and repeatability coefficient of variation” could be defined and used as measures of the dispersion of test results under repeatability conditions. DISCUSSION—In an interlaboratory study, this is the pooled standard deviation of test results obtained under repeatability conditions. See Practice E 691. DISCUSSION—The repeatability standard deviation, usually considered a property of the test method, will generally be smaller than the 88


88

e1

within-laboratory standard deviation. (See within-laboratory standard deviation.)

reproducibility, n—precision under reproducibility conditions. reproducibility conditions, n—conditions where test results are obtained with the same method on identical test items in different laboratories with different operators using different equipment. DISCUSSION— Identical material means either the same test units or test specimens are tested by all the laboratories as for a nondestructive test or test units or test specimens are taken at random from a single quantity of material that is as nearly homogeneous as possible. (See Practice E 691.) DISCUSSION—A different laboratory of necessity means a different operator, different equipment, and different location and under different supervisory control.

reproducibility limit, n—( R) the value below which the absolute difference between two test results obtained under reproducibility conditions may be expected to occur with a probability of approximately 0.95 (95 %). NOTE 41—The reproducibility limit is 2.8 ('1.96 =2 ) times the reproducibility standard deviation. The multiplier is independent of the size of the interlaboratory study (that is, of the number of laboratories participating), as explained in Practice E 177.2 NOTE 42—The approximation to 0.95 is reasonably good (say 0.90 to 0.98) when many laboratories (30 or more) are involved but is likely to be poor when fewer than eight laboratories are studied.

reproducibility standard deviation (S R), n—the standard deviation of test results obtained under reproducibility conditions. NOTE 43—Other measures of the dispersion of test results obtained under reproducibility conditions are the reproducibility variance” and the reproducibility coefficient of variation.” 88

88

NOTE 44—The reproducibility standard deviation includes, in addition to between-laboratory variability, the repeatability standard deviation and a contribution from the interaction of laboratory factors (that is, differences between operators, equipment and environments) with material factors (that is, the differences between properties of the materials other than that property of interest).

residual error, n—the difference between the observed result and the predicted value (estimated treatment response); E 1325 Observed Result minus Predicted Value. response surface, n —the pattern of predicted responses based on the empirical model derived from the experiment observations. E 1325 response variable, n—the variable that shows the observed results of an experimental treatment. Synonym dependent variable. E 1325 robustness, n—insensitivity of a statistical test to departures from underlying assumptions. DISCUSSION—Many statistical test procedures depend on the form of the assumed distribution of the population sampled to obtain exact values for the probability statements. If departures from the assumed distribution do not materially affect the decisions which would be based on the statistical tests involved, the test is considered “robust.” For example, tests based on an assumption of normality that compare averages generally are robust even though the underlying distribution of individual items in the population is not normal. On the other hand,


E 456 – 02 the F-statistic for comparing variances may be an indicator of lack of normality rather than a simple variance comparison.

ruggedness, n—insensitivity of a test method to departures from specified test or environmental conditions. DISCUSSION—An evaluation of the “ruggedness” of a test method or an empirical model derived from an experiment is useful in determining whether the results or decisions will be relatively invariant over some range of environmental variability under which the test method or the model is likely to be applied. E 1169

ruggedness test, n—a planned experiment in which environmental factors or test conditions are deliberately varied in order to evaluate the effects of such variation. DISCUSSION—Since there usually are many environmental factors that might be considered in a ruggedness test, it is customary to use a “screening” type of experiment design (see screening design) which concentrates on examining many first order effects and generally assume that second order effects such as interactions and curvature are relatively negligible. Often in evaluating the ruggedness of a test method, if there is an indication that the results of a test method are highly dependent on the levels of the environmental factors, there is a sufficient indication that certain levels of environmental factors must be included in the specifications for the test method, or even that the test method itself will need further revision. E 1169

run, n—(1) an uninterrupted sequence of occurrences of the same attribute or event in a series of observations, and ( 2) a consecutive set of successively increasing run-up or successively decreasing run-down values in a series of variable measurements. DISCUSSION—In control chart applications, some variable measurements are treated as attributes in determining runs. For example, a run might be considered a series of a specified number of consecutive points above or below the central line.3

sample, n—a group of items, observations, test results, or portions of material, taken from a large collection of items, observations, test results, or quantities of material, which serves to provide information that may be used as a basis for making a decision concerning the larger collection. DISCUSSION—The sample may be the units of material themselves or the set of the observations collected from them. The decision may or may not involve taking action on the units of material, or on the process. It is necessary to describe whether the sample is to be selected on a simple random, a stratified random, or other specified basis. Probability samples, that is, samples selected by chance using appropriate randomization, are required to make confidence interval statements and similar statistical inferences about the parameters of the sampled population. E 2334

sample size, n—the number of units in a sample or the number of observations in a sample. E 2334 sampling fraction, f, n —the ratio f of the number of sampling units selected for the sample to the number of sampling units available. NOTE 45—For the simple random sample case, f = n/N where n is the sample size and N is the number of sampling units available. When f >

3

Other examples may be found in references such as Nelson, L. S., “Interpreting ¯ Control Charts,” Journal of Quality Technology , Vol 17, No. 2, April Shewhart X 1985.

e1

0.10 estimation of the precision of an estimator should take account of this magnitude of f. See finite population correction.

sampling with replacement, n—a procedure used with some probability sampling plans in which a selected unit is replaced after any step in selection so that this sampling unit is available for selection again at the next step of selection, or at any other succeeding step of the sample selection procedure. screening design, n—a balanced design, requiring relatively minimal amount of experimentation, to evaluate the lower order effects of a relatively large number of factors in terms of contributions to variability or in terms of estimates of parameters for a model. NOTE 46—In screening designs, the term lower order effects is sometimes limited to first order terms such as linear components of main effects, but often includes both first order terms and second order terms such as two factor interactions and quadratic curvature components of main effects. E 1325

short term standard deviation, sST , n—the inherent variation present when a process is operating in a state of statistical control, expressed in terms of standard deviation. DISCUSSION—This may also be stated as the inherent process variation. E 2281

significant digit, n—any of the figures 0 through 9, except leading zeros and some trailing zeros, which is used with its place value to denote a numerical quantity to some desired approximation NOTE 47—The digit zero may either indicate a specific value or indicate place only. Zeros leading the first nonzero digit of a number indicate order of magnitude only and are not significant digits. For example, the number 0.0034 has two significant digits. Zeros trailing the last nonzero digit for numbers represented with a decimal point are significant digits. For example, the numbers 1270. and 32.00 each have four significant digits. The significance of trailing zeros for numbers represented without use of a decimal point can only be identified from knowledge of the source of the value. For example, a modulus strength, stated as 140 000 Pa, may have as few as two or as many as six significant digits. To eliminate ambiguity, the exponential notation may be used. Thus, 1.40 3 105 indicates that the modulus is reported to the nearest 0.013 10 5 or 1000 Pa. Use of appropriate SI prefixes is recommended for metric units to reduce the need for trailing zeros of uncertain significance. Thus, 140 kPa and 0.140 MPa each indicate that the modulus is reported to the nearest 1 kPa or 1000 Pa, while 140 kPa may again have two or three significant digits. E 29

special cause, n —source of intermittent variation in a process. ISO 3534-2 DISCUSSION—Sometimes “special cause” is taken to be synonymous with “assignable cause.” However a distinction should be recognized. A special cause is assignable only when it is specifically identified. Also a common cause may be assignable. DISCUSSION—A special cause arises because of specific circumstances which are not always present. As such, in a process subject to special causes, the magnitude of the variation from time to time is unpredictable. E 2281

specification limits, n— see tolerance limits. stable process, n—process in a state of statistical control; process condition when all special causes of variation have

--`,,,````,,````,``,,```,,`,-`-`,,`,,`,`,,`---



E 456 – 02 been removed.

ISO 3534-2

DISCUSSION—Observed variation can then be attributed to random (common) causes. Such a process will generally behave as though the results are simple random samples from the same population. DISCUSSION—This state does not imply that the random variation is large or small, within or outside of specification, but rather that the variation is predictable using statistical techniques. DISCUSSION—The process capability of a stable process is usually improved by fundamental changes that reduce or remove some of the random causes present and/or adjusting the mean towards the preferred value. ` , , , ` ` ` ` , , ` ` ` ` , ` ` , , ` ` ` , , ` , ` ` , , ` , , ` , ` , , ` -

DISCUSSION—Continual adjustment of a stable process will increase variation.

E 2281

staggered nested experiment, n—a nested experiment in which the nested factors are run within only a subset of the versions of the first or succeeding factors. E 1325 standard deviation, n —the most usual measure of the dispersion of observed values or results expressed as the positive square root of the variance. statistic, n—a quantity calculated from a sample of observations, most often to form an estimate of some population parameter. statistical measure, n —statistic or mathematical function of a statistic. DISCUSSION—The word statistical emphasizes that measures are subject to inherent errors and that, in estimating a population parameter, they represent a sample, with inherent sampling variability.

statistical procedures, n—the organized techniques and methods used to collect, analyze, and interpret data. DISCUSSION—Statistical procedures include the sampling considerations or the experiment design for the collection of data, or both, and the numerical and graphical approaches to summarize and analyze the collected data.

E 1488

subgroup, n—(1) object sense, n—a set of units or quantity of material obtained by subdividing a larger group of units or quantity of material, and ( 2) measurement sense, n—a set of groups of observations obtained by subdividing a larger group of observations. See rational subgroup. systematic error of result, n—a component of the error, which in the course of a number of test results for the same characteristic, remains constant or varies in a predictable way. NOTE 48—Systematic errors and their causes may be known or unknown.

systematic sampling, n—sample selection procedure in which every kth element is selected from the universe or population; for example, u, u + k, u + 2k, u + 3k, etc., where u is in the interval 1 to k. DISCUSSION—If k = 20 and u = 7 is the initial unit selected, then sampling units 7, 27, 47, 67, ..., would comprise the sample. When N/k is not an integer, there is a small bias due to the end effect. When u is selected by a chance process and N/k is an integer, the systematic sample will provide unbiased estimates of the population average or total. Situations for which N/k is not an integer usually ignore the small or negligible bias in estimating the mean or total. Schemes have been


e1

developed for non-integer N/k to overcome sampling bias. See Jessen.4 Estimation of the precision of an average computed from a systematic sample is a difficult problem that has no generally satisfactory solution. Independent replicate systematic samples provide an approach to variance estimation, but have been rejected by some writers. In some ASTM situations where replicate samples may be obtained on a routine basis, the technique may be useful. See Cochran5 for an extended discussion of variance estimation for systematic sampling.

test determination, n—(1) the process of deriving from one or more test observations (observed values) the presence or absence of an attribute or the value of a characteristic or dimension of a single test specimen, or ( 2) the attribute (presence or absence) or value derived from the process (see test specimen ). NOTE 49—See Annex A1.

test determination, n —the value of a characteristic or dimension of a single test specimen derived from one or more E 2282 observed values. test method, n—a definitive procedure that produces a test result. DISCUSSION—Examples of test methods include, but are not limited to: identification, measurement, and evaluation of one or more qualities, characteristics, or properties. [ASTM Regulations 2.2.6]

E 2282 test observation, n— see observation. test result, n—the value of a characteristic obtained by carrying out a specified test method. NOTE 50—The test method should specify that one or a number of individual observations be made and their average or another appropriate function, such as the medium or the standard deviation, be reported as the test result. It also may require standard corrections to be applied, such as correction of gas volumes to standard temperature and pressure. A test result, therefore, can be a result calculated from several observed values. In the simple case, the test result is the observed value itself. E 2282

test specimen, n —the portion of a test unit needed to obtain a single test determination. 88

NOTE 51—When used for a physical test, this is sometimes called test piece.” For a chemical test, it is sometimes called test portion or test sample. For optical and other tests, it is also sometimes called test sample. In interlaboratory evaluation of test methods and other statistical procedures, it is best to reserve the word sample for the whole amount of material involved and not the individual test specimens, pieces or portions being tested. NOTE 52—See Annex A1. E 2282

test unit, n—the total quantity of material (containing one or more test specimens) needed to obtain a test result as specified in the test method. See test result. E 2282 tolerance limits (specification limits), n—limits that define the conformance boundaries for an individual unit of a manufacturing or service operation. DISCUSSION—Limits may be established either with or without the use

4

Jessen, R. J., “Statistical Survey Techniques,” John Wiley & Sons, Inc., New York, 1978, Sec. 12.2. 5 Cochran, W. G., “Sampling Techniques,” John Wiley & Sons, Inc., New York, 1977, Chapter 8.


E 456 – 02 of probability considerations. Tolerance limits may be in the form of a single (unilateral) limit (upper or lower) or double (bilateral) limits (upper and lower). Double, or two-sided limits occur more frequently. Double limits are often stated as a symmetrical deviation from a stated value, but they need not be symmetrical. Frequently the term specification limits is used instead of tolerance limits. While tolerance limits is generally preferred in terms of evaluating the manufacturing or service requirements, specification limits may be more appropriate for categorizing material, product, or service in terms of their stated requirements.

tolerance specification, n—the total allowable variation around a level or state (upper limit minus lower limit), or the maximum acceptable excursion of a characteristic. DISCUSSION—The determination of the amount of variation to be allowed involves the product or service requirements and consideration of process capability (see natural process limits), measurement variability, and other appropriate elements or some compromise among these.

e1

upper control limit (UCL), n —control limit for points above the central line. upper process capability index, C pku, n—index describing process capability in relation to the upper specification limit. E 2281 upper process performance index (P pku), n—index describing process performance in relation to the upper specification E 2281 limit. upper tolerance limit (UTL) (upper specification limit), n—a tolerance limit applicable to the upper conformance boundary for an individual unit of a manufacturing or service operation. variables, method of, n—measurement of quality by the method of variables consists of measuring and recording the numerical magnitude of a quality characteristic for each of the units in the group under consideration. NOTE 57—This involves reference to a continuous scale of some kind.

treatment, n—a combination of the levels (versions) of each of the factors assigned to an experimental unit, synonym treatment combination. E 1325 treatment combination, n— see treatment. E 1325 trueness, n —the closeness of agreement between the population mean of the measurements or test results and the accepted reference value.

variables data, n—measurements which vary and may take any of a specified set of numerical values.

NOTE 53—The measure of trueness usually is expressed in terms of bias. Greater bias means less favorable trueness. NOTE 54— Population mean” is, conceptually, the average value of an indefinitely large number of test results. NOTE 55—Trueness is the systematic component of accuracy.

variance, n —a measure of the squared dispersion of observed values or measurements expressed as a function of the sum of the squared deviations from the population mean or sample average.

88

uncertainty, n—an indication of the variability associated with a measured value that takes into account two major components of error: ( 1) bias, and (2) the random error attributed to the imprecision of the measurement process. DISCUSSION—Quantitative measures of uncertainty generally require descriptive statements of explanation because of differing traditions of usage and because of differing circumstances. For example: (1) the bias and imprecision may both be negligible; (2) the bias may not be negligible while the imprecision is negligible; (3) neither the bias nor the imprecision may be negligible; (4) the bias may be negligible while the imprecision is not negligible.

unit, n—an object on which a measurement or observation may be made. DISCUSSION—The word “unit” is commonly used in the sense of a unit of product (service, etc.)—the entity of product inspected in order to determine its classification or its measurements. This entity may be a single article, a set of like articles treated collectively, a subassembly, a stated quantity of material, etc. The unit of product or service need not be the same as the unit of purchase, supply, production, or shipment.

universe (population), n—the totality of the set of items, units, or measurements, etc., real or conceptual, that is under consideration. NOTE 56—This definition of universe is being revised to incorporate the concept of including one or more populations. Use with caution.


DISCUSSION—The term “random variable” or “variate” is often used to indicate that each of the specified set of values is associated with a specified relative frequency or probability, and that each is a random sample from a continuous or a discrete, or discontinuous, population encompassing the specified values.

NOTE 58—The sample variance, or variance of a sample of n observed values, is computed as s2 = [1/(n − 1)][(( yi − y¯ )2]. The sample standard deviation s is the positive square root of the sample variance. The population variance s2 = * R ( y − µ)2 f ( y)dy, where R is the region over which the random variable y is defined, and where f ( y) is the probability density function and µ is the population mean of y. The population standard deviation (s) is the positive square root of the population variance. DISCUSSION—A listing of the sample variance s2 should always be accompanied by the degrees of freedom on which it is based. The degrees of freedom for the sample variance described above are (n − 1).

within-laboratory standard deviation, n—the standard deviation of test results obtained within a laboratory for a single material under conditions that may include such elements as different operators, equipment, and longer time intervals. NOTE 59—Because the training of operators, the agreement of different pieces of equipment in the same laboratory and the variation of environmental conditions with longer time intervals depend on the degree of within-laboratory control, the within-laboratory standard deviation is likely to vary appreciably from laboratory to laboratory.

Youden square, n—a type of block design derived from certain Latin squares by deleting, or adding, rows (or columns) so that one block factor remains complete blocks and the second block factor constitutes balanced incomplete E 1325 blocks.


` , , ` , ` , , ` , , ` ` , ` , , ` ` ` , , ` ` , ` ` ` ` , , ` ` ` ` , , , ` -

E 456 – 02

e1

ANNEX (Mandatory Information) A1. MEASUREMENT TERMINOLOGY

A1.1 A test method often has three distinct stages: ( 1) the direct observation of dimensions or characteristics, ( 2) the combining of the observed values to obtain a single test determination, and ( 3) the combining of a number of test determinations to obtain the test result of the test method. The term measurement may be applied to any one or more of these stages of the measurement process.

test method specifies that only one test determination is to be made, then the test determination value is the test result of the test method. Some test methods require that several determinations be made and the values obtained be averaged or otherwise combined to obtain the test result of the test method. Averaging of several determinations is often used to reduce the effect of local variations of the property within the material.

A1.2 In the simplest of test methods a single direct observation is also the test determination and the test result. For example, a test observation required by a test method may be the mass of a test specimen prepared and weighed in a specified way. The observation would also be the test determination of the mass of the test specimen, and if only one specimen is to be weighed, the observed weight would also be the test result of the test method. Another test method may require the measurement of the area of the test specimen as well as the mass, and then direct that the mass be divided by the area to obtain the mass per unit area of the test specimen. The whole process of measuring the mass and the area and calculating the mass per unit area is a test determination. If the

A1.3 Precision statements for ASTM test methods are usually based on test results, not test determinations or observations. If for some compelling reason an ASTM committee wished to address the issue of variation between test determinations (in addition to the variation among test results), the committee can do so with a clear declaration (of what is being done) to avoid confusion. Sampling plans and product specifications should specify the sample size in terms of the number of replicate test results. A test method should specify the required observations to obtain a test determination and the number of test determinations to be averaged or otherwise combined to obtain a single test result.

` , , , ` ` ` ` , , ` ` ` ` , ` ` , , ` ` ` , , ` , ` ` , , ` , , ` , ` , , ` -

ASTM International takes no position respecting the validity of any patent rights asserted in connection with any item mentioned in this standard. Users of this standard are expressly advised that determination of the validity of any such patent rights, and the risk of infringement of such rights, are entirely their own responsibility. This standard is subject to revision at any time by the responsible technical committee and must be reviewed every five years and if not revised, either reapproved or withdrawn. Your comments are invited either for revision of this standard or for additional standards and should be addressed to ASTM International Headquarters. Your comments will receive careful consideration at a meeting of the responsible technical committee, which you may attend. If you feel that your comments have not received a fair hearing you should make your views known to the ASTM Committee on Standards, at the address shown below. This standard is copyrighted by ASTM International, 100 Barr Harbor Drive, PO Box C700, West Conshohocken, PA 19428-2959, United States. Individual reprints (single or multiple copies) of this standard may be obtained by contacting ASTM at the above address or at 610-832-9585 (phone), 610-832-9555 (fax), or [email protected] (e-mail); or through the ASTM website (www.astm.org).



ASTM E456-02 R04.pdf

Recommend Documents