Lab Corner: Z-scores

By Mike Anderson, P.E.

As a little niño, I loved watching the old “Zorro” television show. Walt Disney Productions…not surprisingly. Zorro was a nobleman pretending to be self-absorbed with his wealthy lifestyle but secretly donning a mask to hide his appearance (a ploy that I don’t think will work in today’s CSI-crazed society, by the way) while he fights corruption and injustice with a foil and slashes Z shapes into wood, clothes and skin to leave his mark. Way cool. Probably responsible for my otherwise inexplicable love of the “¡Three Amigos!” and “Nacho Libre.”

If I see “z” today I think of something a little different…”z-scores”. Ah, to be young again.

Z-scores are a way of comparing an observed result to the mean by considering the standard deviation (a term that captures how spread the data is within a set of values). Statisticians are probably berating me now for oversimplifying the explanation. The rest of you are probably thinking “that was oversimplifying it?”

Basically a z-score tells you how far your data is from the mean using standard deviation as a measure. A z-score of 2, typically means that your data is two standard deviations from the mean. The “+” or “-” tells you whether the data is higher or lower than the mean. It is an expedient way of reviewing your data to let you know where a problem might exist.

AASHTO re:source (formerly AMRL) provides this information in their Proficiency Sample Program (PSP) reports, but converts it to a numerical rating. In their rating system a “5” is great – indicating that your data is within 1 standard deviation of the mean – and “0” is poor – indicating that your data is more than 3 standard deviations from the mean. The z-score matters, but so does the standard deviation. Consider two tests where the average is the same, but the standard deviation is different. Your lab data is represented by the circle in the following figures. Even though your data has exactly the same deviation from the mean in Figure 1 as it does in Figure 2, in one case you would have a z-score of approximately -1 (a rating of “-4” in the AASHTO re:source PSP system) and in the other case you would have a z-score of approximately -2 (a rating of “-2” in the AASHTO re:source PSP system).

The concern with any new test, like the Multiple- Stress Creep Recovery (MSCR) test, is how tight the standard deviation is. The lower the standard deviation, the more confidence the supplier or agency can have in the value of the result. As an example, the table shows data from PG Asphalt Binder Sample 245.

The data for the MSCR Jnr3.2 and RTFO G*/sin  δ tests indicate that the lab would have received a rating of 5 in each case. For Jnr3.2, the score is lower than the mean by approximately 0.144 kPa-1 (Rating of -5). For G*/sin δ, the score is higher than the mean by approximately 0.159 kPa (Rating of +5).

Two observations:

This data makes sense as lower values of Jnr3.2 should correspond with higher values of G*/sin δ. Even though the lab data shows approximately the same deviation from the mean for Jnr3.2 and G*/ sin δ, the higher standard deviation for the Jnr3.2 test means that the lab z-score is lower – indicating that it is, relatively speaking, closer to the mean.

The takeaway – it isn’t enough to see great scores on your PSP report. You need to look further into standard deviation to really understand testing variability. The lower the standard deviation, the more confidence you can have in the test result.

When you get your results, please note that we don’t recommend allowing your technicians to slash your lab number into the wall.

Anderson is the Director of Research and Laboratory Services for the Asphalt Institute.