Regressions and calibration curves

by admin

Important : This article, and therefore, the use of the tool for curves of MCM Alchimia does not take into account the assumptions related to the application of Least Squares, such as normality, linearity, homoscedasticity, and independence. The study and validation of these assumptions allow us to know the quality and applicability of the linear representation from the input data, however MCM Alchimia, allows to connect curves constructed from any input data set performing unplaced least squares and does not ensure or control compliance with these assumptions.

It is common to find trials, analytical methods or calibrations that are based on the measurement at different levels of the analyte or in different positions of the reading scale, when it is an instrument. In this way, in most cases a linear relation between them is considered, obtaining by the least squares method the coefficients of the equation of the line:

In this way when it comes to equipment calibration and the correction is plotted according to the reading scale, it is easy to obtain this value by applying the equation above, from the coefficients obtained by least squares and the reading.

But it is also usual especially in the area of ​​chemistry to construct the calibration curve with certified reference materials of known concentration and then use this curve to obtain the concentration of an analyte from an observed response.

In the least squares analysis we have three types of uncertainty associated with the curve.

  • Random errors in reference responses yi.
  • Random effects on reference values ​​ xi .
  • Effect associated with the presumption of linearity, which will be given by the waste.

If the values ​​ xi and yi do not present uncertainty, the curve obtained will be the one that minimizes the squares of the waste. In case of random errors in reference values ​​ xi and yi are greater than zero, it is possible to generate pseudo-randomizations with the distribution associated to each parameter, obtaining by least squares a curve for each data set. This generates an additional uncertainty in the regression coefficients, in addition to the uncertainty determined by the residuals.

In the graph below, the beam of points that each observation would represent if there is a significant uncertainty in both axes is presented symbolically. This situation therefore yields a bundle of curves that satisfy the least squares of the residuals, generating uncertainty in both regression coefficients. This uncertainty is marked with blue in the graph.

Furthermore, this uncertainty will be increased by the uncertainty inherent in the curve, which is determined as a function of the standard deviation of the residual errors. This uncertainty, in general, is greater than that of the simulation and is represented by the orange zone in the graph.

What does MCM Alchimia software do when connecting a curve?

After all the values ​​are entered into the panel, in order to use the curve in our model, we must “connect” the curve with the button provided for that purpose. When we do this, MCM Alchimia performs a simulation with the data set, performing a random sampling with the input data and the response. For each of the iterations, the regression coefficients b 0 and b 1 are obtained, and a value of the coefficient of determination R 2 .

As in the traditional simulation, a population of data for these parameters with unknown distribution is obtained, and will depend on the distributions and values ​​of the input data.

In order not to make assumptions or inadequate approximations about these statistical properties, MCM Alchimia keeps the set of values ​​in vectors associated with these random variables, which will be available throughout the process.

On the other hand, with the average values of each pair of input data, the application calculates the uncertainty of the curve, as a function of the regression residuals.

This repository of sampling data will be used according to the need of our model, either to use the coefficients b0 and b1 with their uncertainty, to consider the uncertainty of the curve or to use the interpolation functions that will be described later.