Thursday 27. September 2001
I will provide a historical perspective of chemometrics (and other -metrics), starting already in the 19th century, talking in particular about the interplay between statistics and problems from chemistry, but also about some lack of interplay in more recent years. A person named Gosset will be given by me an important role in the talk.
I will also talk about the character of the statistical problems of modern chemometrics, why statisticians should study them and will benefit from it, and perhaps some statistical challenges. In particular I certainly cannot avoid discussing the (in-)famous PLS, as one important theme or example. Another question to be discussed is the role of chemometrics outside applications in chemistry.
A well-known problem in two-way analysis is the rotation freedom in bilinear models. This problem has been known for decades and is especially problematic in situations where identification of the underlying bilinear parameters is sought. It can be shown that the rotational freedom does not exist for some three-way models and the implication of this fundamental property is illustrated with some practical problems.
For example, it will be shown that it is possible to separate, mathematically, measured signals of mixtures (e.g. fluorescence measurements of food samples or cellular phone signals received at antenna) with the use of such multi-way models. Thus, both quantitative and qualitative analysis is possible directly from measured mixtures.
Finally, some attention will be paid to the evident lack of knowledge on the statistical properties of these models.
There is no such thing as "THE BEST" regression method. PLS Regression is just one among many good methods. But to my knowledge, there is no other statistical method with comparable versatility (Martens H. & Martens M. 2001). When compared to competing regression methods, each optimised properly, the PLS Regression seems invariably to come out among the best ones, with respect to statistical predictive ability. Therefore, it is particularly well suited for non-statisticians - researchers who want to use their important contextual knowledge during the actual data analysis, and who cannot take the time to learn many different, abstract statistical methods.
The PLS Regression arose from unfulfilled data-analytical needs and in frustration over traditional statistics. It came about through the close collaboration between the statistician Herman Wold and his son, the chemist Svante Wold, combined with an intense co-operation between Svante and myself as chemometricians.
The development of the PLS Regression took place in the beginning of the 80-ies, between two widely different but sometimes equally suffocating scientific cultures: 1) The mathematical modelling in traditional chemistry and physics, focusing on hard causal models. 2) The parameter estimation of traditional statistical modelling, focusing too much on distribution theory and hypothesis testing and too little on the discovery process in multivariate data.
The PLS Regression (PLSR) is just one out of at least three disjoint methods called "PLS" - all based on Herman Wold's original work. The bi-linear PLSR has proven useful in a number of scientific fields, ranging from chemistry and biotechnology to psychology and marketing, for a wide range of applications (calibration of instruments, reduced-rank regression/prediction, symmetrical and assymmetrical classification/discrimination. When stabilized in certain details (internal ridging etc), and with bi-linear jack-knifing at various levels, it provides a more graphically accessible significance assessment with the potential of simplifying traditional ANOVA/MANOVA/Covariance analysis, in fixed as in mixed models.
References:
Martens H. and Næs T. (1989) Multivariate Calibration. J. Wiley & Sons, Ltd., 450 pages. (Citation Index: p.t. >1500)
Martens H. and Martens M. (2001) Multivariate Analysis of Quality. An Introduction. J. Wiley & Sons, Ltd., 440 pages
Wold, S., Martens, H. and Wold, H. 1983: The Multivariate Calibration Problem in Chemistry solved by the PLS Method. Proc. Conf. Matrix Pencils, (A. Ruhe and B. Kågström, eds.), March 1982, Lecture Notes in Mathematics, Springer Verlag, Heidelberg, 286-293.