Development and comparison of regression models for the determination of quality parameters in margarine spread samples using NIR spectroscopy

Anita Rácz, Marietta Fodor, K. Heberger

Research output: Contribution to journalArticle

1 Citation (Scopus)

Abstract

Fat and dry material contents (connected to moisture) are one of the most important parameters in the quality control of butter, margarine and margarine spreads (dairy spreads). More than a hundred margarine samples were used to model their fat and dry material content based on Fourier transform-near infrared (FT-NIR) spectroscopy in transmission and reflectance modes for the quality control of margarine. We also carried out a systematic comparison of various modeling techniques such as PLS regression, principal component regression (PCR) and support vector machines (SVM). Moreover, three types of cross-validation, three types of variable selection and the effect of different spectral types (transmission and reflectance) were also compared with factorial ANOVA tests. We examined the effect of the applied datasets (calibration, test samples, and both sets) based on the original predicted values. Sum of ranking differences (SRD), a novel comparison tool, was applied for the task. We showed that the SRD values can be used as a promising and useful performance parameter for the ranking and evaluation of numerous regression models. Four datasets with 42-42 transmission and 34-34 reflectance models were used for the evaluations. Finally, we have found the best models in each case based on their SRD values. The properly validated SVM models proved to be the best for all of the four used datasets. Although the method comparison is data set dependent, the suggested methodology is applicable generally and unambiguously. These final models can be used for fast and easy quality control of margarine samples instead of the time-consuming original analytical techniques.

Original languageEnglish
Pages (from-to)3089-3099
Number of pages11
JournalAnalytical Methods
Volume10
Issue number25
DOIs
Publication statusPublished - Jul 7 2018

Fingerprint

Spectroscopy
Quality control
Oils and fats
Support vector machines
Fats
Near infrared spectroscopy
Dairies
Analysis of variance (ANOVA)
Fourier transforms
Moisture
Calibration

ASJC Scopus subject areas

  • Analytical Chemistry
  • Chemical Engineering(all)
  • Engineering(all)

Cite this

Development and comparison of regression models for the determination of quality parameters in margarine spread samples using NIR spectroscopy. / Rácz, Anita; Fodor, Marietta; Heberger, K.

In: Analytical Methods, Vol. 10, No. 25, 07.07.2018, p. 3089-3099.

Research output: Contribution to journalArticle

@article{9ce691b32ead4269a7fb73f8c8c11b64,
title = "Development and comparison of regression models for the determination of quality parameters in margarine spread samples using NIR spectroscopy",
abstract = "Fat and dry material contents (connected to moisture) are one of the most important parameters in the quality control of butter, margarine and margarine spreads (dairy spreads). More than a hundred margarine samples were used to model their fat and dry material content based on Fourier transform-near infrared (FT-NIR) spectroscopy in transmission and reflectance modes for the quality control of margarine. We also carried out a systematic comparison of various modeling techniques such as PLS regression, principal component regression (PCR) and support vector machines (SVM). Moreover, three types of cross-validation, three types of variable selection and the effect of different spectral types (transmission and reflectance) were also compared with factorial ANOVA tests. We examined the effect of the applied datasets (calibration, test samples, and both sets) based on the original predicted values. Sum of ranking differences (SRD), a novel comparison tool, was applied for the task. We showed that the SRD values can be used as a promising and useful performance parameter for the ranking and evaluation of numerous regression models. Four datasets with 42-42 transmission and 34-34 reflectance models were used for the evaluations. Finally, we have found the best models in each case based on their SRD values. The properly validated SVM models proved to be the best for all of the four used datasets. Although the method comparison is data set dependent, the suggested methodology is applicable generally and unambiguously. These final models can be used for fast and easy quality control of margarine samples instead of the time-consuming original analytical techniques.",
author = "Anita R{\'a}cz and Marietta Fodor and K. Heberger",
year = "2018",
month = "7",
day = "7",
doi = "10.1039/c8ay01055b",
language = "English",
volume = "10",
pages = "3089--3099",
journal = "Analytical Methods",
issn = "1759-9660",
publisher = "Royal Society of Chemistry",
number = "25",

}

TY - JOUR

T1 - Development and comparison of regression models for the determination of quality parameters in margarine spread samples using NIR spectroscopy

AU - Rácz, Anita

AU - Fodor, Marietta

AU - Heberger, K.

PY - 2018/7/7

Y1 - 2018/7/7

N2 - Fat and dry material contents (connected to moisture) are one of the most important parameters in the quality control of butter, margarine and margarine spreads (dairy spreads). More than a hundred margarine samples were used to model their fat and dry material content based on Fourier transform-near infrared (FT-NIR) spectroscopy in transmission and reflectance modes for the quality control of margarine. We also carried out a systematic comparison of various modeling techniques such as PLS regression, principal component regression (PCR) and support vector machines (SVM). Moreover, three types of cross-validation, three types of variable selection and the effect of different spectral types (transmission and reflectance) were also compared with factorial ANOVA tests. We examined the effect of the applied datasets (calibration, test samples, and both sets) based on the original predicted values. Sum of ranking differences (SRD), a novel comparison tool, was applied for the task. We showed that the SRD values can be used as a promising and useful performance parameter for the ranking and evaluation of numerous regression models. Four datasets with 42-42 transmission and 34-34 reflectance models were used for the evaluations. Finally, we have found the best models in each case based on their SRD values. The properly validated SVM models proved to be the best for all of the four used datasets. Although the method comparison is data set dependent, the suggested methodology is applicable generally and unambiguously. These final models can be used for fast and easy quality control of margarine samples instead of the time-consuming original analytical techniques.

AB - Fat and dry material contents (connected to moisture) are one of the most important parameters in the quality control of butter, margarine and margarine spreads (dairy spreads). More than a hundred margarine samples were used to model their fat and dry material content based on Fourier transform-near infrared (FT-NIR) spectroscopy in transmission and reflectance modes for the quality control of margarine. We also carried out a systematic comparison of various modeling techniques such as PLS regression, principal component regression (PCR) and support vector machines (SVM). Moreover, three types of cross-validation, three types of variable selection and the effect of different spectral types (transmission and reflectance) were also compared with factorial ANOVA tests. We examined the effect of the applied datasets (calibration, test samples, and both sets) based on the original predicted values. Sum of ranking differences (SRD), a novel comparison tool, was applied for the task. We showed that the SRD values can be used as a promising and useful performance parameter for the ranking and evaluation of numerous regression models. Four datasets with 42-42 transmission and 34-34 reflectance models were used for the evaluations. Finally, we have found the best models in each case based on their SRD values. The properly validated SVM models proved to be the best for all of the four used datasets. Although the method comparison is data set dependent, the suggested methodology is applicable generally and unambiguously. These final models can be used for fast and easy quality control of margarine samples instead of the time-consuming original analytical techniques.

UR - http://www.scopus.com/inward/record.url?scp=85049237390&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85049237390&partnerID=8YFLogxK

U2 - 10.1039/c8ay01055b

DO - 10.1039/c8ay01055b

M3 - Article

AN - SCOPUS:85049237390

VL - 10

SP - 3089

EP - 3099

JO - Analytical Methods

JF - Analytical Methods

SN - 1759-9660

IS - 25

ER -