Machine Learning Approaches Applied to GC-FID Fatty Acid Profiles to Discriminate Wild from Farmed Salmon.

Foods

Centro de Investigação de Montanha (CIMO), Instituto Politécnico de Bragança, Campus de Sta. Apolónia, 5301-857 Bragança, Portugal.

Published: November 2020


Category Ranking

98%

Total Visits

921

Avg Visit Duration

2 minutes

Citations

20

Article Abstract

In the last decade, there has been an increasing demand for wild-captured fish, which attains higher prices compared to farmed species, thus being prone to mislabeling practices. In this work, fatty acid composition coupled to advanced chemometrics was used to discriminate wild from farmed salmon. The lipids extracted from salmon muscles of different production methods and origins (26 wild from Canada, 25 farmed from Canada, 24 farmed from Chile and 25 farmed from Norway) were analyzed by gas chromatography with flame ionization detector (GC-FID). All the tested chemometric approaches, namely principal components analysis (PCA), t-distributed stochastic neighbor embedding (t-SNE) and seven machine learning classifiers, namely k-nearest neighbors (kNN), decision tree, support vector machine (SVM), random forest, artificial neural networks (ANN), naïve Bayes and AdaBoost, allowed for differentiation between farmed and wild salmons using the 17 features obtained from chemical analysis. PCA did not allow clear distinguishing between salmon geographical origin since farmed samples from Canada and Chile overlapped. Nevertheless, using the 17 features in the models, six out of the seven tested machine learning classifiers allowed a classification accuracy of ≥99%, with ANN, naïve Bayes, random forest, SVM and kNN presenting 100% accuracy on the test dataset. The classification models were also assayed using only the best features selected by a reduction algorithm and the best input features mapped by t-SNE. The classifier kNN provided the best discrimination results because it correctly classified all samples according to production method and origin, ultimately using only the three most important features (16:0, 18:2n6c and 20:3n3 + 20:4n6). In general, the classifiers presented good generalization with the herein proposed approach being simple and presenting the advantage of requiring only common equipment existing in most labs.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7695029PMC
http://dx.doi.org/10.3390/foods9111622DOI Listing

Publication Analysis

Top Keywords

machine learning
12
fatty acid
8
discriminate wild
8
farmed
8
wild farmed
8
farmed salmon
8
canada farmed
8
analysis pca
8
learning classifiers
8
random forest
8

Similar Publications

Preclinical stroke research faces a critical translational gap, with animal studies failing to reliably predict clinical efficacy. To address this, the field is moving toward rigorous, multicenter preclinical randomized controlled trials (mpRCTs) that mimic phase 3 clinical trials in several key components. This collective statement, derived from experts involved in mpRCTs, outlines considerations for designing and executing such trials.

View Article and Find Full Text PDF

Background: Subcellular localisation is a determining factor of protein function. Mass spectrometry-based correlation profiling experiments facilitate the classification of protein subcellular localisation on a proteome-wide scale. In turn, static localisations can be compared across conditions to identify differential protein localisation events.

View Article and Find Full Text PDF

To address the technical challenges associated with determining the chronological order of overlapping stamps and textual content in forensic document examination, this study proposes a novel non-destructive method that integrates hyperspectral imaging (HSI) with convolutional neural networks (CNNs). A multi-type cross-sequence dataset was constructed, comprising 60 samples of handwriting-stamp sequences and 20 samples of printed text-stamp sequences, all subjected to six months of natural aging. Spectral responses were collected across the 400-1000 nm range in the overlapping regions.

View Article and Find Full Text PDF

Oral cancer is a major global health burden, ranking sixth in prevalence, with oral squamous cell carcinoma (OSCC) being the most common type. Importantly, OSCC is often diagnosed at late stages, underscoring the need for innovative methods for early detection. The oral microbiome, an active microbial community within the oral cavity, holds promise as a biomarker for the prediction and progression of cancer.

View Article and Find Full Text PDF

Postoperative aphasia (POA) is a common complication in patients undergoing surgery for language-eloquent lesions. This study aimed to enhance the prediction of POA by leveraging preoperative navigated transcranial magnetic stimulation (nTMS) language mapping and diffusion tensor imaging (DTI)-based tractography, incorporating deep learning (DL) algorithms. One hundred patients with left-hemispheric lesions were retrospectively enrolled (43 developed postoperative aphasia, as the POA group; 57 did not, as the non-aphasia (NA) group).

View Article and Find Full Text PDF