Repeated double cross-validation applied to the PCA-LDA classification of SERS spectra: a case study with serum samples from hepatocellular carcinoma patients

Gurian, Elisa; Di Silvestre, Alessia; Mitri, Elisa; Pascut, Devis; Tiribelli, Claudio; Giuffrè, Mauro; Crocè, Lory Saveria; Sergo, Valter; Bonifacio, Alois

doi:10.1007/s00216-020-03093-7

Repeated double cross-validation applied to the PCA-LDA classification of SERS spectra: a case study with serum samples from hepatocellular carcinoma patients

Research Paper
Open access
Published: 08 December 2020

Volume 413, pages 1303–1312, (2021)
Cite this article

Download PDF

You have full access to this open access article

Analytical and Bioanalytical Chemistry Aims and scope Submit manuscript

Repeated double cross-validation applied to the PCA-LDA classification of SERS spectra: a case study with serum samples from hepatocellular carcinoma patients

Download PDF

Elisa Gurian¹,
Alessia Di Silvestre¹,
Elisa Mitri¹,
Devis Pascut²,
Claudio Tiribelli²,
Mauro Giuffrè^2,3,
Lory Saveria Crocè^2,3,
Valter Sergo^1,4 &
…
Alois Bonifacio ORCID: orcid.org/0000-0002-2251-7786¹

4231 Accesses
30 Citations
2 Altmetric
Explore all metrics

Abstract

Intense label-free surface-enhanced Raman scattering (SERS) spectra of serum samples were rapidly obtained on Ag plasmonic paper substrates upon 785 nm excitation. Spectra from the hepatocellular carcinoma (HCC) patients showed consistent differences with respect to those of the control group. In particular, uric acid was found to be relatively more abundant in patients, while hypoxanthine, ergothioneine, and glutathione were found as relatively more abundant in the control group. A repeated double cross-validation (RDCV) strategy was applied to optimize and validate principal component analysis-linear discriminant analysis (PCA-LDA) models. An analysis of the RDCV results indicated that a PCA-LDA model using up to the first four principal components has a good classification performance (average accuracy was 81%). The analysis also allowed confidence intervals to be calculated for the figures of merit, and the principal components used by the LDA to be interpreted in terms of metabolites, confirming that bands of uric acid, hypoxanthine, ergothioneine, and glutathione were indeed used by the PCA-LDA algorithm to classify the spectra.

The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation

Article Open access 02 January 2020

Quantitative Mass Spectrometry-Based Proteomics: An Overview

Cost-sensitive learning for imbalanced medical data: a review

Article Open access 01 March 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Surface-enhanced Raman scattering (SERS) spectroscopy is an analytical technique based on the inelastic scattering of a laser by analytes adsorbed on nanostructured metal surfaces with adequate plasmonic properties [1, 2]. As for normal Raman spectroscopy, bands in SERS spectra are related to the different vibrational modes of the analyte molecules. Different molecular structures will yield different spectra, making vibrational spectroscopies as Raman and SERS very structure-specific. However, SERS benefits from a much greater sensitivity than Raman, due to the intensity enhancement granted by its interaction with the plasmonic surface. These characteristics, together with the availability of relatively inexpensive and portable instrumentation, as well as a fast analytical response, make SERS extremely appealing for bioanalytical applications, many of which are listed in recent reviews [3, 4].

One of the simplest approach used when applying SERS to bioanalysis, usually referred to as label-free SERS, consists of putting a biofluid containing the analyte or a mixture of analytes in contact with a nanostructured metal surface (such as metal nanoparticles) for direct detection of the target molecule(s). While in some cases a specific analyte is sought, in many cases, especially when developing a diagnostic method, an untargeted approach is adopted. By using this strategy, the rich biochemical complexity of biofluids such as blood plasma or serum is explored, and not just one but several metabolites are considered in a multi-marker approach to diagnosis. Thus, in a study where label-free SERS is used to characterize biofluid samples for diagnostic or prognostic purposes, spectra become a sort of metabolic fingerprints, in which bands originate from those narrow subset of metabolites with a higher affinity for the nanostructured metal surface [5].

Label-free SERS of biofluids such as plasma, serum, urine, or saliva is rapidly emerging as a promising method for the diagnosis of several pathologies [3,4,5], especially by using multivariate data analysis and predictive modelling methods to fully exploit the intrinsic multivariate information present in the spectral dataset. By using multivariate prediction algorithms [6,7,8], even what are usually considered extremely small spectral differences can be exploited for classification purposes. However, multivariate methods are a two-edged sword, and while being extremely powerful tools to exploit the information contained in SERS spectra, they should be carefully validated and the results correctly presented. To avoid overfitting, and thus a gross over-estimation of the classification performance of a method, a careful approach should be adopted when trying to optimize and validate a model. Another issue in predictive models is the estimation of the uncertainties of figures of merit (FOM, also addressed as “quality performance metrics”) such as accuracy, sensitivity, specificity, NPV, PPV, and AUC, often used [9] to express the performance of a classification model. The uncertainty about a model performance can be conveyed by specifying confidence intervals for FOM. However, such confidence intervals cannot be derived from a single model, but require an adequate number of different models.

Among the different strategies available for optimization and assessment of models, the repeated double cross-validation [6, 10] (RDCV, see Methods and Discussion for details) has one advantage it automatically optimizes model parameters, thus avoiding arbitrary choices by researchers, while keeping train and test data sets for optimization and validation well separated. These features help to minimize the possibility of overfitting. Moreover, the repeated cross-validation generates many different models that can be used to calculate confidence intervals for FOM.

This paper aims to apply RDCV for classification, using a “principal component analysis - linear discriminant analysis” approach (PCA-LDA [6], see Methods and Discussion for details) on a label-free SERS dataset. RDCV has been originally proposed and used for regression [10], and although a number of studies applied this approach to classification as well on several types of spectroscopic data [11,12,13,14,15], to our knowledge, it has never been applied with this purpose to SERS data. As a case study to assess the use of RDCV, we use a dataset of label-free SERS spectra of serum of two groups of subjects: patients with hepatocellular carcinoma (HCC) and a control group.

The focus on HCC derives from the evidence that early diagnosis for this cancer still represents an unmet clinical need. HCC is the most common type of primary liver cancer, represents the seventh most frequent cancer and the fourth leading cause of cancer-related death worldwide in 2018 [16]. The late diagnosis has a negative impact on patients’ life expectancy since it lowers the chances of effective treatment options. HCC is the only cancer diagnosed through imaging techniques without the need for histological confirmation. However, imaging techniques have some limitations in terms of sensitivity, costs, and patient’s compliance. Short-term surveillance with these techniques is still considered not clinically efficient and cost-effective. New non-invasive tools are thus needed to for early HCC detection, and label-free SERS of serum or other biofluids might be a viable candidate.

Materials and methods

Materials and chemicals

All chemicals used for the SERS substrate fabrication were purchased from Merck and used as received. Pure cellulose qualitative filter paper (grade 410, 2 μm average pore size) was purchased from VWR International Srl (Milano, IT). Ultrapure water (Milli-Q) was used for preparing all solutions.

Human serum samples

Fasting blood samples were collected at time of diagnosis from 72 consecutive male subjects with HCC referring to the Liver Center of the University Hospital of Trieste (Italy) and from 72 consecutive healthy blood donors recruited in 2018 at the Transfusion Clinic of the University Hospital of Trieste (Italy) (Table 1, and Table S1 in the Supplementary Information (ESM)). All the patients provided written informed consent and patient anonymity has been preserved. The investigation was conducted according to the principles expressed in the Declaration of Helsinki. The study was approved by the regional ethical committee (Comitato Etico Regionale Unico del Friuli Venezia Giulia, Prot. No. 2018 Os-008-ASUITS, CINECA no. 2225). HCC was diagnosed according to the EASL criteria and staged according to the Barcelona Clinic Liver Cancer (BCLC) [17].

Table 1 Characteristics of the study populations. Age expressed as median (1st quartile–3rd quartile). For more characteristics, see Table S1 in ESM

Full size table

Sample collection and storage

Serum samples were obtained from 6 mL of whole blood collected in Vacuette® serum separating tubes (Greiner Bio-One International GmbH, Kremsmünster, Austria) and centrifuged at 3500 rpm for 10 min. Supernatants were transferred in 1-mL Eppendorf tubes and subsequently frozen at − 80 °C for long-term storage (until SERS analysis). For HCC, patients’ samples were collected at the time of diagnosis before any treatment.

SERS substrate fabrication

The plasmonic paper substrates in use were fabricated according to an in-house developed procedure, following a dip-coating of filter paper with citrate-reduced silver nanoparticles [18]. The synthesis of the colloidal nanoparticles follows the recipe of Lee and Meisel [19]. Briefly, 10 mL of sodium citrate 1.1% w/w has been added dropwise to 500 mL boiling solution of AgNO₃ 1.1 mM under magnetic stirring for 1 h and kept at dark. All glassware used for this synthesis was previously cleaned with nitric acid and Nochromix solutions (GODAX Labs Inc.), and thoroughly rinsed with Milli-Q water after each cleaning step. The resulting nanoparticles have been concentrated 10 times in volume with an ultra-centrifuge (60 min at 45000 rpm). Afterward, 1 cm² filter paper squares were placed well-wise in a 24 multi-well plates with 3 mL of the concentrated Ag colloid. The addition of 62 μL of 1 M sodium citrate tribasic allowed NP aggregation and precipitation on the paper. After 7 days of incubation, the supernatant was removed and the substrates were transferred and stored in Milli-Q water, in dark and at room temperature, until use. The substrates prepared as described were stable for 3 months.

SERS instrumentation

The spectra collection has been performed in air at room temperature with an i-Raman Plus portable system (BWS465-785S) through a compatible Raman video microscope (BAC151B) and with the BWSpec software (version 4.03_23_c), by B&W Tek (Newark, DE). Excitation was obtained with a 785-nm laser with an output power of about 400 mW. Laser light delivery to the sample and scattering collection occurred through an optical fiber probe connected to a compatible Raman video microscope. The instrument spectrograph had an average spectral resolution of 2.4 cm⁻¹. The laser spot diameter at the sample was of 105 μm, obtained by using a × 20 Olympus objective (N.A. 0.25, working distance 8.8 mm). Spectra collection was performed with a single accumulation of 10 s CCD exposure, and with a laser power at the sample of 38 mW (10% of the maximum laser output). Using these experimental conditions, no substrate photo-degradation was reported. Paracetamol samples were used as standard reference samples during every measurement session to check spectrometer wavelength calibration.

Sample preparation and SERS measurement

Serum samples were immediately analyzed after thawing. Five microliter drops of serum were dropped on the surface of the plasmonic paper substrates and let dry for 20 min. Later, the plasmonic paper substrates were placed under the i-Raman plus portable microscope objective on a glass microscope slide, and spectra were collected at room temperature (25 °C) in three technical replicas for each sample, which were averaged before further preprocessing and analysis. Data was collected on 5 different days and over 3 different batches of substrates. Sample collection was stratified over the different batches of substrates and over various days, so that on each day, an equal number of samples from both H0T and CTR classes and from each substrate batch was measured. This way, differences observed between classes cannot be related to the measurement day or to the substrate batch used. Also, measurements were randomly collected by two different operators.

Data preprocessing, analysis, and visualization

Spectra have been entirely processed using the R environment for data analysis [20]—version 3.6.2 (2019-12-12). In particular, the package hyperSpec [21] was used for data import and visualization. The preprocessing steps included (i) Raman shift range selection (400 to 1800 cm⁻¹) and data interpolation by local polynomial regression fitting (loess) to a new wavelength axis with a spacing of 2 cm⁻¹, (ii) baseline correction (package baseline [22], method modpolyfit, polynomial degree = 4), (iii) vector normalization. Examples of baselines are shown in Fig. S1 of the ESM. After baseline correction, the Raman shift range was further cropped from 430 to 1730 cm⁻¹ to delete possible artifacts due to the baseline subtraction present at the borders of the spectral range. A PCA-LDA prediction algorithm was used, in which a number of principal components (PC) were selected for a linear discriminant analysis (LDA). Principal components analysis (PCA) was performed using the prcomp function, centering but not scaling data. The cumulative proportion of explained Variance for the first 20 principal components of the dataset is available as ESM (Fig. S2A). The function lda from the MASS package [23] was used for the LDA. A RDCV [10] was chosen as validation strategy, in which the number of PC to be used in each LDA model was iteratively optimized using independent portions of the dataset in an “inner k-fold cross-validation loop” (k = 7), while an “outer k-fold cross-validation loop” (k = 3) is used to cross-validate the optimized models on independent folds of the dataset. Data partitions in both loops were created using the createFolds function of the caret package [24]. Data partition was stratified, so that each fold contained the same proportions of the classes considered. Note that the PCA was performed for each loop only for the train set, so that train and test sets were kept well separated and no information from the test set was introduced in the PCA-LDA model. The double cross-validation was repeated n times (n = 100), generating 300 optimized partial models (each from k-1 folds). For the RDCV, functions were also used from packages chemometrics [25], e1071 [26], and ROCR [27].

Confusion matrices were obtained for each of the 100 repetitions of the cross-validation by summing the partial confusion matrices of each fold. Quality performance metrics (sensitivity; specificity; accuracy; PPV—positive predicted values; NPV—negative predicted values; and AUC—area under the curve) for each repetition were calculated then from these confusion matrices, yielding a distribution of 100 values for each metric. The confidence intervals (95%) for sensitivity, specificity, accuracy, PPV, and NPV were calculated using the binom.confint function of the binom package [28], assuming binomial distributions. ROC curves for each repetition were generated by summing the prediction probabilities of each fold obtained with the ROCR package [27]. The confidence intervals for the AUC were calculated using the cvAUC package [29], according to LeDell et al. [30].

All figures were prepared using the R environment for data analysis [20]. Boxplots have been produced using the ggplot2 [31] package, and the ggsignif [32] package was used to calculate and display significant differences between distributions.

Results and discussion

Median SERS spectra of serum from the two classes considered, i.e., patients diagnosed with hepatocellular carcinoma (H0T) and controls (CTR) are reported in Fig. 1, along with the median difference spectrum. For the first time, a large dataset of SERS spectra of serum has been collected using Ag “plasmonic paper” substrates, i.e., paper coated with Ag nanoparticles, previously described and characterized by our group [18]. The spectra in Fig. 1 display the characteristic purine bands of label-free SERS of serum and plasma previously reported for other substrates [5]. The main advantage of using such paper-based solid substrates, with respect to colloidal substrates, is that an intense SERS spectrum can be rapidly obtained without the need to de-proteinize serum samples to promote aggregation [33], as the nanoparticles on the plasmonic paper are already aggregated. Simply depositing few microliters of serum directly on the plasmonic paper, without the need of any sample preprocessing or mixing with metal colloids, allows the collection of an intense SERS spectrum. The SERS spectra in Fig. 1 present some similarities with those recently reported from plasma on a slightly different plasmonic paper [34], where purine bands still dominate the spectrum. As SERS spectra of plasma and serum are not expected to show marked differences [33], the differences between these two spectral datasets could be due to still unknown differences in the physicochemical characteristics of the two surfaces (arising from different preparation protocols), or perhaps to the different population characteristics of the subjects involved in the study (only obese subjects were enrolled for the other study).

SERS spectra from the two classes present some dissimilarities, as evidenced by the difference spectrum represented in the lower part of Fig. 1. A cursory inspection of positive and negative bands in the difference spectrum suggests that uric acid [33] (positive bands at 594, 638, 812, 888, and 1132 cm⁻¹) is relatively more abundant in the sera of HCC patients than in those of controls, whereas hypoxanthine [33] (negative bands at 724 cm⁻¹), ergothioneine [35] (negative bands at 480, 1220, 1442, 1582 cm⁻¹), and perhaps glutathione [36] (negative bands at 664 and 912 cm⁻¹) are relatively less abundant in HCC patients.

Similar differences involving an increase in the uric acid-hypoxanthine SERS band intensity ratio were reported for liver diseases in general by Shao et al. [37], and more specifically for different fatty liver stages (NASH vs. NAFL) in a recent paper by our group [34]. As hypoxanthine is ultimately converted to uric acid by xanthine oxidase, these reports seem to suggest a role of the purine metabolism, and in particular of xanthine oxidase, as a general marker for liver function. Such conjecture has been recently supported by other papers as well [38].

On the other hand, a relative decrease in the SERS intensity of ergothioneine bands for liver cancer patients with respect to controls has been also reported (although with a different band interpretation) by Xiao et al. [39] and Liu et al. [40]. Ergothioneine, a natural amino acid that we assume with the diet and that has been found in high concentrations in the liver [41], appears to be often observed in SERS spectra of various biofluids, including serum and plasma [35]. Although its role is still not known, one of the most often cited hypotheses is its possible role as a potent antioxidant [41]. The fact that bands tentatively assigned to glutathione were also found to be less intense in HCC patients, consistently with those of ergothioneine, indicates a different oxidative status in HCC patients compared to controls. Interestingly, oxidative stress has been indeed suggested to play a relevant role in liver carcinogenesis from different etiologies [42].

Building upon these spectral differences, a predictive model can be trained to classify SERS spectra of serum collected on plasmonic paper as belonging to subjects with (i.e., positive class, labeled as H0T) or without HCC (i.e., negative class or controls, labeled as CTR). A RDCV strategy [10] has been adopted to optimize and evaluate the performance of a PCA-LDA model to classify the SERS spectra of serum.

The RDCV generated optimized models differing one another by the composition of train and test segments for the outer RDCV loop, and for the number of PC used in the LDA algorithm, as resulting from an optimization independently performed in the inner RDCV loop. RDCV is structured so that each optimization and validation step is performed on an independent test set. Thus, overfitting is avoided by each model, since no information from the test set is used to build the model used to predict it.

The only information needed as external input is the maximum number of PC to be considered for the inner loop. In this study, the maximum number of PC for the optimization loop was set to 7, as the first 7 PC calculated from the PCA of the entire dataset explained up to 90% of the spectral variance (see Fig. S2A in the ESM). A visual inspection of the loadings of PC7 (Fig. S2B in ESM) indicates that relevant spectral information is still present, ruling out the possibility of including just noise.

In the inner loop, the optimal number of PC is chosen by applying the so-called one-standard-error rule [6, 43]. The cross-validation error curves for all the models obtained by the RDCV are reported in Fig. 2a, showing that the models do not improve by including more than 4 PC. Consistently with this picture, Fig. 2b shows that most of the models were optimal when up to 3 or 4 PC were included as variables for the LDA, whereas a negligible fraction retained more than 4 PC. These results are suggesting that the PC after the 4th are not meaningful in differentiating between the two classes, although we still do not know which ones, among the first four, are the most relevant.

In the studies reported so far dealing with the classification of label-free SERS spectra of biofluids [3, 5], the value of the parameter for the classification algorithm (e.g., number of PCs or latent variables for PCA-LDA or PLS-DA) was arbitrarily selected on the basis of the information available from the whole dataset (e.g., the cumulative variance explained by a PC or the p value for a statistical test); this was not the case for RDCV. Conversely, in the present study, the use of a RDCV ensured an automated parameter selection for each model based on cross-validation, without using any information from the spectra to be predicted in the outer loop, thus avoiding the risk of overfitting during model optimization.

The performance of each optimized model generated by the RDCV was validated in the outer RDCV loop by comparing the predictions to the actual classes of an independent test set. Each iteration of the RDCV produced a confusion matrix (also known as error matrix), and the statistics of all the confusion matrices thus obtained is summarized in Fig. 3. The medians of the distributions for true positive, true negative, false positive, and false negative values give a first estimation on the overall performance of the PCA-LDA algorithm when up to 4 PCs are used (e.g., on a total of 72 spectra from sera of HCC patients, 62 are correctly predicted while 10 are misclassified as controls).

FOM such as accuracy, sensitivity, and specificity were calculated from the confusion matrix of each optimized model, yielding a distribution of values for each FOM from which confidence intervals were calculated (Table 2). Being able to estimate confidence intervals for these FOM is another advantage of using a repeated cross-validation strategy, as it allows an uncertainty estimation of the predictive capabilities of the model.

Table 2 Figures of merit calculated from the optimized models generated from the RDCV

Full size table

The average FOM and the corresponding confidence intervals suggest that a PCA-LDA model can distinguish between the two groups with an overall accuracy of about 80%, favoring model sensitivity (86%) at the expenses of specificity (76%). Another perspective on the performance of the PCA-LDA model can be gained by inspecting the LD scores (Fig. 4a) and the ROC curves (Fig. 4b) for each model generated by the RDCV. To further assess the statistical significance of these results, they were compared to those obtained from a validation of permuted data (i.e., permutation test [12]) in which the class labels were randomly assigned (Fig. S3 in the ESM). The permutation test confirmed the significance of the results obtained from the RDCV validation from the dataset with the correct class labels.

While an analysis of the optimized models (Fig. 2) illustrates the most frequent number of optimal PC (i.e., 3 or 4), it does not provide information about which components are most important for the performance of the PCA-LDA model. This information can be gained by looking at the medians of the PCA scores for each class, for each PC (Fig. 5). While PC 1, 3, and 4 all seems to be useful to distinguish between the two classes, the second PC seems to be irrelevant.

The question arises about a biochemical interpretation of these PC: what metabolites allow the distinction between the two groups? An inspection of the loadings of PC 1, 3, and 4 (Fig. 6) can help in answering this question, by interpreting the loadings in terms of spectral bands. The negative peaks in the loadings of PC1 can be interpreted as hypoxanthine bands, whereas the positive loadings seem to be correlated with the uric acid bands, corroborating the impression that these two metabolites are important in discriminating between the two groups (with uric acid relatively more abundant and hypoxanthine less abundant in the H0T group). The positive peaks in the loadings of PC3 are less easy to interpret than those of PC1, but negative peaks can be interpreted as bands due to hypoxanthine, ergothioneine, and (tentatively) glutathione, confirming the role of these substances in distinguishing between H0T and CTR classes, being relatively less abundant in the H0T class. Ergothioneine bands (especially the intense band at 480 cm⁻¹) can be also identified in the positive peaks of the PC4 loadings, endorsing the hypothesis that this metabolite is relatively more abundant in the CTR group. In general, the information in Figs. 5 and 6 is corroborating the picture given by Fig. 1, suggesting that the PCA-LDA models are indeed using these spectral differences to discriminate between classes.

The possibility of checking the workings of the classification model in terms of biochemical information given by the loadings is a further advantage of the PCA-LDA (but also of the PLS-LDA) models with respect to other less transparent algorithms (e.g., non-linear models such as support vector machines [10]) working more like “black boxes” concerning spectral interpretation. The fact that the model is based on real spectral differences (and not just noise or artifacts) is an indication that overfitting is less likely, while a biochemical interpretation of the differences used by the model can be exploited to gain a better insight into the biochemistry of the disease.

The clinical relevance of uric acid in relation to cancer risk, recurrence, and mortality has been suggested since many years [44] and it has been extensively reviewed, among others, by Fini et al. in 2012 [45], and more recently by Battelli et al. [46]. The association of hyperuricemia with cancer occurrence and recurrence has been reported in various cancer types, including HCC. More recently, high levels of serum uric acid were specifically suggested as risk factor for recurrence of HCC [47]. Hypoxanthine is metabolically related to uric acid via the xanthine oxidoreductase enzyme. Unfortunately, no information exists about a possible correlation of ergothioneine to HCC or to any cancer. However, we consider this as an interesting finding that has to be further explored in relation to HCC, especially when considering the antioxidant potential of this molecule. The unbalanced redox state is one of the drivers of hepatic carcinogenesis, as oxidative stress induces genomic damage and genetic instability leading to mutations.

Conclusions

Label-free SERS spectra of whole serum can be rapidly obtained from Ag plasmonic paper substrates. Spectra from the HCC and CTR groups showed consistent differences, which were exploited by the PCA-LDA models for classification purposes, with satisfying results in terms of performance. The use of a RDCV approach for the PCA-LDA applied to label-free SERS data allowed to automatically determine the number of PC to be used in LDA, and to calculate confidence intervals for FOM. Most importantly, the analysis of the RDCV results allowed to pinpoint which are the most relevant PC for the LDA model, and to interpret their loadings in terms of metabolites. This analysis confirmed that uric acid, hypoxanthine, ergothioneine and possibly glutathione, which were responsible for most spectral differences observed, have been effectively used by the PCA-LDA algorithm for classification. These metabolites are thus possible candidates as HCC markers, and might be investigated in further studies.

Data availability

The dataset consisting of all spectra is available for download on Zenodo (zenodo.org), DOI: https://doi.org/10.5281/zenodo.4277797.

References

Langer J, Jimenez de Aberasturi D, Aizpurua J, Alvarez-Puebla RA, Auguié B, Baumberg JJ, et al. Present and future of surface-enhanced Raman scattering. ACS Nano. 2020;14:28–117. https://doi.org/10.1021/acsnano.9b04224.
Article CAS PubMed Google Scholar
Wang X, Huang S-C, Hu S, Yan S, Ren B. Fundamental understanding and applications of plasmon-enhanced Raman spectroscopy. Nat Rev Phys. 2020;2:253–71. https://doi.org/10.1038/s42254-020-0171-y.
Article Google Scholar
Zheng X-S, Jahn IJ, Weber K, Cialla-May D, Popp J. Label-free SERS in biological and biomedical applications: recent progress, current challenges and opportunities. Spectrochim Acta A Mol Biomol Spectrosc. 2018;197:56–77. https://doi.org/10.1016/j.saa.2018.01.063.
Article CAS PubMed Google Scholar
Zong C, Xu M, Xu L-J, Wei T, Ma X, Zheng X-S, et al. Surface-enhanced Raman spectroscopy for bioanalysis: reliability and challenges. Chem Rev. 2018;118:4946–80. https://doi.org/10.1021/acs.chemrev.7b00668.
Article CAS PubMed Google Scholar
Bonifacio A, Cervo S, Sergo V. Label-free surface-enhanced Raman spectroscopy of biofluids: fundamental aspects and diagnostic applications. Anal Bioanal Chem. 2015;407:8265–77. https://doi.org/10.1007/s00216-015-8697-z.
Article CAS PubMed Google Scholar
Varmuza K, Filzmoser P. Introduction to multivariate statistical analysis in chemometrics: CRC Press; 2016.
Lussier F, Thibault V, Charron B, Wallace GQ, Masson J-F. Deep learning and artificial intelligence methods for Raman and surface-enhanced Raman scattering. TrAC Trends Anal Chem. 2020;124:115796. https://doi.org/10.1016/j.trac.2019.115796.
Article CAS Google Scholar
Yang J, Xu J, Zhang X, Wu C, Lin T, Ying Y. Deep learning for vibrational spectral analysis: recent progress and a practical guide. Anal Chim Acta. 2019;1081:6–17. https://doi.org/10.1016/j.aca.2019.06.012.
Article CAS PubMed Google Scholar
Cuadros-Rodríguez L, Pérez-Castaño E, Ruiz-Samblás C. Quality performance metrics in multivariate classification methods for qualitative analysis. TrAC Trends Anal Chem. 2016;80:612–24. https://doi.org/10.1016/j.trac.2016.04.021.
Article CAS Google Scholar
Filzmoser P, Liebmann B, Varmuza K. Repeated double cross validation. J Chemom. 2009;23:160–71. https://doi.org/10.1002/cem.1225.
Article CAS Google Scholar
Varmuza K, Filzmoser P, Hilchenbach M, Krüger H, Silén J. KNN classification — evaluated by repeated double cross validation: recognition of minerals relevant for comet dust. Chemom Intell Lab Syst. 2014;138:64–71. https://doi.org/10.1016/j.chemolab.2014.07.011.
Article CAS Google Scholar
Westerhuis JA, Hoefsloot HCJ, Smit S, Vis DJ, Smilde AK, van Velzen EJJ, et al. Assessment of PLSDA cross validation. Metabolomics. 2008;4:81–9. https://doi.org/10.1007/s11306-007-0099-6.
Article CAS Google Scholar
Pérez-Guaita D, Kuligowski J, Lendl B, Wood BR, Quintás G. Assessment of discriminant models in infrared imaging using constrained repeated random sampling – cross validation. Anal Chim Acta. 2018;1033:156–64. https://doi.org/10.1016/j.aca.2018.05.019.
Article CAS PubMed Google Scholar
Guo S, Bocklitz T, Neugebauer U, Popp J. Common mistakes in cross-validating classification models. Anal Methods. 2017;9:4410–7. https://doi.org/10.1039/C7AY01363A.
Article Google Scholar
Féré M, Gobinet C, Liu LH, Beljebbar A, Untereiner V, Gheldof D, et al. Implementation of a classification strategy of Raman data collected in different clinical conditions: application to the diagnosis of chronic lymphocytic leukemia. Anal Bioanal Chem. 2020;412:949–62. https://doi.org/10.1007/s00216-019-02321-z.
Article CAS PubMed Google Scholar
Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2018;68:394–424. https://doi.org/10.3322/caac.21492.
Article Google Scholar
European Association For The Study Of The Liver; European Organisation For Research And Treatment Of Cancer. EASL–EORTC clinical practice guidelines: management of hepatocellular carcinoma. J Hepatol. 2012;56:908–43. https://doi.org/10.1016/j.jhep.2011.12.001.
Article Google Scholar
Dalla Marta S, Novara C, Giorgis F, Bonifacio A, Sergo V. Optimization and characterization of paper-made surface enhanced Raman scattering (SERS) substrates with Au and Ag NPs for quantitative analysis. Materials. 2017;10. https://doi.org/10.3390/ma10121365.
Lee PC, Meisel D. Adsorption and surface-enhanced Raman of dyes on silver and gold sols. J Phys Chem. 1982;86:3391–5. https://doi.org/10.1021/j100214a025.
Article CAS Google Scholar
R Core Team. R: a language and environment for statistical computing. Vienna: R Foundation for Statistical Computing; 2019.
Google Scholar
Beleites C, Sergo V. hyperSpec: a package to handle hyperspectral data sets in R’ (ver 0.99–20180627). http://hyperspec.r-forge.r-project.org. Accessed 2 Dec 2020.
Liland KH, Almøy T, Mevik B-H. Optimal choice of baseline correction for multivariate calibration of spectra. Appl Spectrosc. 2010. https://doi.org/10.1366/000370210792434350.
Venables WN, Ripley BD. Modern applied statistics with S. 4th ed. New York: Springer-Verlag; 2002.
Book Google Scholar
Wing MKC from J, Weston S, Williams A, Keefer C, Engelhardt A, Cooper T, Mayer Z, Kenkel B, Team the RC, Benesty M, Lescarbeau R, Ziem A, Scrucca L. caret: classification and regression training, R package version 6.0–84. 2019. https://CRAN.R-project.org/package=caret. Accessed 2 Dec 2020.
Varmuza K, Filzmoser P. chemometrics: multivariate statistical analysis in chemometrics. R package version 1.4.2. https://CRAN.R-project.org/package=chemometrics. Accessed 2 Dec 2020.
Meyer D, Dimitriadou E, Hornik K, Weingessel A, Leisch F. e1071: Misc Functions of the Department of Statistics, Probability Theory Group (Formerly: E1071), TU Wien. R package version 1.7–3. 2019. https://CRAN.R-project.org/package=e1071. Accessed 2 Dec 2020.
Sing T, Sander O, Beerenwinkel N, Lengauer T. ROCR: visualizing classifier performance in R. Bioinformatics. 2005;21:3940–1. https://doi.org/10.1093/bioinformatics/bti623.
Article CAS Google Scholar
Dorai-Raj S. Binom: binomial confidence intervals for several parameterizations. R package version 1.1–1. 2014. https://CRAN.R-project.org/package=binom. Accessed 2 Dec 2020.
LeDell E, Petersen M, van der Laan M cvAUC: cross-validated area under the ROC curve confidence intervals. R package version 1.1.0. 2014. https://CRAN.R-project.org/package=cvAUC. Accessed 2 Dec 2020.
LeDell E, Petersen M, van der Laan M. Computationally efficient confidence intervals for cross-validated area under the ROC curve estimates. Electron J Stat. 2015;9:1583–607. https://doi.org/10.1214/15-EJS1035.
Article PubMed PubMed Central Google Scholar
Wickham H. ggplot2: elegant graphics for data analysis. New York: Springer-Verlag; 2009.
Book Google Scholar
Ahlmann-Eltze C. ggsignif: significance brackets for “ggplot2”. R package version 0.6.0. 2019. https://CRAN.R-project.org/package=ggsignif. Accessed 2 Dec 2020.
Bonifacio A, Dalla Marta S, Spizzo R, Cervo S, Steffan A, Colombatti A, et al. Surface-enhanced Raman spectroscopy of blood plasma and serum using Ag and Au nanoparticles: a systematic study. Anal Bioanal Chem. 2014;406:2355–65. https://doi.org/10.1007/s00216-014-7622-1.
Article CAS PubMed Google Scholar
Gurian E, Giraudi P, Rosso N, Tiribelli C, Bonazza D, Zanconati F, et al. Differentiation between stages of non-alcoholic fatty liver diseases using surface-enhanced Raman spectroscopy. Anal Chim Acta. 2020;1110:190–8. https://doi.org/10.1016/j.aca.2020.02.040.
Article CAS PubMed Google Scholar
Fornasaro S, Gurian E, Pagarin S, Genova E, Stocco G, Decorti G, et al. Ergothioneine, a dietary amino acid with a high relevance for the interpretation of label-free surface enhanced Raman scattering (SERS) spectra of many biological samples. Spectrochim Acta A Mol Biomol Spectrosc. 2021;246:119024. https://doi.org/10.1016/j.saa.2020.119024.
Article CAS PubMed Google Scholar
Genova E, Pelin M, Decorti G, Stocco G, Sergo V, Ventura A, et al. SERS of cells: what can we learn from cell lysates? Anal Chim Acta. 2018;1005:93–100. https://doi.org/10.1016/j.aca.2017.12.002.
Article CAS PubMed Google Scholar
Shao L, Zhang A, Rong Z, Wang C, Jia X, Zhang K, et al. Fast and non-invasive serum detection technology based on surface-enhanced Raman spectroscopy and multivariate statistical analysis for liver disease. Nanomedicine Nanotechnol Biol Med. 2018;14:451–9. https://doi.org/10.1016/j.nano.2017.11.022.
Article CAS Google Scholar
Brennan P, Clare K, George J, Dillon JF. Determining the role for uric acid in non-alcoholic steatohepatitis development and the utility of urate metabolites in diagnosis: an opinion review. World J Gastroenterol. 2020;26:1683–90. https://doi.org/10.3748/wjg.v26.i15.1683.
Article CAS PubMed PubMed Central Google Scholar
Xiao R, Zhang X, Rong Z, Xiu B, Yang X, Wang C, et al. Non-invasive detection of hepatocellular carcinoma serum metabolic profile through surface-enhanced Raman spectroscopy. Nanomedicine Nanotechnol Biol Med. 2016;12:2475–84. https://doi.org/10.1016/j.nano.2016.07.014.
Article CAS Google Scholar
Liu R, Xiong Y, Guo Y, Si M, Tang W. Label-free and non-invasive BS-SERS detection of liver cancer based on the solid device of silver nanofilm. J Raman Spectrosc. 2018;49:1426–34. https://doi.org/10.1002/jrs.5408.
Article CAS Google Scholar
Halliwell B, Cheah IK, Tang RMY. Ergothioneine – a diet-derived antioxidant with therapeutic potential. FEBS Lett. 2018;592:3357–66. https://doi.org/10.1002/1873-3468.13123.
Article CAS PubMed Google Scholar
Fu Y, Chung F-L. Oxidative stress and hepatocarcinogenesis. Hepatoma Res. 2018;4:39. https://doi.org/10.20517/2394-5079.2018.29.
Article CAS PubMed PubMed Central Google Scholar
Hastie T, Tibshirani R, Friedman J. The elements of statistical learning: data mining, inference, and prediction. 2nd ed. New York: Springer-Verlag; 2009.
Book Google Scholar
Ames BN, Cathcart R, Schwiers E, Hochstein P. Uric acid provides an antioxidant defense in humans against oxidant- and radical-caused aging and cancer: a hypothesis. Proc Natl Acad Sci. 1981;78:6858–62. https://doi.org/10.1073/pnas.78.11.6858.
Article CAS PubMed Google Scholar
Fini MA, Elias A, Johnson RJ, Wright RM. Contribution of uric acid to cancer risk, recurrence, and mortality. Clin Transl Med. 2012;1:e16. https://doi.org/10.1186/2001-1326-1-16.
Article Google Scholar
Battelli MG, Bortolotti M, Polito L, Bolognesi A. Metabolic syndrome and cancer risk: the role of xanthine oxidoreductase. Redox Biol. 2019;21:101070. https://doi.org/10.1016/j.redox.2018.101070.
Article CAS PubMed Google Scholar
Hayashi M, Yamada S, Tanabe H, Takami H, Inokawa Y, Sonohara F, et al. High serum uric acid levels could be a risk factor of hepatocellular carcinoma recurrences. Nutr Cancer. 2020;0:1–8. https://doi.org/10.1080/01635581.2020.1779758.
Article CAS Google Scholar

Download references

Acknowledgments

A.B. thanks Claudia Beleites and Stefano Fornasaro for the insightful discussions on model stability and confidence intervals for performance parameters. Authors thank Luca Giovanni Mascaretti and the Transfusion Medicine Department, Azienda Sanitaria Universitaria Integrata Giuliano Isontina (ASUGI) of Trieste.

Funding

Open access funding provided by Università degli Studi di Trieste within the CRUI-CARE Agreement. The study was supported by Surface-enhanced Raman microRNA for cancer (SERMI4CANCER) PORFESR 2014–2020 FVG, decree no.3028 (05/05/2017) and no.4526 (16/06/2017) and by an internal grant from Fondazione Italiana Fegato.

Author information

Authors and Affiliations

Raman Spectroscopy Lab, Dipartimento di Ingegneria e Architettura (DIA), University of Trieste, via Valerio 6, 34127, Trieste, TS, Italy
Elisa Gurian, Alessia Di Silvestre, Elisa Mitri, Valter Sergo & Alois Bonifacio
Fondazione Italiana Fegato – ONLUS, Area Science Park, SS14, km163.5, 34149, Basovizza, Trieste, TS, Italy
Devis Pascut, Claudio Tiribelli, Mauro Giuffrè & Lory Saveria Crocè
Department of Medical Sciences, University of Trieste, Strada di Fiume, 447, 34129, Trieste, Italy
Mauro Giuffrè & Lory Saveria Crocè
Faculty of Health Sciences, University of Macau, Macau, SAR, People’s Republic of China
Valter Sergo

Authors

Elisa Gurian
View author publications
You can also search for this author in PubMed Google Scholar
Alessia Di Silvestre
View author publications
You can also search for this author in PubMed Google Scholar
Elisa Mitri
View author publications
You can also search for this author in PubMed Google Scholar
Devis Pascut
View author publications
You can also search for this author in PubMed Google Scholar
Claudio Tiribelli
View author publications
You can also search for this author in PubMed Google Scholar
Mauro Giuffrè
View author publications
You can also search for this author in PubMed Google Scholar
Lory Saveria Crocè
View author publications
You can also search for this author in PubMed Google Scholar
Valter Sergo
View author publications
You can also search for this author in PubMed Google Scholar
Alois Bonifacio
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Elisa Gurian: Conceptualization, methodology, investigation, writing (original draft). Alessia Di Silvestre: Investigation. Elisa Mitri: Investigation. Devis Pascut: Investigation and data curation. Claudio Tiribelli: Writing (review and editing) and resources. Mauro Giuffrè: Resources. Lory Saveria Crocè: Resources. Valter Sergo: Writing (review and editing) and resources. Alois Bonifacio: Conceptualization, methodology, validation, formal analysis, writing (original draft), writing (review and editing), visualization, supervision.

Corresponding author

Correspondence to Alois Bonifacio.

Ethics declarations

Conflict of interest

The authors declare that they have no conflicts of interests.

Ethics approval

The investigation was conducted according to the principles expressed in the Declaration of Helsinki. The study was approved by the regional ethical committee (Comitato Etico Regionale Unico del Friuli Venezia Giulia, Prot. No. 2018 Os-008-ASUITS, CINECA no. 2225).

Consent to participate

All the patients enrolled in the study provided written informed consent and patient anonymity has been preserved.

Consent for publication

Not applicable.

Code availability

The R code used for data preprocessing, analysis, and visualization is available for download on Zenodo (zenodo.org), DOI: https://doi.org/10.5281/zenodo.4277797.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

ESM 1

(PDF 768 kb).

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Gurian, E., Di Silvestre, A., Mitri, E. et al. Repeated double cross-validation applied to the PCA-LDA classification of SERS spectra: a case study with serum samples from hepatocellular carcinoma patients. Anal Bioanal Chem 413, 1303–1312 (2021). https://doi.org/10.1007/s00216-020-03093-7

Download citation

Received: 17 October 2020
Revised: 19 November 2020
Accepted: 23 November 2020
Published: 08 December 2020
Issue Date: February 2021
DOI: https://doi.org/10.1007/s00216-020-03093-7

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Repeated double cross-validation applied to the PCA-LDA classification of SERS spectra: a case study with serum samples from hepatocellular carcinoma patients

Abstract

Similar content being viewed by others

The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation

Quantitative Mass Spectrometry-Based Proteomics: An Overview

Cost-sensitive learning for imbalanced medical data: a review

Introduction