Systematic Error Removal Using Random Forest for Normalizing Large-Scale Untargeted Lipidomics Data

Identifikátory výsledku

Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F67985823%3A_____%2F19%3A00504336" target="_blank" >RIV/67985823:_____/19:00504336 - isvavai.cz</a>
Výsledek na webu
<a href="https://pubs.acs.org/doi/10.1021/acs.analchem.8b05592" target="_blank" >https://pubs.acs.org/doi/10.1021/acs.analchem.8b05592</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1021/acs.analchem.8b05592" target="_blank" >10.1021/acs.analchem.8b05592</a>

Alternativní jazyky

Jazyk výsledku
angličtina
Název v původním jazyce
Systematic Error Removal Using Random Forest for Normalizing Large-Scale Untargeted Lipidomics Data
Popis výsledku v původním jazyce
Large-scale untargeted lipidomics experiments involve the measurement of hundreds to thousands of samples. Such data sets are usually acquired on one instrument over days or weeks of analysis time. Such extensive data acquisition processes introduce a variety of systematic errors, including batch differences, longitudinal drifts, or even instrument-to instrument variation. Technical data variance can obscure the true biological signal and hinder biological discoveries. To combat this issue, we present a novel normalization approach based on using quality control pool samples (QC). This method is called systematic error removal using random forest (SERRF) for eliminating the unwanted systematic variations in large sample sets. We compared SERRF with 15 other commonly used normalization methods using six lipidomics data sets from three large cohort studies (832, 1162, and 2696 samples). SERRF reduced the average technical errors for these data sets to 5% relative standard deviation. We conclude that SERRF outperforms other existing methods and can significantly reduce the unwanted systematic variation, revealing biological variance of interest.
Název v anglickém jazyce
Systematic Error Removal Using Random Forest for Normalizing Large-Scale Untargeted Lipidomics Data
Popis výsledku anglicky
Large-scale untargeted lipidomics experiments involve the measurement of hundreds to thousands of samples. Such data sets are usually acquired on one instrument over days or weeks of analysis time. Such extensive data acquisition processes introduce a variety of systematic errors, including batch differences, longitudinal drifts, or even instrument-to instrument variation. Technical data variance can obscure the true biological signal and hinder biological discoveries. To combat this issue, we present a novel normalization approach based on using quality control pool samples (QC). This method is called systematic error removal using random forest (SERRF) for eliminating the unwanted systematic variations in large sample sets. We compared SERRF with 15 other commonly used normalization methods using six lipidomics data sets from three large cohort studies (832, 1162, and 2696 samples). SERRF reduced the average technical errors for these data sets to 5% relative standard deviation. We conclude that SERRF outperforms other existing methods and can significantly reduce the unwanted systematic variation, revealing biological variance of interest.

Klasifikace

Druh
J<sub>imp</sub> - Článek v periodiku v databázi Web of Science
CEP obor
—
OECD FORD obor
10406 - Analytical chemistry

Návaznosti výsledku

Projekt
—
Návaznosti
I - Institucionalni podpora na dlouhodoby koncepcni rozvoj vyzkumne organizace

Ostatní

Rok uplatnění
2019
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů

Údaje specifické pro druh výsledku

Název periodika
Analytical Chemistry
ISSN
0003-2700
e-ISSN
—
Svazek periodika
91
Číslo periodika v rámci svazku
5
Stát vydavatele periodika
US - Spojené státy americké
Počet stran výsledku
7
Strana od-do
3590-3596
Kód UT WoS článku
000460709200057
EID výsledku v databázi Scopus
2-s2.0-85062373533

Podobné výsledky(10)

Comparison of Mass Spectrometry Based Approaches in Lipidomics Intra-laboratory comparison of four analytical platforms for lipidomic quantitation using hydrophilic interaction liquid chromatography or supercritical fluid chromatography coupled to quadrupole - time-of-flight mass spectrometry Lipidomic analysis of biological samples: Comparison of liquid chromatography, supercritical fluid chromatography and direct infusion mass spectrometry methods

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Systematic Error Removal Using Random Forest for Normalizing Large-Scale Untargeted Lipidomics Data

Identifikátory výsledku

Alternativní jazyky

Klasifikace

Návaznosti výsledku

Ostatní

Údaje specifické pro druh výsledku

Podobné výsledky(10)

Co hledáte?

Rychlé hledání

Chytré vyhledávání

Popis výsledku

Identifikátory výsledku

Identifikátory výsledku

Alternativní jazyky

Alternativní jazyky

Klasifikace

Klasifikace

Návaznosti výsledku

Návaznosti výsledku

Ostatní

Ostatní

Údaje specifické pro druh výsledku

Údaje specifické pro druh výsledku

Podobné výsledky(10)