Data normalization and scaling: Consequences for the analysis in omics sciences
The result's identifiers
Result code in IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F61989592%3A15310%2F18%3A73589128" target="_blank" >RIV/61989592:15310/18:73589128 - isvavai.cz</a>
Result on the web
—
DOI - Digital Object Identifier
—
Alternative languages
Result language
angličtina
Original language name
Data normalization and scaling: Consequences for the analysis in omics sciences
Original language description
The main task in the analysis of omics data is to understand biological information in the data. From a statistical point of view, classification analysis is one of the goals. If the data are consisting of groups of, e.g., controls and patients, the accurate prediction of new samples is desirable. To understand the processes in the human body or in other organisms, the interpretation of the model is necessary. In the two-group setting, the information about important features is one of the main tasks in omics disciplines. In metabolomics, the problem is called biomarker identification, while in genetics this is called fold changes problem, where it is examined for a feature, how many times the average concentration in one group is higher/lower than for the other group. In statistics, this is often referred to as the feature selection problem. Section 4 analyzes the impact of pretreatment methods on publicly available real-world data sets in terms of classification and feature selection analysis. As an example of omics disciplines, the data sets are originating from the metabolomics field. Section 5 discusses and summarizes the main findings and provides some overall recommendations.
Czech name
—
Czech description
—
Classification
Type
C - Chapter in a specialist book
CEP classification
—
OECD FORD branch
10103 - Statistics and probability
Result continuities
Project
<a href="/en/project/GF15-34613L" target="_blank" >GF15-34613L: Statistics in metabolomics for biomarker research in medicine</a><br>
Continuities
P - Projekt vyzkumu a vyvoje financovany z verejnych zdroju (s odkazem do CEP)
Others
Publication year
2018
Confidentiality
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Data specific for result type
Book/collection name
Data analysis for omics sciences: Methods and applications
ISBN
978-0-444-64044-4
Number of pages of the result
32
Pages from-to
165-196
Number of pages of the book
706
Publisher name
Elsevier
Place of publication
Amsterdam
UT code for WoS chapter
—