openHSU logo
  • English
  • Deutsch
  • Log In
  • Communities & Collections
  1. Home
  2. Helmut-Schmidt-University / University of the Federal Armed Forces Hamburg
  3. Publications
  4. 3 - Publication references (without full text)
  5. HarmonizR enables data harmonization across independent proteomic datasets with appropriate handling of missing values
 
Options
Show all metadata fields

HarmonizR enables data harmonization across independent proteomic datasets with appropriate handling of missing values

Publication date
2022-06-20
Document type
Research article
Author
Voß, Hannah
Schlumbohm, Simon 
Barwikowski, Philip
Wurlitzer, Marcus
Dottermusch, Matthias
Neumann, Philipp 
Schlüter, Hartmut
Neumann, Julia E.
Krisp, Christoph
Organisational unit
High Performance Computing 
Projekt DEAL
DOI
10.1038/s41467-022-31007-x
URI
https://openhsu.ub.hsu-hh.de/handle/10.24405/14377
Scopus ID
2-s2.0-85132208409
Pubmed ID
35725563
ISSN
2041-1723
2041-1723
Series or journal
Nature Communications
Periodical volume
13
Peer-reviewed
✅
Part of the university bibliography
✅
  • Additional Information
Abstract
Dataset integration is common practice to overcome limitations in statistically underpowered omics datasets. Proteome datasets display high technical variability and frequent missing values. Sophisticated strategies for batch effect reduction are lacking or rely on error-prone data imputation. Here we introduce HarmonizR, a data harmonization tool with appropriate missing value handling. The method exploits the structure of available data and matrix dissection for minimal data loss, without data imputation. This strategy implements two common batch effect reduction methods-ComBat and limma (removeBatchEffect()). The HarmonizR strategy, evaluated on four exemplarily analyzed datasets with up to 23 batches, demonstrated successful data harmonization for different tissue preservation techniques, LC-MS/MS instrumentation setups, and quantification approaches. Compared to data imputation methods, HarmonizR was more efficient and performed superior regarding the detection of significant proteins. HarmonizR is an efficient tool for missing data tolerant experimental variance reduction and is easily adjustable for individual dataset properties and user preferences.
Version
Not applicable (or unknown)
Access right on openHSU
Metadata only access

  • Cookie settings
  • Privacy policy
  • Send Feedback
  • Imprint