Institute of Environmental Science and Research
Browse

EpiDiP/NanoDiP: a versatile unsupervised machine learning edge computing platform for epigenomic tumour diagnostics

Version 2 2024-08-28, 21:16
Version 1 2024-08-28, 03:51
journal contribution
posted on 2024-08-28, 21:16 authored by Jürgen Hench, Claus Hultschig, Jon Brugger, Luigi Mariani, Raphael Guzman, Jehuda Soleman, Severina Leu, Miles C. Benton, Irenäus Maria Stec, Ivana Bratic Hench, Per Hoffmann, Patrick Harter, Katharina J Weber, Anne Albers, Christian Thomas, Martin Hasselblatt, Ulrich Schüller, Lisa Restelli, David Capper, Ekkehard Hewer, Joachim Diebold, Danijela Kolenc, Ulf C. Schneider, Elisabeth Rushing, Rosa della Monica, Lorenzo Chiariotti, Martin Sill, Daniel Schrimpf, Andreas von Deimling, Felix Sahm, Christian Kölsche, Markus Tolnay, Stephan Frank

DNA methylation analysis based on supervised machine learning algorithms with static reference data, allowing diagnostic tumour typing with unprecedented precision, has quickly become a new standard of care. Whereas genome-wide diagnostic methylation profiling is mostly performed on microarrays, an increasing number of institutions additionally employ nanopore sequencing as a faster alternative. In addition, methylation-specific parallel sequencing can generate methylation and genomic copy number data. Given these diverse approaches to methylation profiling, to date, there is no single tool that allows (1) classification and interpretation of microarray, nanopore and parallel sequencing data, (2) direct control of nanopore sequencers, and (3) the integration of microarray-based methylation reference data. Furthermore, no software capable of entirely running in routine diagnostic laboratory environments lacking high-performance computing and network infrastructure exists. To overcome these shortcomings, we present EpiDiP/NanoDiP as an open-source DNA methylation and copy number profiling suite, which has been benchmarked against an established supervised machine learning approach using in-house routine diagnostics data obtained between 2019 and 2021. Running locally on portable, cost- and energy-saving system-on-chip as well as gpGPU-augmented edge computing devices, NanoDiP works in offline mode, ensuring data privacy. It does not require the rigid training data annotation of supervised approaches. Furthermore, NanoDiP is the core of our public, free-of-charge EpiDiP web service which enables comparative methylation data analysis against an extensive reference data collection. We envision this versatile platform as a useful resource not only for neuropathologists and surgical pathologists but also for the tumour epigenetics research community. In daily diagnostic routine, analysis of native, unfixed biopsies by NanoDiP delivers molecular tumour classification in an intraoperative time frame.

Funding

Stiftung Kinderkrebs beider Basel, Switzerland (#2023-F006/007 to JH and SF)

History

Submitter

Salila Bryant