Coding of Genomic Sequencing Data

TNT members involved in this project:
Yeremia G. Adhisantoso, M.Sc.
Dr.-Ing. Marco Munderloh
Prof. Dr.-Ing. Jörn Ostermann
Dipl.-Ing. Jan Voges

Over the past years technological advances in genomic sequencing - the process of reading out genomic information from biological samples - have led to a faster and more cost-efficient approach to sequence individual genomes and other genomic samples. Because of the enormous amount of sequencing data generated the processing, storage and analysis of sequencing data entails novel challenges for the scientific community. New processes and tools have to be developed to overcome the current limitations in terms of storage space, processing speed and many more. Our goal is to develop novel algorithms to enhance data processing "from the tissue to the hard drive".

In the scope of this project we actively contribute to the series of MPEG-G standards (ISO/IEC 23092). More information is available on the MPEG-G website.

If you are interested in writing your thesis and thereby in contributing to this project please contact Jan Voges.

  • J. Voges, and J. Ostermann: Streaming für die Genomforschung, Binaire, vol. 2019, no. 2, 2019 (link)

Show all publications
  • Jan Voges, Tom Paridaens, Fabian Müntefering, Liudmila S Mainzer, Brian Bliss, Mingyu Yang, Idoia Ochoa, Jan Fostier, Jörn Ostermann, Mikel Hernaez
    GABAC: an arithmetic coding solution for genomic data
    Bioinformatics, Oxford University Press, Vol. 36, No. 7, pp. 2275-2277, December 2019, edited by John Hancock
  • Brian E Bliss, Joshua M Allen, Saurabh Baheti, Matthew A Bockol, Shubham Chandak, Jaime Delgado, Jan Fostier, Josep L Gelpi, Steven N Hart, Mikel Hernaez Arrazola, Matthew E Hudson, Michael T Kalmbach, Eric W Klee, Liudmila S Mainzer, Fabian Müntefering, Daniel Naro, Idoia Ochoa-Alvarez, Jörn Ostermann, Tom Paridaens, Christian A Ross, Jan Voges, Eric D Wieben, Mingyu Yang, Tsachy Weissman, Mathieu Wiepert
    Genie: an MPEG-G conformant software to compress genomic data
    International Conference for High Performance Computing, Networking, Storage and Analysis (SC19), p. (poster), Denver, CO (US), November 2019
  • Tom Paridaens, Jan Voges, Mikel Hernaez, Jan Fostier, Jörn Ostermann
    GABAC: an arithmetic coding solution for genomic data
    27th Conference on Intelligent Systems for Molecular Biology (ISMB) and 18th European Conference on Computational Biology (ECCB) 2019, International Society for Computational Biology (ISCB), Vol. 8, p. 1463 (poster), Basel (CH), July 2019