%SUMMARY %- ABSTRACT %- INTRODUCTION %# BASICS %- \acs{DNA} STRUCTURE %- DATA TYPES % - BAM/FASTQ % - NON STANDARD %- COMPRESSION APPROACHES % - SAVING DIFFERENCES WITH GIVEN BASE \acs{DNA} % - HUFFMAN ENCODING % - PROBABILITY APPROACHES (WITH BASE?) % %# COMPARING TOOLS %- %# POSSIBLE IMPROVEMENT %- \acs{DNA}S STOCHASTICAL ATTRIBUTES %- IMPACT ON COMPRESSION %\chapter{Analysis for Possible Compression Improvements} \chapter{Feasibillity Analysis for New Algorithm Considering Stochastic Organisation of Genomes} % first thoughts: % - just save one nuceleotide every n bits % - save checksum for whole genome % - use algorithms (from new discoveries) to recreate genome % - check checksum -> finished : retry % - can run recursively and threaded % - im falle von testdata: hetzer, dedizierter hardware, auf server compilen, specs aufschreiben -> 'lscpu' || 'cat /proc/cpuinfo'