k2_dna_structure.tex 2.3 KB

12345678910111213141516171819202122232425262728293031323334
  1. %SUMMARY
  2. %- ABSTRACT
  3. %- INTRODUCTION
  4. %# BASICS
  5. %- \acs{DNA} STRUCTURE
  6. %- DATA TYPES
  7. % - BAM/FASTQ
  8. % - NON STANDARD
  9. %- COMPRESSION APPROACHES
  10. % - SAVING DIFFERENCES WITH GIVEN BASE \acs{DNA}
  11. % - HUFFMAN ENCODING
  12. % - PROBABILITY APPROACHES (WITH BASE?)
  13. %
  14. %# COMPARING TOOLS
  15. %-
  16. %# POSSIBLE IMPROVEMENT
  17. %- \acs{DNA}S STOCHASTICAL ATTRIBUTES
  18. %- IMPACT ON COMPRESSION
  19. \chapter{Structure Of Biological Data}
  20. To strengthen the understanding how and where biological information is stored, this section starts with a quick and general rundown on the structure of any living organism.
  21. % todo add picture
  22. All living organisms, like plants and animals, are made of cells (a human body can consist out of several trillion cells).
  23. % human body estimated 3.72 x 10^13 cells https://www.tandfonline.com/doi/full/10.3109/03014460.2013.807878
  24. A cell in itsel is a living organism, the smalles one possible. A cell got two layers, the inner one is called nucleus wich contains chromosomes. The chromosomes hold the genetic information in form of \acs{DNA}.
  25. \section{DNA}
  26. \ac{DNA} is often seen in the form of a double helix. A double helix consists, as the name suggestes, of two single helix. Each of them consists of two main components: the Suggar Phosphat backbone, which is irelavant for this Paper and the Bases. The arrangement of Bases represents the Information stored in the \acs{DNA}. A base is an organic molecule, they are called Nucleotides. %Nucleotides have special attributes and influence other Nucleotides in the \acs{DNA} Sequence
  27. % describe Genomes?
  28. \section{Nucleotides}
  29. For this paper, nucleotides are the most important parts of the \acs{DNA}. A Nucleotide can have one of four forms: it can be either adenine, thymine, guanine or cytosine. Each of them got a Counterpart on the helix, to be more explicit: adenine can only bond with thymine, guanine can only bond with cytosine. This means with the content of one helix, the other one can be determined by ``inverting'' the first. The counterpart for e.g.: adenine, guanine, adenine would be: thymine, cytosine, thymine. For the sake of simplicity, one does not write out the full name of each nucleotide but only use its initial: AGA in one Helix, TCT in the other.
  30. % it there is only one section -> remove it or move everything into introduction