One of the major next-generation sequencing (NGS) technologies that are most frequently used in medical research is Ion Torrent. Software for Ion Torrent machines provides output in BAM files that are huge in size. Additionally, their compression is also space expensive.
For efficient compression of these BAM files into CRAM format, new software has been introduced. This software is known as IonCRAM . IonCRAM is a lossless reference-based compression tool. It is capable of compressing BAM files into archiving CRAM files such that it could save up to 43% of space. This is open-source software and is freely available at GitHub, CodeOcean, and http://ioncram.saudigenomeproject.com.
How IonCRAM works?
Its algorithm is based on binning the flow signals and quality values initially introduced by Illumina . At first, the algorithm sorts the reads by genomic coordinates in the BAM file followed by their prefix while sorting the respective CIGAR string to bring similar reads closer to each other. The sorted reads are then scanned for blocks of reads that are mapped to the same locus. Finally, flow signals from each block are collected and compressed together.
IonCRAM is available for Ubuntu and CentOS. For further reading, click here.
- Shokrof, M., Abouelhoda, M. IonCRAM: a reference-based compression tool for ion torrent sequence files. BMC Bioinformatics 21, 397 (2020).
- Illumina inc., “Understanding Illumina Quality Scores,” 2012.
[Tutorial] Trailing of paired end reads using Trimmomatic tool in GALAXY.
Trimmomatic is a read trimming tool for Illumina NGS data . It is a flexible tool providing several functions to be operated on reads. These functions include trailing, leading, and several other quality control operations. In this article, we are going to perform trailing on NGS paired-end reads data using the GALAXY platform . (more…)
How to extract methylation call using Bismark?
Bismark is bioinformatics to map bisulfite treated sequencing reads and to perform methylation calls . In this article, we are going to extract methylation information from Bismark alignment outputs. (more…)
Installing PANDAseq on Ubuntu
Installing Bismark on Ubuntu
FiNGS- A New Software providing Filters for Next Generation Sequencing
We use somatic variant callers to detect mutations in cancer samples by comparing sequencing data tumor and normal sample pairs. This is followed by some ad-hoc filtering that may produce low precision data resulting in a large number of false positives. (more…)
Assembly of high-throughput mRNA-Seq data: A review
Transcriptome represents the complete set of all expressed transcripts (RNA molecules) present in a cell or tissue at a given point of time. The transcriptome is always dynamic in nature and keeps on changing with time driven by the external and internal environment. (more…)
Predictive metagenomics profiling: why, what and how ?
What is predictive metagenomics profiling?
Recently, predictive metagenomics profiling (PMP) has been added to the microbial ecologist’s arsenal of strategies for probing microbial communities. (more…)
High throughput sequencing has revolutionized the new world of bioinformatics research. Since everyone is aware of the Human Genome project in which the human genome has been sequenced, millions of species have been sequenced so far. Sequencing is a very important aspect of bioinformatics so new faster and better sequencing techniques are needed . New sequencing platforms produce biological sequence fragments faster and cheaper.