A new suite of tools called SequelTools has been developed for analyzing PacBio raw sequence data . Pacbio is a third-generation DNA sequencing method that is capable of detecting methylated bases, gives very long reads, and provides real-time sequencing.
SequelTools is a command-line program that consists of tools for read subsampling, quality control, and read filtering . According to the authors, there is no such tool available yet for analyzing the quality, subsampling, and filtering of PacBio raw sequence data.
SequelTools consists of three tools that can be used one at a time.
1. Quality Control (QC) tool
This tool processes SMRTcells and generates multiple statistics and plots. These plots describe the quality of input data.
2. Read subsampling tool
It subsamples reads based on different criteria selected by the user such as random CLR selection or longest subreads per CLR.
3. Read filtering tool
This tool allows users to normalize data by minimum length or by filtering out certain low-quality scrap reads.
These tools can be easily used with command-line arguments. The main script written in bash uses Samtools for file conversion BAM and SAM file formats.
SequelTools is developed in R, bash, and Python. It requires BAM format files as input. The suite is freely accessible, fast, and efficient, and is available at https://github.com/ISUgenomics/SequelTools. It can be run on any operating system. The performance of SequelTools was also tested on benchmark data. It revealed that SequelTools took around half an hour while processing scraps only and a little more than an hour with scraps and subreads .
For further details, read here.
- Hufnagel, D. E., Hufford, M. B., & Seetharam, A. S. (2020). SequelTools: a suite of tools for working with PacBio Sequel raw sequence data. BMC bioinformatics, 21(1), 1-11.
How to execute matlab from terminal in Ubuntu (Linux)?
How to install Kpax on Ubuntu (Linux)?
Kpax is a bioinformatics program to search and align protein structures . It is currently available for Linux platforms only. In this article, we are going to install the latest version of Kpax (5.1.3) on Ubuntu (Linux). (more…)
How to run do_dssp command (mkdssp) in Gromacs 2022?
In the latest version of GROMACS (2022) , there are some issues regarding the gmx do_dssp command. Apparently, this command either does not run displaying a fatal error, or if it runs then it does not read any frame from MD simulation files. In this article, we are going to run the same command for GROMACS 2022. (more…)