Connect with us

NGS

[Tutorial] Trailing of paired end reads using Trimmomatic tool in GALAXY.

Dr. Muniba Faiza

Published

on

How to generate config file for docking using Autodock Tools?

Trimmomatic is a read trimming tool for Illumina NGS data [1]. It is a flexible tool providing several functions to be operated on reads. These functions include trailing, leading, and several other quality control operations. In this article, we are going to perform trailing on NGS paired-end reads data using the GALAXY platform [2].

Trailing means cutting off the reads from the 3′-end (i.e., from right to left). We have an interleaved file of paired-end reads, let’s call it ‘input.fastq’. 

Deinterlacing the input file

  • Since this file is interleaved, we will have to deinterlace it. Click on the ‘FASTQ de-interlacer’ in the GALAXY tools.
  • Select fastq dataset, i.e., your interleaved file containing paired-end reads.
  • Click ‘Execute’.
    This will create four different files: one file for forward reads, one for reverse reads, and reads without a pair are placed into the other two files.

Running Trimmomatic

After deinterlacing our file, we will perform trailing on our two first output files.

  • Select ‘Trimmomatic’. 
  • Select ‘Paired-end (two separate input files)’ from the dropdown.
  • Now, select for the R1 input file, select output file from the last step, namely ‘FASTQ de-interlaced left mates from data2‘ fand then for the R2 input file, select another output file, namely, ‘FASTQ de-interlaced right mates from data2‘ from the next dropdown menu.
  • Now, select the trimming operation. We need to cut off at the end of the reads, therefore, we will select ‘Cut bases off the ens of a read, if below a threshold quality (TRAILING)’.
  • The generally considered minimum quality to keep a base is 20. So, enter 20 in the minimum quality text box.
  • Click ‘Execute‘. If the de-interlacer has finished the previous job, then trailing will start right away, otherwise, you will have to wait for the de-interlacer job to finish first.

References

  1. Bolger, A. M., Lohse, M., & Usadel, B. (2014). Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics30(15), 2114-2120.
  2. https://usegalaxy.org/

Dr. Muniba is a Bioinformatician based in New Delhi, India. She has completed her PhD in Bioinformatics from South China University of Technology, Guangzhou, China. She has cutting edge knowledge of bioinformatics tools, algorithms, and drug designing. When she is not reading she is found enjoying with the family. Know more about Muniba

NGS

How to extract methylation call using Bismark?

Dr. Muniba Faiza

Published

on

How to extract methylation call using Bismark?

Bismark is bioinformatics to map bisulfite treated sequencing reads and to perform methylation calls [1]. In this article, we are going to extract methylation information from Bismark alignment outputs. (more…)

Continue Reading

NGS

Installing PANDAseq on Ubuntu

Dr. Muniba Faiza

Published

on

Installing pandaseq on Ubuntu

PANDAseq is a bioinformatics tool that aligns paired-ends of Illumina sequences [1]. In this article, we are going to install PANDAseq on Ubuntu. (more…)

Continue Reading

NGS

Installing Bismark on Ubuntu

Dr. Muniba Faiza

Published

on

Installing Bismark on Ubuntu

Bismark is a bioinformatics tool to map bisulfite treated sequencing reads to a genome [1]. It also determines cytosine methylation sites. In this article, we will install Bismark on Ubuntu. (more…)

Continue Reading

NGS

FiNGS- A New Software providing Filters for Next Generation Sequencing

Published

on

fings

We use somatic variant callers to detect mutations in cancer samples by comparing sequencing data tumor and normal sample pairs. This is followed by some ad-hoc filtering that may produce low precision data resulting in a large number of false positives. (more…)

Continue Reading

NGS

IonCRAM: New Tool for Ion Torrent Sequence Files Compression

Published

on

IonCRAM for Ion Torrent file compression

One of the major next-generation sequencing (NGS) technologies that are most frequently used in medical research is Ion Torrent. Software for Ion Torrent machines provides output in BAM files that are huge in size. Additionally, their compression is also space expensive. (more…)

Continue Reading

HTS

Assembly of high-throughput mRNA-Seq data: A review

Published

on

By

Transcriptome represents the complete set of all expressed transcripts (RNA molecules) present in a cell or tissue at a given point of time. The transcriptome is always dynamic in nature and keeps on changing with time driven by the external and internal environment. (more…)

Continue Reading

Meta Analysis

Predictive metagenomics profiling: why, what and how ?

Published

on

What is predictive metagenomics profiling?

Recently, predictive metagenomics profiling (PMP) has been added to the microbial ecologist’s arsenal of strategies for probing microbial communities. (more…)

Continue Reading

NGS

ALFALFA explained

Dr. Muniba Faiza

Published

on

High throughput sequencing has revolutionized the new world of bioinformatics research. Since everyone is aware of the Human Genome project in which the human genome has been sequenced, millions of species have been sequenced so far. Sequencing is a very important aspect of bioinformatics so new faster and better sequencing techniques are needed . New sequencing platforms produce biological sequence fragments faster and cheaper.

(more…)

Continue Reading