Bioinformatics ReviewBioinformatics Review
Notification Show More
Font ResizerAa
  •  Home
  • Docking
  • MD Simulation
  • Tools
  • More Topics
    • Softwares
    • Sequence Analysis
    • Algorithms
    • Bioinformatics Programming
    • Bioinformatics Research Updates
    • Drug Discovery
    • Phylogenetics
    • Structural Bioinformatics
    • Editorials
    • Tips & Tricks
    • Bioinformatics News
    • Featured
    • Genomics
    • Bioinformatics Infographics
  • Community
    • BiR-Research Group
    • Community Q&A
    • Ask a question
    • Join Telegram Channel
    • Join Facebook Group
    • Join Reddit Group
    • Subscription Options
    • Become a Patron
    • Write for us
  • About Us
    • About BiR
    • BiR Scope
    • The Team
    • Guidelines for Research Collaboration
    • Feedback
    • Contact Us
    • Recent @ BiR
  • Subscription
  • Account
    • Visit Dashboard
    • Login
Font ResizerAa
Bioinformatics ReviewBioinformatics Review
Search
Have an existing account? Sign In
Follow US
Sequence AnalysisSoftwareTools

Aligning DNA reads against a local database using DIAMOND

Dr. Muniba Faiza
Last updated: September 28, 2020 6:12 pm
Dr. Muniba Faiza
Share
2 Min Read
pairwise alignment using DIAMOND
SHARE

DIAMOND is a program for high throughput pairwise alignment of DNA reads and protein sequences [1]. It is used for the high-performance analysis of large sequence data. In this article, we will make a local database of protein sequences and align protein sequences against the reference database.

Contents
Creating a reference databaseAligning DNA readsReferences

Creating a reference database

Keep all FASTA sequences in a file, let’s name it as ‘db.fa’. Now make a reference database of these sequences using the following command.

$ diamond makedb --in db.fa -d nr_db

Here, -d defines the output DIAMOND database file.

If you wish to provide taxonomy features as well, then you can use the following arguments.

--taxonmap <gzip file> to map NCBI protein accession numbers to taxon ids. The gzip file can be downloaded from here.

--taxonnodes <gzip file> to map taxon nodes. The gzip file can be downloaded from here.

--taxonnames <gzip file> to map taxon names. The gzip file can be downloaded from here.

Now, the reference database is created as ‘nr_db.dmnd’.

Aligning DNA reads

Save all DNA reads in FASTA format, let’s name it as ‘dna_reads.fna’. Align the DNA reads pairwise using the ‘blastx’ module of DIAMOND. If you are aligning protein sequences, then use ‘blastp’ instead of ‘blastx’.

$ diamond blastx -d nr_db -q dna_reads.fna -o aligned_reads.m8 --sensitive --outfmt 0

The default output is the BLAST tabular format. You can set the output format, go through the command line options mentioned here. You can set the sensitivity, output format, gap penalty, and other parameters.


References

  1. Buchfink, B., Xie, C., & Huson, D. H. (2015). Fast and sensitive protein alignment using DIAMOND. Nature methods, 12(1), 59-60.
TAGGED:Align DNA readscommand-lineDIAMOND
Share This Article
Facebook Copy Link Print
ByDr. Muniba Faiza
Follow:
Dr. Muniba is a Bioinformatician based in New Delhi, India. She has completed her PhD in Bioinformatics from South China University of Technology, Guangzhou, China. She has cutting edge knowledge of bioinformatics tools, algorithms, and drug designing. When she is not reading she is found enjoying with the family. Know more about Muniba
Leave a Comment

Leave a Reply Cancel reply

You must be logged in to post a comment.

Starting in Bioinformatics? Do This First!
Starting in Bioinformatics? Do This First!
Tips & Tricks
[Editorial] Is it ethical to change the order of authors’ names in a manuscript?
Editorial Opinion
Installing bbtools on Ubuntu
[Tutorial] Installing BBTools on Ubuntu (Linux).
Sequence Analysis Software Tools
wes_data_analysis Whole Exome Sequencing (WES) Data visualization Toolkit
wes_data_analysis: Whole Exome Sequencing (WES) Data visualization Toolkit
Bioinformatics Programming GitHub Python

You Might Also Like

FEGS- A New Feature Extraction Model for Protein Sequence Analysis
Sequence AnalysisSoftware

FEGS- A New Feature Extraction Model for Protein Sequence Analysis

June 15, 2021
How to perform site-specific docking using Pyrx?
DockingSoftwareTools

How to perform site-specific docking using Pyrx?

July 25, 2021
GenomicsSequence Analysis

The basic concepts of genome assembly

December 10, 2015
alvis
SoftwareTools

Alvis- A New Tool for Alignment & Visualization of Contigs & Reads and Chimera Detection

March 21, 2021
Copyright 2024 IQL Technologies
  • Journal
  • Customer Support
  • Contact Us
  • FAQs
  • Terms of Use
  • Privacy Policy
  • Cookie Policy
  • Sitemap
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?

Not a member? Sign Up