Bioinformatics ReviewBioinformatics Review
Notification Show More
Font ResizerAa
  •  Home
  • Docking
  • MD Simulation
  • Tools
  • More Topics
    • Softwares
    • Sequence Analysis
    • Algorithms
    • Bioinformatics Programming
    • Bioinformatics Research Updates
    • Drug Discovery
    • Phylogenetics
    • Structural Bioinformatics
    • Editorials
    • Tips & Tricks
    • Bioinformatics News
    • Featured
    • Genomics
    • Bioinformatics Infographics
  • Community
    • BiR-Research Group
    • Community Q&A
    • Ask a question
    • Join Telegram Channel
    • Join Facebook Group
    • Join Reddit Group
    • Subscription Options
    • Become a Patron
    • Write for us
  • About Us
    • About BiR
    • BiR Scope
    • The Team
    • Guidelines for Research Collaboration
    • Feedback
    • Contact Us
    • Recent @ BiR
  • Subscription
  • Account
    • Visit Dashboard
    • Login
Font ResizerAa
Bioinformatics ReviewBioinformatics Review
Search
Have an existing account? Sign In
Follow US
Sequence AnalysisSoftwareTools

Aligning DNA reads against a local database using DIAMOND

Dr. Muniba Faiza
Last updated: September 28, 2020 6:12 pm
Dr. Muniba Faiza
Share
2 Min Read
pairwise alignment using DIAMOND
SHARE

DIAMOND is a program for high throughput pairwise alignment of DNA reads and protein sequences [1]. It is used for the high-performance analysis of large sequence data. In this article, we will make a local database of protein sequences and align protein sequences against the reference database.

Contents
  • Creating a reference database
  • Aligning DNA reads
    • References

Creating a reference database

Keep all FASTA sequences in a file, let’s name it as ‘db.fa’. Now make a reference database of these sequences using the following command.

$ diamond makedb --in db.fa -d nr_db

Here, -d defines the output DIAMOND database file.

If you wish to provide taxonomy features as well, then you can use the following arguments.

--taxonmap <gzip file> to map NCBI protein accession numbers to taxon ids. The gzip file can be downloaded from here.

--taxonnodes <gzip file> to map taxon nodes. The gzip file can be downloaded from here.

--taxonnames <gzip file> to map taxon names. The gzip file can be downloaded from here.

Now, the reference database is created as ‘nr_db.dmnd’.

Aligning DNA reads

Save all DNA reads in FASTA format, let’s name it as ‘dna_reads.fna’. Align the DNA reads pairwise using the ‘blastx’ module of DIAMOND. If you are aligning protein sequences, then use ‘blastp’ instead of ‘blastx’.

$ diamond blastx -d nr_db -q dna_reads.fna -o aligned_reads.m8 --sensitive --outfmt 0

The default output is the BLAST tabular format. You can set the output format, go through the command line options mentioned here. You can set the sensitivity, output format, gap penalty, and other parameters.


References

  1. Buchfink, B., Xie, C., & Huson, D. H. (2015). Fast and sensitive protein alignment using DIAMOND. Nature methods, 12(1), 59-60.
TAGGED:Align DNA readscommand-lineDIAMOND
Share This Article
Facebook Copy Link Print
ByDr. Muniba Faiza
Follow:
Dr. Muniba is a Bioinformatician based in New Delhi, India. She has completed her PhD in Bioinformatics from South China University of Technology, Guangzhou, China. She has cutting edge knowledge of bioinformatics tools, algorithms, and drug designing. When she is not reading she is found enjoying with the family. Know more about Muniba
Leave a Comment

Leave a Reply Cancel reply

You must be logged in to post a comment.

ai tools vs traditional tools in bioinformatics
AI Tools vs Traditional Tools in Bioinformatics- Which one to select?
Algorithms Artificial Intelligence Machine Learning Software Tools
AI vs Physics in Molecular Docking
AI vs Physics in Molecular Docking: Towards Faster and More Accurate Pose Prediction
Artificial Intelligence Drug Discovery Machine Learning
10 years of Bioinformatics Review: From a Blog to a Bioinformatics Knowledge Hub!
Editorial
Starting in Bioinformatics? Do This First!
Starting in Bioinformatics? Do This First!
Tips & Tricks

You Might Also Like

IPD2.0- An Updated Version of IPD to Analyze the SARS-CoV-2 Genome
SARSSoftwareTools

IPD2.0- An Updated Version of IPD to Analyze the SARS-CoV-2 Genome

June 2, 2021
alvis
SoftwareTools

Alvis- A New Tool for Alignment & Visualization of Contigs & Reads and Chimera Detection

March 21, 2021
NGlyAlign: A tool to align Highly Variable Regions in HIV envelope
Sequence AnalysisSoftwareTools

NGlyAlign- A New Tool to Align Highly Variable Regions in HIV Sequences

February 18, 2021
MD SimulationSoftwareTools

Tutorial: MD simulation output analysis of protein using GROMACS

April 6, 2023
Copyright 2024 IQL Technologies
  • Journal
  • Customer Support
  • Contact Us
  • FAQs
  • Terms of Use
  • Privacy Policy
  • Cookie Policy
  • Sitemap
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?

Not a member? Sign Up