In this article, we summarize the most widely used tools (online/ standalone) for transcription binding site prediction in DNA sequences.
This tool identifies putative transcription factor binding sites in DNA sequences . It acts as a virtual laboratory where it predicts the transcription factor binding sites based on constructed specific binding site weight matrices from the TANSFAC database . It provides an easy-to-use graphical user interface and downloadable output files. PROMO is currently using version 8.3 of TRANSFAC . It is available online at http://alggen.lsi.upc.es/cgi-bin/promo_v3/promo/promoinit.cgi?dirDB=TF_8.3
It is a database of eukaryotic transcription factors, their DNA binding profiles, and genomic binding sites . TRANSFAC consists of information about transcriptional regulation that can help in predicting potential transcription factor binding sites in DNA sequences. However, this is not freely accessible. You can buy it online at https://genexplain.com/transfac/#section0.
3. JASPAR CORE
JASPAR CORE is an online curated database of transcription factor binding sites [3,4]. This database consists of a non-redundant set of profiles and experimentally defined transcription factor binding sites of eukaryotes. It stores the profiles in the form of position frequency matrices (PFMs) and transcription factor flexible models (TFFMs). Users can find profiles for multiple species across six taxonomic groups. It is freely accessible at http://jaspar.genereg.net/.
This is another online prediction tool of transcription factor binding sites in DNA sequences . TFBIND searches for TATA boxes, GC boxes, CCAAT boxes, transcription start sites (TSS). The tool uses a weight matrix as defined by TRANSFAC R.3.4. It is quite easy to use. Users have to upload or enter a nucleic acid sequence in FASTA format and submit it. TFBIND also allows the users to obtain compressed results. It is freely accessible at http://tfbind.hgc.jp/.
Tfsitescan is an online predictor of transcription factor binding sites in DNA sequences . It works best with sequences consisting of around 500 nucleotides. Users can search for mammalian sites, prokaryotic sites, amphibian sites, yeast, avian, drosophila, and many other sites. It is freely available at http://www.ifti.org/cgi-bin/ifti/Tfsitescan.pl
- Messeguer, X., Escudero, R., Farré, D., Nuñez, O., Martı́nez, J., & Albà, M. M. (2002). PROMO: detection of known transcription regulatory elements using species-tailored searches. Bioinformatics, 18(2), 333-334.
- Wingender, E., Dietze, P., Karas, H., & Knüppel, R. (1996). TRANSFAC: a database on transcription factors and their DNA binding sites. Nucleic acids research, 24(1), 238-241.
- Khan, A., Fornes, O., Stigliani, A., Gheorghe, M., Castro-Mondragon, J. A., Van Der Lee, R., … & Mathelier, A. (2018). JASPAR 2018: update of the open-access database of transcription factor binding profiles and its web framework. Nucleic acids research, 46(D1), D260-D266.
- Fornes, O., Castro-Mondragon, J. A., Khan, A., Van der Lee, R., Zhang, X., Richmond, P. A., … & Mathelier, A. (2020). JASPAR 2020: update of the open-access database of transcription factor binding profiles. Nucleic acids research, 48(D1), D87-D92.
- Tsunoda, T., & Takagi, T. (1999). Estimating transcription factor bindability on DNA. Bioinformatics (Oxford, England), 15(7), 622-630.
Video Tutorial: Calculating binding pocket volume using PyVol plugin.
How to generate topology from SMILES for MD Simulation?
[Tutorial] Installing jdock on Ubuntu (Linux).
jdock is an extended version of idock . It has the same features as the idock along with some bug fixes. However, the binary name and the GitHub repository names are changed. We are installing jdock on Ubuntu (Linux).
How to upgrade cmake on Ubuntu (Linux)?
In bioinformatics, cmake is used to install multiple software including GROMACS, jdock, and so on. Here is a short tutorial on how to upgrade cmake on Ubuntu and get rid of the previous version. (more…)
How to install GMXPBSA on Ubuntu (Linux)?
[Tutorial] Installing Pyrx on Windows.
How to solve ‘Could NOT find CUDA: Found unsuitable version “10.1”‘ error during GROMACS installation?
Compiling GROMACS  with GPU can be trivial. Previously, we have provided a few articles on the same. In this article, we will solve an error frequently occurring during GROMACS installation.
Installing Autodock4 on MacOS.
How to install Autodock4 on Ubuntu?
DS Visualizer: Uses & Applications
Protein structure & folding information exploited from remote homologs.
Remote homologs are similar protein structures that share similar functions, but there is no easily detectable sequence similarity in them. A new study has revealed that the protein folding information can be exploited from remote homologous structures. A new tool is developed to recognize such proteins and predict their structure and folding pathway. (more…)
Pathonoia- A new tool to detect pathogens in RNA-seq data.
Detecting viruses and bacteria in RNA-seq data with less false positive rate is a difficult task. A new tool is introduced to detect pathogens in RNA-seq data with high precision and recall known as Pathonoia .
AlphaFill- New algorithm to fill ligands in AlphaFold models.
AlphaFold is a popular artificial intelligence based protein prediction tool . Though it predicts good protein structures, it lacks the capability to predict the small molecules present in the structure such as ligands. For this purpose, AlphaFill is introduced by Hekkelman et al.,. (more…)
How to calculate binding pocket volume using PyVol plugin in PyMol?
How to generate electron density map using Pymol?
Installing PyVOL plugin in Pymol on Ubuntu (Linux).
How to execute matlab from terminal in Ubuntu (Linux)?
How to install Kpax on Ubuntu (Linux)?
Kpax is a bioinformatics program to search and align protein structures . It is currently available for Linux platforms only. In this article, we are going to install the latest version of Kpax (5.1.3) on Ubuntu (Linux). (more…)
How to run do_dssp command (mkdssp) in Gromacs 2022?
In the latest version of GROMACS (2022) , there are some issues regarding the gmx do_dssp command. Apparently, this command either does not run displaying a fatal error, or if it runs then it does not read any frame from MD simulation files. In this article, we are going to run the same command for GROMACS 2022. (more…)