cis-regulatory elements are DNA sequence segments that regulate gene expression. cis-regulatory elements consist of some regions such as promoters, enhancers, and so on. These regions consist of specific sequence motifs.
A newly expanded suite is developed to model the motif occurrence combinatorics in DNA sequences . This suite is known as MOCCA (Motif Occurrence Combinatorics Algorithms). It is based on support vector machines (SVM) and random forest (RF) models, SVM-MOCCA and RF-MOCCA respectively.
Users enter motifs, training sequences, and model specifications. The sequences are provided in FATSA format or can be generated by either an i.i.d. model or an N-th order Markov chain. Users can enter IUPAC nucleotide code motifs and Position Weight Matrix (PWM) motifs.
MOCCA implements three types of models:
- Dummy PREdictor
- including a new hierarchical model RF-MOCCA.
MOCCA is implemented in C++ with a minimal number of dependencies to ease the process of installation. It can be installed on Unix-based systems.
MOCCA is freely available to download from Github.
For more details, read here.
- Bredesen, B.A., Rehmsmeier, M. (2021). MOCCA: a flexible suite for modelling DNA sequence motif occurrence combinatorics. BMC Bioinformatics 22, 234.
vs_Analysis.py: A Python Script to Analyze Virtual Screening Results of Autodock Vina
The output files obtained as a result of virtual screening (VS) using Autodock Vina may be large in number. It is difficult or quite impossible to analyze them manually. Therefore, we are providing a Python script to fetch top results (i.e., compounds showing low binding affinities). (more…)