Bioinformatics ReviewBioinformatics Review
Notification Show More
Font ResizerAa
  •  Home
  • Docking
  • MD Simulation
  • Tools
  • More Topics
    • Softwares
    • Sequence Analysis
    • Algorithms
    • Bioinformatics Programming
    • Bioinformatics Research Updates
    • Drug Discovery
    • Phylogenetics
    • Structural Bioinformatics
    • Editorials
    • Tips & Tricks
    • Bioinformatics News
    • Featured
    • Genomics
    • Bioinformatics Infographics
  • Community
    • BiR-Research Group
    • Community Q&A
    • Ask a question
    • Join Telegram Channel
    • Join Facebook Group
    • Join Reddit Group
    • Subscription Options
    • Become a Patron
    • Write for us
  • About Us
    • About BiR
    • BiR Scope
    • The Team
    • Guidelines for Research Collaboration
    • Feedback
    • Contact Us
    • Recent @ BiR
  • Subscription
  • Account
    • Visit Dashboard
    • Login
Font ResizerAa
Bioinformatics ReviewBioinformatics Review
Search
Have an existing account? Sign In
Follow US
Bioinformatics ProgrammingCheminformaticsPython

How to obtain SMILES of ligands using PDB ligand IDs?

Dr. Muniba Faiza
Last updated: January 5, 2023 11:19 pm
Dr. Muniba Faiza
Share
2 Min Read
How to obtain SMILES of ligands using PDB ligand IDs?
SHARE

Fetching SMILE strings for a given number of SDF files of chemical compounds is not such a trivial task. We can quickly obtain them using RDKit or OpenBabel. But what if you don’t have SDF files of ligands in the first place? All you have is Ligand IDs from PDB. If they are a few then you can think of downloading SDF files manually but still, it seems time-consuming, especially when you have multiple compounds to work with. Therefore, we provide a Python script that will read all Ligand IDs and fetch their SDF files, and will finally convert them into SMILE strings.

Contents
  • Requirements
  • Usage
  • Availability
    • References

pdb_ligand_id-to-smi.ipynb is a Python notebook that will fetch SMILES for each ligand ID using RDKit [1] provided in a CSV file.

Requirements

This script requires Python3 and uses RDKit along with some additional packages. Install them using the following commands.

$ conda create -c conda-forge -n my-rdkit-env rdkit
$ conda activate my-rdkit-env
$ conda install pandas

Usage

Provide all Ligand IDs in the ‘lig-ids.csv‘ file and save it. Run the Jupyter notebook to get the results. The script will read ligand IDs, then download their respective SDF files, followed by combining them into a single SDF file. Finally, it will fetch SMILES from RDKit and write results in the ‘smiles.txt‘ file.

Availability

The script is available on GitHub in the ‘cheminformatics‘ repository.


References

  1. Landrum, G. (2013). Rdkit documentation. Release, 1 (1-79), 4.
TAGGED:Githubpdb ligand id to smilespythonrdkit
Share This Article
Facebook Copy Link Print
ByDr. Muniba Faiza
Follow:
Dr. Muniba is a Bioinformatician based in New Delhi, India. She has completed her PhD in Bioinformatics from South China University of Technology, Guangzhou, China. She has cutting edge knowledge of bioinformatics tools, algorithms, and drug designing. When she is not reading she is found enjoying with the family. Know more about Muniba
Leave a Comment

Leave a Reply Cancel reply

You must be logged in to post a comment.

ai tools vs traditional tools in bioinformatics
AI Tools vs Traditional Tools in Bioinformatics- Which one to select?
Algorithms Artificial Intelligence Machine Learning Software Tools
AI vs Physics in Molecular Docking
AI vs Physics in Molecular Docking: Towards Faster and More Accurate Pose Prediction
Artificial Intelligence Drug Discovery Machine Learning
10 years of Bioinformatics Review: From a Blog to a Bioinformatics Knowledge Hub!
Editorial
Starting in Bioinformatics? Do This First!
Starting in Bioinformatics? Do This First!
Tips & Tricks

You Might Also Like

Clustering using MCL
Bioinformatics ProgrammingSoftwareTools

How to perform graph-based clustering of peptide/protein sequences using MCL?

June 22, 2020
Installing cdk on Ubuntu
CheminformaticsSoftwareTools

Installing CDK (Chemistry Development Kit) on Ubuntu (Linux)

July 4, 2021
AlgorithmsBioinformatics ProgrammingPerl

How to read fasta sequences as hash using perl?

May 20, 2020
How to calculate drug-likeness using RDKit?
Bioinformatics ProgrammingPython

How to calculate drug-likeness using RDKit?

August 29, 2022
Copyright 2024 IQL Technologies
  • Journal
  • Customer Support
  • Contact Us
  • FAQs
  • Terms of Use
  • Privacy Policy
  • Cookie Policy
  • Sitemap
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?

Not a member? Sign Up