
DrugBank Database
Provided By: John Snow Labs

DrugBank Database
Provided By: John Snow Labs
This data package contains information on DrugBank identifiers, names, and synonyms to permit easy linking and integration into any type of project. DrugBank is a richly annotated resource that combines detailed drug data with comprehensive drug target and drug action information. DrugBank is widely used to facilitate in silico drug target discovery, drug design, drug docking or screening, drug metabolism prediction, drug interaction prediction and general pharmaceutical education.
Product offers
The following offers are available for this product. Choose an offer to view the pricing and access duration options for the offer. Select an offer and continue to subscribe. Your subscription begins on the date that you accept the offer. Additional taxes or fees might apply.
Public offer
Payment schedule: Upfront payment | Offer auto-renewal: Supported
$0 for 12 months
Overview
Overview
The DrugBank database is a unique bioinformatics and cheminformatics resource that combines detailed drug (i.e. chemical, pharmacological and pharmaceutical) data with comprehensive drug target (i.e. sequence, structure, and pathway) information. The database contains 9591 drug entries including 2037 FDA-approved small molecule drugs, 241 FDA-approved biotech (protein/peptide) drugs, 96 nutraceuticals and over 6000 experimental drugs. Additionally, 4661 non-redundant protein (i.e. drug target/enzyme/transporter/carrier) sequences are linked to these drug entries.
License Information
The use of John Snow Labs datasets is free for personal and research purposes. For commercial use please subscribe to the Data Library on AWS. The subscription will allow you to use all John Snow Labs datasets and data packages for commercial purposes.
Metadata
Description | Value |
---|---|
Data Package Complexity | Simple |
Available Enrichments | N/A |
Keywords | Drugbank Target, Drug Molecules List, Drugbank Download, Drugbank Database PDF, Drugbank XML |
Other Titles | How to Cite Drugbank, DrugBank Vocabulary Dataset |
Included Datasets
DrugBank Vocabulary
DrugBank is a richly annotated resource that combines drug data with drug target and drug action information. Released in 2006, DrugBank has been widely used to facilitate in silico drug target discovery, drug design, drug docking or screening, drug metabolism prediction, drug interaction prediction and general pharmaceutical education. "Drug Bank Vocabulary" contains information on DrugBank identifiers, names, and synonyms to permit easy linking and integration into any type of project.
Data Engineering Overview
We deliver high-quality data
- Each dataset goes through 3 levels of quality review
- 2 Manual reviews are done by domain experts
- Then, an automated set of 60+ validations enforces every datum matches metadata & defined constraints
- Data is normalized into one unified type system
- All dates, unites, codes, currencies look the same
- All null values are normalized to the same value
- All dataset and field names are SQL and Hive compliant
- Data and Metadata
- Data is available in both CSV and Apache Parquet format, optimized for high read performance on distributed Hadoop, Spark & MPP clusters
- Metadata is provided in the open Frictionless Data standard, and its every field is normalized & validated
- Data Updates
- Data updates support replace-on-update: outdated foreign keys are deprecated, not deleted
Our data is curated and enriched by domain experts
Each dataset is manually curated by our team of doctors, pharmacists, public health & medical billing experts:
- Field names, descriptions, and normalized values are chosen by people who actually understand their meaning
- Healthcare & life science experts add categories, search keywords, descriptions and more to each dataset
- Both manual and automated data enrichment supported for clinical codes, providers, drugs, and geo-locations
- The data is always kept up to date – even when the source requires manual effort to get updates
- Support for data subscribers is provided directly by the domain experts who curated the data sets
- Every data source’s license is manually verified to allow for royalty-free commercial use and redistribution.
Need Help?
- If you have questions about our products, contact us at info@johnsnowlabs.com.
About Us
John Snow Labs , an AI and NLP for healthcare company, provides state-of-the-art software, models, and data to help healthcare and life science organizations build, deploy, and operate AI projects.
Provided By
Fulfillment Method
AWS Data Exchange
Data dictionaries
A data dictionary is a visual representation of the contents of a data set. The data represented here is for evaluation purposes only and may not accurately represent the actual content of the product.
Field name | Description | Example | Data type | Primary key | Nullable | Max length | Data set | Table name | Table description |
---|---|---|---|---|---|---|---|---|---|
Drug_Bank_Id | Unique Drug Bank ID consisting of a 2 letter prefix (DB) and a 5 number suffix. Identifier of the chemical in the DrugBank database ('|'-delimited list). The DrugBank database is a unique bioinformatics and cheminformatics resource that combines detailed drug (i.e. chemical, pharmacological and pharmaceutical) data with comprehensive drug target (i.e. sequence, structure, and pathway) information. | - | string | - | false | - | DrugBank Vocabulary | drugbank_vocabulary | |
Accession_Number | Accession number consists of the 4 letter prefix and a 5 number suffix. Each Accession number is unique to the drug's generic name. The 4 letter suffix (APRD, EXPT, BIOD, NUTR) indicates the type of drug (APRD=approved small molecule drug, EXPT=experimental drug, BIOD=biotech drug, NUTR=nutraceutical or natural product). ('|'-delimited list) | - | string | - | - | - | DrugBank Vocabulary | drugbank_vocabulary | |
Common_Name_of_Drug | Chemical name of the drug as provided by the manufacturer | - | string | - | false | - | DrugBank Vocabulary | drugbank_vocabulary | |
CAS_Registry_Number | Chemical Abstract Service (CAS) identification number | - | string | - | - | - | DrugBank Vocabulary | drugbank_vocabulary | |
FDA_UNII_Code | UNII (Substance Registration System - Unique Ingredient Identifier (UNII) code is used for substances in drugs, biologics, foods, and devices. The UNII is a non- proprietary, free, unique, unambiguous, non semantic, alphanumeric identifier based on a substance’s molecular structure and/or descriptive information | - | string | - | - | - | DrugBank Vocabulary | drugbank_vocabulary | |
Synonym | Alternate names (protein names, abbreviations, etc.) by which the chemical is known. ('|'-delimited list) | Hirudin variant-1 | Lepirudin | Lepirudin recombinant | string | - | - | - | DrugBank Vocabulary | drugbank_vocabulary | |
Standard_InChI_Key | Standard InChI key. InCHI (The IUPAC International Chemical Identifier) is a non-proprietary identifier for chemical substances that can be used in printed and electronic data sources thus enabling easier linking of diverse data compilations. | - | string | - | - | - | DrugBank Vocabulary | drugbank_vocabulary |
Data sets (1)
You will receive access to the following data sets
Revision access rules
All historical revisions | All future revisions
Name | Type | Data dictionary | AWS Region |
---|---|---|---|
DrugBank Vocabulary | Included | US East (N. Virginia) |
Data dictionaries and samples
Sample data is for evaluation purposes only and may not accurately represent the actual content of the product.
Name | Resource | File type | File size | Description | ||
---|---|---|---|---|---|---|
DrugBank Vocabulary | Data set | - | - | - | ||
Data dictionary | Data dictionary | text/csv | - |
Usage information
By subscribing to this product, you agree that your use of this product is subject to the provider's offer terms including pricing information and Data Subscription Agreement . Your use of AWS services remains subject to the AWS Customer Agreement or other agreement with AWS governing your use of such services.
Support information
Support contact email address
Support contact URL
Refund policy
No refunds offered. For any questions email us at info@johnsnowlabs.com
General AWS Data Exchange support