Select your cookie preferences

We use essential cookies and similar tools that are necessary to provide our site and services. We use performance cookies to collect anonymous statistics, so we can understand how customers use our site and make improvements. Essential cookies cannot be deactivated, but you can choose “Customize” or “Decline” to decline performance cookies.

If you agree, AWS and approved third parties will also use cookies to provide useful site features, remember your preferences, and display relevant content, including relevant advertising. To accept or decline all non-essential cookies, choose “Accept” or “Decline.” To make more detailed choices, choose “Customize.”

Sign in
Your Saved List Become a Channel Partner Sell in AWS Marketplace Amazon Web Services Home Help

DrugBank Database

Provided By: John Snow Labs

DrugBank Database

Provided By: John Snow Labs

This data package contains information on DrugBank identifiers, names, and synonyms to permit easy linking and integration into any type of project. DrugBank is a richly annotated resource that combines detailed drug data with comprehensive drug target and drug action information. DrugBank is widely used to facilitate in silico drug target discovery, drug design, drug docking or screening, drug metabolism prediction, drug interaction prediction and general pharmaceutical education.

Product offers

The following offers are available for this product. Choose an offer to view the pricing and access duration options for the offer. Select an offer and continue to subscribe. Your subscription begins on the date that you accept the offer. Additional taxes or fees might apply.

Public offer

Payment schedule: Upfront payment | Offer auto-renewal: Supported
$0 for 12 months

Overview

Overview

The DrugBank database is a unique bioinformatics and cheminformatics resource that combines detailed drug (i.e. chemical, pharmacological and pharmaceutical) data with comprehensive drug target (i.e. sequence, structure, and pathway) information. The database contains 9591 drug entries including 2037 FDA-approved small molecule drugs, 241 FDA-approved biotech (protein/peptide) drugs, 96 nutraceuticals and over 6000 experimental drugs. Additionally, 4661 non-redundant protein (i.e. drug target/enzyme/transporter/carrier) sequences are linked to these drug entries.


License Information

The use of John Snow Labs datasets is free for personal and research purposes. For commercial use please subscribe to the Data Library on AWS. The subscription will allow you to use all John Snow Labs datasets and data packages for commercial purposes.


Metadata

DescriptionValue
Data Package ComplexitySimple
Available EnrichmentsN/A
KeywordsDrugbank Target, Drug Molecules List, Drugbank Download, Drugbank Database PDF, Drugbank XML
Other TitlesHow to Cite Drugbank, DrugBank Vocabulary Dataset

Included Datasets

  1. DrugBank Vocabulary

    DrugBank is a richly annotated resource that combines drug data with drug target and drug action information. Released in 2006, DrugBank has been widely used to facilitate in silico drug target discovery, drug design, drug docking or screening, drug metabolism prediction, drug interaction prediction and general pharmaceutical education. "Drug Bank Vocabulary" contains information on DrugBank identifiers, names, and synonyms to permit easy linking and integration into any type of project.


Data Engineering Overview

We deliver high-quality data
  • Each dataset goes through 3 levels of quality review
    • 2 Manual reviews are done by domain experts
    • Then, an automated set of 60+ validations enforces every datum matches metadata & defined constraints
  • Data is normalized into one unified type system
    • All dates, unites, codes, currencies look the same
    • All null values are normalized to the same value
    • All dataset and field names are SQL and Hive compliant
  • Data and Metadata
    • Data is available in both CSV and Apache Parquet format, optimized for high read performance on distributed Hadoop, Spark & MPP clusters
    • Metadata is provided in the open Frictionless Data standard, and its every field is normalized & validated
  • Data Updates
    • Data updates support replace-on-update: outdated foreign keys are deprecated, not deleted

Our data is curated and enriched by domain experts

Each dataset is manually curated by our team of doctors, pharmacists, public health & medical billing experts:

  • Field names, descriptions, and normalized values are chosen by people who actually understand their meaning
  • Healthcare & life science experts add categories, search keywords, descriptions and more to each dataset
  • Both manual and automated data enrichment supported for clinical codes, providers, drugs, and geo-locations
  • The data is always kept up to date – even when the source requires manual effort to get updates
  • Support for data subscribers is provided directly by the domain experts who curated the data sets
  • Every data source’s license is manually verified to allow for royalty-free commercial use and redistribution.

Need Help?


About Us

John Snow Labs , an AI and NLP for healthcare company, provides state-of-the-art software, models, and data to help healthcare and life science organizations build, deploy, and operate AI projects.

Provided By
Fulfillment Method
AWS Data Exchange

Data dictionaries

A data dictionary is a visual representation of the contents of a data set. The data represented here is for evaluation purposes only and may not accurately represent the actual content of the product.

Field name
Description
Example
Data type
Primary key
Nullable
Max length
Data set
Table name
Table description
Drug_Bank_Id

Unique Drug Bank ID consisting of a 2 letter prefix (DB) and a 5 number suffix. Identifier of the chemical in the DrugBank database ('|'-delimited list). The DrugBank database is a unique bioinformatics and cheminformatics resource that combines detailed drug (i.e. chemical, pharmacological and pharmaceutical) data with comprehensive drug target (i.e. sequence, structure, and pathway) information.

-
string
-
false
-
DrugBank Vocabulary
drugbank_vocabulary
Accession_Number

Accession number consists of the 4 letter prefix and a 5 number suffix. Each Accession number is unique to the drug's generic name. The 4 letter suffix (APRD, EXPT, BIOD, NUTR) indicates the type of drug (APRD=approved small molecule drug, EXPT=experimental drug, BIOD=biotech drug, NUTR=nutraceutical or natural product). ('|'-delimited list)

-
string
-
-
-
DrugBank Vocabulary
drugbank_vocabulary
Common_Name_of_Drug

Chemical name of the drug as provided by the manufacturer

-
string
-
false
-
DrugBank Vocabulary
drugbank_vocabulary
CAS_Registry_Number

Chemical Abstract Service (CAS) identification number

-
string
-
-
-
DrugBank Vocabulary
drugbank_vocabulary
FDA_UNII_Code

UNII (Substance Registration System - Unique Ingredient Identifier (UNII) code is used for substances in drugs, biologics, foods, and devices. The UNII is a non- proprietary, free, unique, unambiguous, non semantic, alphanumeric identifier based on a substance’s molecular structure and/or descriptive information

-
string
-
-
-
DrugBank Vocabulary
drugbank_vocabulary
Synonym

Alternate names (protein names, abbreviations, etc.) by which the chemical is known. ('|'-delimited list)

Hirudin variant-1 | Lepirudin | Lepirudin recombinant

string
-
-
-
DrugBank Vocabulary
drugbank_vocabulary
Standard_InChI_Key

Standard InChI key. InCHI (The IUPAC International Chemical Identifier) is a non-proprietary identifier for chemical substances that can be used in printed and electronic data sources thus enabling easier linking of diverse data compilations.

-
string
-
-
-
DrugBank Vocabulary
drugbank_vocabulary

Data sets (1)

You will receive access to the following data sets

Revision access rules
All historical revisions | All future revisions
Name
Type
Data dictionary
AWS Region
DrugBank Vocabulary
Included
US East (N. Virginia)

Data dictionaries and samples

Sample data is for evaluation purposes only and may not accurately represent the actual content of the product.

Name
Resource
File type
File size
Description
DrugBank Vocabulary
Data set
-
-
-
Data dictionary
Data dictionary
text/csv
-

Usage information

By subscribing to this product, you agree that your use of this product is subject to the provider's offer terms including pricing information and Data Subscription Agreement . Your use of AWS services remains subject to the AWS Customer Agreement  or other agreement with AWS governing your use of such services.

Support information

Support contact email address
Refund policy
No refunds offered. For any questions email us at info@johnsnowlabs.com
General AWS Data Exchange support