Disease-specific differentiation between drugs and non-drugs using principal component analysis of their molecular descriptor space

Alfonso T. García-Sosa, Mare Oja, C. Hetényi, Uko Maran

Research output: Contribution to journalArticle

8 Citations (Scopus)

Abstract

The physicochemical descriptor space has been extensively mapped and described in the literature for orally administered drugs and lead compounds. However, consideration of negative examples (non-drugs) or disease pathophysiology is not common in many studies. In the present work, a principal component analysis was carried out using drugs and non-drugs taking into account disease- and organ-specific categories, as well as different administration routes in addition to oral. The study involves 1386 relevant small-molecules including natural and synthetic products. Drug-specific as well as disease-category-specific or organ-specific regions and their respective threshold sets (ranges of descriptors) relative to non-drugs were elucidated on the scores plot and validated with external, independent sets of drugs and non-drugs. The respective loadings plot of molecular descriptors was rationalized in terms of physicochemically relevant groups related to the components of solvation free energy. The results of this analysis can contribute to the improved profiling of drug candidates and libraries making use of disease- and organ-specificity coded by physicochemical descriptors and ligand binding efficiency.

Original languageEnglish
Pages (from-to)369-383
Number of pages15
JournalMolecular Informatics
Volume31
Issue number5
DOIs
Publication statusPublished - May 2012

Fingerprint

Principal Component Analysis
Principal component analysis
Pharmaceutical Preparations
Lead compounds
Solvation
Organ Specificity
Free energy
Ligands
Libraries
Molecules

Keywords

  • Drugs
  • Non-drugs
  • Principal component analysis

ASJC Scopus subject areas

  • Organic Chemistry
  • Computer Science Applications
  • Drug Discovery
  • Molecular Medicine
  • Structural Biology

Cite this

Disease-specific differentiation between drugs and non-drugs using principal component analysis of their molecular descriptor space. / García-Sosa, Alfonso T.; Oja, Mare; Hetényi, C.; Maran, Uko.

In: Molecular Informatics, Vol. 31, No. 5, 05.2012, p. 369-383.

Research output: Contribution to journalArticle

@article{428268227acc42c993deffa902f211bc,
title = "Disease-specific differentiation between drugs and non-drugs using principal component analysis of their molecular descriptor space",
abstract = "The physicochemical descriptor space has been extensively mapped and described in the literature for orally administered drugs and lead compounds. However, consideration of negative examples (non-drugs) or disease pathophysiology is not common in many studies. In the present work, a principal component analysis was carried out using drugs and non-drugs taking into account disease- and organ-specific categories, as well as different administration routes in addition to oral. The study involves 1386 relevant small-molecules including natural and synthetic products. Drug-specific as well as disease-category-specific or organ-specific regions and their respective threshold sets (ranges of descriptors) relative to non-drugs were elucidated on the scores plot and validated with external, independent sets of drugs and non-drugs. The respective loadings plot of molecular descriptors was rationalized in terms of physicochemically relevant groups related to the components of solvation free energy. The results of this analysis can contribute to the improved profiling of drug candidates and libraries making use of disease- and organ-specificity coded by physicochemical descriptors and ligand binding efficiency.",
keywords = "Drugs, Non-drugs, Principal component analysis",
author = "Garc{\'i}a-Sosa, {Alfonso T.} and Mare Oja and C. Het{\'e}nyi and Uko Maran",
year = "2012",
month = "5",
doi = "10.1002/minf.201100094",
language = "English",
volume = "31",
pages = "369--383",
journal = "Molecular Informatics",
issn = "1868-1743",
publisher = "Wiley - VCH Verlag GmbH & CO. KGaA",
number = "5",

}

TY - JOUR

T1 - Disease-specific differentiation between drugs and non-drugs using principal component analysis of their molecular descriptor space

AU - García-Sosa, Alfonso T.

AU - Oja, Mare

AU - Hetényi, C.

AU - Maran, Uko

PY - 2012/5

Y1 - 2012/5

N2 - The physicochemical descriptor space has been extensively mapped and described in the literature for orally administered drugs and lead compounds. However, consideration of negative examples (non-drugs) or disease pathophysiology is not common in many studies. In the present work, a principal component analysis was carried out using drugs and non-drugs taking into account disease- and organ-specific categories, as well as different administration routes in addition to oral. The study involves 1386 relevant small-molecules including natural and synthetic products. Drug-specific as well as disease-category-specific or organ-specific regions and their respective threshold sets (ranges of descriptors) relative to non-drugs were elucidated on the scores plot and validated with external, independent sets of drugs and non-drugs. The respective loadings plot of molecular descriptors was rationalized in terms of physicochemically relevant groups related to the components of solvation free energy. The results of this analysis can contribute to the improved profiling of drug candidates and libraries making use of disease- and organ-specificity coded by physicochemical descriptors and ligand binding efficiency.

AB - The physicochemical descriptor space has been extensively mapped and described in the literature for orally administered drugs and lead compounds. However, consideration of negative examples (non-drugs) or disease pathophysiology is not common in many studies. In the present work, a principal component analysis was carried out using drugs and non-drugs taking into account disease- and organ-specific categories, as well as different administration routes in addition to oral. The study involves 1386 relevant small-molecules including natural and synthetic products. Drug-specific as well as disease-category-specific or organ-specific regions and their respective threshold sets (ranges of descriptors) relative to non-drugs were elucidated on the scores plot and validated with external, independent sets of drugs and non-drugs. The respective loadings plot of molecular descriptors was rationalized in terms of physicochemically relevant groups related to the components of solvation free energy. The results of this analysis can contribute to the improved profiling of drug candidates and libraries making use of disease- and organ-specificity coded by physicochemical descriptors and ligand binding efficiency.

KW - Drugs

KW - Non-drugs

KW - Principal component analysis

UR - http://www.scopus.com/inward/record.url?scp=84860505355&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84860505355&partnerID=8YFLogxK

U2 - 10.1002/minf.201100094

DO - 10.1002/minf.201100094

M3 - Article

AN - SCOPUS:84860505355

VL - 31

SP - 369

EP - 383

JO - Molecular Informatics

JF - Molecular Informatics

SN - 1868-1743

IS - 5

ER -