Bioinformatical approaches to characterize intrinsically disordered/unstructured proteins

Z. Dosztányi, Bálint Mészáros, I. Simon

Research output: Contribution to journalArticle

86 Citations (Scopus)

Abstract

Intrinsically disordered/unstructured proteins exist without a stable three-dimensional (3D) structure as highly flexible conformational ensembles. The available genome sequences revealed that these proteins are surprisingly common and their frequency reaches high proportions in eukaryotes. Due to their vital role in various biological processes including signaling and regulation and their involvement in various diseases, disordered proteins and protein segments are the focus of many biochemical, molecular biological, pathological and pharmaceutical studies. These proteins are difficult to study experimentally because of the lack of unique structure in the isolated form. Their amino acid sequence, however, is available, and can be used for their identification and characterization by bioinformatic tools, analogously to globular proteins. In this review, we first present a small survey of current methods to identify disordered proteins or protein segments, focusing on those that are publicly available as web servers. In more detail we also discuss approaches that predict disordered regions and specific regions involved in protein binding by modeling the physical background of protein disorder. In our review we argue that the heterogeneity of disordered segments needs to be taken into account for a better understanding of protein disorder.

Original languageEnglish
Article numberbbp061
Pages (from-to)225-243
Number of pages19
JournalBriefings in Bioinformatics
Volume11
Issue number2
DOIs
Publication statusPublished - Dec 10 2009

Fingerprint

Intrinsically Disordered Proteins
Proteins
Biological Phenomena
Computational Biology
Eukaryota
Protein Binding
Bioinformatics
Amino Acid Sequence
Drug products
Amino acids
Genome
Servers
Genes

Keywords

  • Binary classification
  • Coupled folding and binding
  • Machine-learning algorithm
  • Prediction method
  • Protein disorder

ASJC Scopus subject areas

  • Molecular Biology
  • Information Systems

Cite this

Bioinformatical approaches to characterize intrinsically disordered/unstructured proteins. / Dosztányi, Z.; Mészáros, Bálint; Simon, I.

In: Briefings in Bioinformatics, Vol. 11, No. 2, bbp061, 10.12.2009, p. 225-243.

Research output: Contribution to journalArticle

@article{093d252bbee04e59aea85ac04a4cfa36,
title = "Bioinformatical approaches to characterize intrinsically disordered/unstructured proteins",
abstract = "Intrinsically disordered/unstructured proteins exist without a stable three-dimensional (3D) structure as highly flexible conformational ensembles. The available genome sequences revealed that these proteins are surprisingly common and their frequency reaches high proportions in eukaryotes. Due to their vital role in various biological processes including signaling and regulation and their involvement in various diseases, disordered proteins and protein segments are the focus of many biochemical, molecular biological, pathological and pharmaceutical studies. These proteins are difficult to study experimentally because of the lack of unique structure in the isolated form. Their amino acid sequence, however, is available, and can be used for their identification and characterization by bioinformatic tools, analogously to globular proteins. In this review, we first present a small survey of current methods to identify disordered proteins or protein segments, focusing on those that are publicly available as web servers. In more detail we also discuss approaches that predict disordered regions and specific regions involved in protein binding by modeling the physical background of protein disorder. In our review we argue that the heterogeneity of disordered segments needs to be taken into account for a better understanding of protein disorder.",
keywords = "Binary classification, Coupled folding and binding, Machine-learning algorithm, Prediction method, Protein disorder",
author = "Z. Doszt{\'a}nyi and B{\'a}lint M{\'e}sz{\'a}ros and I. Simon",
year = "2009",
month = "12",
day = "10",
doi = "10.1093/bib/bbp061",
language = "English",
volume = "11",
pages = "225--243",
journal = "Briefings in Bioinformatics",
issn = "1467-5463",
publisher = "Oxford University Press",
number = "2",

}

TY - JOUR

T1 - Bioinformatical approaches to characterize intrinsically disordered/unstructured proteins

AU - Dosztányi, Z.

AU - Mészáros, Bálint

AU - Simon, I.

PY - 2009/12/10

Y1 - 2009/12/10

N2 - Intrinsically disordered/unstructured proteins exist without a stable three-dimensional (3D) structure as highly flexible conformational ensembles. The available genome sequences revealed that these proteins are surprisingly common and their frequency reaches high proportions in eukaryotes. Due to their vital role in various biological processes including signaling and regulation and their involvement in various diseases, disordered proteins and protein segments are the focus of many biochemical, molecular biological, pathological and pharmaceutical studies. These proteins are difficult to study experimentally because of the lack of unique structure in the isolated form. Their amino acid sequence, however, is available, and can be used for their identification and characterization by bioinformatic tools, analogously to globular proteins. In this review, we first present a small survey of current methods to identify disordered proteins or protein segments, focusing on those that are publicly available as web servers. In more detail we also discuss approaches that predict disordered regions and specific regions involved in protein binding by modeling the physical background of protein disorder. In our review we argue that the heterogeneity of disordered segments needs to be taken into account for a better understanding of protein disorder.

AB - Intrinsically disordered/unstructured proteins exist without a stable three-dimensional (3D) structure as highly flexible conformational ensembles. The available genome sequences revealed that these proteins are surprisingly common and their frequency reaches high proportions in eukaryotes. Due to their vital role in various biological processes including signaling and regulation and their involvement in various diseases, disordered proteins and protein segments are the focus of many biochemical, molecular biological, pathological and pharmaceutical studies. These proteins are difficult to study experimentally because of the lack of unique structure in the isolated form. Their amino acid sequence, however, is available, and can be used for their identification and characterization by bioinformatic tools, analogously to globular proteins. In this review, we first present a small survey of current methods to identify disordered proteins or protein segments, focusing on those that are publicly available as web servers. In more detail we also discuss approaches that predict disordered regions and specific regions involved in protein binding by modeling the physical background of protein disorder. In our review we argue that the heterogeneity of disordered segments needs to be taken into account for a better understanding of protein disorder.

KW - Binary classification

KW - Coupled folding and binding

KW - Machine-learning algorithm

KW - Prediction method

KW - Protein disorder

UR - http://www.scopus.com/inward/record.url?scp=77950957385&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=77950957385&partnerID=8YFLogxK

U2 - 10.1093/bib/bbp061

DO - 10.1093/bib/bbp061

M3 - Article

C2 - 20007729

AN - SCOPUS:77950957385

VL - 11

SP - 225

EP - 243

JO - Briefings in Bioinformatics

JF - Briefings in Bioinformatics

SN - 1467-5463

IS - 2

M1 - bbp061

ER -