An algebro-topological description of protein domain structure

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Standard

An algebro-topological description of protein domain structure. / Penner, Robert Clark; Knudsen, Michael; Wiuf, Carsten; Andersen, Jørgen Ellegaard.

I: PLoS ONE, Bind 6, Nr. 5, e19670, 27.05.2011.

Publikation: Bidrag til tidsskriftTidsskriftartikelForskningfagfællebedømt

Harvard

Penner, RC, Knudsen, M, Wiuf, C & Andersen, JE 2011, 'An algebro-topological description of protein domain structure', PLoS ONE, bind 6, nr. 5, e19670. https://doi.org/10.1371/journal.pone.0019670

APA

Penner, R. C., Knudsen, M., Wiuf, C., & Andersen, J. E. (2011). An algebro-topological description of protein domain structure. PLoS ONE, 6(5), [e19670]. https://doi.org/10.1371/journal.pone.0019670

Vancouver

Penner RC, Knudsen M, Wiuf C, Andersen JE. An algebro-topological description of protein domain structure. PLoS ONE. 2011 maj 27;6(5). e19670. https://doi.org/10.1371/journal.pone.0019670

Author

Penner, Robert Clark ; Knudsen, Michael ; Wiuf, Carsten ; Andersen, Jørgen Ellegaard. / An algebro-topological description of protein domain structure. I: PLoS ONE. 2011 ; Bind 6, Nr. 5.

Bibtex

@article{f3727d5c60ce4be8ac46244ba2ade6c1,
title = "An algebro-topological description of protein domain structure",
abstract = "The space of possible protein structures appears vast and continuous, and the relationship between primary, secondary and tertiary structure levels is complex. Protein structure comparison and classification is therefore a difficult but important task since structure is a determinant for molecular interaction and function. We introduce a novel mathematical abstraction based on geometric topology to describe protein domain structure. Using the locations of the backbone atoms and the hydrogen bonds, we build a combinatorial object - a so-called fatgraph. The description is discrete yet gives rise to a 2-dimensional mathematical surface. Thus, each protein domain corresponds to a particular mathematical surface with characteristic topological invariants, such as the genus (number of holes) and the number of boundary components. Both invariants are global fatgraph features reflecting the interconnectivity of the domain by hydrogen bonds. We introduce the notion of robust variables, that is variables that are robust towards minor changes in the structure/fatgraph, and show that the genus and the number of boundary components are robust. Further, we invesigate the distribution of different fatgraph variables and show how only four variables are capable of distinguishing different folds. We use local (secondary) and global (tertiary) fatgraph features to describe domain structures and illustrate that they are useful for classification of domains in CATH. In addition, we combine our method with two other methods thereby using primary, secondary, and tertiary structure information, and show that we can identify a large percentage of new and unclassified structures in CATH.",
author = "Penner, {Robert Clark} and Michael Knudsen and Carsten Wiuf and Andersen, {J{\o}rgen Ellegaard}",
year = "2011",
month = may,
day = "27",
doi = "10.1371/journal.pone.0019670",
language = "English",
volume = "6",
journal = "PLoS ONE",
issn = "1932-6203",
publisher = "Public Library of Science",
number = "5",

}

RIS

TY - JOUR

T1 - An algebro-topological description of protein domain structure

AU - Penner, Robert Clark

AU - Knudsen, Michael

AU - Wiuf, Carsten

AU - Andersen, Jørgen Ellegaard

PY - 2011/5/27

Y1 - 2011/5/27

N2 - The space of possible protein structures appears vast and continuous, and the relationship between primary, secondary and tertiary structure levels is complex. Protein structure comparison and classification is therefore a difficult but important task since structure is a determinant for molecular interaction and function. We introduce a novel mathematical abstraction based on geometric topology to describe protein domain structure. Using the locations of the backbone atoms and the hydrogen bonds, we build a combinatorial object - a so-called fatgraph. The description is discrete yet gives rise to a 2-dimensional mathematical surface. Thus, each protein domain corresponds to a particular mathematical surface with characteristic topological invariants, such as the genus (number of holes) and the number of boundary components. Both invariants are global fatgraph features reflecting the interconnectivity of the domain by hydrogen bonds. We introduce the notion of robust variables, that is variables that are robust towards minor changes in the structure/fatgraph, and show that the genus and the number of boundary components are robust. Further, we invesigate the distribution of different fatgraph variables and show how only four variables are capable of distinguishing different folds. We use local (secondary) and global (tertiary) fatgraph features to describe domain structures and illustrate that they are useful for classification of domains in CATH. In addition, we combine our method with two other methods thereby using primary, secondary, and tertiary structure information, and show that we can identify a large percentage of new and unclassified structures in CATH.

AB - The space of possible protein structures appears vast and continuous, and the relationship between primary, secondary and tertiary structure levels is complex. Protein structure comparison and classification is therefore a difficult but important task since structure is a determinant for molecular interaction and function. We introduce a novel mathematical abstraction based on geometric topology to describe protein domain structure. Using the locations of the backbone atoms and the hydrogen bonds, we build a combinatorial object - a so-called fatgraph. The description is discrete yet gives rise to a 2-dimensional mathematical surface. Thus, each protein domain corresponds to a particular mathematical surface with characteristic topological invariants, such as the genus (number of holes) and the number of boundary components. Both invariants are global fatgraph features reflecting the interconnectivity of the domain by hydrogen bonds. We introduce the notion of robust variables, that is variables that are robust towards minor changes in the structure/fatgraph, and show that the genus and the number of boundary components are robust. Further, we invesigate the distribution of different fatgraph variables and show how only four variables are capable of distinguishing different folds. We use local (secondary) and global (tertiary) fatgraph features to describe domain structures and illustrate that they are useful for classification of domains in CATH. In addition, we combine our method with two other methods thereby using primary, secondary, and tertiary structure information, and show that we can identify a large percentage of new and unclassified structures in CATH.

UR - http://www.scopus.com/inward/record.url?scp=79956347574&partnerID=8YFLogxK

U2 - 10.1371/journal.pone.0019670

DO - 10.1371/journal.pone.0019670

M3 - Journal article

C2 - 21629687

AN - SCOPUS:79956347574

VL - 6

JO - PLoS ONE

JF - PLoS ONE

SN - 1932-6203

IS - 5

M1 - e19670

ER -

ID: 203896441