TY - JOUR
T1 - Systematic Analysis of Primary Sequence Domain Segments for the Discrimination Between Class C GPCR Subtypes
AU - König, Caroline
AU - Alquézar, René
AU - Vellido, Alfredo
AU - Giraldo, Jesús
N1 - Publisher Copyright:
© 2018, Springer-Verlag GmbH Germany, part of Springer Nature.
PY - 2018/3/1
Y1 - 2018/3/1
N2 - G-protein-coupled receptors (GPCRs) are a large and diverse super-family of eukaryotic cell membrane proteins that play an important physiological role as transmitters of extracellular signal. In this paper, we investigate Class C, a member of this super-family that has attracted much attention in pharmacology. The limited knowledge about the complete 3D crystal structure of Class C receptors makes necessary the use of their primary amino acid sequences for analytical purposes. Here, we provide a systematic analysis of distinct receptor sequence segments with regard to their ability to differentiate between seven class C GPCR subtypes according to their topological location in the extracellular, transmembrane, or intracellular domains. We build on the results from the previous research that provided preliminary evidence of the potential use of separated domains of complete class C GPCR sequences as the basis for subtype classification. The use of the extracellular N-terminus domain alone was shown to result in a minor decrease in subtype discrimination in comparison with the complete sequence, despite discarding much of the sequence information. In this paper, we describe the use of Support Vector Machine-based classification models to evaluate the subtype-discriminating capacity of the specific topological sequence segments.
AB - G-protein-coupled receptors (GPCRs) are a large and diverse super-family of eukaryotic cell membrane proteins that play an important physiological role as transmitters of extracellular signal. In this paper, we investigate Class C, a member of this super-family that has attracted much attention in pharmacology. The limited knowledge about the complete 3D crystal structure of Class C receptors makes necessary the use of their primary amino acid sequences for analytical purposes. Here, we provide a systematic analysis of distinct receptor sequence segments with regard to their ability to differentiate between seven class C GPCR subtypes according to their topological location in the extracellular, transmembrane, or intracellular domains. We build on the results from the previous research that provided preliminary evidence of the potential use of separated domains of complete class C GPCR sequences as the basis for subtype classification. The use of the extracellular N-terminus domain alone was shown to result in a minor decrease in subtype discrimination in comparison with the complete sequence, despite discarding much of the sequence information. In this paper, we describe the use of Support Vector Machine-based classification models to evaluate the subtype-discriminating capacity of the specific topological sequence segments.
KW - Biocuration
KW - G-protein-coupled receptors
KW - Machine learning
KW - Pharmaco-proteomics
KW - Support vector machines
UR - http://www.scopus.com/inward/record.url?scp=85042882349&partnerID=8YFLogxK
U2 - 10.1007/s12539-018-0286-3
DO - 10.1007/s12539-018-0286-3
M3 - Article
C2 - 29460086
AN - SCOPUS:85042882349
SN - 1913-2751
VL - 10
SP - 43
EP - 52
JO - Interdisciplinary Sciences - Computational Life Sciences
JF - Interdisciplinary Sciences - Computational Life Sciences
IS - 1
ER -