Abstract
G protein-coupled receptors (GPCRs) are a large and heterogeneous superfamily of receptors that are key cell players for their role as extracellular signal transmitters. Class C GPCRs, in particular, are of great interest in pharmacology. The lack of knowledge about their full 3-D structure prompts the use of their primary amino acid sequences for the construction of robust classifiers, capable of discriminating their different subtypes. In this paper, we describe the use of feature selection techniques to build Support Vector Machine (SVM)-based classification models from selected receptor subsequences described as n-grams. We show that this approach to classification is useful for finding class C GPCR subtype-specific motifs.
Original language | English |
---|---|
Pages (from-to) | 89-96 |
Number of pages | 8 |
Journal | Advances in Intelligent Systems and Computing |
Volume | 294 |
Issue number | AISC |
DOIs | |
Publication status | Published - 2014 |
Keywords
- Feature selection
- G-Protein coupled receptors
- n-grams
- Pharmaco-proteomics
- Support vector machines