TY - JOUR
T1 - Sequence complexity of histone H1 subtypes
AU - Ponte, Imma
AU - Vila, Roger
AU - Suau, Pedro
PY - 2003/3/1
Y1 - 2003/3/1
N2 - H1 subtypes are involved in chromatin higher-order structure and gene regulation. H1 has a characteristic three-domain structure. We studied the length variation of the available H1 subtypes and showed that the length of the N-terminal and C-terminal domains was more variable than that of the central domain. The N-terminal and C-terminal domains were of low sequence complexity both at the nucleotide and at the amino acid level, whereas the globular domain was of high complexity. In most subtypes, low complexity was due only to cryptic simplicity, which reflects the clustering of a number of short and often imperfect sequence motifs. However, a subset of subtypes from eubacteria, plants, and invertebrates contained tandem repeats of short amino acid motifs (four to 12 residues), which could amount to a large proportion of the terminal domains. In addition, some other subtypes, such as those of Drosophila and mammalian H1t, were only marginally simple. The coexistence of these three kinds of subtypes suggests that the terminal domains could have originated in the amplification of short sequence motifs, which would then have evolved by point mutation and further slippage.
AB - H1 subtypes are involved in chromatin higher-order structure and gene regulation. H1 has a characteristic three-domain structure. We studied the length variation of the available H1 subtypes and showed that the length of the N-terminal and C-terminal domains was more variable than that of the central domain. The N-terminal and C-terminal domains were of low sequence complexity both at the nucleotide and at the amino acid level, whereas the globular domain was of high complexity. In most subtypes, low complexity was due only to cryptic simplicity, which reflects the clustering of a number of short and often imperfect sequence motifs. However, a subset of subtypes from eubacteria, plants, and invertebrates contained tandem repeats of short amino acid motifs (four to 12 residues), which could amount to a large proportion of the terminal domains. In addition, some other subtypes, such as those of Drosophila and mammalian H1t, were only marginally simple. The coexistence of these three kinds of subtypes suggests that the terminal domains could have originated in the amplification of short sequence motifs, which would then have evolved by point mutation and further slippage.
KW - Histone H1
KW - Length mutations
KW - Simplicity
KW - Slippage
KW - Tandem repeats
U2 - 10.1093/molbev/msg041
DO - 10.1093/molbev/msg041
M3 - Article
SN - 0737-4038
VL - 20
SP - 371
EP - 380
JO - Molecular Biology and Evolution
JF - Molecular Biology and Evolution
ER -