TY - JOUR
T1 - Drosophila Evolution over Space and Time (DEST) - A New Population Genomics Resource
AU - Glaser-Schmitt, Amanda
AU - Jelic, Mihailo
AU - Gallardo-Jiménez, Francisco D
AU - Puerma, Eva
AU - Tauber, Eran
AU - Schou, Mads F
AU - Fabian, Daniel K
AU - Ullastres, Anna
AU - Kapun, Martin
AU - Espinosa-Jimenez, M Luisa
AU - Bogaerts-Márquez, María
AU - Merenciano, Miriam
AU - Outten, Joseph
AU - Rota-Stabelli, Omar
AU - Buendía-Ruíz, Antonio J
AU - Wheat, Christopher W
AU - Guio, Lain
AU - Paris, Margot
AU - Patenkovic, Aleksandra
AU - Lazzaro, Brian P
AU - Schaeffer, Stephen W
AU - Merritt, Thomas J S
AU - García Guerreiro, Maria P
AU - Veselinovic, Marija Savic
AU - Ometto, Lino
AU - Staubach, Fabian
AU - Nunez, Joaquin C B
AU - Abbott, Jessica K
AU - Serga, Svitlana V
AU - Eric, Katarina
AU - Tanaskovic, Marija
AU - Casillas, Sònia
AU - Wang, Yun
AU - Grath, Sonja
AU - Coronado-Zamora, Marta
AU - Murga-Moreno, Jesús
AU - Orengo, Dorcas J
AU - Parsch, John
AU - Horváth, Vivien
AU - Onder, Banu S
AU - Argyridou, Eliza
AU - Behrman, Emily L
AU - Dyer, Kelly A
AU - Rajpurohit, Subhash
AU - Gómez-Julián, M Josefa
AU - Kankare, Maaria
AU - Loeschcke, Volker
AU - Guirao-Rico, Sara
AU - Stamenkovic-Radak, Marina
AU - Barbadilla Prados, Antoni
AU - Tern, Courtney
N1 - © The Author(s) 2021. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
PY - 2021/12/9
Y1 - 2021/12/9
N2 - Drosophila melanogaster is a leading model in population genetics and genomics, and a growing number of whole-genome datasets from natural populations of this species have been published over the last years. A major challenge is the integration of disparate datasets, often generated using different sequencing technologies and bioinformatic pipelines, which hampers our ability to address questions about the evolution of this species. Here we address these issues by developing a bioinformatics pipeline that maps pooled sequencing (Pool-Seq) reads from D. melanogaster to a hologenome consisting of fly and symbiont genomes and estimates allele frequencies using either a heuristic (PoolSNP) or a probabilistic variant caller (SNAPE-pooled). We use this pipeline to generate the largest data repository of genomic data available for D. melanogaster to date, encompassing 271 previously published and unpublished population samples from over 100 locations in > 20 countries on four continents. Several of these locations have been sampled at different seasons across multiple years. This dataset, which we call Drosophila Evolution over Space and Time (DEST), is coupled with sampling and environmental meta-data. A web-based genome browser and web portal provide easy access to the SNP dataset. We further provide guidelines on how to use Pool-Seq data for model-based demographic inference. Our aim is to provide this scalable platform as a community resource which can be easily extended via future efforts for an even more extensive cosmopolitan dataset. Our resource will enable population geneticists to analyze spatio-temporal genetic patterns and evolutionary dynamics of D. melanogaster populations in unprecedented detail.
AB - Drosophila melanogaster is a leading model in population genetics and genomics, and a growing number of whole-genome datasets from natural populations of this species have been published over the last years. A major challenge is the integration of disparate datasets, often generated using different sequencing technologies and bioinformatic pipelines, which hampers our ability to address questions about the evolution of this species. Here we address these issues by developing a bioinformatics pipeline that maps pooled sequencing (Pool-Seq) reads from D. melanogaster to a hologenome consisting of fly and symbiont genomes and estimates allele frequencies using either a heuristic (PoolSNP) or a probabilistic variant caller (SNAPE-pooled). We use this pipeline to generate the largest data repository of genomic data available for D. melanogaster to date, encompassing 271 previously published and unpublished population samples from over 100 locations in > 20 countries on four continents. Several of these locations have been sampled at different seasons across multiple years. This dataset, which we call Drosophila Evolution over Space and Time (DEST), is coupled with sampling and environmental meta-data. A web-based genome browser and web portal provide easy access to the SNP dataset. We further provide guidelines on how to use Pool-Seq data for model-based demographic inference. Our aim is to provide this scalable platform as a community resource which can be easily extended via future efforts for an even more extensive cosmopolitan dataset. Our resource will enable population geneticists to analyze spatio-temporal genetic patterns and evolutionary dynamics of D. melanogaster populations in unprecedented detail.
UR - https://www.mendeley.com/catalogue/2e348095-41fd-3bf0-a922-79c3932f7d14/
U2 - 10.1093/molbev/msab259
DO - 10.1093/molbev/msab259
M3 - Article
C2 - 34469576
SN - 0737-4038
VL - 38
SP - 5782
EP - 5805
JO - Molecular Biology and Evolution
JF - Molecular Biology and Evolution
IS - 12
ER -