TY - JOUR
T1 - A crowdsourcing database for the copy-number variation of the Spanish population
AU - López-López, Daniel
AU - Roldán, Gema
AU - Fernández-Rueda, Jose L.
AU - Bostelmann, Gerrit
AU - Carmona, Rosario
AU - Aquino, Virginia
AU - Perez-Florido, Javier
AU - Ortuño, Francisco
AU - Pita, Guillermo
AU - Núñez-Torres, Rocío
AU - González-Neira, Anna
AU - Alonso, Angel
AU - Salgado-Garrido, Josefa
AU - Pasalodos-Sanchez, Sara
AU - Ayuso, Carmen
AU - Minguez, Pablo
AU - Avila-Fernandez, Almudena
AU - Corton, Marta
AU - Artuch, Rafael
AU - Borrego, Salud
AU - Antiñolo, Guillermo
AU - Carracedo, Angel
AU - Amigo, Jorge
AU - Castaño, Luis Antonio
AU - Tejada, Isabel
AU - Delmiro, Aitor
AU - Espinos, Carmina
AU - Grinberg, Daniel
AU - Guillén, Encarnación
AU - Lapunzina, Pablo
AU - Lopez-Escámez, Jose Antonio
AU - Gallego-Martinez, Alvaro
AU - Martí, Ramón
AU - Rovira, Eulalia
AU - Millán, José Mª
AU - Moreno, Miguel Angel
AU - Morin, Matías
AU - Moreno-Galdó, Antonio
AU - Fernández-Cancio, Mónica
AU - Morte, Beatriz
AU - Mulero, Victoriano
AU - García, Diana
AU - Nunes, Virginia
AU - Palau, Francesc
AU - Perez, Belén
AU - Jurado, Luis Pérez
AU - Perona, Rosario
AU - Pujol, Aurora
AU - Ramos, Feliciano
AU - Surrallés, Jordi
AU - Peña-Chilet, María
AU - Dopazo, Joaquin
N1 - Publisher Copyright:
© 2023, The Author(s).
PY - 2023/3/9
Y1 - 2023/3/9
N2 - Background: Despite being a very common type of genetic variation, the distribution of copy-number variations (CNVs) in the population is still poorly understood. The knowledge of the genetic variability, especially at the level of the local population, is a critical factor for distinguishing pathogenic from non-pathogenic variation in the discovery of new disease variants. Results: Here, we present the SPAnish Copy Number Alterations Collaborative Server (SPACNACS), which currently contains copy number variation profiles obtained from more than 400 genomes and exomes of unrelated Spanish individuals. By means of a collaborative crowdsourcing effort whole genome and whole exome sequencing data, produced by local genomic projects and for other purposes, is continuously collected. Once checked both, the Spanish ancestry and the lack of kinship with other individuals in the SPACNACS, the CNVs are inferred for these sequences and they are used to populate the database. A web interface allows querying the database with different filters that include ICD10 upper categories. This allows discarding samples from the disease under study and obtaining pseudo-control CNV profiles from the local population. We also show here additional studies on the local impact of CNVs in some phenotypes and on pharmacogenomic variants. SPACNACS can be accessed at: http://csvs.clinbioinfosspa.es/spacnacs/. Conclusion: SPACNACS facilitates disease gene discovery by providing detailed information of the local variability of the population and exemplifies how to reuse genomic data produced for other purposes to build a local reference database.
AB - Background: Despite being a very common type of genetic variation, the distribution of copy-number variations (CNVs) in the population is still poorly understood. The knowledge of the genetic variability, especially at the level of the local population, is a critical factor for distinguishing pathogenic from non-pathogenic variation in the discovery of new disease variants. Results: Here, we present the SPAnish Copy Number Alterations Collaborative Server (SPACNACS), which currently contains copy number variation profiles obtained from more than 400 genomes and exomes of unrelated Spanish individuals. By means of a collaborative crowdsourcing effort whole genome and whole exome sequencing data, produced by local genomic projects and for other purposes, is continuously collected. Once checked both, the Spanish ancestry and the lack of kinship with other individuals in the SPACNACS, the CNVs are inferred for these sequences and they are used to populate the database. A web interface allows querying the database with different filters that include ICD10 upper categories. This allows discarding samples from the disease under study and obtaining pseudo-control CNV profiles from the local population. We also show here additional studies on the local impact of CNVs in some phenotypes and on pharmacogenomic variants. SPACNACS can be accessed at: http://csvs.clinbioinfosspa.es/spacnacs/. Conclusion: SPACNACS facilitates disease gene discovery by providing detailed information of the local variability of the population and exemplifies how to reuse genomic data produced for other purposes to build a local reference database.
KW - Crowdsourcing
KW - DNA Copy Number Variations/genetics
KW - Databases, Factual
KW - Genomics
KW - Phenotype
UR - http://www.scopus.com/inward/record.url?scp=85149677788&partnerID=8YFLogxK
UR - https://www.mendeley.com/catalogue/e9699ad1-1894-3156-a6de-bdfd15f93f5e/
U2 - 10.1186/s40246-023-00466-8
DO - 10.1186/s40246-023-00466-8
M3 - Article
C2 - 36894999
AN - SCOPUS:85149677788
SN - 1473-9542
VL - 17
SP - 20
JO - Human Genomics
JF - Human Genomics
IS - 1
M1 - 20
ER -