Exploring data processing strategies in NGS target enrichment to disentangle radiations in the tribe Cardueae (Compositae)

Sonia Herrando-Moraira, Juan Antonio Calleja, Pau Carnicero, Kazumi Fujikawa, Mercè Galbany-Casals, Núria Garcia-Jacas, Hyoung Tak Im, Seung Chul Kim, Jian Quan Liu, Javier López-Alvarado, Jordi López-Pujol, Jennifer R. Mandel, Sergi Massó, Iraj Mehregan, Noemí Montes-Moreno, Elizaveta Pyak, Cristina Roquet, Llorenç Sáez, Alexander Sennikov, Alfonso SusannaRoser Vilatersana

Research output: Contribution to journalArticleResearch

8 Citations (Scopus)

Abstract

© 2018 Elsevier Inc. Target enrichment is a cost-effective sequencing technique that holds promise for elucidating evolutionary relationships in fast-evolving lineages. However, potential biases and impact of bioinformatic sequence treatments in phylogenetic inference have not been thoroughly explored yet. Here, we investigate this issue with an ultimate goal to shed light into a highly diversified group of Compositae (Asteraceae) constituted by four main genera: Arctium, Cousinia, Saussurea, and Jurinea. Specifically, we compared sequence data extraction methods implemented in two easy-to-use workflows, PHYLUCE and HybPiper, and assessed the impact of two filtering practices intended to reduce phylogenetic noise. In addition, we compared two phylogenetic inference methods: (1) the concatenation approach, in which all loci were concatenated in a supermatrix; and (2) the coalescence approach, in which gene trees were produced independently and then used to construct a species tree under coalescence assumptions. Here we confirm the usefulness of the set of 1061 COS targets (a nuclear conserved orthology loci set developed for the Compositae) across a variety of taxonomic levels. Intergeneric relationships were completely resolved: there are two sister groups, Arctium-Cousinia and Saussurea-Jurinea, which are in agreement with a morphological hypothesis. Intrageneric relationships among species of Arctium, Cousinia, and Saussurea are also well defined. Conversely, conflicting species relationships remain for Jurinea. Methodological choices significantly affected phylogenies in terms of topology, branch length, and support. Across all analyses, the phylogeny obtained using HybPiper and the strictest scheme of removing fast-evolving sites was estimated as the optimal. Regarding methodological choices, we conclude that: (1) trees obtained under the coalescence approach are topologically more congruent between them than those inferred using the concatenation approach; (2) refining treatments only improved support values under the concatenation approach; and (3) branch support values are maximized when fast-evolving sites are removed in the concatenation approach, and when a higher number of loci is analyzed in the coalescence approach.
Original languageEnglish
Pages (from-to)69-87
JournalMolecular Phylogenetics and Evolution
Volume128
DOIs
Publication statusPublished - 1 Nov 2018

Keywords

  • Asteraceae
  • COS targets
  • HybPiper
  • NGS filtering strategies
  • Phylogenetic noise
  • PHYLUCE

Fingerprint Dive into the research topics of 'Exploring data processing strategies in NGS target enrichment to disentangle radiations in the tribe Cardueae (Compositae)'. Together they form a unique fingerprint.

Cite this