Research Article |
Corresponding author: Felix Grewe ( fgrewe@fieldmuseum.org ) Academic editor: Ekaphan Kraichak
© 2018 Felix Grewe, Elisa Lagostina, Huini Wu, Christian Printzen, H. Thorsten Lumbsch.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Grewe F, Lagostina E, Wu H, Printzen C, Lumbsch HT (2018) Population genomic analyses of RAD sequences resolves the phylogenetic relationship of the lichen-forming fungal species Usnea antarctica and Usnea aurantiacoatra. MycoKeys 43: 91-113. https://doi.org/10.3897/mycokeys.43.29093
|
Neuropogonoid species in the lichen-forming fungal genus Usnea exhibit great morphological variation that can be misleading for delimitation of species. We specifically focused on the species delimitation of two closely-related, predominantly Antarctic species differing in the reproductive mode and representing a so-called species pair: the asexual U. antarctica and the sexual U. aurantiacoatra. Previous studies have revealed contradicting results. While multi-locus studies based on DNA sequence data provided evidence that these two taxa might be conspecific, microsatellite data suggested they represent distinct lineages. By using RADseq, we generated thousands of homologous markers to build a robust phylogeny of the two species. Furthermore, we successfully implemented these data in fine-scale population genomic analyses such as DAPC and fineRADstructure. Both Usnea species are readily delimited in phylogenetic inferences and, therefore, the hypothesis that both species are conspecific was rejected. Population genomic analyses also strongly confirmed separated genomes and, additionally, showed different levels of co-ancestry and substructure within each species. Lower co-ancestry in the asexual U. antarctica than in the sexual U. aurantiacoatra may be derived from a wider distributional range of the former species. Our results demonstrate the utility of this RADseq method in tracing population dynamics of lichens in future analyses.
Antarctica, Ascomycota , lichens, Parmeliaceae , phylogeny, RADseq
Over the last decades, the use of DNA sequence data to delimit species and reconstruct phylogenetic relationships has become standard (
The general lineage species concept (
We here focus on a complex of two morphologically similar species that differ in their reproductive mode and are considered a species pair in the genus Usnea: U. aurantiacoatra and U. antarctica, the latter reproducing by asexual soredia (
The advent of next-generation sequencing (NGS), also referred to as high-throughput sequencing, drastically changed the scale of molecular datasets for systematic analyses and revolutionised our ability to assess evolutionary histories of organisms (
By using genome-wide single nucleotide polymorphisms (SNPs) produced by restriction site-associated DNA sequencing (RADseq) of predominantly Antarctic lichen-forming fungi, our main aim in this study was to clarify the taxonomy of asexual Usnea antarctica and sexual Usnea aurantiacoatra and test their hypothesised species pair relationship. To further support our findings, we applied population genomic methods to measure the degree of genomic divergence and infer the levels of co-ancestry for each species.
Samples were collected in Antarctica between December 2015 and January 2017. From these collected specimens, we chose to compare 105 representative specimens of the species U. antarctica and U. aurantiacoatra for this study (see details of specimens in Suppl. material
Total metagenomic DNA was extracted either by following a cetyltrimethylammonium bromide (CTAB) protocol as modified by
We first deep-sequenced and assembled a reference sequence of an Usnea strigosa specimen to aid in mapping lichen-fungal loci during the processing of metagenomic RADseq data. A paired-end Illumina sequencing library of 150 bp read length was constructed from total DNA with the Nextera XT DNA Library Prep Kit (Illumina, San Diego, CA, USA) and sequenced on a NextSeq platform at the University of Illinois Chicago’s Sequencing Core Facility (Chicago, IL, USA). The resulting reads were quality trimmed using the programme Trimmomatic v0.36 (
RADseq libraries for Usnea antarctica and U. aurantiacoatra were prepared as described previously (
The raw reads of U. antarctica and U. aurantiacoatra from the MiSeq sequencing were processed and assembled as described earlier for metagenomic datasets of lichens (
Phylogenetic trees were calculated from all unlinked SNPs of the filtered RADseq dataset, i.e. a matrix that was limited to one SNP per RAD locus. This matrix was used for a RAxML v7.2.8 (
To calculate differences in the population structure between U. antarctica and U. aurantiacoatra, we created a reduced dataset that included all sites with a minor allele frequency (MAF) greater than or equal to 0.05 and greater than 50% coverage using vcftools v0.1.15 (
To determine the degree to which both populations are subdivided, we estimated Gst (
The genetic structure of samples of U. antarctica and U. aurantiacoatra was evaluated with the Discriminant Analysis of Principal Components (DAPC) implemented in the R package adegenet v2.02. This non-parametric method first transforms the data using a principle components analysis (PCA) and subsequently distinguishes between two or more groups using a discriminant analysis (DA). The DAPC was conducted by using the first 60 principal components and all (two) DA-eigenvalues. In addition to the display of the genetic variation in genomic space, the DAPC allows a prediction of the group membership probability for each sample which is visualised in a STRUCTURE-like plot.
In addition to the nonparametric approach with DAPC, we used a model-based method to detect population subdivision using the programme fineRADstructure (
The U. strigosa reference sequence and all scripts that were used in this study are available online (https://github.com/felixgrewe/Usnea). All RAD sequences were deposited in the NCBI Sequence Read Archive (SRA) with accession number PRJNA505526.
We assembled a draft reference genome of U. strigosa to filter for fungal RAD loci from U. antarctica and U. aurantiacoatra. The Illumina NextSeq sequencing of the whole U. strigosa lichen resulted in 8,552,530 metagenomic paired-end reads. First, we trimmed these raw data which reduced the paired-end reads to 8,366,962 (97.78% of raw data). The trimmed read pairs were then assembled into 16,932 scaffolds (N50 = 12,750 bp) with a total size of 40.9 Mbp (including 1,187 scaffolds of sizes larger than 10 kb). Metagenomic binning identified 28.92 Mbp of the assembly as fungal derived from which we selected 1,100 scaffolds (N50 = 23,562 bp) with sizes larger than 10 kb; all but two of these scaffolds were continuous assemblies (contigs). The sorted draft genome of U. strigosa had a total size of 24.1 Mbp and an estimated completeness of 72.18%.
We included 105 specimens of U. antarctica and U. aurantiacoatra that were collected in Antarctica in four RADseq libraries (Suppl. material
The phylogenetic analysis of the RADseq data showed two distinct and highly supported clades corresponding to the phenotypically circumscribed species U. antarctica and U. aurantiacoatra (Figure
Phylogenetic tree inferred from the U. antarctica and U. aurantiacoatraRADseq data. The clades of each species are highlighted by brackets. Bootstrap values are indicated at the branches. The unit of branch length is substitutions per site. Note that branches leading to both major clades were abbreviated by 0.4 substitutions per site.
We determined the degree to which both species complexes are subdivided by Gst, G’st and D measurements. For these analyses, we included only SNPs with a MAF greater than 0.05 and more than 50% coverage. This reduced the RAD dataset to a total of 4,132 SNPs. We plotted the frequency of Gst, G’st and D measures for each SNP (Figure
The same reduced dataset of 4,132 SNPs was used to differentiate the genomes by their variation in a non-parametric approach with a DAPC (Figure
The separation of U. antarctica and U. aurantiacoatra was further supported by the results of a Bayesian model-based approach with the programme fineRADstructure. By converting the pyRAD allele output for fineRADstructure, we reduced the dataset to 3,803 unlinked SNPs with a minimum coverage of 4 samples. The resulting clustered co-ancestry matrix showed that both species shared more co-ancestry within each other than between species (Figure
Clustered fineRADstructure co-ancestry matrix. A Full dataset including U. antarctica collected on the Antarctic Peninsula in addition to U. antarctica and U. aurantiacoatra collected on King George Island and Elephant Island B Reduced dataset with all U. antarctica and U. aurantiacoatra collected on King George Island and Elephant Island. Two major clades are corresponding to the two species U. antarctica (top-left) and U. aurantiacoatra (bottom-right). The top and left trees were calculated from the co-ancestry matrix to sort the individuals by their population structure. The matrix is diagonally split into the top-right half showing raw data and the bottom-left half displaying aggregated data.
In this study, we used RAD sequencing for evaluating the delimitation of two predominantly Antarctic Usnea species. Phylogenetic evidence and population genomic analyses of the RADseq data strongly supported that the two species represent independent lineages. Although both species showed no overlapping genomic structure in a DAPC, we could compare levels of co-ancestry and detect genomic substructure within each species in a fineRADstructure plot.
In previous studies using multi-locus approaches, including the ITS barcoding marker (
We developed a RADseq method for lineages involved in intimate symbiotic associations (
RADseq data are extremely powerful, since the method generates a matrix of thousands of homologous loci derived from randomly distributed regions across the genome. Many studies have successfully used large RADseq datasets for phylogenetic analysis which were difficult to resolve due to insufficient signals in available markers (
The fineRADstructure matrix estimated lower co-ancestry (and hence higher genotypic variation) for the sexually-reproducing U. aurantiacoatra, compared to the asexually-reproducing U. antarctica when comparing samples that were collected in the same geographic range (Figure
Despite the lower co-ancestry of U. antarctica compared to U. aurantiacoatra, we detected three pairs of very close relatives with high co-ancestry of U. antarctica (Figure
We successfully used RADseq for phylogenetic and population genomic studies on two species of the lichen-fungal genus Usnea. Phylogenetic inference using RAD data clearly delimited the species U. antarctica and U. aurantiacoatra into two lineages, which were irresolvable using multi-locus DNA sequence markers. Furthermore, the RADseq approach offered sufficient genotyping data for conclusive population genomic analyses. We used RADseq to measure lower co-ancestry in the asexual U. antarctica than in the sexual U. aurantiacoatra, potentially derived from a wider geographical distribution of U. antarctica in our sample. These results show that RADseq has much potential for future phylogenetic and population genomic studies on lichens, particularly for groups of organisms which remained unresolved by multi-locus markers.
The authors thank the Robert A. Pritzker Center for Meteoritics and Polar Studies and Tawani Foundation for financial support. We want to acknowledge the Lauer family for the generous gift of a MiSeq sequencer to the Pritzker Lab for Molecular Systematics at the Field Museum. Fieldwork by Elisa Lagostina was funded by the German Science Foundation through grant Pr 567/18-1. Mikhael Andreev, Aline Lorenz and Mayara Scur are gratefully acknowledged for contributing samples from Elephant and King George Island and the Antarctic Peninsula. We also thank Marta Ribeiro and Kevin Feldheim (Chicago) for their invaluable help in the laboratory and Sarah Cowles (Miami), Steve D. Leavitt (Provo) and Pradeep K. Divakar (Madrid) for comments on the manuscript.
Origin of samples used for this study
Data type: species data
Overview of RADseq results after individual steps of RAD analyses
Data type: molecular data
Correlation of RADseq results after individual steps of RAD analyses
Data type: molecular data