FINDINGS: Our high-throughput workflow minimizes these risks via a 4-step strategy: (i) technical replication with 2 PCR replicates and 2 extraction replicates; (ii) using multi-markers (12S,16S,CytB); (iii) a "twin-tagging," 2-step PCR protocol; and (iv) use of the probabilistic taxonomic assignment method PROTAX, which can account for incomplete reference databases. Because annotation errors in the reference sequences can result in taxonomic misassignment, we supply a protocol for curating sequence datasets. For some taxonomic groups and some markers, curation resulted in >50% of sequences being deleted from public reference databases, owing to (i) limited overlap between our target amplicon and reference sequences, (ii) mislabelling of reference sequences, and (iii) redundancy. Finally, we provide a bioinformatic pipeline to process amplicons and conduct PROTAX assignment and tested it on an invertebrate-derived DNA dataset from 1,532 leeches from Sabah, Malaysia. Twin-tagging allowed us to detect and exclude sequences with non-matching tags. The smallest DNA fragment (16S) amplified most frequently for all samples but was less powerful for discriminating at species rank. Using a stringent and lax acceptance criterion we found 162 (stringent) and 190 (lax) vertebrate detections of 95 (stringent) and 109 (lax) leech samples.
CONCLUSIONS: Our metabarcoding workflow should help research groups increase the robustness of their results and therefore facilitate wider use of environmental and invertebrate-derived DNA, which is turning into a valuable source of ecological and conservation information on tetrapods.
RESULTS: We found three distinct matrilineal groups of red muntjacs: Sri Lankan red muntjacs (including the Western Ghats) diverged first from other muntjacs about 1.5 Mya; later northern red muntjacs (including North India and Indochina) and southern red muntjacs (Sundaland) split around 1.12 Mya. The diversification of red muntjacs into these three main lineages was likely promoted by two Pleistocene barriers: one through the Indian subcontinent and one separating the Indochinese and Sundaic red muntjacs. Interestingly, we found a high level of gene flow within the populations of northern and southern red muntjacs, indicating gene flow between populations in Indochina and dispersal of red muntjacs over the exposed Sunda Shelf during the Last Glacial Maximum.
CONCLUSIONS: Our results provide new insights into the evolution of species in South and Southeast Asia as we found clear genetic differentiation in a widespread and generalist species, corresponding to two known biogeographical barriers: The Isthmus of Kra and the central Indian dry zone. In addition, our molecular data support either the delineation of three monotypic species or three subspecies, but more importantly these data highlight the conservation importance of the Sri Lankan/South Indian red muntjac.