The topological nature of tag jumping in environmental DNA metabarcoding studies
The topological nature of tag jumping in environmental DNA metabarcoding studies
Metabarcoding of environmental DNA constitutes a state-of-the-art tool for environmental studies. One fundamental principle implicit in most metabarcoding studies is that individual sample amplicons can still be identified after being pooled with others—based on their unique combinations of tags—during the so-called demultiplexing step that follows sequencing. Nevertheless, it has been recognized that tags can sometimes be changed (i.e., tag jumping), which ultimately leads to sample crosstalk. Here, using four DNA metabarcoding data sets derived from the analysis of soils and sediments, we show that tag jumping follows very specific and systematic patterns. Specifically, we find a strong correlation between the number of reads in blank samples and their topological position in the tag matrix (described by vertical and horizontal vectors). This observed spatial pattern of artefactual sequences could be explained by polymerase activity, which leads to the exchange of the 3′ tag of single stranded tagged sequences through the formation of heteroduplexes with mixed barcodes. Importantly, tag jumping substantially distorted our data sets—despite our use of methods suggested to minimize this error. We developed a topological model to estimate the noise based on the counts in our blanks, which suggested that 40%–80% of the taxa in our soil and sedimentary samples were likely false positives introduced through tag jumping. We highlight that the amount of false positive detections caused by tag jumping strongly biased our community analyses.
621-631
Rodriguez-Martinez, Saúl
11b2345c-60fd-49d4-b630-4ac2e6242e4b
Klaminder, Jonatan
baa3bb9c-e105-4fc6-876b-b7c8bdc8deed
Morlock, Marina A.
d466bbf2-a924-4bfa-adfd-cdc23ba382f3
Dalen, Love
1c94226a-7597-4332-8f41-4f8f65049c53
Huang, Doreen Yu-Tuan
1262e2e9-3f09-4572-a1ff-1700467bac7e
6 January 2023
Rodriguez-Martinez, Saúl
11b2345c-60fd-49d4-b630-4ac2e6242e4b
Klaminder, Jonatan
baa3bb9c-e105-4fc6-876b-b7c8bdc8deed
Morlock, Marina A.
d466bbf2-a924-4bfa-adfd-cdc23ba382f3
Dalen, Love
1c94226a-7597-4332-8f41-4f8f65049c53
Huang, Doreen Yu-Tuan
1262e2e9-3f09-4572-a1ff-1700467bac7e
Rodriguez-Martinez, Saúl, Klaminder, Jonatan, Morlock, Marina A., Dalen, Love and Huang, Doreen Yu-Tuan
(2023)
The topological nature of tag jumping in environmental DNA metabarcoding studies.
Molecular Ecology Resources, 23 (3), .
(doi:10.1111/1755-0998.13745).
Abstract
Metabarcoding of environmental DNA constitutes a state-of-the-art tool for environmental studies. One fundamental principle implicit in most metabarcoding studies is that individual sample amplicons can still be identified after being pooled with others—based on their unique combinations of tags—during the so-called demultiplexing step that follows sequencing. Nevertheless, it has been recognized that tags can sometimes be changed (i.e., tag jumping), which ultimately leads to sample crosstalk. Here, using four DNA metabarcoding data sets derived from the analysis of soils and sediments, we show that tag jumping follows very specific and systematic patterns. Specifically, we find a strong correlation between the number of reads in blank samples and their topological position in the tag matrix (described by vertical and horizontal vectors). This observed spatial pattern of artefactual sequences could be explained by polymerase activity, which leads to the exchange of the 3′ tag of single stranded tagged sequences through the formation of heteroduplexes with mixed barcodes. Importantly, tag jumping substantially distorted our data sets—despite our use of methods suggested to minimize this error. We developed a topological model to estimate the noise based on the counts in our blanks, which suggested that 40%–80% of the taxa in our soil and sedimentary samples were likely false positives introduced through tag jumping. We highlight that the amount of false positive detections caused by tag jumping strongly biased our community analyses.
Text
Molecular Ecology Resources - 2022 - Rodriguez‐Martinez - The topological nature of tag jumping in environmental DNA
- Version of Record
More information
Accepted/In Press date: 17 November 2022
e-pub ahead of print date: 7 December 2022
Published date: 6 January 2023
Identifiers
Local EPrints ID: 505704
URI: http://eprints.soton.ac.uk/id/eprint/505704
ISSN: 1755-098X
PURE UUID: 7e0ca234-bb12-44cc-8f29-cf2f777f6207
Catalogue record
Date deposited: 16 Oct 2025 16:53
Last modified: 17 Oct 2025 02:18
Export record
Altmetrics
Contributors
Author:
Saúl Rodriguez-Martinez
Author:
Jonatan Klaminder
Author:
Marina A. Morlock
Author:
Love Dalen
Author:
Doreen Yu-Tuan Huang
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics