The University of Southampton
University of Southampton Institutional Repository

The topological nature of tag jumping in environmental DNA metabarcoding studies

The topological nature of tag jumping in environmental DNA metabarcoding studies
The topological nature of tag jumping in environmental DNA metabarcoding studies
Metabarcoding of environmental DNA constitutes a state-of-the-art tool for environmental studies. One fundamental principle implicit in most metabarcoding studies is that individual sample amplicons can still be identified after being pooled with others—based on their unique combinations of tags—during the so-called demultiplexing step that follows sequencing. Nevertheless, it has been recognized that tags can sometimes be changed (i.e., tag jumping), which ultimately leads to sample crosstalk. Here, using four DNA metabarcoding data sets derived from the analysis of soils and sediments, we show that tag jumping follows very specific and systematic patterns. Specifically, we find a strong correlation between the number of reads in blank samples and their topological position in the tag matrix (described by vertical and horizontal vectors). This observed spatial pattern of artefactual sequences could be explained by polymerase activity, which leads to the exchange of the 3′ tag of single stranded tagged sequences through the formation of heteroduplexes with mixed barcodes. Importantly, tag jumping substantially distorted our data sets—despite our use of methods suggested to minimize this error. We developed a topological model to estimate the noise based on the counts in our blanks, which suggested that 40%–80% of the taxa in our soil and sedimentary samples were likely false positives introduced through tag jumping. We highlight that the amount of false positive detections caused by tag jumping strongly biased our community analyses.
1755-098X
621-631
Rodriguez-Martinez, Saúl
11b2345c-60fd-49d4-b630-4ac2e6242e4b
Klaminder, Jonatan
baa3bb9c-e105-4fc6-876b-b7c8bdc8deed
Morlock, Marina A.
d466bbf2-a924-4bfa-adfd-cdc23ba382f3
Dalen, Love
1c94226a-7597-4332-8f41-4f8f65049c53
Huang, Doreen Yu-Tuan
1262e2e9-3f09-4572-a1ff-1700467bac7e
Rodriguez-Martinez, Saúl
11b2345c-60fd-49d4-b630-4ac2e6242e4b
Klaminder, Jonatan
baa3bb9c-e105-4fc6-876b-b7c8bdc8deed
Morlock, Marina A.
d466bbf2-a924-4bfa-adfd-cdc23ba382f3
Dalen, Love
1c94226a-7597-4332-8f41-4f8f65049c53
Huang, Doreen Yu-Tuan
1262e2e9-3f09-4572-a1ff-1700467bac7e

Rodriguez-Martinez, Saúl, Klaminder, Jonatan, Morlock, Marina A., Dalen, Love and Huang, Doreen Yu-Tuan (2023) The topological nature of tag jumping in environmental DNA metabarcoding studies. Molecular Ecology Resources, 23 (3), 621-631. (doi:10.1111/1755-0998.13745).

Record type: Article

Abstract

Metabarcoding of environmental DNA constitutes a state-of-the-art tool for environmental studies. One fundamental principle implicit in most metabarcoding studies is that individual sample amplicons can still be identified after being pooled with others—based on their unique combinations of tags—during the so-called demultiplexing step that follows sequencing. Nevertheless, it has been recognized that tags can sometimes be changed (i.e., tag jumping), which ultimately leads to sample crosstalk. Here, using four DNA metabarcoding data sets derived from the analysis of soils and sediments, we show that tag jumping follows very specific and systematic patterns. Specifically, we find a strong correlation between the number of reads in blank samples and their topological position in the tag matrix (described by vertical and horizontal vectors). This observed spatial pattern of artefactual sequences could be explained by polymerase activity, which leads to the exchange of the 3′ tag of single stranded tagged sequences through the formation of heteroduplexes with mixed barcodes. Importantly, tag jumping substantially distorted our data sets—despite our use of methods suggested to minimize this error. We developed a topological model to estimate the noise based on the counts in our blanks, which suggested that 40%–80% of the taxa in our soil and sedimentary samples were likely false positives introduced through tag jumping. We highlight that the amount of false positive detections caused by tag jumping strongly biased our community analyses.

Text
Molecular Ecology Resources - 2022 - Rodriguez‐Martinez - The topological nature of tag jumping in environmental DNA - Version of Record
Available under License Creative Commons Attribution.
Download (2MB)

More information

Accepted/In Press date: 17 November 2022
e-pub ahead of print date: 7 December 2022
Published date: 6 January 2023

Identifiers

Local EPrints ID: 505704
URI: http://eprints.soton.ac.uk/id/eprint/505704
ISSN: 1755-098X
PURE UUID: 7e0ca234-bb12-44cc-8f29-cf2f777f6207
ORCID for Doreen Yu-Tuan Huang: ORCID iD orcid.org/0000-0002-4096-0706

Catalogue record

Date deposited: 16 Oct 2025 16:53
Last modified: 17 Oct 2025 02:18

Export record

Altmetrics

Contributors

Author: Saúl Rodriguez-Martinez
Author: Jonatan Klaminder
Author: Marina A. Morlock
Author: Love Dalen
Author: Doreen Yu-Tuan Huang ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×