Novel breeding resources for the underutilised legume, lablab, based on a pangenome approach
Novel breeding resources for the underutilised legume, lablab, based on a pangenome approach
Individuals across a species exhibit substantial presence-absence variation, to the extent that a reference genome from a single individual only contains a subset of the species’ genome. Cataloguing genome regions absent from a reference genome can therefore reveal novel genome regions, and some of this variation can be adaptive. In this work, existing short sequencing reads for the underutilised crop lablab (Lablab purpureus (L.) Sweet) were used to identify regions of the genome absent from the reference genome. Lablab is made up of two distinct gene pools, each with wild and domesticated types therefore represents an opportunity to identify gene pool-specific variation. Approximately 7.7% of the reads from eight accessions failed to map to the lablab reference genome (cv. Highworth), putatively being novel, and these were assembled and collapsed into between 735 and 12,304 contigs. Four samples were focussed on (one each wild and domesticated from each of the gene pools) and the novel contigs compared, to identify those present only in subsets of samples. Whilst the number of contigs containing sequenced with similarity to known genes in other legumes was low, there were some enriched gene ontology (GO) terms that could relate to adaptive differences between the groups and therefore contain novel genes for future lablab breeding. The approached used here has potential use in any other species.
genomic variation, lablab, pangenome, underutilised crop
61-66
Chapman, Mark A.
8bac4a92-bfa7-4c3c-af29-9af852ef6383
25 March 2025
Chapman, Mark A.
8bac4a92-bfa7-4c3c-af29-9af852ef6383
Chapman, Mark A.
(2025)
Novel breeding resources for the underutilised legume, lablab, based on a pangenome approach.
Breeding Science, 75 (1), .
(doi:10.1270/jsbbs.24055).
Abstract
Individuals across a species exhibit substantial presence-absence variation, to the extent that a reference genome from a single individual only contains a subset of the species’ genome. Cataloguing genome regions absent from a reference genome can therefore reveal novel genome regions, and some of this variation can be adaptive. In this work, existing short sequencing reads for the underutilised crop lablab (Lablab purpureus (L.) Sweet) were used to identify regions of the genome absent from the reference genome. Lablab is made up of two distinct gene pools, each with wild and domesticated types therefore represents an opportunity to identify gene pool-specific variation. Approximately 7.7% of the reads from eight accessions failed to map to the lablab reference genome (cv. Highworth), putatively being novel, and these were assembled and collapsed into between 735 and 12,304 contigs. Four samples were focussed on (one each wild and domesticated from each of the gene pools) and the novel contigs compared, to identify those present only in subsets of samples. Whilst the number of contigs containing sequenced with similarity to known genes in other legumes was low, there were some enriched gene ontology (GO) terms that could relate to adaptive differences between the groups and therefore contain novel genes for future lablab breeding. The approached used here has potential use in any other species.
Text
75_24055
- Version of Record
More information
Accepted/In Press date: 25 September 2024
e-pub ahead of print date: 5 February 2025
Published date: 25 March 2025
Keywords:
genomic variation, lablab, pangenome, underutilised crop
Identifiers
Local EPrints ID: 501865
URI: http://eprints.soton.ac.uk/id/eprint/501865
ISSN: 1344-7610
PURE UUID: 6d82a275-9911-4132-b370-ad6cd90a2119
Catalogue record
Date deposited: 11 Jun 2025 16:48
Last modified: 22 Aug 2025 02:08
Export record
Altmetrics
Download statistics
Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
View more statistics