The University of Southampton
University of Southampton Institutional Repository

Aberrant 5' splice sites in human disease genes: mutation pattern, nucleotide structure and comparison of computational tools that predict their utilization

Aberrant 5' splice sites in human disease genes: mutation pattern, nucleotide structure and comparison of computational tools that predict their utilization
Aberrant 5' splice sites in human disease genes: mutation pattern, nucleotide structure and comparison of computational tools that predict their utilization
Despite a growing number of splicing mutations found in hereditary diseases, utilization of aberrant splice sites and their effects on gene expression remain challenging to predict. We compiled sequences of 346 aberrant 5'splice sites (5'ss) that were activated by mutations in 166 human disease genes. Mutations within the 5'ss consensus accounted for 254 cryptic 5'ss and mutations elsewhere activated 92 de novo 5'ss. Point mutations leading to cryptic 5'ss activation were most common in the first intron nucleotide, followed by the fifth nucleotide. Substitutions at position +5 were exclusively G>A transitions, which was largely attributable to high mutability rates of C/G>T/A. However, the frequency of point mutations at position +5 was significantly higher than that observed in the Human Gene Mutation Database, suggesting that alterations of this position are particularly prone to aberrant splicing, possibly due to a requirement for sequential interactions with U1 and U6 snRNAs. Cryptic 5'ss were best predicted by computational algorithms that accommodate nucleotide dependencies and not by weight-matrix models. Discrimination of intronic 5'ss from their authentic counterparts was less effective than for exonic sites, as the former were intrinsically stronger than the latter. Computational prediction of exonic de novo 5'ss was poor, suggesting that their activation critically depends on exonic splicing enhancers or silencers. The authentic counterparts of aberrant 5'ss were significantly weaker than the average human 5'ss. The development of an online database of aberrant 5'ss will be useful for studying basic mechanisms of splice-site selection, identifying splicing mutations and optimizing splice-site prediction algorithms.
genetics, consensus sequence, algorithms, rna, evaluation studies, expression, research support, base sequence, comparative study, mutation, software, nucleic acid, chemistry, databases, genetic engineering, inborn, rna splice sites, human, italy, gene expression, utilization, research, point mutation, sequence alignment, nucleotides, methods, humans, genes, dna mutational analysis, computational biology, genetic diseases, disease
0305-1048
4250-4263
Buratti, E.
bf128132-8b56-444c-a325-3b008aa1fb13
Chivers, M.
639fadfe-5293-45eb-8c38-22cbbf3f4b68
Kralovicova, J.
b3e0c1e7-05ed-445d-b3d9-ace11e3b4878
Romano, M.
0c26ba48-a574-4f6a-90bc-7e9bb7058bd1
Baralle, M.
ee248bdf-c13c-44a7-ac1e-e5a3f1c552b0
Krainer, A.R.
a7406da5-9d09-4904-9829-6136f1f013e0
Vorechovsky, I.
7245de2f-8c9b-4034-8935-9a451d9b682e
Buratti, E.
bf128132-8b56-444c-a325-3b008aa1fb13
Chivers, M.
639fadfe-5293-45eb-8c38-22cbbf3f4b68
Kralovicova, J.
b3e0c1e7-05ed-445d-b3d9-ace11e3b4878
Romano, M.
0c26ba48-a574-4f6a-90bc-7e9bb7058bd1
Baralle, M.
ee248bdf-c13c-44a7-ac1e-e5a3f1c552b0
Krainer, A.R.
a7406da5-9d09-4904-9829-6136f1f013e0
Vorechovsky, I.
7245de2f-8c9b-4034-8935-9a451d9b682e

Buratti, E., Chivers, M., Kralovicova, J., Romano, M., Baralle, M., Krainer, A.R. and Vorechovsky, I. (2007) Aberrant 5' splice sites in human disease genes: mutation pattern, nucleotide structure and comparison of computational tools that predict their utilization. Nucleic Acids Research, 35 (13), 4250-4263. (doi:10.1093/nar/gkm402).

Record type: Article

Abstract

Despite a growing number of splicing mutations found in hereditary diseases, utilization of aberrant splice sites and their effects on gene expression remain challenging to predict. We compiled sequences of 346 aberrant 5'splice sites (5'ss) that were activated by mutations in 166 human disease genes. Mutations within the 5'ss consensus accounted for 254 cryptic 5'ss and mutations elsewhere activated 92 de novo 5'ss. Point mutations leading to cryptic 5'ss activation were most common in the first intron nucleotide, followed by the fifth nucleotide. Substitutions at position +5 were exclusively G>A transitions, which was largely attributable to high mutability rates of C/G>T/A. However, the frequency of point mutations at position +5 was significantly higher than that observed in the Human Gene Mutation Database, suggesting that alterations of this position are particularly prone to aberrant splicing, possibly due to a requirement for sequential interactions with U1 and U6 snRNAs. Cryptic 5'ss were best predicted by computational algorithms that accommodate nucleotide dependencies and not by weight-matrix models. Discrimination of intronic 5'ss from their authentic counterparts was less effective than for exonic sites, as the former were intrinsically stronger than the latter. Computational prediction of exonic de novo 5'ss was poor, suggesting that their activation critically depends on exonic splicing enhancers or silencers. The authentic counterparts of aberrant 5'ss were significantly weaker than the average human 5'ss. The development of an online database of aberrant 5'ss will be useful for studying basic mechanisms of splice-site selection, identifying splicing mutations and optimizing splice-site prediction algorithms.

This record has no associated files available for download.

More information

Published date: July 2007
Keywords: genetics, consensus sequence, algorithms, rna, evaluation studies, expression, research support, base sequence, comparative study, mutation, software, nucleic acid, chemistry, databases, genetic engineering, inborn, rna splice sites, human, italy, gene expression, utilization, research, point mutation, sequence alignment, nucleotides, methods, humans, genes, dna mutational analysis, computational biology, genetic diseases, disease

Identifiers

Local EPrints ID: 59541
URI: http://eprints.soton.ac.uk/id/eprint/59541
ISSN: 0305-1048
PURE UUID: ba56a1ab-605f-4f97-a3f9-c3a8edf358eb
ORCID for M. Chivers: ORCID iD orcid.org/0000-0001-8581-2670
ORCID for I. Vorechovsky: ORCID iD orcid.org/0000-0002-6740-6502

Catalogue record

Date deposited: 03 Sep 2008
Last modified: 16 Mar 2024 03:44

Export record

Altmetrics

Contributors

Author: E. Buratti
Author: M. Chivers ORCID iD
Author: J. Kralovicova
Author: M. Romano
Author: M. Baralle
Author: A.R. Krainer
Author: I. Vorechovsky ORCID iD

Download statistics

Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.

View more statistics

Atom RSS 1.0 RSS 2.0

Contact ePrints Soton: eprints@soton.ac.uk

ePrints Soton supports OAI 2.0 with a base URL of http://eprints.soton.ac.uk/cgi/oai2

This repository has been built using EPrints software, developed at the University of Southampton, but available to everyone to use.

We use cookies to ensure that we give you the best experience on our website. If you continue without changing your settings, we will assume that you are happy to receive cookies on the University of Southampton website.

×