Aberrant 3' splice sites in human disease genes: mutation pattern, nucleotide structure and comparison of computational tools that predict their utilization
Aberrant 3' splice sites in human disease genes: mutation pattern, nucleotide structure and comparison of computational tools that predict their utilization
 
  The frequency distribution of mutation-induced aberrant 3' splice sites (3'ss) in exons and introns is more complex than for 5' splice sites, largely owing to sequence constraints upstream of intron/exon boundaries. As a result, prediction of their localization remains a challenging task. Here, nucleotide sequences of previously reported 218 aberrant 3'ss activated by disease-causing mutations in 131 human genes were compared with their authentic counterparts using currently available splice site prediction tools. Each tested algorithm distinguished authentic 3'ss from cryptic sites more effectively than from de novo sites. The best discrimination between aberrant and authentic 3'ss was achieved by the maximum entropy model. Almost one half of aberrant 3'ss was activated by AG-creating mutations and approximately 95% of the newly created AGs were selected in vivo. The overall nucleotide structure upstream of aberrant 3'ss was characterized by higher purine content than for authentic sites, particularly in position -3, that may be compensated by more stringent requirements for positive and negative nucleotide signatures centred around position -11. A newly developed online database of aberrant 3'ss will facilitate identification of splicing mutations in a gene or phenotype of interest and future optimization of splice site prediction tools
  databases, comparative study, mutation, evaluation studies, genetics, algorithms, nucleic acid, nucleotides, genes, disease, humans, genetic diseases, research, rna splice sites, analysis, base composition, dna mutational analysis, rna, exons, utilization, internet, introns, research support, phenotype, human, methods, software, inborn, computational biology
  
  
  4630-4641
  
    
      Vorechovsky, Igor
      
        7245de2f-8c9b-4034-8935-9a451d9b682e
      
     
  
  
   
  
  
    
      September 2006
    
    
  
  
    
      Vorechovsky, Igor
      
        7245de2f-8c9b-4034-8935-9a451d9b682e
      
     
  
       
    
 
  
    
      
  
  
  
  
  
  
    Vorechovsky, Igor
  
  
  
  
   
    (2006)
  
  
    
    Aberrant 3' splice sites in human disease genes: mutation pattern, nucleotide structure and comparison of computational tools that predict their utilization.
  
  
  
  
    Nucleic Acids Research, 34 (16), .
  
   (doi:10.1093/nar/gkl535). 
  
  
   
  
  
  
  
  
   
  
    
      
        
          Abstract
          The frequency distribution of mutation-induced aberrant 3' splice sites (3'ss) in exons and introns is more complex than for 5' splice sites, largely owing to sequence constraints upstream of intron/exon boundaries. As a result, prediction of their localization remains a challenging task. Here, nucleotide sequences of previously reported 218 aberrant 3'ss activated by disease-causing mutations in 131 human genes were compared with their authentic counterparts using currently available splice site prediction tools. Each tested algorithm distinguished authentic 3'ss from cryptic sites more effectively than from de novo sites. The best discrimination between aberrant and authentic 3'ss was achieved by the maximum entropy model. Almost one half of aberrant 3'ss was activated by AG-creating mutations and approximately 95% of the newly created AGs were selected in vivo. The overall nucleotide structure upstream of aberrant 3'ss was characterized by higher purine content than for authentic sites, particularly in position -3, that may be compensated by more stringent requirements for positive and negative nucleotide signatures centred around position -11. A newly developed online database of aberrant 3'ss will facilitate identification of splicing mutations in a gene or phenotype of interest and future optimization of splice site prediction tools
        
        This record has no associated files available for download.
       
    
    
   
  
  
  More information
  
    
      Published date: September 2006
 
    
  
  
    
  
    
  
    
  
    
  
    
  
    
     
        Keywords:
        databases, comparative study, mutation, evaluation studies, genetics, algorithms, nucleic acid, nucleotides, genes, disease, humans, genetic diseases, research, rna splice sites, analysis, base composition, dna mutational analysis, rna, exons, utilization, internet, introns, research support, phenotype, human, methods, software, inborn, computational biology
      
    
  
    
  
    
  
  
        Identifiers
        Local EPrints ID: 60372
        URI: http://eprints.soton.ac.uk/id/eprint/60372
        
          
        
        
        
          ISSN: 0305-1048
        
        
          PURE UUID: 2562912d-821b-4b17-9a42-ee6aca0d84aa
        
  
    
        
          
            
              
            
          
        
    
  
  Catalogue record
  Date deposited: 08 Sep 2008
  Last modified: 16 Mar 2024 03:32
  Export record
  
  
   Altmetrics
   
   
  
 
 
  
  
    Download statistics
    
      Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
      
      View more statistics