Pitch-scaled estimation of simultaneous voiced and turbulence-noise components in speech
Pitch-scaled estimation of simultaneous voiced and turbulence-noise components in speech
 
  Almost all speech contains simultaneous contributions from more than one acoustic source within the speaker's vocal tract. In this paper, we propose a method -- the pitch-scaled harmonic filter (PSHF) -- which aims to separate the voiced and turbulence-noise components of the speech signal during phonation, based on a maximum-likelihood approach. The PSHF outputs periodic and aperiodic components that are estimates of the respective contributions of the different types of acoustic source. It produces four reconstructed time series signals by decomposing the original speech signal, first, according to amplitude, and then according to power of the Fourier coefficients. Thus, one pair of periodic and aperiodic signals is optimized for subsequent time-series analysis, and another pair for spectral analysis. The performance of the PSHF algorithm was tested on synthetic signals, using three forms of disturbance (jitter, shimmer and additive noise), and the results were used to predict the performance on real speech. Processing recorded speech examples elicited latent features from the signals, demonstrating the PSHF's potential for analysis of mixed-source speech.
  periodic-aperiodic decomposition, speech modification, speech preprocessing
  713-726
  
    
      Jackson, Philip J.B.
      
        c658b148-ce3e-418d-ba12-9e970c9563dc
      
     
  
    
      Shadle, Christine H.
      
        d41e8c97-d8f9-41ca-a69a-4021e978f786
      
     
  
  
   
  
  
    
      October 2001
    
    
  
  
    
      Jackson, Philip J.B.
      
        c658b148-ce3e-418d-ba12-9e970c9563dc
      
     
  
    
      Shadle, Christine H.
      
        d41e8c97-d8f9-41ca-a69a-4021e978f786
      
     
  
       
    
 
  
    
      
  
  
  
  
  
  
    Jackson, Philip J.B. and Shadle, Christine H.
  
  
  
  
   
    (2001)
  
  
    
    Pitch-scaled estimation of simultaneous voiced and turbulence-noise components in speech.
  
  
  
  
    IEEE Transactions on Speech and Audio Processing, 9 (7), .
  
   
  
  
   
  
  
  
  
  
   
  
    
    
      
        
          Abstract
          Almost all speech contains simultaneous contributions from more than one acoustic source within the speaker's vocal tract. In this paper, we propose a method -- the pitch-scaled harmonic filter (PSHF) -- which aims to separate the voiced and turbulence-noise components of the speech signal during phonation, based on a maximum-likelihood approach. The PSHF outputs periodic and aperiodic components that are estimates of the respective contributions of the different types of acoustic source. It produces four reconstructed time series signals by decomposing the original speech signal, first, according to amplitude, and then according to power of the Fourier coefficients. Thus, one pair of periodic and aperiodic signals is optimized for subsequent time-series analysis, and another pair for spectral analysis. The performance of the PSHF algorithm was tested on synthetic signals, using three forms of disturbance (jitter, shimmer and additive noise), and the results were used to predict the performance on real speech. Processing recorded speech examples elicited latent features from the signals, demonstrating the PSHF's potential for analysis of mixed-source speech.
         
      
      
        
          
            
  
    Text
 ieeesap01.pdf
     - Other
   
  
  
 
          
            
          
            
           
            
           
        
        
       
    
   
  
  
  More information
  
    
      Published date: October 2001
 
    
  
  
    
  
    
  
    
  
    
  
    
  
    
     
        Keywords:
        periodic-aperiodic decomposition, speech modification, speech preprocessing
      
    
  
    
     
        Organisations:
        Electronics & Computer Science
      
    
  
    
  
  
        Identifiers
        Local EPrints ID: 259158
        URI: http://eprints.soton.ac.uk/id/eprint/259158
        
        
        
        
          PURE UUID: 4a0d72a4-4110-46f5-8c90-03bfb70e6608
        
  
    
        
          
        
    
        
          
        
    
  
  Catalogue record
  Date deposited: 14 Mar 2004
  Last modified: 14 Mar 2024 06:20
  Export record
  
  
 
 
  
    
    
      Contributors
      
          
          Author:
          
            
            
              Philip J.B. Jackson
            
          
        
      
          
          Author:
          
            
            
              Christine H. Shadle
            
          
        
      
      
      
    
  
   
  
    Download statistics
    
      Downloads from ePrints over the past year. Other digital versions may also be available to download e.g. from the publisher's website.
      
      View more statistics