Newsgroups: comp.speech
Path: lyra.csx.cam.ac.uk!doc.ic.ac.uk!agate!howland.reston.ans.net!cs.utexas.edu!news.tamu.edu!news.utdallas.edu!corpgate!nrtphaa9.nt.com!brtph560!brtph8ee!hoequist
From: hoequist@brtph8ee.bnr.ca (Charles Hoequist P250)
Subject: Re: pitch shifting
Message-ID: <1994Aug25.122126.14577@brtph560.bnr.ca>
Sender: hoequist@brtph8ee (Charles Hoequist P250)
Date: Thu, 25 Aug 1994 12:21:26 GMT
References:  <Cv1y38.Gz@psych.toronto.edu>
Organization: BNR Inc., RTP, NC
Lines: 36

In article <Cv1y38.Gz@psych.toronto.edu>, bonnie@psych.toronto.edu (Bonnie Turner) writes:
|> I am looking for information regarding the effects of shifting the pitch
|> of a speech sample.  Specifically, what happens to the upper formants of 
|> a speech sample if the fundamental frequency is shifted several semitones?
|> 
|> If anyone could point me toward specific references regarding methods and
|> effects of pitch shifting speech samples, I'd really appreciate it!
|> 
|> Thank you!

The formant center frequencies should (ideally)remain the same, 
since you're presumably changing only a source characteristic (F0)
and leaving the filter unchanged.

There's a whole series of articles by R.J. McAulay and T.F. Quatieri on
resynthesis of speech with alterations in F0 and rate. Some 
citations:

"Speech Transformations Based on a Sinusoidal Representation", IEEE-ASSP
vol.34, no.6, pp.1149-1464. (1986)

"Speech Analysis/Synthesis Based on a Sinusoidal Representation", IEEE-ASSP
vol.34 no.4, pp.744-754 (1986)

"Magnitude-only Reconstruction Using a Sinusoidal Speech Model", 
ICASSP-84, paper 27.6

-- 
# # # # # # # # # # # # # # # # # # # # # # # # # #
Charles Hoequist, Jr.   | Internet: hoequist@bnr.ca
BNR, Inc.               |  voice: 919-991-8642
PO Box 13478            |  fax:   919-991-8008
Research Triangle Park NC 27709-3478
USA
The number you have dialed is imaginary. Please rotate your
telephone ninety degrees and try again.
