Chanwoo Kim's Homepage

Navigation

Chanwoo Kim

Publications

You can also find my Google Scholar Profile

Peer-Reviewed International Journal Papers

T. N. Sainath, R. J. Weiss, K. W. Wilson, B. Li, A. Narayanan, E. Variani, M. Bacchiani, I. Shafran, A. Senior, K. Chin, A. Misra, and C. Kim "Multichannel signal processing with deep neural networks for automatic speech recognition", IEEE/ACM Trans., Speech, Audio, Lang. Process. Feb. 2017. (PDF)
C. Kim and R. M. Stern. Power-Normalized Cepstral Coefficients for robust speech recognition. IEEE/ACM Trans., Speech, Audio, Lang. Process., Vol. 24, No. 7, July 2016. (PDF)
B. Cho, H. Kwon, J-W. Cho, C. Kim, R. M. Stern, and H. Park. "A subband-based stationary-component suppression method using harmanics and power ratio for reverberant speech recognition, IEEE Signal Process. Lett., Vol. 23. No. 6, June, 2016. (PDF)
C. Kim, K. Seo, and W. Sung. Efficient media synchronization method for video telephony system. IEICE Trans. Information and Systems, E89-D(6):1901.1905, June 2006. (PDF)
C. Kim, K. Seo, and W. Sung. A robust formant extraction algorithm combining spectral peak-picking and roots polishing. Eurasip Journ. on Applied Signal Processing, 2006:Article ID 67960, 16 pages, 2006. (PDF)
C. Kim and K. Seo. Robust DTW-based recognition algorithm for hand-held consumer devices. IEEE Trans. Consumer Electronics, 51(2):699.709, May 2005. (PDF)

Book Chapter

T. N. Sainath, R. J. Weiss, K. W. Wilson, B. Li, A. Narayanan, E. Variani, M. Bacchiani, I. Shafran, A. Senior, K. Chin, A. Misra and C. Kim "Raw Multichannel Processing Using Deep Neural Networks," chapter in New Era for Robust Speech Recognition: Exploiting Deep Learning, S. Watanabe, M. Delcroix, F.Metze, & J. Hershey (eds). , Springer, 2017. (PDF)

Peer-Reviewed International Conference Papers

A. Menon C. Kim U. Kurokawa, and R. Stern, "Binural processing for robust recognition of degraded speech," IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Dec, 2017(accepted) (PDF)
C. Kim, A. Misra, K.K. Chin, T. Hughes, A. Narayanan, T. Sainath, and M. Bacchiani, "Generation of large-scale simulated utterances in virtual rooms to train deep-neural networks for far-field speech recognition in Google Home", INTERSPEECH-2017, Aug. 2017. (PDF)
B. Li, T. Sainath, J. Caroselli, A.Narayanan, M. Bacchiani, A. Misra, I. Shafran, G. Pundak, K.K. Chin, K-C Sim, R. Weiss, K. Wilson, E. Variani, C. Kim, O. Siohan, M. Weintraub, E. McDermott, R. Rose, and M. Shannon, "Acoustic Modeling for Google Home", In INTERSPEECH-2017, Aug. 2017. (PDF)
A. Menon, C. Kim, and R. M. Stern, Robust Speech Recognition Based on Binaural Auditory Processing , In INTERSPEECH-2017, Aug. 2017.(PDF)
C. Kim, K. K. Chin, Sound source separation algorithm using phase difference and angle distribution modeling near the target, In INTERSPEECH-2015, Sept. 2015. (PDF)
C. Kim, K. K. Chin, M.Bacchiani, and R. M. Stern, Robust speech recognition using temporal masking and thresholding algorithm, In INTERSPEECH-2014, Sept. 2014. (PDF)
H. Park, M. Maciejewski, C. Kim, and R. M. Stern, Robust Speech Recognition in Reverberant Environments Using Subband-Based Steady-State Monaural and Binaural Suppression, in INTERSPEECH-2014, Sept. 2014. (PDF)
C. Kim, C. Khawand, and R. M. Stern, Two-microphone source separation algorithm based on statistical modeling of angle distributions, in IEEE. Conf. Acoust, Speech, and Signal Processing, March, 2012. (PDF)
C. Kim and R. M. Stern. Power-normalized coefficients (PNCC) for robust speech recognition, in IEEE. Conf. Acoust, Speech, and Signal Processing, March, 2012.(PDF)
C. Kim, K. Kumar, and R. M. Stern. Binaural sound source separation motivated by auditory processing, in IEEE. Conf. Acoust, Speech, and Signal Processing, May, 2011, pp. 4574-4577. (PDF)
K. Kumar, C. Kim and R. M. Stern. Delta spectral cepstral coefficients for robust speech recognition, in IEEE. Conf. Acoust. Speech, and Signal Processing, May, 2011, pp. 4784-4787.(PDF)
C. Kim and R. M. Stern. Nonlinear enhancement of onset for robust speech recognition. In INTERSPEECH-2010, Sept. 2010. (PDF) (Matlab Source Code)
C. Kim, K. Eom, J. Lee, and R. M. Stern. Automatic selection of thresholds for signal separation algorithms based on interaural delay. In INTERSPEECH-2010, Sept. 2010. (PDF) (MATLAB Source Code)
C. Kim and R. M. Stern. Feature extraction for robust speech recognition based on maximizing the sharpness of the power distribution and on power flooring. In IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, pp. 4574-4577, March 2010.(PDF) (MATLAB Source Code)
C. Kim, K. Kumar and R. M. Stern. Robust speech recognition using small power boosting algorithm. In IEEE Automatic Speech Recognition and Understanding Workshop, pp. 243.248, Dec. 2009.(PDF) (MATLAB Source Code)
C. Kim and R. M. Stern. Power function-based power distribution normalization algorithm for robust speech recognition. In IEEE Automatic Speech Recognition and Understanding Workshop, pp. 188-193, Dec. 2009. (PDF) (MATLAB Source Code)
C. Kim and R. M. Stern. Feature extraction for robust speech recognition using a power-law nonlinearity and power-bias subtraction. In INTERSPEECH-2009, pp. 28-31, Sept. 2009. (PDF) (MATLAB Source Code)
C. Kim, K. Kumar, B. Raj, and R. M. Stern. Signal separation for robust speech recognition based on phase difference information obtained in the frequency domain. In INTERSPEECH-2009, pp. 2495-2498, Sept. 2009. (PDF) (Matlab Source Code)
C. Kim and R. M. Stern. Robust signal-to-noise ratio estimation based on waveform amplitude distribution analysis. In INTERSPEECH-2008, pp. 2598-2601, Sept. 2008. (PDF) (C++ Source Code)
R. M. Stern, E. Gouvea, C. Kim, K. Kumar, and H .Park. Binaural and multiple-microphone signal processing motivated by auditory perception. In Hands-Free Speech Communication and Microphone Arrys, 2008, pp. 98-103, May. 2008. (PDF)
C. Kim, Y.-H. Chiu, and R.M. Stern. Physiologically-motivated synchrony-based processing for robust automatic speech recognition. In INTERSPEECH-2006, pp. 1975-1978, Sept. 2006. (PDF)
C. Kim, K. Seo, W. Sung, and S. Jung. Efficient audio/video synchronization method for video telephony system in consumer cellular phones. In IEEE Int. Conf. Consumer Elect., pp. 137- 138, Jan. 2006. (PDF)
C. Kim and K. Seo. Robust dtw-based recognition algorithm for hand-held consumer device. In IEEE Int. Conf. Consumer Elect., pp. 433-434, Jan. 2005. (PDF)
C. Kim and W. Sung. Implementation of intonational quality assessment system. In INTERSPEECH- 2002, pp. 1225-1228, Sept. 2002. (PDF)
C. Kim and W. Sung. Vowel pronunciation accuracy checking system based on phoneme segmentation and formants extraction. In Int. Conf. Speech Processing, pp. 447-452, Aug. 2001. (PDF)

Theses

C. Kim Signal Processing for Robust Speech Recognition Motivated by Auditory Processing, Ph. D Thesis, Carnegie Mellon University, 189 pp., Dec. 2010. (PDF)
C. Kim Implementation of an Intonation and Pronunciation Checking System for Embedded Systems, M. S. Thesis, Seoul National University, 71 pp., Feb. 2001. (On-line version of the document is availabe at http://library.snu.ac.kr)

Domestic (Korea) Conference Paper

(written in Korean)

C. Kim S. Park and K. Seo. Efficient audio/video synchronization method for video mobile communication terminals (in korean). In Korea Computer Congress, pp. 355-357, July 2005

US Patents Issued

C. Khawand and C. Kim Target object angle determination using multiple cameras, United States Patents (9,269,146), Feb. 2016 [Online]. Available: https://www.lens.org/lens/patent/US_9269146_B2
C. Kim and C. Khwand Multi-microphone audio source separation based on combined statistical angle distributions, United State Patents (9,131,295) , Sept. 2015 [Online]. Available: https://www.lens.org/lens/patent/US_9131295_B2
C. Kim K. Eom, J. Lee, and R. M. Stern, Signal separation system and method for automatically selecting threshold to separate sound sources, (8,718,293), May, 2014. [Online]. Available: https://www.lens.org/lens/patent/US_8718293_B2
C. Kim Formants extracting method combining spectral peak picking and roots extraction, (8,000,959), Aug, 2011. [Online]. Available: https://www.lens.org/lens/patent/US_8000959_B2
C. Kim. Speech distinction method. United States Patent, (7,761,294), July, 2010. [Online]. Available: https://www.lens.org/lens/patent/US_7761294_B2
K. Seo and C. Kim. Synchronizing video/audio data of mobile communication terminal. United States Patent, (7,710,943), May, 2010. [Online]. Available: https://www.lens.org/lens/patent/US_7710943_B2
C. Kim. Method of filtering speech signals to enhance quality of speech and apparatus thereof. United States Patent, (7,590,524), Sept. 2009. [Online]. Available: https://www.lens.org/lens/patent/US_7590524_B2
C. Kim. Baseband modem for speech recognition and mobile communication terminal using the same. United States Patent, (7,593,853), Sept. 2009. [Online]. Available: https://www.lens.org/lens/patent/US_7593853_B2
C. Kim. Mobile device and method for preventing undesired key depression in the same. United States Patent, (7,602,377), Oct. 2009. [Online]. Available: https://www.lens.org/lens/patent/US_7602377_B2
C. Kim. Speech coding apparatus with perceptual weighting and method therefor. United States Patent, (7,603,271), Oct. 2009. [Online]. Available: https://www.lens.org/lens/patent/US_7603271_B2
C. Kim. Telephone number retrieval system and method. United States Patent, (7,356,356), Apr. 2008. [Online]. Available: https://www.lens.org/lens/patent/US_7356356_B2

US Patent Applications

Many patents are also applied for in other countries (European patents, Japanese patents, Chinese patents, and Korea patents)

C. Kim, C. Khawand, and J. Moon, Automatically optimizing capture of images of one or more subjects. United States Patent Application, (Application no. 2012/0 300 092). (US Patent Office Official Document)
C. Kim, Voice coding/decoding method and apparatus. United States Patent Application, (Application no. 20060015330). (US Patent Office Official Document)
C. Kim, Voice coding apparatus and method using PLP in mobile communications terminal. United States Patent Application, (Application no.20060025991). (US Patent Office Official Document)
C. Kim, Mobile terminal having support power pack. United States Patent Application, (Application no. 20060111155). (US Patent Office Official Document)
C. Kim, Method for extracting feature vectors for speech recognition. United States Patent Application, (Application no. 20060129392). (US Patent Office Official Document)
C. Kim, Apparatus and method for reducing power consumption in a mobile communication terminal. United States Patent Application, (20050057548). (US Patent Office Official Document)
C. Kim, Voice recognition method. United States Patent Application, (20050131693). (US Patent Office Official Document)

Korean Patents Issued

The following are written in Korean.

C. Kim. Formant frequency detecting method of voice signal. Korea Patent, (Application no. 10- 2003-0069175 ,Registration no.10-0511316), Aug. 2005.
C. Kim. Slide type mobile communication terminal applying subdisplay device. Korea Patent, (Application no. 10-2003-0071130, Registration no. 10-0560919), March 2006.
C. Kim and K. Seo. A method and a apparatus of synchronization videosignal with audio signal for mobile phone. Korea Patent, (Application no. 10-2004-0046697, Registration no. 10-0565333), March 2006.
C. Kim. Guidance method and apparatus for telephone number. Korea Patent, (Application no. 10-2003-0076089, Registration no. 10-0595610), June 2006.
C. Kim. Key pushing prevention method for portable apparatus. Korea Patent, (Application no. 10-2003-0081627, Registration no. 10-0595614), June 2006.
C. Kim. A method and a apparatus of advanced low bit ratelinear prediction coding with plp coefficient formobile phone. Korea Patent, (Application no. 10-2004-0057739, Registration no. 10-0619893), Aug. 2006.
C. Kim. Speech distinction method. Korea Patent, (Application no. 10-2004-0097650, Registration no. 10-0631608), Sept. 2006.
C. Kim. Method and apparatus for enhancing quality of speech. Korea Patent, (Application no. 10-2004-0071371, Registration no. 10-0640865), Oct. 2006.
C. Kim. Baseband modem and mobile terminal for voice recognition. Korea Patent, (Application no. 10-2004-0071327, Registration no. 10-0640893), Oct. 2006.
C. Kim. A mobile terminal having a support power pack. Korea Patent, (Application no. 10-2004- 0095924, Registration no. 10-0677397), Jan. 2007.
C. Kim. Separable mobile terminal. Korea Patent, (Application no. 10-2004-0067617, Registration no. 10-0677347), Jan. 2007.
C. Kim. Voice coding/decoding method, and apparatus for the same. Korea Patent, (Application no. 10-2004-0055634, Registration no. 10-0672355), Jan. 2007.
C. Kim. Mobile phone. Korea Patent, (Application no. 10-2004-0009331, Registration no. 10- 0677304), Jan. 2007.
C. Kim. A multi-party system and method for requires reducedcomputational amount. Korea Patent, (Application no. 10-2005-0075404, Registration no. 10-0733713), June 2007.
C. Kim. Apparatus for removing noise by using hands-free mike of mobile terminal. Korea Patent, (Application no 10-2005-0072454. , Registration no. 10-0739178), July 2007.

About Me

Yeah, it´s me! Chanwoo Kim

chanwcom at gmail dot com

Pittsburgh, PA

Latest News

I got the bronze award in the 2011 17th Samsung Humantech thesis after getting the honor prize in the 2010 16th Samsung Humantech thesis.
I graduated from CMU and got the Ph. D. degree in Dec. 16th, 2010
At ICASSP 2011, one paper was accepted as the first author, and the other was accepted as the second author
I did my Ph. D. defense Sept. 20th ,2010
One more US patent (Patent No. 7,761,294) is issued July 20th ,2010
Two INTERSPEECH 2010 papers were accepted. July 2nd, 2010.
This webpage was re-created after I deleted the previous webboard and wiki years ago due to the unsolicited usage of the board by bots July, 2010.