Home
Publications

Refereed Journals and Book Chapters:

·         Qin Jin, Tanja Schultz, and Alex Waibel. “Far-field Speaker Recognition”. In IEEE transactions on Audio, Speech, and Language Processing (TASL), Vol. 15, No. 7, September, 2007.

·         Hazim K. Ekenel and Qin Jin. “ISL Person Identification Systems in the CLEAR Evaluations”. In Multimodal Technologies for Perception of Humans of Lecture Notes in Computer Science, Springer Berlin / Heidelberg, May 18, 2007.

·         Stephen M. Chu, Hong-Kwang Kuo, Lidia Mangu, Qin Shi, Shilei Zhang, Yong Qin, Qin Jin, Ian Lane and Yik-Cheung Tam Towards the State of the Art in Automatic Mandarin Broadcast Speech Transcription”. Handbook of Natural Language Processing and Machine Translation, Chapter 3.5.2, pp. 487-495, Springer, ISBN 978-1-4419-7712-0, 2011.

·         Roger Hsiao, Mark Fuhs, Yik-Cheung Tam, Qin Jin, Ian Lane, and Tanja Schultz, “CMU-InterACT Mandarin Transcription System for GALE”. Handbook of Natural Language Processing and Machine Translation, Chapter 3.5.3, pp. 496-504, Springer, ISBN 978-1-4419-7712-0, 2011.

·         Udhyakumar Nallasamy, Ian Lane, Mark Fuhs, Mohamed Noamany, Yik-Cheung Tam, Qin Jin and Tanja Schultz, “CMU-InterACT Arabic Speech Recognition System for GALE”. Handbook of Natural Language Processing and Machine Translation, Chapter 3.6.4, pp. 535-540, Springer, ISBN 978-1-4419-7712-0, 2011.

Refereed Conference Publications:

·         Kornel Laskowski and Qin Jin. “Harmonic Structure Transform for Speaker Recognition”. to appear in Proc. of the 12th Annual Conf. of the International Speech Communication Association (INTERSPEECH 2011), Florence, Italy, 28, August 2011.

·         Udhay Nallasamy, Michael Garbus, Florian Metze, Qin Jin, Thomas Schaff, Tanja Schultz. “Analysis of Dialectal Influence in Pan-Arabic ASR”. to appear in Proc. of the 12th Annual Conf. of the International Speech Communication Association (INTERSPEECH 2011), Florence, Italy, 28, August 2011.

·         Qian Yang, Qin Jin, Tanja Schultz. “Investigation of Cross-show Speaker Diarization”. to appear in Proc. of the 12th Annual Conf. of the International Speech Communication Association (INTERSPEECH 2011), Florence, Italy, 28, August 2011.

·         Florian Metze, Roger Hsiao, Qin Jin, Udhay Nallasamy, Tanja Schultz. “The 2010 CMU GALE Speech-to-Text System”. In Proc. of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010), Makuhari, Japan, 26, September 2010.

·         Kornel Laskowski and Qin Jin. “Modeling Prosody for Speaker Recognition: Why Estimating Pitch May Be a Red Herring”. In Proc. of the 7th ISCA Speaker and Language Recognition Workshop (ODYSSEY2010), Brno, Czech Republic, 28 June - 01 July.

·         Qin Jin, Runxin Li, Qian Yang, Kornel Laskowski, Tanja Schultz. “Speaker Indeitification with Distant Microphone Speech”, to appear in 35th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP2010), Dallas, TX, USA, 14-19 April.

·         Qin Jin, Arthur Toth, Tanja Schultz, and Alan Black, “Speaker De-identification via Voice Transformation, in IEEE workshop on Automatic Speech Recognition and Understanding (ASRU 2009), Merano, Italy.

·         Matthias Wölfel, Qian Yang, Qin Jin, Tanja Schultz. “Speaker Identification using Warped MVDR Cepstral Features”, in Interspeech 2009, Brighton, United Kingdom, 06. September 2009.

·         Runxin Li, Qin Jin, Tanja Schultz. “Improving Speaker Segmentation via Speaker Identification and Text Segmentation”, in Interspeech 2009, Brighton, United Kingdom, 06. September 2009.

·         Qin Jin, Arthur R. Toth, Tanja Schultz, Alan W Black. “Voice Convergin: Speaker De-Identification by Voice Transformation”, In proceedings of the 34th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP2009), Taipei, Taiwan, 19-24 April, pp3909-3912.

·         Kornel Laskowski and Qin Jin, “Modeling Instantaneous Intonation for Speaker Identification Using the Fundamental Frequency Variation Spectrum”. In proceedings of the 34th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP2009), Taipei, Taiwan, 19-24 April, pp4541-4544.

·         Mark C. Fuhs, Qin Jin, and Tanja Schultz. “Detecting Bandlimited Audio in Broadcast Television Shows”. In proceedings of the 34th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP2009), Taipei, Taiwan, 19-24 April, pp4589-4592.

·         Haizhou Li, Bin Ma, K-A. Lee, Hanwu Sun, Donglai Zhu, Khe Chai Sim, Changhuai You, Rong Tong, Ismo Karkkainen, Chien-Lin Huang, Vladimir Pervouchine, Wu Guo, Yijie Li, Lirong Dai, M. Nosratighods, T. Tharmarajah, Julien Epps, E. Ambikairajah, E.-S. Chng, Qin Jin, Tanja Schultz. “The I4U System In NIST 2008 Speaker Recognition Evaluation”. In proceedings of the 34th IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP2009), Taipei, Taiwan, 19-24 April, pp4201-4204.

·         Roger Hsiao, Mark Fuhs, Yik-Cheung Tam, Qin Jin, Tanja Schultz. “The CMU-InterACT 2008 Mandarin Transcription System”, in proceedings of InterSpeech 2008, Brisbane, Australia, 22. September 2008.

·         Qin Jin, Tanja Schultz. “Robust Far-Field Speaker Identification Under Mismatched Conditions”, in proceedings of InterSpeech 2008, Brisbane, Australia, 22. September 2008.

·         Qin Jin, Kshitiz Kumar , Tanja Schultz, Richard M Stern, “Compensation Approaches for Far-field Speaker Identification”, NIST SRE Workshop 2008, NIST SRE 2008, Montreal, Canada, 17. June 2008.

·         Qin Jin, Arthur Toth, Alan Black, and Tanja Schultz. “Is Voice Transformation a Threat to Speaker Identification?”, in proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-2008), Las Vagas, April, 2008.

·         Hazim Kemal Ekenel, Qin Jin, Rainer Stiefelhagen, “ISL Person Identification Systems in the CLEAR 2007 Evaluations”, Springer Lecture Notes in Computer Science, No. 4625., pp 256-265, Proceedings of the International Evaluation Workshops , Clear 2007 und RT 2007, Baltimore, MD, USA, 01. May 2008.

·         Qin Jin, Szu-Chen Stan Jou, and Tanja Schultz. Whispering Speaker Identification”, in proceedings of International Conference on Multimedia & Expo (ICME), Beijing, P.R.China, July 2007.

·         Hazim K. Ekenel, Mika Fischer, Qin Jin, Rainer Stiefelhagen. Multi-modal Person Identification in a Smart Environment”, in Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern (CVPR) Biometrics Workshop, Minneapolis, USA, June 2007.

·         Qin Jin, Yue Pan, and Tanja Schultz. “Far-field Speaker Recognition”, in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-2006), Toulouse, France, May 2006.

·         Hazim Kemal and Qin Jin. “ISL Person Identification Systems in the CLEAR Evaluations”, in the Proceedings of CLEAR Evaluation Workshop (CLEAR-2006), Southampton, UK, April 2006.

·         Sebastian Stüker, Christian Fügen, Roger Hsiao, Shajith Ikbal, Qin Jin, Florian Kraft, Matthias Paulik, Martin Raab, Yik-Cheung Tam, and Matthias Wölfel. “The ISL TC-STAR Spring 2006 ASR Evaluation Systems”, in Proceedings of the TC-STAR Workshop on Speech-to-Speech Translation, Barcelona, Spain, 2006, ELDA.

·         Qin Jin and Tanja Schultz. "Speaker Segmentation and Clustering in Meetings". In proceedings of International Conference of Spoken Language Processing (ICSLP-2004), Jeju Island, South Korea, October 2004. [PDF]

·         Kornel Laskowski, Qin Jin, and Tanja Schultz. "Crosscorrelation-based Multispeaker Speech Activity Detection". In proceedings of International Conference of Spoken Language Processing (ICSLP-2004), Jeju Island, South Korea, October 2004. [PDF]

·         Florian Metze, Qin Jin, Christian Fügen, Kornel Laskowski, Yue Pan, and Tanja Schultz. "Issues in Meeting Transcription - The ISL Meeting Transcription System". In proceedings of International Conference of Spoken Language Processing (ICSLP-2004), Jeju Island, South Korea, October 2004. [PDF]

·         Hua Yu, Yik-Cheung Tam, Thomas Schaaf, Sebastian Stüker, Qin Jin, Mohamed Noamany, and Tanja Schultz. "The ISL RT04 Mandarin Broadcast News Evaluation System". EARS Rich Transcription Workshop, Palisades, NY, November 2004. [PDF]

·         Qin Jin, Jiri Navratil, Douglas Reynolds, Joseph. Campbell, Walter. Andrews, and Joy Abramson. "Combining Cross-Stream and Time Dimensions in Phonetic Speaker Recognition". In proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-2003), Hong Kong, China, April 2003. [PDF]

·         Jiri Navratil, Qin Jin, Walter Andrews, and Joseph Campbell. "Phonetic Speaker Recognition Using Maximum-likelihood Binary-Decision Tree Mothods". In proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-2003), Hong Kong, China, April 2003. [PDF]

·         D. Reynolds, W. Andrews, J. Campbell, J. Navratil, B. Peskin, A. Adami, Q. Jin, D. Klusacek, J. Abramson, R. Mihaescu, J. Godfrey, D. Jones, and B. Xiang. "The SuperSID Project: Exploiting High-level Information for High-accuracy Speaker Recognition". In proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-2003), Hong Kong, China, April 2003. [PDF]

·         Qin Jin, Tanja Schultz, and Alex Waibel. "Phonetic Speaker Identification". In proceedings of the International Conference of Spoken Language Processing (ICSLP-2002), Denver, CO, September 2002. [PDF]

·         Qin Jin, Tanja Schultz and Alex Waibel. "Speaker Identification Using Multilingual Phone Strings". In proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP-2002), Orlando, Florida, May 2002. [PDF]

·         Tanja Schultz, Qin Jin, Kornel Laskowski, Alicia Tribble, and Alex Waibel. "Improvements in Non-verbal Cue Identification using Multilingual Phone Strings". In proceedings of the Speech-to-Speech Translation Workshop on the 40th Anniversary Meeting of the Association for Computational Linguistics (ACL-2002), Philadelphia, July 6-12, 2002. [PDF]

·         Tanja Schultz, Qin Jin, Kornel Laskowski, Alicia Tribble and Alex Waibel. Speaker, Accent and Language Identification Using Multilingual Phone Strings". In proceedings of the Human Language Technology Meeting (HLT-2002), San Diego, March 2002. [PDF]

·         Qin Jin A Naive De-lambing  Method for Speaker Identification". In proceedings of International Conference on Spoken Language Processing (ICSLP-2000), Beijing, P.R.China, October 2000. [PDF]

·         Qin Jin and Alex Waibel. "Application of LDA to Speaker Recognition". In proceedings of International Conference on Spoken Language Processing ( ICSLP-2000), Beijing, P.R.China, October 2000. [PDF]

·         Qin Jin, Luo Si and Qixiu Hu. "A High-Performance Text-Independent Speaker Identification System Based on BCDM". In proceedings of International Conference on Spoken Language Processing (ICSLP-1998), Sidney, Australia, November 1998. [PDF]

·         Qin Jin, Luo Si and Qixiu Hu. "A New Set of Cepstra Features Based on Non-All-Pole Speech Production Model". In (Chinese) proceedings of National Conference on Man-Machine Speech Communication (NCMMSC-1998), Harbin, P.R.China, July 1998.