- 1
-
Suyoun Kim and Florian Metze.
Dialog-context aware end-to-end speech recognition.
In Proc. SLT, Athens; Greece, December 2018. IEEE.
Accepted.
- 2
-
Shruti Palaskar and Florian Metze.
Acoustic-to-word recognition
with sequence-to-sequence models.
In Proc. SLT, Athens; Greece, December 2018. IEEE.
Accepted.
- 3
-
Siddharth Dalmia, Xinjian Li, Florian Metze, and Alan W Black.
Domain robust feature extraction for rapid low resource ASR
development.
In Proc. SLT, Athens; Greece, December 2018. IEEE.
Accepted.
- 4
-
Ramon Sanabria and Florian Metze.
Hierarchical multi task
learning with CTC.
In Proc. SLT, Athens; Greece, December 2018. IEEE.
Accepted.
- 5
-
Florian Metze, Shruti Palaskar, and Ramon Sanabria.
Grounded sequence-to-sequence transduction on how-to videos - a
report from JSALT 2018.
In Proc. Asilomar Conference on Signals, Systems, and
Computers, Pacific Grove, CA, October 2018. IEEE.
Accepted.
- 6
-
Adrien Le Franc, Eric Riebling, Julien Karadayi, Yun Wang, Camila Scaff,
Florian Metze, and Alejandrina Cristia.
The ACLEW DiViMe: An easy-to-use
diarization tool.
In Proc. INTERSPEECH, Hyderabad; India, September 2018. ISCA.
- 7
-
Yun Wang, Juncheng B. Li, and Florian Metze.
Comparing the max and
noisy-or pooling functions in multiple instance learning for weakly
supervised sequence learning tasks.
In Proc. INTERSPEECH, Hyderabad; India, September 2018. ISCA.
- 8
-
Thomas Zenkel, Ramon Sanabria, Florian Metze, and Alex Waibel.
Subword and crossword units
for CTC acoustic models.
In Proc. INTERSPEECH, Hyderabad; India, September 2018. ISCA.
- 9
-
Shao-Yen Tseng, Juncheng B. Li, Yun Wang, Florian Metze, Joseph Szurley, and
Samarjit Das.
Multiple instance deep learning
for weakly supervised small-footprint audio event detection.
In Proc. INTERSPEECH, Hyderabad; India, September 2018. ISCA.
- 10
-
Niluthpol Mithun, Juncheng B. Li, Florian Metze, and Amit Roy-Chowdhury.
Learning joint embedding with
multimodal cues for cross-modal video-text retrieval.
In Proc. ICMR, Yokohama, Japan, June 2018. ACM.
Best paper.
- 11
-
Boyang Li, Beth Cardier, Tong Wang, and Florian Metze.
Annotating high-level structures of short
stories and personal anecdotes.
In Proc. LREC, Miyazaki, Japan, May 2018. ELRA.
- 12
-
Neville Ryant, Elika Bergelson, Kenneth Church, Alejandrina Cristia, Jun Du,
Sriram Ganapathy, Sanjeev Khudanpur, Diana Kowalski, Mahesh Krishnamoorthy,
Rajat Kulshreshta, Mark Liberman, Yu-Ding Lu, Matthew Maciejewski, Florian
Metze, Jan Profant, Lei Sun, Yu Tsao, and Zhou Yu.
Enhancement and
analysis of conversational speech: JSALT 2017.
In Proc. ICASSP, Calgary, BC; Canada, April 2018. IEEE.
- 13
-
Odette Scharenborg, Laurent Besacier, Alan Black, Mark Hasegawa-Johnson,
Florian Metze, Graham Neubig, Sebastian Stüker, Pierre Godard, Markus
Müller, Lucas Ondel, Shruti Palaskar, Philip Arthur, Francesco Ciannella,
Mingxing Du, Elin Larsen, Danny Merkx, Rachid Riad, Liming Wang, and Emmanuel
Dupoux.
Linguistic unit discovery from
multi-modal inputs in unwritten languages: Summary of the 'Speaking
Rosetta Stone' JSALT 2017 workshop.
In Proc. ICASSP, Calgary, BC; Canada, April 2018. IEEE.
- 14
-
Siddarth Dalmia, Ramon Sanabria, Florian Metze, and Alan Black.
Sequence-based
multi-lingual low resource speech recognition.
In Proc. ICASSP, Calgary, BC; Canada, April 2018. IEEE.
- 15
-
Shruti Palaskar, Ramon Sanabria, and Florian Metze.
End-to-end multi-modal speech
recognition.
In Proc. ICASSP, Calgary, BC; Canada, April 2018. IEEE.
- 16
-
Juncheng B. Li, Yun Wang, Joseph Szurley, Florian Metze, and Samarjit Das.
A light-weight
multimodal framework for improved environmental audio tagging.
In Proc. ICASSP, Calgary, BC; Canada, April 2018. IEEE.
- 17
-
Odette Scharenborg, Francesco Ciannella, Shruti Palaskar, Alan Black, Florian
Metze, Lucas Ondel, and Mark Hasegawa-Johnson.
Building an ASR system
for a low-research language through the adaptation of a high-resource
language ASR system: Preliminary results.
In Proc. ICNLSSP 2017, Casablanca, Morocco, December 2017.
- 18
-
Niluthpol C. Mithun, Juncheng B. Li, Florian Metze, Amit K. Roy-Chowdhury, and
Das Samarjit.
CMU-UCR-Bosch @ TRECVID
2017: Video to text retrieval.
In Proc. Trecvid, Gaithersburg, MD, November 2017.
- 19
-
Shao-Yen Tseng, Juncheng B. Li, Yun Wang, Florian Metze, and Das Samarjit.
Large-scale weakly
supervised sound event detection (DCASE challenge 2017).
Technical report, Carnegie Mellon University, Munich; Germany, 2017.
- 20
-
Brian MacWhinney, Davida Fromm, Margie Forbes, and Florian Metze.
Automatic speech recognition of scripted productions from PWAs.
In Proc. Academy of Aphasia, number 39 in 1, Baltimore, MD;
U.S.A., November 2017.
- 21
-
Thomas Zenkel, Ramon Sanabria, Florian Metze, Jan Niehues, Matthias Sperber,
Sebastian Stüker, and Alex Waibel.
Comparison of decoding
strategies for ctc acoustic models.
In Proc. INTERSPEECH, Stockholm, Sweden, August 2017. ISCA.
- 22
-
Yun Wang and Florian Metze.
A transfer learning based feature extractor
for polyphonic sound event detection using connectionist temporal
classification.
In Proc. INTERSPEECH, Stockholm, Sweden, August 2017. ISCA.
- 23
-
Florian Metze, Yun Wang, Rajat Kulshreshta, and Markus Müller.
The cmu-kit submissions to the opensat 2017
evaluation.
Technical Report CMU-LTI-17-004, Carnegie Mellon University,
Pittsburgh, PA; U.S.A., July 2017.
- 24
-
Judy Chang, Emily Underwood, Abdesalam Soudi, and Florian Metze.
Differences in patient-provider communication in first obstetric visits
before and after implementation of electronic medical records (EMR).
In Proc. The Patient, The Practitioner, and The Computer (PPC)
Conference, Providence, RI; U.S.A., March 2017. Brown University.
- 25
-
Yun Wang and Florian Metze.
A first attempt at polyphonic sound event detection
using connectionist temporal classification.
In Proc. ICASSP, New Orleans, LA; U.S.A., March 2017. IEEE.
- 26
-
Abhinav Gupta, Yajie Miao, Leonardo Neves, and Florian Metze.
Visual features for context-aware speech
recognition.
In Proc. ICASSP, New Orleans, LA; U.S.A., March 2017. IEEE.
Best student paper candidate.
- 27
-
Juncheng B. Li, Wei Dai, Florian Metze, Shuhui Qu, and Samarjit Das.
A comparison of deep learning
methods for environmental sound detection.
In Proc. ICASSP, New Orleans, LA; U.S.A., March 2017. IEEE.
- 28
-
Ramon Sanabria, Florian Metze, and Fernando De la Torre.
Robust end-to-end deep
audiovisual speech recognition.
CoRR, abs/1611.06986, 2016.
- 29
-
Shinji Watanabe, Marc Delcroix, Florian Metze, and John R. Hershey, editors.
New Era for
Robust Speech Recognition - Exploiting Deep Learning, volume 1.
Springer, January 2017.
- 30
-
Yajie Miao and Florian Metze.
New Era for
Robust Speech Recognition - Exploiting Deep Learning, chapter End-to-End
Architectures for Speech Recognition.
Volume 1 of Watanabe et al. [29], January 2017.
- 31
-
Marvin Ritter, Markus Müller, Sebastian Stüker, Florian Metze, and Alex
Waibel.
Robust speech recognition for reverberated
environments.
In 12. ITG Fachtagung Sprachkommunikation, Paderborn,
Germany, October 2016. VDE.
- 32
-
Yashesh Gaur, Florian Metze, and Jeffrey P. Bigham.
Manipulating word lattices to
incorporate human corrections.
In Proc. INTERSPEECH, San Francisco, CA; U.S.A., September
2016. ISCA.
- 33
-
Yajie Miao and Florian Metze.
Open-domain audio-visual speech recognition: A
deep learning approach.
In Proc. INTERSPEECH, San Francisco, CA; U.S.A., September
2016. ISCA.
- 34
-
Rebecca Bates, Eric Fosler-Lussier, Florian Metze, Martha Larson, Gina-Anne
Levow, and Emily Mower Provost.
Experiences with shared
resources for research and education in speech and language processing.
In Proc. INTERSPEECH, San Francisco, CA; U.S.A., September
2016. ISCA.
- 35
-
Florian Metze, Eric Riebling, Anne S. Warlaumont, and Elika Bergelson.
Virtual machines and containers as a
platform for experimentation.
In Proc. INTERSPEECH, San Francisco, CA; U.S.A., September
2016. ISCA.
- 36
-
Yun Wang and Florian Metze.
Recurrent support vector machines for
audio-based multimedia event detection.
In Proc. ICMR, New York, NY; U.S.A., June 2016. ACM.
- 37
-
Yashesh Gaur, Walter S. Lasecki, Florian Metze, and Jeffrey P. Bigham.
The effects of automatic speech
recognition quality on human transcription latency.
In Proc. Web for All (W4A), Montreal; Canada, April 2016.
Best paper runner up.
- 38
-
Yajie Miao, Mohammad Gowayyed, Florian Metze, Xingyu Na, Tom Ko, and Alex
Waibel.
An empirical exploration of CTC acoustic
models.
In Proc. ICASSP, Shanghai; China, March 2016. IEEE.
- 39
-
Yun Wang, Leonardo Neves, and Florian Metze.
Audio-based multimedia event detection using deep
recurrent neural networks.
In Proc. ICASSP, Shanghai; China, March 2016. IEEE.
- 40
-
Shoou-I Yu, Lu Jiang, Zhongwen Xu, Zhenzhong Lan, Shicheng Xu, Xiaojun Chang,
Xuanchong Li, Zexi Mao, Chuang Gan, Yajie Miao, Xingzhong Du, Yang Cai, Lara
Martin, Nikolas Wolfe, Anurag Kumar, Huan Li, Ming Lin, Zhigang Ma, Yi Yang,
Deyu Meng, Shiguang Shan, Pinar Duygulu Sahin, Susanne Burger, Florian Metze,
Rita Singh, Bhiksha Raj, Teruko Mitamura, Richard Stern, Alexander Hauptmann,
Zhiyong Cheng, Jialie Shen, Xingzhong Du, and Xiaofang Zhou.
Cmu informedia@trecvid 2015: Med/ sin/ lnk/ sed.
In Proc. TrecVID, Gaithersburg, MD; U.S.A., December 2015.
NIST.
- 41
-
Yajie Miao, Mohammad Gowayyed, and Florian Metze.
EESEN: End-to-End Speech Recognition
using Deep RNN Models and WFST-based Decoding.
In Proc. Automatic Speech Recognition and Understanding Workshop
(ASRU), Scottsdale, AZ; U.S.A., December 2015. IEEE.
- 42
-
Abeer Alwaan and Elizabeth Shriberg (eds.).
The role
of speech science in developing robust speech processing applications.
Technical report, National Science Foundation; Arlington, VA, May
2015.
- 43
-
Yajie Miao, Hao Zhang, and Florian Metze.
Speaker
adaptive training of deep neural network acoustic models using i-vectors.
IEEE/ACM Transactions on Audio, Speech and Language Processing,
23(11):1938-1949, November 2015.
- 44
-
Florian Metze, Eric Riebling, Eric Fosler-Lussier, Andrew Plummer, and Rebecca
Bates.
The speech recognition virtual kitchen turns
one.
In Proc. INTERSPEECH, Dresden, Germany, September 2015. ISCA.
- 45
-
Yashesh Gaur, Florian Metze, Yajie Miao, and Jeffrey P. Bigham.
Using keyword spotting to help humans
correct captioning faster.
In Proc. INTERSPEECH, Dresden, Germany, September 2015. ISCA.
- 46
-
Yajie Miao and Florian Metze.
Distance-aware DNNs for robust speech
recognition.
In Proc. INTERSPEECH, Dresden, Germany, September 2015. ISCA.
- 47
-
Yajie Miao and Florian Metze.
On speaker adaptation of long short-term
memory recurrent neural networks.
In Proc. INTERSPEECH, Dresden, Germany, September 2015. ISCA.
- 48
-
Hao Zhang, Yajie Miao, and Florian Metze.
Regularizing DNN acoustic
models with Gaussian stochastic neurons.
In Proc. ICASSP, Brisbane; Australia, April 2015. IEEE.
- 49
-
Florian Metze, Ankur Gandhe, Yajie Miao, Zaid Sheikh, Yun Wang, Di Xu, Hao
Zhang, Jungsuk Kim, Ian Lane, Won Kyum Lee, Sebastian Stüker, and Markus
Müller.
Semi-supervised training in low-resource
ASR and KWS.
In Proc. ICASSP, Brisbane; Australia, April 2015. IEEE.
- 50
-
Xavier Anguera, Luis Javier Rodríguez-Fuentes, Andi Buzo, Florian Metze,
Igor Szöke, and Mikel Peñagarikano.
QUESST 2014: Evaluating
query-by-example speech search in a zero-resource setting with real-life
queries.
In Proc. ICASSP, Brisbane; Australia, April 2015. IEEE.
- 51
-
Markus Müller, Sebastian Stüker, Zaid Sheikh, Florian Metze, and Alex
Waibel.
Multilingual deep bottle
neck features - a study on language selection and training techniques.
In Proc. IWSLT, Lake Tahoe, NV; U.S.A., December 2014.
- 52
-
Jan Trmal, Guoguo Chen, Daniel Povey, Sanjeev Khudanpur, Pegah Ghahremani,
Xiaohui Zhang, Vimal Manohar, Chunxi Liu, Aren Jansen, Dietrich Klakow, David
Yarowsky, and Florian Metze.
A keyword search system using open source
software.
In Proc. IEEE Workshop on Spoken Language Technology, South
Lake Tahoe, NV; USA, December 2014. IEEE.
- 53
-
Yajie Miao, Lu Jiang, Hao Zhang, and Florian Metze.
Improvements to speaker adaptive training
of deep neural networks.
In Proc. IEEE Workshop on Spoken Language Technology, South
Lake Tahoe, NV; USA, December 2014. IEEE.
Best poster presentation.
- 54
-
Di Xu, Yun Wang, and Florian Metze.
EM-based phoneme
confusion matrix generation for low-resource spoken term detection.
In Proc. IEEE Workshop on Spoken Language Technology, South
Lake Tahoe, NV; USA, December 2014. IEEE.
- 55
-
Lara Martin, Matthew Stone, Florian Metze, and Jack Mostow.
A methodology for using
crowdsourced data to measure uncertainty in natural speech.
In Proc. IEEE Workshop on Spoken Language Technology, South
Lake Tahoe, NV; USA, December 2014. IEEE.
- 56
-
Shoou-I Yu, Lu Jiang, Zexi Mao, Xiaojun Chang, Xingzhong Du, Chuang Gan,
Zhenzhong Lan, Zhongwen Xu, Xuanchong Li, Yang Cai, Anurag Kumar, Yajie Miao,
Lara Martin, Nikolas Wolfe, Shicheng Xu, Huan Li, Ming Lin, Zhigang Ma,
Yi Yang, Deyu Meng, Shiguang Shan, Pinar Duygulu Sahin, Susanne Burger,
Florian Metze, Rita Singh, Bhiksha Raj, Teruko Mitamura, Richard Stern, and
Alexander Hauptmann.
Informedia@trecvid 2014 - MED and MER.
In Proc. TRECVID, Orlando, FL; U.S.A., November 2014. NIST.
- 57
-
Xavier Anguera, Luis Javier Rodríguez-Fuentes, Igor Szöke, Andi Buzo,
and Florian Metze.
Query by example search on
speech at Mediaeval 2014.
In Proc. MediaEval Workshop, Barcelona; Spain, October 2014.
- 58
-
Andrew Plummer, Eric Riebling, Anuj Kumar, Florian Metze, Eric Fosler-Lussier,
and Rebecca Bates.
The speech
recognition virtual kitchen: Launch party.
In Proc. INTERSPEECH, Singapore, September 2014. ISCA.
- 59
-
Xavier Anguera, Luis Javier Rodríguez-Fuentes, Igor Szöke, Andi Buzo,
Florian Metze, and Mikel Peñagarikano.
Query-by-example spoken term detection on
multilingual unconstrained speech.
In Proc. INTERSPEECH, Singapore, September 2014. ISCA.
- 60
-
Ankur Gandhe, Florian Metze, and Ian Lane.
Neural network
language models for low resource languages.
In Proc. INTERSPEECH, Singapore, September 2014. ISCA.
- 61
-
Yajie Miao, Hao Zhang, and Florian Metze.
Towards speaker adaptive training of
deep neural network acoustic models.
In Proc. INTERSPEECH, Singapore, September 2014. ISCA.
- 62
-
Yajie Miao and Florian Metze.
Improving language-universal feature
extraction with deep maxout and convolutional neural networks.
In Proc. INTERSPEECH, Singapore, September 2014. ISCA.
- 63
-
Yajie Miao, Hao Zhang, and Florian Metze.
Distributed learning of multilingual
DNN feature extractors using GPUs.
In Proc. INTERSPEECH, Singapore, September 2014. ISCA.
- 64
-
Di Xu and Florian Metze.
Word-based probabilistic phonetic retrieval
for low-resource spoken term detection.
In Proc. INTERSPEECH, Singapore, September 2014. ISCA.
- 65
-
Yun Wang and Florian Metze.
An in-depth comparison of keyword specific
thresholding and sum-to-one score normalization.
In Proc. INTERSPEECH, Singapore, September 2014. ISCA.
- 66
-
Sarah Cosentino, Susanne Burger, Lara Martin, Florian Metze, Tatsuhiro Kishi,
Kenji Hashimoto, Salvatore Sessa, Massimiliano Zecca, and Atsuo Takanishi.
A multisensory non-invasive system for
laughter analysis.
In Proc. 36th Annual International EMBS Conference, Chicago,
IL; U.S.A., August 2014. IEEE.
- 67
-
Florian Metze, Shourabh Rawat, and Yipei Wang.
Improved audio features for large-scale
multimedia event detection.
In Proc. ICME, Chengdu; China, July 2014. IEEE.
- 68
-
Xavier Anguera, Luis Javier Rodríguez-Fuentes, Igor Szöke, Andi Buzo,
Florian Metze, and Mikel Peñagarikano.
Query-by-example spoken term detection
evaluation on low-resource languages.
In Proc. SLTU, St. Petersburg, Russia, May 2014. ISCA.
- 69
-
Yipei Wang, Shourabh Rawat, and Florian Metze.
Exploring audio semantic
concept for event-based video retrieval.
In Proc. ICASSP, Firenze; Italy, May 2014. IEEE.
- 70
-
Yipei Wang, Shourabh Rawat, and Florian Metze.
Semi-automatic audio
semantic concept discovery for multimedia retrieval.
In Proc. ICASSP, Firenze; Italy, May 2014. IEEE.
- 71
-
Ankur Gandhe, Florian Metze, Alex Waibel, and Ian Lane.
Optimization of neural
network language models for keyword search.
In Proc. ICASSP, Firenze; Italy, May 2014. IEEE.
- 72
-
Yulia Tsvetkov, Florian Metze, and Chris Dyer.
Augmenting translation models with
simulated acoustic confusions for improved spoken language translation.
In Proc. EACL, Gothenborg; Sweden, April 2014. ACL.
- 73
-
Nikolas Wolfe, Vinay Vyas Vemuri, Lara J. Martin, Florian Metze, and Alan W.
Black.
Applause: A learning tool
for low-resource languages.
In Proc. Designing Speech and Language Interactions Workshop at
CHI, Toronto; Canada, April 2014. ACM.
- 74
-
Anuj Kumar, Florian Metze, Eric Riebling, and Matthew Kam.
Demystifying development of
speech recognizers for novices.
In Proc. Designing Speech and Language Interactions Workshop at
CHI, Toronto; Canada, April 2014. ACM.
- 75
-
Anuj Kumar, Florian Metze, and Matthew Kam.
Enabling the rapid
development and adoption of speech-user interfaces.
IEEE Computer Magazine, 46(1), January 2014.
- 76
-
Florian Metze, Xavier Anguera, Etienne Barnard, Marelie Davel, and Guillaume
Gravier.
Language
independent search in MediaEval's Spoken Web Search Task.
Computer Speech and Language, Special Issue on Information
Extraction & Retrieval, 2014.
- 77
-
Jonas Gehring, Quoc Bao Nguyen, Florian Metze, and Alex Waibel.
DNN acoustic modeling with
modular multi-lingual feature extraction networks.
In Proc. ASRU, Olomouc; Czech Republic, December 2013. IEEE.
- 78
-
Florian Metze, Zaid A. W. Sheikh, Alex Waibel, Jonas Gehring, Kevin Kilgour,
Quoc Bao Nguyen, and Van Huy Nguyen.
Models of tone for tonal and non-tonal
languages.
In Proc. ASRU, Olomouc; Czech Republic, December 2013. IEEE.
- 79
-
Ankur Gandhe, Long Qin, Florian Metze, Alexander Rudnicky, Ian Lane, and
Matthias Eck.
Using web text to improve keyword
spotting in speech.
In Proc. ASRU, Olomouc; Czech Republic, December 2013. IEEE.
- 80
-
Udhyakumar Nallasamy, Mark Fuhs, Monika Woszczyna, Florian Metze, and Tanja
Schultz.
Neighbour selection and adaptation
for rapid speaker-dependent ASR.
In Proc. ASRU, Olomouc; Czech Republic, December 2013. IEEE.
- 81
-
Yajie Miao, Florian Metze, and Shourabh Rawat.
Deep maxout networks for
low-resource speech recognition.
In Proc. ASRU, Olomouc; Czech Republic, December 2013. IEEE.
- 82
-
Zhen-Zhong Lan, Lu Jiang, Shoou-I Yu, Chenqiang Gao, Shourabh Rawat, Yang Cai,
Shicheng Xu, Haoquan Shen, Xuanchong Li, Yipei Wang, Waito Sze, Yan Yan,
Zhigang Ma, Nicolas Ballas, Deyu Meng, Wei Tong, Yi Yang, Susanne Burger,
Florian Metze, Rita Singh, Bhiksha Raj, Richard Stern, Teruko Mitamura, Eric
Nyberg, and Alexander Hauptmann.
Informedia E-Lamp @
TrecVID 2013 - Multimedia event detection and recounting (MED and
MER).
In Proc. TrecVID, Gaithersburg, MD; USA, November 2013. NIST.
- 83
-
Xavier Anguera, Florian Metze, Andi Buzo, Igor Szöke, and Luis Javier
Rodríguez-Fuentes.
The spoken web search task.
In Proc. MediaEval Workshop, Barcelona; Spain, October 2013.
http://www.multimediaeval.org/mediaeval2013/sws2013/.
- 84
-
Sujay Kumar Jauhar, Yun-Nung (Vivian) Chen, and Florian Metze.
Prosody-based unsupervised speech
summarization with two-layer mutually reinforced random walk.
In Proc. IJCNLP, Nagoya, Japan, October 2013.
- 85
-
Florian Metze, Eric Fosler-Lussier, and Rebecca Bates.
The speech recognition virtual
kitchen.
In Proc. INTERSPEECH, Lyon; France, August 2013. ISCA.
https://github.org/srvk.
- 86
-
Yun-Nung (Vivian) Chen and Florian Metze.
Multi-layer mutually reinforced random
walk with hidden parameters for improved multi-party meeting summarization.
In Proc. INTERSPEECH, Lyon; France, August 2013. ISCA.
- 87
-
Yajie Miao and Florian Metze.
Improving low-resource CD-DNN-HMM using
dropout and multilingual DNN training.
In Proc. INTERSPEECH, Lyon; France, August 2013. ISCA.
- 88
-
Anuj Kumar, Florian Metze, Wenyi Wang, and Matthew Kam.
Formalizing expert
knowledge for developing accurate speech recognizers.
In Proc. INTERSPEECH, Lyon; France, August 2013. ISCA.
- 89
-
Shourabh Rawat, Peter Schulam, Susanne Burger, Duo Ding, Yipei Wang, and
Florian Metze.
Robust audio codebooks for large scale event
detection in consumer videos.
In Proc. INTERSPEECH, Lyon; France, August 2013. ISCA.
- 90
-
Yulia Tsvetkov, Zaid Sheikh, and Florian Metze.
Identification and modeling of word fragments
in spontaneous speech.
In Proc. ICASSP, Vancouver, BC; Canada, May 2013. IEEE.
- 91
-
Yajie Miao, Florian Metze, and Alex Waibel.
Subspace mixture model for low-resource
speech recognition in cross-lingual settings.
In Proc. ICASSP, Vancouver, BC; Canada, May 2013. IEEE.
- 92
-
Yajie Miao, Florian Metze, and Alex Waibel.
Learning discriminative basis
coefficients for eigenspace MLLR unsupervised adaptation.
In Proc. ICASSP, Vancouver, BC; Canada, May 2013. IEEE.
- 93
-
Aren Jansen, Emmanuel Dupoux, Sharon Goldwater, Mark Johnson, Sanjeev
Khudanpur, Kenneth Church, Naomi Feldman, Hynek Hermansky, Florian Metze,
Richard Rose, Michael Seltzer, Pascal Clark, Ian McGraw, Balakrishnan
Varadarajan, Erin Bennett, Benjamin Borschinger, Justin Chiu, Ewan Dunbar,
Abdellah Fourtassi, David Harwath, Chia-Ying Lee, Keith Levin, Atta
Norouzian, Vijayaditya Peddinti, Rachael Richardson, Thomas Schatz, and
Samuel Thomas.
A summary of the 2012 JHU CLSP
workshop on zero resource speech technologies and models of early language
acquisition.
In Proc. ICASSP, Vancouver, BC; Canada, May 2013. IEEE.
- 94
-
Florian Metze, Xavier Anguera, Etienne Barnard, Marelie Davel, and Guillaume
Gravier.
The spoken web search task at
MediaEval 2012.
In Proc. ICASSP, Vancouver, BC; Canada, May 2013. IEEE.
- 95
-
Jonas Gehring, Yajie Miao, Florian Metze, and Alex Waibel.
Extracting deep bottleneck features using
stacked auto-encoders.
In Proc. ICASSP, Vancouver, BC; Canada, May 2013. IEEE.
- 96
-
Florian Metze, Duo Ding, Ehsan Younessian, and Alexander Hauptmann.
Beyond audio and
video retrieval: Topic oriented multimedia summarization.
International Journal of Multimedia Information Retrieval,
2013.
Springer.
- 97
-
Udhyakumar Nallasamy, Florian Metze, and Tanja Schultz.
Active learning for accent adaptation in
automatic speech recognition.
In Proc. SLT, Miami, FL; U.S.A., December 2012. IEEE.
- 98
-
Yun-Nung (Vivian) Chen and Florian Metze.
Two-layer mutually reinforced random walk
for improved multi-party meeting summarization.
In Proc. SLT, Miami, FL; U.S.A., December 2012. IEEE.
- 99
-
Shoou-I Yu, Zhongwen Xu, Duo Ding, Waito Sze, Francisco Vicente, Zhenzhong Lan,
Yang Cai, Shourabh Rawat, Peter Schulam, Sohail Bahmani, Antonio Juarez, Wei
Tong, Yi Yang, Susanne Burger, Florian Metze, Rita Singh, Bhiksha Raj,
Richard Stern, Teruko Mitamura, Eric Nyberg, and Alex Hauptmann.
Informedia E-LAMP @ TRECVID 2012 -
Multimedia event detection and recounting (MED and MER).
In Proc. TrecVID, Gaithersburg, MD; USA, November 2012. NIST.
- 100
-
Karen Livescu, Eric Fosler-Lussier, and Florian Metze.
Sub-word modeling
for automatic speech recognition.
Signal Processing Magazine, Special Issue “Fundamental
Technologies in Modern Speech Recognition”, 29(6):44-57, November 2012.
IEEE.
- 101
-
Florian Metze, Etienne Barnard, Marelie Davel, Charl van Heerden, Xavier
Anguera, Guillaume Gravier, and Nitendra Rajput.
The spoken web search task.
In Proc. MediaEval Workshop, Pisa; Italy, October 2012.
- 102
-
Susanne Burger, Qin Jin, Peter F. Schulam, and Florian Metze.
Noisemes: Manual annotation of environmental
noise in audio streams.
Technical Report CMU-LTI-12-07, Carnegie Mellon University,
Pittsburgh, PA; U.S.A., 2012.
- 103
-
Udhyakumar Nallasamy, Florian Metze, and Tanja Schultz.
Semi-supervised learning for speech
recognition in the context of accent adaptation.
In Proc. MLSLP2012 - The Second Symposium on Machine Learning in
Speech and Language Processing, Portland, OR; U.S.A., September 2012. ISCA.
- 104
-
Florian Metze and Eric Fosler-Lussier.
The speech recognition virtual kitchen:
An initial prototype.
In Proc. INTERSPEECH, Portland, OR, September 2012. ISCA.
- 105
-
Yun-Nung (Vivian) Chen and Florian Metze.
Integrating intra-speaker topic modeling and
temporal-based inter-speaker topic modeling in random walk for improved
multi-party meeting summarization.
In Proc. INTERSPEECH, Portland, OR, September 2012. ISCA.
- 106
-
Qin Jin, Peter F. Schulam, Shourabh Rawat, Susanne Burger, Duo Ding, and
Florian Metze.
Event-based video retrieval using
audio.
In Proc. INTERSPEECH, Portland, OR, September 2012. ISCA.
- 107
-
Tim Polzehl, Katrin Schoenenberg, Sebastian Möller, Florian Metze, Gelareh
Mohammadi, and Alessandro Vinciarelli.
On speaker-independent personality perception and
prediction from speech.
In Proc. INTERSPEECH, Portland, OR, September 2012. ISCA.
- 108
-
Udhyakumar Nallasamy, Florian Metze, and Tanja Schultz.
Enhanced polyphone decision tree adaptation
for accented speech recognition.
In Proc. INTERSPEECH, Portland, OR, September 2012. ISCA.
- 109
-
Ngoc Thang Vu, Wojtek Breiter, Florian Metze, and Tanja Schultz.
An investigation on initialization schemes
for multilayer perceptron training using multilingual data and their effect
on asr performance.
In Proc. INTERSPEECH, Portland, OR, September 2012. ISCA.
- 110
-
Duo Ding, Florian Metze, Shourabh Rawat, Peter Franz Schulam, Susanne Burger,
Ehsan Younessian, Lei Bao, Michael G. Christel, and Alexander Hauptmann.
Beyond audio and video retrieval:
Towards multimedia summarization.
In Proc. ICMR, Hong Kong; China, June 2012. ACM.
- 111
-
Yun-Nung (Vivian) Chen and Florian Metze.
Intra-speaker topic modeling for
improved multi-party meeting summarization with integrated random walk.
In Proc. NAACL/ HLT, Montreal; Canada, June 2012. ACM.
- 112
-
Duo Ding, Florian Metze, Shourabh Rawat, Peter F. Schulam, and Susanne Burger.
Generating natural language summaries for
multimedia.
In Proc. 7th International Natural Language Generation
Conference, Starved Rock, IL; USA, May 2012. ACL.
- 113
-
Stefan Steidl, Tim Polzehl, H. Timothy Bunnell, Ying Dou, Prasanna Kumar
Muthukumar, Daniel Perry, Kishore Prahallad, Callie Vaughn, Alan Black, and
Florian Metze.
Emotion identification for evaluation of
synthesized emotional speech.
In Proc. Speech Prosody, Shanghai; China, May 2012.
- 114
-
Ngoc Thang Vu, Florian Metze, and Tanja Schultz.
Multilingual
bottle-neck features and its application for under-resourced languages.
In Proc. 3rd Workshop on Spoken Language Technologies for
Under-resourced Languages, Cape Town; S. Africa, May 2012. MICA.
- 115
-
Jochen Weiner, Ngoc Thang Vu, Dominic Telaar, Florian Metze, Tanja Schultz,
Dau-Cheng Lyu, Eng-Siong Chng, and Haizhou Li.
Integration of language
identification into a recognition system for spoken conversations containing
code-switches.
In Proc. 3rd Workshop on Spoken Language Technologies for
Under-resourced Languages, Cape Town; S. Africa, May 2012. MICA.
- 116
-
Florian Metze, Nitendra Rajput, Xavier Anguera, Marelie Davel, Guillaume
Gravier, Charl van Heerden, Gautam V. Mantena, Armando Muscariello, Kishore
Prahallad, Igor Szöke, and Javier Tejedor.
The spoken web search task at
MediaEval 2011.
In Proc. ICASSP, Kyoto; Japan, March 2012. IEEE.
- 117
-
Alan W. Black, H. Timothy Bunnell, Ying Dou, Prasanna Kumar Muthukumar, Florian
Metze, Daniel Perry, Tim Polzehl, Kishore Prahallad, Stefan Steidl, and
Callie Vaughn.
Articulatory features for expressive
speech synthesis.
In Proc. ICASSP, Kyoto; Japan, March 2012. IEEE.
- 118
-
Lei Bao, Shoou-I Yu, Zhen-Zhong Lan, Arnold Overwijk, Qin Jin, Brian Langner,
Michael Garbus, Susanne Burger, Florian Metze, and Alexander Hauptmann.
Informedia @ TrecVID 2011.
In Proc. TrecVID Workshop, Gaithersburg, MD; USA, December
2011. NIST.
- 119
-
Nitendra Rajput and Florian Metze.
Spoken web search.
In Proc. MediaEval Workshop, Pisa; Italy, September 2011.
- 120
-
Matthias Peissner, Vanessa Doebler, and Florian Metze.
Can voice interaction help reducing the level of distraction and prevent
accidents.
White paper, Fraunhofer IAO, Stuttgart, Germany, May 2011.
Meta-Study on Driver Distraction and Voice Interaction.
- 121
-
Tim Polzehl, Sebastian Möller, and Florian Metze.
Modeling speaker personality using voice.
In Proc. INTERSPEECH, Firenze; Italy, August 2011. ISCA.
- 122
-
Udhyakumar Nallasamy, Michael Garbus, Florian Metze, Qin Jin, Thomas Schaaf,
and Tanja Schultz.
Analysis of dialectal
influence in pan-arabic asr.
In Proc. INTERSPEECH, Firenze; Italy, August 2011. ISCA.
- 123
-
Florian Metze, Alan Black, and Tim Polzehl.
A review of personality in voice-based
man machine interaction.
In Proc. Human Computer Interaction (HCI) International,
Orlando, FL; USA, July 2011. Springer LNCS.
- 124
-
Matthias Vogelgesang and Florian Metze.
Parallelization strategies for a dynamic lexical
tree decoder.
Technical Report CMU-LTI-010, Carnegie Mellon University, Pittsburgh,
PA; U.S.A., 2011.
- 125
-
Udhyakumar Nallasamy, Florian Metze, and Thomas Schaaf.
Normalization of gender, dialect, and speaking
style using probabilistic front-ends.
In Proc. 37. Jahrestagung für Akustik, Düsseldorf;
Germany, May 2011. Deutsche Gesellschaft für Akustik.
- 126
-
Tim Polzehl, Alexander Schmitt, Florian Metze, and Michael Wagner.
Anger
recognition in speech using acoustic and linguistic cues.
Speech Communication, Special Issue on Sensing Emotion and
Affect - Facing Realism in Speech Processing, 2011.
- 127
-
Anuj Kumar, Anuj Tewari, Seth Horrigan, Matthew Kam, Florian Metze, and John
Canny.
Rethinking speech recognition on mobile
devices.
In Proc. 2nd International Workshop on Intelligent User
Interfaces for Developing Regions (IUI4DR) with IUI 2011, Palo Alto, CA;
USA, 2011.
- 128
-
Tim Polzehl, Sebastian Möller, and Florian Metze.
Automatically assessing acoustic
manifestations of personality in speech.
In Proc. IEEE Workshop on Spoken Language Technology, Berkeley,
CA; USA, December 2010. IEEE.
- 129
-
Martha Larson, Roeland Ordelman, Florian Metze, Wessel Kraaij, and Franciska
de Jong.
Multimedia content with a speech track:
ACM multimedia 2010 workshop on searching spontaneous conversational
speech.
In Proc. ACM Multimedia, 2010.
- 130
-
Huan Li, Lei Bao, Zan Gao, Arnold Overwijk, Wei Liu, Long-Fei Zhang, Shoou-I
Yu, Ming-Yu Chen, Florian Metze, and Alexander Hauptmann.
Informedia @ TrecVID 2010.
In Proc. 2010 TrecVID Workshop, Gaithersburg, MD; USA, November
2010. NIST.
- 131
-
Felix Putze, Jeronimo Dzaak, Florian Metze, and Tanja Schultz.
Modellierung kognitiver Wahrnehmungsprozesse
der Mensch-Maschine-Interaktion.
In Proc. 47. Kongress der Deutschen Gesellschaft für
Psychologie, Bremen; Germany, September 2010. DGPs.
In German.
- 132
-
Tim Polzehl, Sebastian Möller, and Florian Metze.
Automatically assessing personality from
speech.
In Proc. 4th International Conference on Semantic Computing
(ICSC), Pittsburgh, PA; USA, September 2010. IEEE.
- 133
-
Florian Metze, Anton Batliner, Florian Eyben, Tim Polzehl, Björn Schuller,
and Stefan Steidl.
Emotion recognition using imperfect speech
recognition.
In Proc. INTERSPEECH, Makuhari; Japan, September 2010. ISCA.
- 134
-
Thomas Schaaf and Florian Metze.
Analysis of gender normalization using
MLP and VTLN features.
In Proc. INTERSPEECH, Makuhari; Japan, September 2010. ISCA.
- 135
-
Florian Metze, Roger Hsiao, Qin Jin, Udhyakumar Nallasamy, and Tanja Schultz.
The 2010 CMU GALE speech-to-text system.
In Proc. INTERSPEECH, Makuhari; Japan, September 2010. ISCA.
- 136
-
Roger Hsiao, Florian Metze, and Tanja Schultz.
Improvements to generalized discriminative
feature transformation for speech recognition.
In Proc. INTERSPEECH, Makuhari; Japan, September 2010. ISCA.
- 137
-
Tim Polzehl, Alexander Schmitt, and Florian Metze.
Salient
features for anger recognition in German and English voice portals.
In Wolfgang Minker, Gary Geunbae Lee, Joseph Mariani, and Satoshi
Nakamura, editors, Spoken Dialogue Systems Technology and Design.
Springer, 2010.
- 138
-
Tim Polzehl, Alexander Schmitt, and Florian Metze.
Approaching multi-lingual emotion
recognition from speech - on language dependency of acoustic/prosodic
features for anger detection.
In Proc. Speech Prosody, Chicago, IL; USA, May 2010.
- 139
-
Björn Schuller, Florian Metze, Stefan Steidl, Anton Batliner, Florian
Eyben, and Tim Polzehl.
Late fusion of individual
engines for improved recognition of negative emotion in speech - learning
vs. democratic vote.
In Proc. ICASSP 2010, Dallas, TX; USA, March 2010. IEEE.
- 140
-
Tim Polzehl, Alexander Schmitt, and Florian Metze.
Comparing features for acoustic anger
classification in German and English IVR portals.
In Proc. International Workshop on Spoken Dialogue Systems,
Irsee; Germany, December 2009. Universität Ulm.
- 141
-
Florian Metze, Tim Polzehl, and Michael Wagner.
Fusion of acoustic and linguistic
speech features for emotion detection.
In Proc. 3rd International Conference on Semantic Computing
(ICSC), Berkeley, CA; USA, September 2009. IEEE.
- 142
-
Tim Polzehl, Shiva Sundaram, Hamed Ketabdar, Michael Wagner, and Florian Metze.
Emotion classification in
children's speech using fusion of acoustic and linguistic features.
In Proc. INTERSPEECH, Brighton, UK, September 2009. ISCA.
- 143
-
Ina Wechsung, Klaus-Peter Engelbrecht, Anja B. Naumann, Stefan Schaffer, Julia
Seebode, Florian Metze, and Sebastian Möller.
Predicting the quality of multimodal
systems based on judgments of single modalities.
In Proc. INTERSPEECH, Brighton, UK, September 2009. ISCA.
- 144
-
Julia Seebode, Stefan Schaffer, Ina Wechsung, and Florian Metze.
Influence of training on direct and indirect
measures for the evaluation of multimodal systems.
In Proc. INTERSPEECH, Brighton, UK, September 2009. ISCA.
- 145
-
Ina Wechsung, Klaus-Peter Engelbrecht, Julia Seebode, Stefan Schaffer, Florian
Metze, and Sebastian Möller.
Evaluation multimodaler Systeme: Ist das
Ganze die Summe seiner Teile?
In Proc. Mensch & Computer 2009, Berlin, Germany, September
2009. GI.
In German.
- 146
-
Florian Metze, Ina Wechsung, Stefan Schaffer, Julia Seebode, and Sebastian
Möller.
Reliable evaluation of multimodal dialog
systems.
In Proc. Human Computer Interaction International (HCI
International), San Diego, CA; USA, July 2009. Springer LNCS.
- 147
-
Ina Wechsung, Klaus-Peter Engelbrecht, Stefan Schaffer, Julia Seebode, Florian
Metze, and Sebastian Möller.
Usability evaluation of
multimodal interfaces: Is the whole the sum of its parts?
In Proc. Human Computer Interaction International (HCI
International), San Diego, CA; USA, July 2009. Springer LNCS.
- 148
-
Felix Burkhardt, Tim Polzehl, Joachim Stegmann, Florian Metze, and Richard
Huber.
Detecting real life anger.
In Proc. ICASSP, Taipei; Taiwan, April 2009. IEEE.
- 149
-
Stefan Schaffer, Julia Seebode, Ina Wechsung, Florian Metze, and Christine
Kühnel.
User characteristics and usage of gesture
and speech in a smart office environment.
In Proc. GW 2009 - The 8th International Gesture Workshop,
Bielefeld; Germany, February 2009. ZiF - Center for Interdisciplinary
Research, Bielefeld University.
- 150
-
Robert Wetzker, Winfried Umbrath, Leonhard Hennig, Christian Bauckhage, Tansu
Alpcan, and Florian Metze.
Tailoring taxonomies for efficient text
categorization and expert finding.
In Proc. Workshop on Optimization-based Data Mining and Web
Intelligence (ODM 2008), WI-IAT 2008, Sydney; Australia, December 2008. IEEE
Computer Society.
- 151
-
Robert Wetzker, Till Plumbaum, Alexander Korth, Christian Bauckhage, Tansu
Alpcan, and Florian Metze.
Detecting trends
in social bookmarking systems using a probabilistic generative model and
smoothing.
In Proceedings of the International Conference on Pattern
Recognition (ICPR), Tampa, FL; USA, December 2008. IEEE Computer Society.
- 152
-
Florian Metze, Roman Englert, Udo Bub, Ingmar Kliche, and Thomas Scheerbarth.
User perception of multi-modal interfaces for
mobile applications.
In Proc. INTERSPEECH, Brisbane; Australia, September 2008.
ISCA.
- 153
-
Tim Polzehl and Florian Metze.
Using prosodic features to prioritize voice
messages.
In Proc. Searching Spontaneous Conversational Speech (SSCS)
Workshop at SIGIR, Singapore, July 2008. ACM.
- 154
-
Florian Metze, Tansu Alpcan, and Christian Bauckhage.
Social and
expert search in online communities.
In Phillip Sheu, Heather Yu, Chittoor V. Ramamoorthy, Arvind K.
Joshi, and Lotfi A. Zadeh, editors, Semantic Computing. IEEE/ Wiley,
April 2010.
- 155
-
Florian Metze and Frank Oberle.
An architecture for natural language speech applications.
In Proc. Third Workshop on Advanced Dialogs, San Diego, CA;
USA, March 2008. VoiceXML Forum.
- 156
-
Florian Metze, Roman Englert, Udo Bub, Felix Burkhardt, and Joachim Stegmann.
Getting closer
- tailored human-computer speech dialog.
Universal Access in the Information Society, 2008.
- 157
-
Felix Burkhardt, Florian Metze, and Joachim Stegmann.
Speaker
classification for next generation voice dialog systems.
In Rainer Martin, editor, Advances in Digital Speech
Transmission. Wiley, January 2008.
- 158
-
Florian Metze, Thomas Ziem, and Ingmar Kliche.
Kinesthetic input
modalities for the W3C multimodal architecture.
In Proc. Workshop on W3C's Multimodal Architecture and
Interfaces, Fujisawa; Japan, November 2007. World Wide Web Consortium.
- 159
-
Christian Bauckhage, Tansu Alpcan, Sachin Agarwal, Florian Metze, Robert
Wetzker, Milena Ilic, and Sahin Albayrak.
An intelligent knowledge sharing system for
web communities.
In Proc. of the 2007 IEEE International Conference on Systems,
Man and Cybernetics, Montreal; Canada, October 2007. IEEE.
- 160
-
Florian Metze, Christian Bauckhage, Tansu Alpcan, Kathrin Dobbrott, and
Caroline Clemens.
The “SPREE” expert finding system.
In Proc. of the First IEEE International Conference on
Semantic Computing, Irvine, CA; USA, September 2007. IEEE.
- 161
-
Martin Eckert, Stefan Feldes, Karlheinz Schuhmacher, Ralf Kirchherr, Joachim
Stegmann, and Florian Metze.
Unterstützende
Sprachübersetzung in Telefonkonferenzen.
In Proc. 18. Konferenz on Elektronische Sprachsignalverarbeitung
ESSV 2007, Cottbus; Germany, September 2007.
In German.
- 162
-
Florian Metze.
Discriminative speaker adaptation using articulatory features.
Speech Communication special issue “Bridging the Gap Between
Human and Automatic Speech Processing”, 49(5), 2007.
- 163
-
Florian Metze, Roman Englert, Udo Bub, Felix Burkhardt, Bernhard Kaspar, and
Joachim Stegmann.
Getting closer - tailored multi-modal
human-computer interaction.
In Proc. “Striking a Chord” CHI 2007 Workshop on
non-verbal acoustic interaction, San Jose, CA; USA, April 2007. ACM.
- 164
-
Florian Metze.
On using articulatory features for discriminative
speaker adaptation.
In Proc. NAACL-HLT, Rochester, NY; USA, April 2007. ACL.
- 165
-
Jitendra Ajmera and Florian Metze.
Keyword spotting using durational entropy.
In Proc. ICASSP 2007, Honolulu, Hawaii, April 2007. IEEE.
- 166
-
Florian Metze, Jitendra Ajmera, Roman Englert, Udo Bub, Felix Burkhardt,
Joachim Stegmann, Christian Müller, Richard Huber, Bernt Andrassy,
Josef G. Bauer, and Bernhard Littel.
Comparison of four approaches to age and gender
recognition for telephone applications.
In Proc. ICASSP 2007, Honolulu, Hawaii, April 2007. IEEE.
- 167
-
Florian Metze.
Information discovery using distant speech
recognition.
In Proc. 33rd German Annual Conference on Acoustics, Stuttgart;
Germany, March 2007. DEGA.
- 168
-
Jitendra Ajmera and Florian Metze.
The TUB 2006 spoken term detection
system.
In Proc. NIST 2006 Spoken Term Detection Evaluation Workshop,
Gaithersburg, MD; USA, December 2006. NIST.
- 169
-
Florian Metze.
Data-driven speaker adaptation using
articulatory features.
In Proc. Siebzehnte Konferenz Elektronische
Sprachsignalverarbeitung “ESSV 2006”, Freiberg; Germany, August 2006.
Technische Universität Dresden.
- 170
-
Florian Metze.
Articulatory features for “Meeting” speech
recognition.
In Proc. INTERSPEECH2006-ICSLP, Pittsburgh, PA; USA, October
2006. ISCA.
- 171
-
Florian Metze.
Articulatory Features for Conversational
Speech Recognition.
PhD thesis, Fakultät für Informatik der Universität
Karlsruhe (TH), Karlsruhe; Germany, December 2005.
- 172
-
Lena Maier-Hein, Florian Metze, Tanja Schultz, and Alex Waibel.
Session independent non-audible speech recognition using surface
electromyography.
In Proc. ASRU 2005, Cancun; Mexico, November 2005. IEEE.
- 173
-
Florian Metze, Petra Gieselmann, Hartwig Holzapfel, Tobias Kluge, Ivica Rogina,
Alex Waibel, Matthias Wölfel, James Crowley, Patrick Reignier, Dominique
Vaufreydaz, François Bérard, Bérangère Cohen, Joëlle
Coutaz, Sylvie Rouillard, Victoria Arranz, Manu Bertrán, and Horacio
Rodriguez.
The 'FAME' Interactive Space.
In Proc. 2nd Joint Workshop on Multimodal Interaction and
Related Machine Learning Algorithms (MLMI2005), Edinburgh; UK, July 2005.
Springer.
- 174
-
Florian Metze, Christian Fügen, Yue Pan, and Alex Waibel.
Automatically Transcribing Meetings Using
Distant Microphones.
In Proc. ICASSP 2005, Philadelphia, PA; USA, March 2005. IEEE.
- 175
-
Florian Metze, Qin Jin, Christian Fügen, Kornel Laskowski, Yue Pan, and
Tanja Schultz.
Issues in Meeting Transcription - The ISL
Meeting Transcription System.
In Proc. INTERSPEECH2004-ICSLP, Jeju Island; Korea, October
2004. ISCA.
- 176
-
Florian Metze, Christian Fügen, Yue Pan, Tanja Schultz, and Hua Yu.
The ISL RT-04S Meeting Transcription
System.
In Proceedings NIST RT-04S Evaluation Workshop, Montreal;
Canada, May 2004. NIST.
- 177
-
Hagen Soltau, Hua Yu, Florian Metze, Christian Fügen, Qin Jin, and Szu-Chen
Jou.
The 2003 ISL rich transcription
system for conversational telephony speech.
In Proc. ICASSP 2004, Montreal; Canada, 2004. IEEE.
- 178
-
Jan Kratt, Florian Metze, Rainer Stiefelhagen, and Alex Waibel.
Large Vocabulary Audio-Visual Speech
Recognition Using the Janus Speech Recognition Toolkit.
In Carl Edward Rasmussen, Heinrich H. Bülthoff, Bernhard
Schölkopf, and Martin A. Giese, editors, Proc. DAGM Symposium,
volume 3175 of Lecture Notes in Computer Science, pages 488-495.
Springer, 2004.
- 179
-
Florian Metze and Alex Waibel.
Using Articulatory Features for
Speaker Adaptation.
In Proc. Automatic Speech Recognition and Understanding (ASRU),
St. Thomas; US VI, 2003. IEEE.
- 180
-
Christian Fügen, Sebastian Stüker, Hagen Soltau, Florian Metze, and
Tanja Schultz.
Efficient handling of
multilingual language models.
In Proc. Automatic Speech Recognition and Understanding (ASRU),
St. Thomas; US VI, December 2003. IEEE.
- 181
-
Nadia Mana, Susi Burger, Roldano Cattoni, Laurent Besacier, Vicky MacLaren,
John McDonough, and Florian Metze.
The Nespole! VoIP Multilingual Corpora in Tourism and Medical
Domains.
In Proc. EuroSpeech 2003, Geneva, Switzerland, 2003. ISCA.
- 182
-
Sebastian Stüker, Florian Metze, Tanja Schultz, and Alex Waibel.
Integrating Multilingual Articulatory Features
into Speech Recognition.
In Proc. EuroSpeech 2003, Geneva; Switzerland, 2003. ISCA.
- 183
-
Sebastian Stüker, Tanja Schultz, Florian Metze, and Alex Waibel.
Multilingual Articulatory
Features.
In Proc. ICASSP 2003. IEEE, April 2003.
- 184
-
Hagen Soltau, Florian Metze, and Alex Waibel.
Compensating for Hyperarticulation by
Modeling Articulatory Properties.
In Proc. ICSLP 2002. ISCA, September 2002.
- 185
-
Florian Metze and Alex Waibel.
A Flexible Stream Architecture for
ASR using Articulatory Features.
In Proc. ICSLP 2002, Denver, CO; USA, September 2002. ISCA.
- 186
-
Hagen Soltau, Florian Metze, Christian Fügen, and Alex Waibel.
Efficient Language Model Lookahead
through Polymorphic Linguistic Context Assignment.
In Proc. ICASSP 2002, Orlando, FL; USA, 2002. IEEE.
- 187
-
Alon Lavie, Laurent Besacier, Florian Metze, Fabio Pianesi, Susanne Burger,
Donna Gates, Lori Levin, Chad Langley, Kay Peterson, Tanja Schultz, Alex
Waibel, Dorcas Wallace, John McDonough, Hagen Soltau, Roldano Cattoni, Gianni
Lazzari, Nadia Mana, Emanuele Pianta, Erica Costantini, Laurent Besacier,
Hervé Blanchon, Dominique Vaufreydaz, and Loredana Taddei.
Enhancing the Usability and Performance of Nespole! - a Real-World
Speech-to-Speech Translation System.
In Proc. HLT 2002, San Diego, CA; USA, 3 2002.
- 188
-
Florian Metze, John McDonough, Hagen Soltau, Alex Waibel, Alon Lavie, Susanne
Burger, Chad Langley, Kornel Laskowski, Lori Levin, Tanja Schultz, Fabio
Pianesi, Roldano Cattoni, Gianni Lazzari, Nadia Mana, and Emanuele Pianta.
The Nespole! Speech-to-Speech Translation System.
In Proceedings of the Second International Conference on Human
Language Technology Research, pages 378-838, San Diego, CA; USA, 3 2002.
DARPA.
- 189
-
Alon Lavie, Florian Metze, Roldano Cattoni, Erica Costantini, Susanne Burger,
Donna Gates, Chad Langley, Kornel Laskowski, Lori Levin, Kay Peterson, Tanja
Schultz, Alex Waibel, Dorcas Wallace, John McDonough, Hagen Soltau, Gianni
Lazzari, Nadja Mana, Fabio Pianesi, Emanuele Pianta, Laurent Besacier,
Hervé Blanchon, Dominique Vaufreydaz, and Loredana Taddei.
A Multi-Perspective Evaluation of the Nespole! Speech-to-Speech
Translation System.
In Proceedings of Workshop on Speech-to-Speech Translation:
Algorithms and Systems at the 40th Annual Meeting of the ACL, Philadelphia,
PA, 7 2002. Association of Computational Linguistics.
- 190
-
Hagen Soltau, Florian Metze, Christian Fügen, and Alex Waibel.
A One-pass Decoder based on Polymorphic
Linguistic Context Assignment.
In Proc. Automatic Speech Recognition and Understanding (ASRU),
Madonna di Campiglio, Italy, December 2001. IEEE.
- 191
-
Susanne Burger, Laurent Besacier, Paolo Coletti, Florian Metze, and Céline
Morel.
The NESPOLE! VoIP Dialogue Database.
In Proc. EuroSpeech 2001 - Scandinavia, Aalborg; Denmark, 2001.
ISCA.
- 192
-
Florian Metze, John McDonough, and Hagen Soltau.
Speech Recognition over NetMeeting
Connections.
In Proc. EuroSpeech 2001 - Scandinavia, Aalborg, Denmark, 2001.
ISCA.
- 193
-
Tanja Schultz, Alex Waibel, Michael Bett, Florian Metze, Yue Pan, Klaus Ries,
Thomas Schaaf, Hagen Soltau, Martin Westphal, Hua Yu, and Klaus Zechner.
The ISL meeting room system.
In Proc. HSC, 2001.
- 194
-
Alex Waibel, Michael Bett, Florian Metze, Klaus Ries, Thomas Schaaf, Tanja
Schultz, Hagen Soltau, Hua Yu, and Klaus Zechner.
Advances in Meeting Record Creation
and Access.
In Proc. ICASSP 2001, Salt Lake City; USA, 2001. IEEE.
- 195
-
Hagen Soltau, Thomas Schaaf, Florian Metze, and Alex Waibel.
The ISL Evaluation System for
Verbmobil - II.
In Proc. ICASSP 2001, Salt Lake City, USA, May 2001.
- 196
-
John McDonough, Florian Metze, Hagen Soltau, and Alex Waibel.
Speaker compensation with sine-log all-pass transforms.
In Proc. ICASSP, Salt Lake City; USA, May 2001. IEEE.
- 197
-
Alex Waibel, Hua Yu, Hagen Soltau, Tanja Schultz, Thomas Schaaf, Yue Pan,
Florian Metze, and Michael Bett.
Advances in Meeting Recognition.
In Proc. HLT-2001, San Diego, CA, March 2001. ISCA.
- 198
-
Alex Waibel, Hagen Soltau, Tanja Schultz, Thomas Schaaf, and Florian Metze.
Multilingual
Speech Recognition.
In Wolfgang Wahlster, editor, Verbmobil: Foundations of
Speech-to-Speech Translation. Springer Verlag, Heidelberg; Germany, 2000.
- 199
-
Florian Metze and Thomas Kemp.
Das View4You-System: End-to-End
Evaluation.
In Proc. Konvens 2000, Ilmenau, Germany, October 2000. VDE
Verlag.
In German.
- 200
-
Florian Metze, Thomas Kemp, Thomas Schaaf, Tanja Schultz, and Hagen Soltau.
Confidence measure based Language
Identification.
In Proc. ICASSP 2000, Istanbul, April 2000. IEEE.
- 201
-
Sebastian Albrecht, Jan Busch, Martin Kloppenburg, Florian Metze, and Paul
Tavan.
Generalized
radial basis function networks for classification and novelty detection:
self-organization of optimal Bayesian decision.
Neural Networks, 13:1075-1093, May 2000.
- 202
-
Daniel Zboril, Marion Libossek, and Florian Metze.
In-development assessment of
concatenation synthesis by nonnative speakers.
Forschungsberichte 35, Institut für Phonetik und Sprachliche
Kommunikation der Universität München, 1997.
- 203
-
Daniel Zboril and Florian Metze.
Indeterminateness in qualitative and
quantitative reasoning.
In Proc. Seventh International Workshop on Database and Expert
Systems Applications (DEXA), Teheran; Iran, September 1996. IEEE.