Anurag Kumar

PhD Candidate
Machine Learning and Signal Processing Group
Google Scholar

Contact:
GHC 5509
Language Technologies Institute
Carnegie Mellon University
Pittsburgh, PA, USA
Email: alnu [AT] andrew [DOT] cmu [DOT] edu
            alnu [AT] cs [DOT] cmu [DOT] edu

Theory is the first term in the Taylor series expansion of practice. - Thomas Cover

I have had my results for a long time, but I do not yet know how I am to arrive at them. - Carl Friedrich Gauss


Hi! This is my dugout on the web.

I am currently a PhD student in Language Technologies Institute, School of Computer Science at Carnegie Mellon University. I am advised by Prof. Bhiksha Raj. The name of my research group, Machine Learning and Signal Processing more or less gives away my broad research interests. Mainly, I work on methods for audio content analysis. We are trying to build a large scale sound event understanding system, which can possibly tag acoustic events and scenes in recordings on the web. You can find more information about my research work by looking at my publications.
Before joining CMU, I did my undergraduate at Indian Institute of Technology, Kanpur from where I obtained B.Tech-M.Tech Integrated Dual degree in Electrical Engineering. During the summers of 2015 I worked with Dinei Florencio in the Multimedia, Interaction, and Communication (MIC) group at Microsoft Research, Redmond. My research there focused on Speech Enhancement using Deep Neural Networks.

PUBLICATIONS

arXiv Preprints

Classifier Risk Estimation under Limited Labeling Resources
Anurag Kumar, Bhiksha Raj [arXiv]

Audio Content based Geotagging in Multimedia
Anurag Kumar, Benjamin Elizalde, Bhiksha Raj [arXiv]

An Approach for Self-Training Audio Event Detectors Using Web Data
Ankit Shah, Rohan Badlani, Anurag Kumar, Benjamin Elizalde, Bhiksha Raj [arXiv]

Features and Kernels for Audio Event Detection
Anurag Kumar, Bhiksha Raj [arXiv]


Published

Audio Event and Scene Recognition: A Unified Approach using Strongly and Weakly Labeled Data
Anurag Kumar, Bhiksha Raj [arXiv Version]
IEEE International Joint Conference on Neural Networks (IJCNN), 2017

Discovering Sound Concepts and Acoustic Relations In Text
Anurag Kumar, Bhiksha Raj, Ndapandula Nakashole [arXiv Version]
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2017. Companion Webpage here

Experiments on the DCASE Challenge 2016: Acoustic Scene Classification and Sound Event Detection in Real Life Recording
Benjamin Elizalde, Anurag Kumar, et. al.
[arXiv version]
in Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE), 2016

Audio Event Detection using Weakly Labeled Data
Anurag Kumar, Bhiksha Raj
in 24th ACM International Conference on Multimedia (ACM Multimedia), 2016 [arXiv Version]

Speech Enhancement In Multiple-Noise Conditions using Deep Neural Networks
Anurag Kumar, Dinei Florencio [arXiv Version]
in Interspeech, 2016. Companion Webpage here .

Weakly Supervised Scalable Audio Content Analysis
Anurag Kumar, Bhiksha Raj
IEEE International Conference on Multimedia & Expo (ICME), 2016 [arXiv Version]

A Novel Ranking Method For Multiple Classifier Systems
Anurag Kumar, Bhiksha Raj
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2015

Informedia@ Trecvid 2014 med and mer
CMU Aladdin MED Team

Unsupervised Fusion Weight Learning in Multiple Classifier Systems
Anurag Kumar, Bhiksha Raj [arXiv]

Detecting Sound Objects In Audio Recordings
Anurag Kumar, Rita Singh, Bhiksha Raj
22nd European Signal Processing Conference(EUSIPCO), 2014

Monaural Speaker Segregation Using Group Delay Spectral Matrix Factorization
Karan Nathwani, Anurag Kumar and Rajesh Hegde
20th National Conference on Communications (NCC), 2014

Event Detection in Short Duration Audio Using Gaussian Mixture Model and Random Forest Classifier
Anurag Kumar, Rajesh Hegde, Rita Singh and Bhiksha Raj
21 st European Signal Processing Conference (EUSIPCO), 2013

Audio event detection from acoustic unit occurrence patterns
Anurag Kumar, Pranay Dighe, Rita Singh, Sourish Chaudhuri and Bhiksha Raj
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2012


OTHERS - Other academic information