Music Understanding

See also Computer Accompaniment and Beat Tracking

This page is divided into several categories:

General Overviews, mostly conference presentations that describe a number of research projects I have worked on.
Style Classification, using computers and machine learning to label different styles of music.
Music Information Retrieval, especially systems that search using melodies as a query (query-by-humming).
Structural Analysis, systems that analyze music to obtain a description such as AABA, or discover other patterns.
Music Alignment, systems that time-align two performances or match symbolic representations to audio (note that computer accompaniment systems perform real-time alignment to follow a score, but this is such an important and distinctive application, that I have put those papers in their own group).

Expression considers automatic approaches to performing music expressively.

General Overviews
These are all overviews of my work in this area. The IAKTA/LIST paper is most up-to-date, although it is actually quite dated. Since then, I've worked on Query-by-Humming, polyphonic score alignment, music search by polyphonic alignment, music structure analysis, and beat tracking informed by music structure.

Dannenberg, “Music Understanding,” 1987/1988 Computer Science Research Review, Carnegie Mellon School of Computer Science, pp. 19-28.

[Postscript Version] [Adobe Acrobat (PDF) Version]

Dannenberg, “Recent Work In Real-Time Music Understanding By Computer,” Music, Language, Speech, and Brain, Wenner-Gren International Symposium Series, Sundberg, Nord, and Carlson, ed., Macmillan, 1991, pp. 194-202.

[Postscript Version]

Dannenberg, “Computerbegleitung und Musicverstehen,” in Neue Musiktechnologie, Bernd Enders, ed., Schott, 1993, Mainz, pp. 241-252.

Dannenberg, “Recent Work in Music Understanding,” in Proceedings of the 11th Annual Symposium on Small Computers in the Arts, Philadelphia, PA November 15-17, 1991. Philadelphia: SCAN, November 1991, pp. 9-14.

ABSTRACT: Interaction with computers in musical performances is very much limited by a lack of music understanding by computers. If computers do not understand musical structures such as rhythmic units, chords, keys, and phrases, then interaction with computers will necessarily be difficult and cumbersome. Research into Music Understanding by computer aims to raise the level of human computer interaction in musical tasks including live music performance.

[Postscript Version] [Adobe Acrobat (PDF) Version]

Dannenberg, “Music Understanding and the Future of Computer Music,” Contemporary Music Review, (to appear).

Dannenberg, “Music Understanding by Computer,” in IAKTA/LIST International Workshop on Knowledge Technology in the Arts Proceedings, International Association of Knowledge Technology in the Arts, Inc. in cooperation with Laboratories of Image Information Science and Technology, Osaka Japan, pp. 41-56 (September 16, 1993).

ABSTRACT. Music Understanding refers to the recognition or identification of structure and pattern in musical information. Music understanding projects initiated by the author are discussed. In the first, Computer Accompaniment, the goal is to follow a performer in a score. Knowledge of the position in the score as a function of time can be used to synchronize an accompaniment to the live performer and automatically adjust to tempo variations. In the second project, it is shown that statistical methods can be used to recognize the location of an improviser in a cyclic chord progression such as the 12-bar blues. The third project, Beat Tracking, attempts to identify musical beats using note-onset times from a live performance. Parallel search techniques are used to consider several hypotheses simultaneously, and both timing and higher-level musical knowledge are integrated to evaluate the hypotheses. The fourth project, the Piano Tutor, identifies student performance errors and offers advice. The fifth project studies human tempo tracking with the goal of improving the naturalness of automated accompaniment systems.

[Postscript Version] [Adobe Acrobat (PDF) Versioon.]

Dannenberg, “Artificial Intelligence, Machine Learning, and Music Understanding,” in Proceedings of the Brazilian Symposium on Computer Music (SBCM2000), Curitiba, Brazil, (2000).

ABSTRACT: Artificial Intelligence and Machine Learning are enabling many advances in the area of music. Three computer music problems are described in which Machine Learning promises to solve problems and advance the state of the art. These are: computer accompaniment, music understanding in interactive compositions, and music synthesis. Machine Learning plays an important role in dealing with poorly defined problems where data is subject to noise and other variation, and where complexity rules out direct, handcrafted solutions. These characteristics are typical in sophisticated computer music systems. Machine learning promises to enable more natural communication between machines and musicians.

[Adobe Acrobat (PDF) Version]

Style Classification

Getting a computer music system to listen to a performance and determine aspects of style, such as:

Improvisational style: frantic, lyrical, syncopated, ...
Instrumentalist: ala Miles Davis, Louis Armstrong, ...
Composer: Mozartian, Bach-like, ...
Texture: homophonic, polyphonic, ...
Emotion,
(This list could go on and on.)

Dannenberg, Thom, and Watson, “A Machine Learning Approach to Musical Style Recognition" in 1997 International Computer Music Conference, International Computer Music Association (September 1997), pp. 344-347.

ABSTRACT: Much of the work on perception and understanding of music by computers has focused on low-level perceptual features such as pitch and tempo. Our work demonstrates that machine learning can be used to build effective style classifiers for interactive performance systems. We also present an analysis explaining why these techniques work so well when hand-coded approaches have consistently failed. We also describe a reliable real-time performance style classifier.
[Postscript Version] [Adobe Acrobat (PDF) Version]

Han, Rho, Dannenberg, and Hwang, “SMERS: Music Emotion Recognition Using Support Vector Regression” in Proceedings of the 10th International Conference on Music Information Retrieval (ISMIR 2009), (October 2009), pp. 651-656.

ABSTRACT: Music emotion plays an important role in music retrieval, mood detection and other music-related applications. Many issues for music emotion recognition have been addressed by different disciplines such as physiology, psychology, cognitive science and musicology. We present a support vector regression (SVR) based music emotion recognition system. The recognition process consists of three steps: (i) seven distinct features are extracted from music; (ii) those features are mapped into eleven emotion categories on Thayer's two-dimensional emotion model; (iii) two regression functions are trained using SVR and then arousal and valence values are predicted. We have tested our SVR-based emotion classifier in both Cartesian and polar coordinate systems empirically. The results indicate the SVR classifier in the polar representation produces satisfactory results which reach 94.55% accuracy, superior to the SVR (in Cartesian) and other machine learning classification algorithms such as SVM and GMM.
[Adobe Acrobat (PDF) Version]

Dannenberg, “Style in Music,” in The Structure of Style: Algorithmic Approaches to Understanding Manner and Meaning, Shlomo Argamon, Kevin Burns, and Shlomo Dubnov, eds., Berlin: Springer-Verlag. 2010, pp. 45-58.

ABSTRACT: Because music is not objectively descriptive or representational, the subjective qualities of music seem to be most important. Style is one of the most salient qualities of music, and in fact most descriptions of music refer to some aspect of musical style. Style in music can refer to historical periods, composers, performers, sonic texture, emotion, and genre. In recent years, many aspects of music style have been studied from the standpoint of automation: How can musical style be recognized and synthesized? An introduction to musical style describes ways in which style is characterized by composers and music theorists. Examples are then given where musical style is the focal point for computer models of music analysis and music generation.
[Adobe Acrobat (PDF) Version]

Music Information Retrieval

This work is mostly focussed on retrieval from melodic databases using a sung or hummed query as the search key. This raises many issues relating to melodic similarity, music representation, and pitch recognition. Many of the melodic similarity techniques are related to earlier work in Computer Accompaniment.

Dannenberg, Foote, Tzanetakis, and Weare, “Panel: New Directions in Music Information Retrieval,” in Proceedings of the 2001 International Computer Music Conference, International Computer Music Association, (September 2001), pp. 52-59.

Mazzoni and Dannenberg, “Melody Matching Directly from Audio,” in ISMIR 2001 2nd Annual International Symposium on Music Information Retrieval, Bloomington: Indiana University, (2001), pp. 73-82.

ABSTRACT: In this paper we explore a technique for content-based music retrieval using a continuous pitch contour derived from a recording of the audio query instead of a quantization of the query into discrete notes. Our system determines the pitch for each unit of time in the query and then uses a time-warping algorithm to match this string of pitches against songs in a database of MIDI files. This technique, while much slower at matching, is usually far more accurate than techniques based on discrete notes. It would be an ideal technique to use to provide the final ranking of candidate results produced by a faster but lest robust matching algorithm.

Music Understanding

General Overviews

Style Classification

Music Information Retrieval

Structural Analysis

Music Alignment

Expression