Jean-Louis Durrieu, PhD: Research Page
[bio]
[interests]
[publications]
[software]
[links]
[personal]
Short Bio
I currently work as a post-doctoral research scientist at the Signal Processing Laboratory 5 (LTS5), at the Ecole Polytechnique Fédérale de Lausanne, in Switzerland. My work is funded by the Swiss CTI, through a project entitled "Objective Measures in AudioPhonology" in collaboration with the company SpeedLingua. We aim at designing objective measures to assess improvement during foreign language pronunciation training.
In 2010, I obtained my PhD degree from Telecom ParisTech, under the supervision of G. Richard and B. David. My PhD topic was on the extraction and estimation of the main melody, as well as on the separation of the lead instrument from its background accompaniment.
In 2007, I also obtained the Engineer degree from Telecom ParisTech, with majors in signal processing, mostly audio signal processing, statistical modelling and artificial intelligence.
[top]
Research Interests
- Methodology
- Statistical Signal Processing:
- Bayesian inference
- Stochastic models
- Algorithms: Expectation-Maximisation (EM), Variational Approximation
- Signal production models:
- Source/Filter models for speech and music instruments
- Gaussian Mixture Models (GMM), Hidden Markov Models (HMM)
- Non-negative Matrix Factorisation (NMF)
- Applications
- (Music) Audio Signal Processing: analysis, transcription, separation
- Speech processing: language identification, representations, enhancement
I enjoy investigating mathematical and statistical models for audio, music or speech processing. More specifically, I have been working on using many source/filter models for voice and various instruments along with techniques that decompose the signal into a basis of elementary components, namely the Nonnegative Matrix Factorization (NMF). This research was particularly successful at two tasks, the audio melody extraction and the lead instrument separation from the background music. At the moment, I more specifically work on language learning technologies, or in our case "computer-aided pronunciation training" (CAPT) software.
Apart from that, I would be keen on experiencing with bio-medical data. Fitting models and estimating parameters become even more enjoyable when you know your research can lead to useful applications, enlightening other people's lives!
[top]
Publications, Scientific dissemination
- JOURNAL PAPERS:
- 2011:
-
J.-L. Durrieu, B. David and G. Richard,
A Musically Motivated Mid-Level Representation For Pitch
Estimation And Musical Audio Source Separation,
IEEE Journal of Selected Topics on Signal Processing, October 2011, Vol. 5 (6), pp. 1180 - 1191. (First submission: September 2010). [link on IEEExplore][web][preprint][copyright]
- 2008:
- C. Févotte, N. Bertin and J.-L. Durrieu,
Nonnegative Matrix Factorization with the Itakura-Saito Divergence:
With Application to Music Analysis,
Neural Computation,
March 2009, Vol. 21, No. 3: 793 - 830.
[audio samples]
[bibtex]
This research was partly funded by the European
K-Space
project.
- PEER-REVIEWED INTERNATIONAL CONFERENCES:
- 2012
- J.-L. Durrieu and J.-P. Thiran,
Musical Audio Source Separation Based on User-Selected
F0 Track, the
International Conference on Latent Variable Analysis and
Signal Separation (LVA/ICA),
March 12-15, 2012, Tel-Aviv, Israel.
[web]
-
J.-L. Durrieu, F. Kelly and J.-P. Thiran,
Lower and upper bounds for approximation of the
Kullback-Leibler divergence between two Gaussian mixture models
,
IEEE
ICASSP,
March 25-30 2012, Kyoto, Japan.
- 2011
- J.-L. Durrieu and J.-P. Thiran,
Sparse Non-Negative Decomposition Of
Speech Power Spectra For Formant Tracking,
IEEE
ICASSP,
May 22-27 2011, Prague, Czech Republic.
- A. Ozerov, C. Févotte, R. Blouet and J.-L. Durrieu,
Multichannel nonnegative tensor factorization with structured
constraints for user-guided audio source separation,
IEEE
ICASSP,
May 22-27 2011, Prague, Czech Republic.
- F. Weninger, J.-L. Durrieu, F. Eyben, G. Richard,
B. Schuller,
Combining Monaural Source Separation With Long Short-Term Memory
for Increased Robustness in Vocalist Gender Recognition,
IEEE ICASSP,
May 22-27 2011, Prague, Czech Republic.
- 2010
- R. Foucard, J.-L. Durrieu, M. Lagrange and G. Richard,
Multimodal similarity between musical streams for cover
version detection,
ICASSP,
14 - 19 March 2010, Dallas, Texas, USA.
- 2009
- J.-L. Durrieu, A. Ozerov, C. Févotte,
G. Richard and B. David,
Main Instrument Separation From Stereophonic Audio Signals
Using A Source/Filter Model,
EUSIPCO,
24-28 August 2009, Glasgow, Scotland.
[pdf]
[presentation (4.5Mo)]
[audio examples]
This research is partly funded by the OSEO project
Quaero
and partly funded by the French ANR project
SARAH
(StAndardisation du Remastering Audio Haute-definition)
- J.-L. Durrieu, G. Richard and B. David,
An Iterative Approach to Monaural Musical Mixture De-Soloing,
ICASSP,
April 19-24 2009, Taipei, Taiwan.
[pdf]
[poster]
[audio examples]
[copyright]
This research is partly funded by the European
K-Space
project and by the OSEO project
Quaero.
- SEMINARS
-
J.-L. Durrieu,
Automatic Extraction of the Main Melody from
Polyphonic
Music Signals. With Application to Transcription and
Separation,
seminar at the EPFL, Lausanne, Switzerland, 4th December 2009.
[presentation (4.7Mo)]
-
J.-L. Durrieu,
Automatic Separation and Transcription of the Main Melody
from Polyphonic Music Signals,
seminar at IRCAM, Paris, 30th November 2009.
[
presentation (4.8Mo)]
-
J.-L. Durrieu,
Automatic Transcription and Separation of the Main Melody
from Polyphonic Music Signals,
seminar at METISS group, IRISA, Rennes, 16th April 2009.
[
presentation (13.3Mo)]
- THESES
- PhD thesis:
Automatic Transcription and Separation of the Main Melody
in Polyphonic Music Signals,
defended on Friday May 7th 2010, 2pm, at Telecom ParisTech.
[
pdf]
[web]
- Master thesis: A Query_By_Humming System ,
July 2006, research internship at the "FIT" laboratory,
under Professor Xu MingXing's supervision.
[pdf]
- EVALUATION CAMPAIGNS:
- 2011:
- SiSEC, Professionally Produced Music Recordings.
Mostly best SDR on each submitted individual extracted vocals.
[web]
[results
(dev set)
(test set)]
- 2009:
- Music Information Retrieval Evaluation eXchange (MIREX),
Audio Melody Extraction (AME). Best overall accuracy on
MIREX08 dataset, 2nd best global overall accuracy.
[web]
[results]
- 2008:
-
Music Information Retrieval Evaluation eXchange (MIREX), Audio Melody Extraction (AME). Best overall accuracy on MIREX08 dataset, 2nd best global overall accuracy. [web][results]
- SiSEC, Professionally Produced Music Recordings. [web][results]
- ACTIVITIES AS A SCIENTIFIC PEER:
- Reviewer for international conferences and international journals:
IEEE Signal Processing Letters (SPL),
IEEE Journal of Selected Topics in Signal Processing (JSTSP),
IEEE Transactions on Audio, Speech and Language Processing (TASLP),
International Conference of the International Society for Music Information Retrieval (ISMIR),
Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC).
-
MISC.
-
2009:
-
J. Weil, T. Sikora, J.-L. Durrieu and G. Richard,
Beat Tracking Using The Delta-Phase Matrix,
research report, Telecom Paristech, 2009.
[
tech-rep]
[top]
Software
- Main instrument source separation program, Python/NumPy/SciPy.
See this companion site, for our JSTSP'2011 article.
- Fundamental frequency saliance visualization, Vamp Plugin.
- User-guided source separation, Python/NumPy/SciPy/PyQt4 or PySide.
See the SiSEC 2011 - LVA/ICA 2012 companion website.
[top]
Links
- Friends, colleagues:
- Those with permanent positions (as of 12/01/2012):
- And the others (which you may find using your favorite Internet search engine): Simon Arberet, Romain Hennequin, Antoine Liutkus, Thomas Maugey, Laurent Oudre, Alexey Ozerov, Jan Weil, ...
- Myself:
Facebook,
Google Plus,
LinkedIn,
Twitter
[top]
Personal
I play the oboe and the saxophone, like playing in chamber music orchestras (woodwind quintets, wind octets, sax quartets), and I am keen on table tennis and martial arts (Taiji, Nunchuks).
[top]
Copyright information for the publications
Copyright 2008 IEEE.
Published in the IEEE 2008 International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2008), scheduled for March 30 - April 4, 2008 in Las Vegas, Nevada, U.S.A.
Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works, must be obtained from the IEEE. Contact: Manager, Copyrights and Permissions / IEEE Service Center / 445 Hoes Lane / P.O. Box 1331 / Piscataway, NJ 08855-1331, USA. Telephone: + Intl. 908-562-3966.
Copyright 2009 IEEE.
Published in the IEEE 2009
International Conference on Acoustics, Speech, and Signal Processing
(ICASSP
2009), scheduled for April 19 - 24, 2009 in Taipei, Taiwan
Personal use of this material is permitted. However, permission to
reprint/republish this material for advertising or promotional purposes
or for
creating new collective works for resale or redistribution to servers
or lists,
or to reuse any copyrighted component of this work in other works, must
be
obtained from the IEEE. Contact: Manager, Copyrights and Permissions /
IEEE
Service Center / 445 Hoes Lane / P.O. Box 1331 / Piscataway, NJ
08855-1331,
USA. Telephone: + Intl. 908-562-3966.
© 2011 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Jean-Louis Durrieu
Last modified: Thu Jan 12 13:37:20 CET 2012