Alexey Ozerov

More recent picture

address:

I am now with Technicolor, Cesson Sévigné, France

e-mail:

myFirstName.myLastName@technicolor.com

Research Interests | Short Bio | C. V. | Demonstrations | Software | Evaluations | Responsibilities | Projects | Collaborators | Publications


Research Interests

Statistical signal processing, machine learning and information theory, including:

Application areas:


Short Bio

Alexey Ozerov holds a Ph.D. in Signal Processing from the University of Rennes 1 (France). He worked towards this degree from 2003 to 2006 in the labs of France Telecom R&D and in collaboration with the IRISA institute. Earlier, he received an M.Sc. degree in Mathematics from the Saint-Petersburg State University (Russia) in 1999 and an M.Sc. degree in Applied Mathematics from the University of Bordeaux 1 (France) in 2003. From 1999 to 2002, Alexey worked at Terayon Communicational Systems (USA) as a R&D software engineer, first in Saint-Petersburg and then in Prague (Czech Republic). He was for one year (2007) in Sound and Image Processing Lab at KTH (Royal Institute of Technology), Stockholm, Sweden, for one year and half (2008-2009) in TELECOM ParisTech / CNRS LTCI - Signal and Image Processing (TSI) Department, and for two years (2009 - 2011) with METISS team of IRISA / INRIA - Rennes. Now he is with Technicolor R&D departement in Cesson Sévigné, France.


Curriculum Vitae

in English: PDF, PostScript       in French: PDF, PostScript


Demonstrations

One microphone singing voice separation

One microphone source separation

Multichannel nonnegative matrix factorization for convolutive blind source separation

Factorial scaled hidden Markov model for single channel speech / music separation

SARAH project istrument extraction demos:

User-Guided Audio Source Separation via Multichannel Nonnegative Tensor Factorization With Structured Constraints

Using the FASST source separation toolbox for noise robust speech recognition

Coding-based Informed Source Separation


Software

Multichannel nonnegative matrix factorization toolbox (in Matlab)

BSS Locate - A toolbox for source localization in stereo convolutive audio mixtures (in Matlab)

FASST - Flexible Audio Source Separation Toolbox (in Matlab)


Participation in Evaluation Campaigns


Public Responsibilities


Projects


Collaborators


Publications

IEEE Copyright declimer conserning all IEEE papers reprints posted below: Copyright © 2005-2011 IEEE. This material is posted here with permission of the IEEE. Internal or personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution must be obtained from the IEEE by writing to pubs-permissions@ieee.org. By choosing to view these documents, you agree to all provisions of the copyright laws protecting it.

Submitted

  1. M. Li, J. Klejsa, A. Ozerov and W. B. Kleijn, "Audio Coding with Power Spectral Density Preserving Quantization," IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'12), Kyoto, Japan, March, 2012. (submitted)

  2. M. Li, A. Ozerov, J. Klejsa and W. B. Kleijn, "Asymptotically optimal distribution preserving quantization for stationary Gaussian processes," IEEE Transactions on Communications (submitted)

  3. S. Arberet, A. Ozerov, F. Bimbot and R. Gribonval, "A tractable framework for estimating and combining spectral source models for audio source separation," Signal Processing, special issue on "Latent Variable Analysis and Signal Separation" (submitted)

    Research report: HAL

Journal Articles

  1. E. Vincent, S. Araki, F. Theis, G. Nolte, P. Bofill, H Sawada, A Ozerov, V. Gowreesunker, D. Lutter, N.Q.K. Duong, "The Signal Separation Evaluation Campaign (2007-2010): Achievements and remaining challenges," Signal Processing, special issue on "Latent Variable Analysis and Signal Separation" (to appear)

    Article: HAL

  2. C. Blandin, A. Ozerov and E. Vincent, "Multi-source TDOA estimation in reverberant audio using angular spectra and clustering," Signal Processing, special issue on "Latent Variable Analysis and Signal Separation" (to appear)

    Article: HAL,       Code

  3. A. Ozerov, E. Vincent and F. Bimbot, "A general flexible framework for the handling of prior information in audio source separation," IEEE Trans. on Audio, Speech and Lang. Proc. (to appear)

    Article: HAL,       Code and Audio Examples

  4. A. Ozerov and W. B. Kleijn, "Asymptotically optimal model estimation for quantization," IEEE Transactions on Communications, vol. 59, no. 4, pp. 1031-1042 , April 2011.

    Article: PDF

  5. A. Ozerov and C. Févotte, "Multichannel nonnegative matrix factorization in convolutive mixtures for audio source separation," IEEE Trans. on Audio, Speech and Lang. Proc. special issue on Signal Models and Representations of Musical and Environmental Sounds, vol. 18, no. 3, pp. 550-563, March 2010.

    Article: PDF,       Audio Examples,       Code

  6. A. Ozerov, P. Philippe, F. Bimbot and R. Gribonval, "Adaptation of Bayesian models for single channel source separation and its application to voice / music separation in popular songs," IEEE Trans. on Audio, Speech and Lang. Proc., special issue on Blind Signal Proc. for Speech and Audio Applications, vol. 15, no. 5, pp. 1564-1578, July 2007.

    Article: PDF       Audio Examples,

  7. A. Ozerov, R. Gribonval, P. Philippe and F. Bimbot, "Choix et adaptation de modèles statistiques pour la séparation de voix chantée à partir d'un seul microphone," Traitement du signal, vol. 24, no. 3, pp. 211-224, 2007.

    abstract in English: HTML, preprint in French: PDF

Conferences

  1. A. Ozerov, A. Liutkus, R. Badeau and G. Richard, "Informed source separation: source coding meets source separation," In IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA'11), Mohonk, NY, Oct. 16-19, 2011.

    Article: PDF,       Audio Examples

  2. A. Ozerov, M. Lagrange and E. Vincent, "GMM-based classification from noisy features," International Workshop on Machine Listening in Multisource Environments (CHiME 2011), pages 30-35, Florence, Italy, September, 2011.

    Article: PDF, Slides: PDF

  3. A. Ozerov and E. Vincent, "Using the FASST source separation toolbox for noise robust speech recognition," International Workshop on Machine Listening in Multisource Environments (CHiME 2011), pages 86-87, Florence, Italy, September, 2011.

    Article: PDF, Poster: PDF,       Audio Examples

  4. A. Ozerov, C. Févotte, R. Blouet and J.-L. Durrieu, "Multichannel nonnegative tensor factorization with structured constraints for user-guided audio source separation," IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'11), pages 257-260, Prague, May, 2011.

    Article: PDF, Poster: PDF,       Audio Examples

  5. C. Blandin, E. Vincent and A. Ozerov, "Multi-source TDOA estimation using SNR-based angular spectra," IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'11), pages 2616 - 2619, Prague, May, 2011.

    Article: PDF, Poster: PDF,       Code

  6. A. Ozerov, E. Vincent and F. Bimbot, "A general modular framework for audio source separation", In 9th International Conference on Latent Variable Analysis and Signal Separation (LVA/ICA'10), pages 33 - 40, Saint-Malo, France, Sep. 27-30, 2010.

    Article: PDF, Poster: PDF

  7. S. Araki, A. Ozerov, V. Gowreesunker, H. Sawada, F. Theis, G. Nolte, D. Lutter and N.Q.K. Duong, "The 2010 Signal Separation Evaluation Campaign (SiSEC2010): - Audio source separation -", In 9th International Conference on Latent Variable Analysis and Signal Separation (LVA/ICA'10), pages 114 - 122, Saint-Malo, France, Sep. 27-30, 2010.

    Article: PDF

  8. S. Araki, F. Theis, G. Nolte, D. Lutter, A. Ozerov, V. Gowreesunker, H. Sawada and N.Q.K. Duong, "The 2010 Signal Separation Evaluation Campaign (SiSEC2010): - Biomedical source separation -", In 9th International Conference on Latent Variable Analysis and Signal Separation (LVA/ICA'10), pages 123 - 130, Saint-Malo, France, Sep. 27-30, 2010.

    Article: PDF

  9. C. Févotte and A. Ozerov, "Notes on nonnegative tensor factorization of the spectrogram for audio source separation : statistical insights and towards self-clustering of the spatial cues", In 7th International Symposium on Computer Music Modeling and Retrieval (CMMR 2010), 2010.

    Article: PDF,       Audio Examples,       Code

  10. S. Arberet, A. Ozerov, N.Q.K. Duong, E. Vincent, R. Gribonval, F. Bimbot and P. Vandergheynst, "Nonnegative matrix factorization and spatial covariance model for under-determined reverberant audio source separation", In 10th International Conference on Information Sciences, Signal Processing and their applications (ISSPA 2010), 2010.

    Article: PDF

  11. A. Ozerov, C. Févotte and M. Charbit, "Factorial scaled hidden Markov model for polyphonic audio representation and source separation", In IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA'09), Mohonk, NY, Oct. 18-21, 2009.

    Article: PDF, Slides: PDF,       Audio Examples

  12. J.-L. Durrieu, A. Ozerov, C. Févotte, G. Richard and B. David, "Main instrument separation from stereophonic audio signals using a source/filter model", In EUSIPCO, 17th European Signal Processing Conference, Glasgow, Scotland, August 24-28, 2009.

    Article: PDF,       Audio Examples

  13. A. Ozerov and C. Févotte, "Multichannel nonnegative matrix factorization in convolutive mixtures. With application to blind audio source separation", In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'09), pages 3137-3140, Taipei, Taiwan, April 19-24, 2009.

    Article: PDF, Poster: PDF,       Audio Examples,       Code

  14. A. Ozerov and W. B. Kleijn, "Optimal parameter estimation for model-based quantization," In IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP'09), pages 2497-2500, Taipei, Taiwan, April 19-24, 2009.

    Article: PDF, Poster: PDF

  15. S. Arberet, A. Ozerov, R. Gribonval and F. Bimbot, "Blind spectral-GMM estimation for underdetermined instantaneous audio source separation", In Proc. Int. Conf. on Independent Component Analysis and Blind Source Separation (ICA'09), pages 751-758, Paraty, Brazil, March 15-18, 2009.

    Article: PDF

  16. I. Potamitis and A. Ozerov, "Single channel source separation using static and dynamic features in the power domain", In EUSIPCO, 16th European Signal Processing Conference, Laussane, Switzerland, August 25-29, 2008.

    Article: PDF,       Audio Examples

  17. A. Ozerov and W. B. Kleijn, "Flexible quantization of audio and speech based on the autoregressive model," In IEEE Asilomar Conference on Signals, Systems, and Computers (Asilomar CSSC'07), pages 535-539, Pacific Grove, CA, Nov. 4-7, 2007.

    Article: PDF, Poster: PDF

  18. R. Heusdens, W. B. Kleijn and A. Ozerov, "Entropy-constrained high-resolution lattice vector quantization using a perceptually relevant distortion measure," In IEEE Asilomar Conference on Signals, Systems, and Computers (Asilomar CSSC'07), pages 2075-2079, Pacific Grove, CA, Nov. 4-7, 2007.

    Article: PDF

  19. W. B. Kleijn and A. Ozerov, "Rate distribution between model and signal," In IEEE Worksh. on Apps. of Signal Processing to Audio and Acoustics (WASPAA'07), pages 243-246, Mohonk, NY, Oct. 2007.

    Article: PDF

  20. A. Ozerov, P. Philippe, R. Gribonval and F. Bimbot, "One microphone singing voice separation using source-adapted models," In IEEE Worksh. on Apps. of Signal Processing to Audio and Acoustics (WASPAA'05), pages 90-93, Mohonk, NY, Oct. 2005.

    Article: PDF, Slides: PDF,       Audio Examples

  21. A. Ozerov, R. Gribonval, P. Philippe and F. Bimbot, "Séparation voix / musique à partir d'enregistrements mono : quelques remarques sur le choix et l'adaptation des modèles," In GRETSI'05 Symposium on Signal and Image Processing, Louvain-la-Neuve, Belgique, Sept. 2005.

    abstract in English: HTML, full text in French: PDF, PostScript,       Audio Examples

  22. G. Gravier, L. Benaroya, A. Ozerov, R. Gribonval and F. Bimbot, "Séparation de sources à partir d'un seul capteur pour la reconnaissance robuste de la parole," In Journées d'Etude sur la Parole (JEP'04), April 2004.

    abstract in English: HTML, full text in French: PDF

Patents

  1. A. Ozerov, C. Févotte and R. Blouet, "Automatic source separation via joint use of segmental information and spatial diversity" US patent 13021692, 2011 (filled).

  2. S. Arberet, A. Ozerov, R. Gribonval and F. Bimbot, "Procédé et un dispositif d'estimation de signaux de source issus d'un signal de mélange" French patent 2939933, 2010 (published) and international extension WO2010/076412, 2010 (published).

Technical reports

  1. A. Ozerov, S. Essid and M. Charbit, "Reconnaissance des instruments dans la musique polyphonique par décomposition NMF et classification SVM," Technical Report TELECOM ParisTech 2009D014, July 2009.

    abstract in English: HTML, full text in French: PDF

Theses

  1. A. Ozerov. "Adaptation de modèles statistiques pour la séparation de sources mono-capteur. Application à la séparation voix / musique dans les chansons." PhD thesis, University of Rennes 1, 2006.

    abstract in English: HTML, full text in French: PDF, PostScript

  2. A. Ozerov. "Représentations robustes pour la reconnaissance automatique de la parole". MSc thesis, DESS "Scientific Calculation and Applications", University of Bordeaux 1, 2003.

    abstract in English: HTML, full text in French: PDF, PostScript

  3. A. Ozerov. "A criterion of nondisappearance of invariant sets satisfying Krasovsky property under C0 perturbations of right part of the system". MSc thesis, department of Ordinary Differential Equations, Mathematics and Mechanics faculty, St. Petersburg State University, 1999.

    abstract in English: PDF, full text in Russian: PDF

Miscellaneous

  1. A. Ozerov, C. Févotte and R. Blouet, "The SARAH project: Standardization of High-Definition Audio Remastering", Demo presented at IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA'09), Mohonk, NY, Oct. 18-21, 2009.

    Poster: PDF