Music Information Retrieval at Spotify and the Future of ML Tooling with Andreas Jansson of Replicate

Machine Learning Engineered

Dec 15 2020 • 1 hr 33 mins

Andreas Jansson is the co-founder of Replicate, a version control tool for machine learning. He holds a PhD from City University of London in Music Informatics and was previously a machine learning engineer at Spotify, researching and applying algorithms for music information retrieval. Learn more about Andreas: https://replicate.ai/ (https://replicate.ai/) https://www.linkedin.com/in/janssonandreas/ (https://www.linkedin.com/in/janssonandreas/) Every Thursday I send out the most useful things I’ve learned, curated specifically for the busy machine learning engineer. Sign up here: http://bitly.com/mle-newsletter (http://bitly.com/mle-newsletter) Follow Charlie on Twitter: https://twitter.com/CharlieYouAI (https://twitter.com/CharlieYouAI) Subscribe to ML Engineered: https://mlengineered.com/listen (https://mlengineered.com/listen) Comments? Questions? Submit them here: http://bit.ly/mle-survey (http://bit.ly/mle-survey) Take the Giving What We Can Pledge: https://www.givingwhatwecan.org/ (https://www.givingwhatwecan.org/) Timestamps: 02:30 Andreas Jansson 07:30 Overview of music information retrieval (MIR) 13:30 Why use spectrograms and not raw audio? 19:55 The potential for transformers in MIR 22:45 Most exciting applications for ML in MIR 29:20 Challenges in putting ML into production 36:45 What Andreas imagines for the future of ML tools 41:45 Why he's building a tool for ML version control (http://replicate.ai/ (http://replicate.ai/)) 52:55 What Replicate enables via integration or as a platform 01:02:55 Learnings from doing customer discovery for Replicate 01:14:10 "Github for ML models and data" 01:22:30 Rapid fire questions Links: https://deepmind.com/blog/article/wavenet-generative-model-raw-audio (WaveNet: a generative model for raw audio) https://openaccess.city.ac.uk/id/eprint/19289/1/ (Singing Voice Separation with Deep U-Net CNNs) https://openaccess.city.ac.uk/id/eprint/23669/1/ (Joint Singing Voice Separation and F0 Estimation with Deep U-Net Architectures) https://www.arxiv-vanity.com/ (arXiv Vanity) https://replicate.ai/ (Replicate) https://discord.gg/QmzJApGjyE (Replicate's Discord)