@GoodGood014 Profile picture

Hao Hao Tan

@GoodGood014

AI/ML @BandLab. ex @tiktok_us @sutdsg. Machine learning & music tech. 🇲🇾 🇸🇬

Joined December 2011
Similar User
Carl Thomé photo

@carlthome

Stefan Lattner photo

@deeplearnmusic

Javier Nistal photo

@latentspaces

Pedro Sarmento photo

@umpedronosapato

Emir Demirel photo

@_emir_demirel_

Christian Steinmetz photo

@csteinmetz1

Néstor Nápoles López photo

@napulen

Yi-Hsuan Yang photo

@affige_yang

Qiuqiang Kong photo

@QiuqiangK

Magdalena Fuentes photo

@mfu3ntes

Jordi Pons photo

@jordiponsdotme

Elio Quinton photo

@elio_elioo

Ilaria Manco photo

@Ilaria__Manco

Dmitry Bogdanov photo

@di_bogdanov

Nicholas J. Bryan photo

@NicholasJBryan

Hao Hao Tan Reposted

If you’re a Member, we’ve also added some serious Studio upgrades. Build on your MIDI ideas quickly with Smart Tools, and give your ideas that extra push with Extend, Layer, and Recompose! 🦾 (3/5)


MIDI Smart Tools is finally out on BandLab! Think of it as MS Copilot, but in the context of a music DAW. Heavily involved in this with my teammates 😀🎶

Wanna see how far your creativity can go with this AI-powered trio? Peep this tutorial to discover new ways to create with Smart Tools! 🔧 blog.bandlab.com/smart-tools-ba… (5/5)



🧮 Just finished writing a new blog post on RVC, one of the most popular voice conversion project on GitHub: gudgud96.github.io/2024/09/26/ann… If you are interested in Fake Drake & AI covers, and want to dive a little bit deeper on the technical details, I hope this article is for you.


"More AI to come" to sports, from F1 racing to Premier League football: youtube.com/watch?v=A_pxpJ…


Hao Hao Tan Reposted

Today, we release several Moshi artifacts: a long technical report with all the details behind our model, weights for Moshi and its Mimi codec, along with streaming inference code in Pytorch, Rust and MLX. More details below 🧵 ⬇️ Paper: kyutai.org/Moshi.pdf Repo:…


Hao Hao Tan Reposted

Audio Match Cutting Finding and Creating Matching Audio Transitions in Movies and Videos discuss: huggingface.co/papers/2408.10… A "match cut" is a common video editing technique where a pair of shots that have a similar composition transition fluidly from one to another. Although…


Hao Hao Tan Reposted

🥳We just uploaded another ISMIR paper to arXiv: Stem-JEPA: A Joint-Embedding Predictive Architecture for Musical Stem Compatibility Estimation A. Riou, S. Lattner, G. Hadjeres, M. Anslow, G. Peeters 📜 Paper: arxiv.org/abs/2408.02514 🌍 Code: github.com/SonyCSLParis/S… For more…

Glad to announce that Stem-JEPA has been accepted to @ISMIRConf ! In this work, we tackle the task of musical stem compatibility estimation (what “fits” together) as a representation learning problem. (1/7) Paper: arxiv.org/abs/2408.02514 Code: github.com/SonyCSLParis/S…



Hao Hao Tan Reposted

I gave a 1-hour talk about generative modelling at the EEML 2024 summer school last month. It's mostly an intuitive look at how and why diffusion models actually work -- not unlike the content of my recent blog posts. All summer school talks will be freely available online!🙏

EEML'24 Day 1 videos are out! 🇷🇸 * Intro to DL (@alfcnz): youtu.be/1bBOneUMu3Y?si… * Generative modelling + iterative refinement (@sedielem): youtu.be/9BHQvQlsVdE?si… * AI for Good (@weballergy): youtu.be/tJSicw7DPVU?si… * Reasoning (@backprop2seed & I): youtu.be/CyIuM5eQZ5A?si…



Hao Hao Tan Reposted

We built a real-time music jamming system using RL and generative models -- you can play along with this model and learn more about our work at #ICML2024 🎶! 📄 paper: twtr.to/UFe6X 🌐 website: twtr.to/v9E5y 🕐 Tue 23 Jul 1:30 - 3 p.m. CEST 📍 Hall C 4-9 🧵


Hao Hao Tan Reposted

🔥 How I broke the internet today and what lessons can we learn from it? #Crowdstrike 🧐 Several things that make it a good fake that worked: 👇 1. No culprit named yet, I bring it on a platter, people like to have a culprit. 2- The culprit seems completely stupid, he is proud…


Just finding this super interesting as a Liverpool fan, who was so proud of our 2019 UCL corner kick that was "taken quickly". Now see geometric deep learning in action to study corner kicks! deepmind.google/discover/blog/…


Hao Hao Tan Reposted

Masked Generative Video-to-Audio Transformers with Enhanced Synchronicity Video-to-audio (V2A) generation leverages visual-only video features to render plausible sounds that match the scene. Importantly, the generated sound onsets should match the visual actions that are…


Hao Hao Tan Reposted

Moshi and Alex going on a space adventure 🚀


Hao Hao Tan Reposted

📢 Okio is closing its doors. We are ending all product offerings & open source support, but our code will remain open on GitHub. Heartfelt gratitude to all former employees, customers, users, contributors & investors for their incredible support & being part of our journey.


Hao Hao Tan Reposted

Training convnets on waveforms is hard—far harder than on magnitude spectrograms. "Instabilities in Convnets for Raw Audio" approaches this phenomenon from the perspective of sensitivity to initialization. IEEE Signal Processing Letters vol. 31 preprint: hal.science/hal-04528116

lostanlen's tweet image. Training convnets on waveforms is hard—far harder than on magnitude spectrograms.

"Instabilities in Convnets for Raw Audio" approaches this phenomenon from the perspective of sensitivity to initialization.

IEEE Signal Processing Letters vol. 31
preprint: <a style="text-decoration: none;" rel="nofollow" target="_blank" href="https://t.co/Cj6BAGffQR">hal.science/hal-04528116</a>

Hao Hao Tan Reposted

hi music people, i wrote a tutorial on large language models and music information retrieval. of course it's called.. LLMs <3 MIR 🥁 have fun! llms-heart-mir.github.io/tutorial


Loading...

Something went wrong.


Something went wrong.