Huck Yang @huckiyang Twitter Profile

Huck Yang

@huckiyang

Sr. Research Scientist @NVIDIAAI Generative Voice Correction | Ph.D. MSc @GeorgiaTech | Past: @GoogleAI @AmazonScience | 🗣️ education

365Posts 815Followers 647Following

Similar User

@mhnt1580

@YungSungChuang

@AcouIntel

@RoshanSSharma2

@jefflai108

@yhtu_

@alex_h_liu

@Sid_Arora_18

@jiatongshi

@GTL094144

@vinsdotton

@RuiLiu60711141

@iamyuanchung

@fishball_Lin

@jack978397

Huck Yang Reposted

Huck Yang

@huckiyang

14 Nov

Excited to share our new preprint, "CLaSP: Learning Concepts for Time-Series Signals from Natural Language Supervision"! Our model enables query-by-text/signal retrieval and zero-shot classification for time series data. Check it out: arxiv.org/abs/2411.08397

Huck Yang Reposted

Huck Yang

@huckiyang

15 Nov

🚨 Adaptive Decoding via Latent Preference Optimization 🚨 - New layer added to Transformer, selects decoding params automatically *per token* - Learnt via new method, Latent Preference Optimization - Outperforms any fixed temperature decoding method, choosing creativity or…

Huck Yang Reposted

Huck Yang

@huckiyang

14 Nov

60% of the pull requests (and probably more than 60% of the code) for our github issue resolver were written by coding agents in November, and the number keeps going up. Amazing times.

All Hands AI

@allhands_ai

14 Nov

📈So far, in November, OpenHands authored or co-authored ~60% of commits to the openhands-resolver github.com/All-Hands-AI/o… repo and ~20% of the main repo github.com/All-Hands-AI/O…

Huck Yang

@huckiyang

13 Nov

🚀 FastAdaSP on the speech-text token merging at SoTA SLMs will be presented at the oral session today at @emnlpmeeting by Eason and Jiaqi from @WavLab @shinjiw_at_cmu @LTIatCMU @NVIDIAAI ⏰Time: 10:50 AM - 11:00 AM, Wed, Nov 13, 2024 📍Location: Tuttle, Lower Terrace Level,…

Huck Yang Reposted

Huck Yang

@huckiyang

12 Nov

☄️Excited to share our #SLT2024 paper "Mamba-based Decoder-Only Approach with Bidirectional Speech Modeling for Speech Recognition"! Code: github.com/YoshikiMas/mad… arXiv: arxiv.org/abs/2411.06968 Mamba and Mamba-2 perform pretty well even completely without explicit attention😎

Huck Yang

@huckiyang

12 Nov

"LLM agents for Acoustics and Continuous signals" @emnlpmeeting 2024 gather session co-hosted with @PiotrZelasko and @_dmchan will be held in Tuesday Nov 12th Room Foster, Level 2. Feel free to join with us and open to all! 🗣️ - From Continuous Signals Modeling to Audio…

Huck Yang Reposted

Huck Yang

@huckiyang

5 Nov

What makes an agent a "conversational" agent? Introducing ReSpAct, reason, speak, act for building conversational agents. LLM based agent decides on when to interact with the user (or user agent) when it is needed, such as disambiguation, clarification, or simply stuck at an…

Vardhan Dongre

@Vardhan_Dongre

5 Nov

Ever felt that agents that “just follow orders” often fall short, especially when critical details are on the line? In our latest work, we introduce ReSpAct—an approach that transforms agents into true conversational partners🌐 vardhandongre.github.io/respact-llm/ 🧵 [1/n]

Huck Yang Reposted

Huck Yang

@huckiyang

25 Oct

It's a beautiful demonstration of the powers of GPT-4o, and also it's limitations. GPT-4o gives you a salad of claims from the RL literature, some written by over-hyped authors, at the end of which you are not sure if RL can really reason counterfactually (ie, at level 3), or…

Nando de Freitas

@NandoDF

24 Oct

Why does RL lead to causal understanding? 🧵🪡 GPT-4o: Reinforcement learning (RL) can lead to causal understanding because, by interacting with an environment, an agent learns not just correlations between actions and outcomes, but also the underlying cause-effect…

Huck Yang Reposted

Huck Yang

@huckiyang

23 Oct

👀 Exciting advancements in Automated Audio Captioning.✨ The CMU-NVIDIA team has unveiled a groundbreaking approach that leverages multi-agent collaboration and GPU technology to enhance audio-to-text systems. 🔍 Key innovations include: ✅ Multi-encoder fusion: Combining…

Huck Yang

@huckiyang

18 Oct

Time series with descriptive Texts! Very nice works by @yohekawag and @Hitachi

Yohei Kawaguchi

@yohekawag

17 Oct

Happy to announce our latest preprint: "Domain-Independent Text Generation for Time-Series Data"! Our method can generate descriptive texts from sensor-captured time-series signals, regardless of the domain. Check it out here: arxiv.org/abs/2409.16647

Huck Yang Reposted

Huck Yang

@huckiyang

8 Oct

Congratulations to @geoffreyhinton for winning the Nobel Prize in physics!!

Huck Yang Reposted

Huck Yang

@huckiyang

7 Oct

👏👏Check out the 2nd best-performing system in our Generative Error Correction Challenge for LLM-based emotion recognition (yet unreasonably rejected by SLT2024). They achieved 75.1% accuracy on ASR transcriptions of IEMOCAP using GPT-4o.

arXiv Sound

@ArxivSound

7 Oct

``Context and System Fusion in Post-ASR Emotion Recognition with Large Language Models,'' Pavel Stepachev, Pinzhen Chen, Barry Haddow, ift.tt/tCG7YmT

Huck Yang Reposted

Huck Yang

@huckiyang

4 Oct

Now HEAR this (not just watch) - We've got audio covered for generated videos 🔊 Introducing Movie Gen Audio, which adds 48kHz synced SFX and aligned music to amazing videos from Movie Gen Video (and other sources!) Super honored to work with this amazing team! More to come🔥🔥

AI at Meta

@AIatMeta

4 Oct

🎥 Today we’re premiering Meta Movie Gen: the most advanced media foundation models to-date. Developed by AI research teams at Meta, Movie Gen delivers state-of-the-art results across a range of capabilities. We’re excited for the potential of this line of research to usher in…

Huck Yang Reposted

Huck Yang

@huckiyang

2 Oct

僕はよくアメリカの大学システムを長良川の鵜飼に例えるんですが、いらすとやに鵜飼の絵があってビックリ。ちなみに教員（僕）は鵜飼じゃなくて鵜の方です。

Huck Yang

@huckiyang

2 Oct

Water tastes .. sweet again after @iclr_conf deadline .. 😵

Huck Yang Reposted

Huck Yang

@huckiyang

1 Oct

My answer: Alec Radford. There were good suggestions below, but to my mind @AlecRad is clearly the person with the largest influence, yet the least recognition. He's been the driver so many amazing developments and should be in the history books as a (the?) father of modern…

Jeff Clune

@jeffclune

27 Sep

Who's the most important, yet most under-recognized AI scientist in the world? In my view, there's only one clear, right answer. One person behind most of the major advances, yet rarely mentioned or celebrated. Who do you think I mean? I'll reveal my answer after hearing your…