@huckiyang Profile picture

Huck Yang

@huckiyang

Sr. Research Scientist @NVIDIAAI Generative Voice Correction | Ph.D. MSc @GeorgiaTech | Past: @GoogleAI @AmazonScience | 🗣️ education

Similar User
Wei-Ning Hsu photo

@mhnt1580

Yung-Sung Chuang @ EMNLP2024 photo

@YungSungChuang

Anurag Kumar photo

@AcouIntel

Roshan Sharma photo

@RoshanSSharma2

Cheng-I Lai photo

@jefflai108

YH photo

@yhtu_

Alexander H. Liu photo

@alex_h_liu

Siddhant Arora photo

@Sid_Arora_18

jiatongshi photo

@jiatongshi

Guan-Ting (Daniel) Lin photo

@GTL094144

vins.ton photo

@vinsdotton

Rui Liu photo

@RuiLiu60711141

Yu-An (Andy) Chung photo

@iamyuanchung

Fishball Lin🔮 photo

@fishball_Lin

RF yao photo

@jack978397

Huck Yang Reposted

Excited to share our new preprint, "CLaSP: Learning Concepts for Time-Series Signals from Natural Language Supervision"! Our model enables query-by-text/signal retrieval and zero-shot classification for time series data. Check it out: arxiv.org/abs/2411.08397

Tweet Image 1
Tweet Image 2
Tweet Image 3

Huck Yang Reposted

🚨 Adaptive Decoding via Latent Preference Optimization 🚨 - New layer added to Transformer, selects decoding params automatically *per token* - Learnt via new method, Latent Preference Optimization - Outperforms any fixed temperature decoding method, choosing creativity or…

Tweet Image 1

Huck Yang Reposted

60% of the pull requests (and probably more than 60% of the code) for our github issue resolver were written by coding agents in November, and the number keeps going up. Amazing times.

📈So far, in November, OpenHands authored or co-authored ~60% of commits to the openhands-resolver github.com/All-Hands-AI/o… repo and ~20% of the main repo github.com/All-Hands-AI/O…

Tweet Image 1
Tweet Image 2


🚀 FastAdaSP on the speech-text token merging at SoTA SLMs will be presented at the oral session today at @emnlpmeeting by Eason and Jiaqi from @WavLab @shinjiw_at_cmu @LTIatCMU @NVIDIAAI ⏰Time: 10:50 AM - 11:00 AM, Wed, Nov 13, 2024 📍Location: Tuttle, Lower Terrace Level,…

Tweet Image 1

Huck Yang Reposted

☄️Excited to share our #SLT2024 paper "Mamba-based Decoder-Only Approach with Bidirectional Speech Modeling for Speech Recognition"! Code: github.com/YoshikiMas/mad… arXiv: arxiv.org/abs/2411.06968 Mamba and Mamba-2 perform pretty well even completely without explicit attention😎

Tweet Image 1

"LLM agents for Acoustics and Continuous signals" @emnlpmeeting 2024 gather session co-hosted with @PiotrZelasko and @_dmchan will be held in Tuesday Nov 12th Room Foster, Level 2. Feel free to join with us and open to all! 🗣️ - From Continuous Signals Modeling to Audio…

Tweet Image 1

Huck Yang Reposted

What makes an agent a "conversational" agent? Introducing ReSpAct, reason, speak, act for building conversational agents. LLM based agent decides on when to interact with the user (or user agent) when it is needed, such as disambiguation, clarification, or simply stuck at an…

Ever felt that agents that “just follow orders” often fall short, especially when critical details are on the line? In our latest work, we introduce ReSpAct—an approach that transforms agents into true conversational partners🌐 vardhandongre.github.io/respact-llm/ 🧵 [1/n]

Tweet Image 1


Huck Yang Reposted

It's a beautiful demonstration of the powers of GPT-4o, and also it's limitations. GPT-4o gives you a salad of claims from the RL literature, some written by over-hyped authors, at the end of which you are not sure if RL can really reason counterfactually (ie, at level 3), or…

Why does RL lead to causal understanding? 🧵🪡 GPT-4o: Reinforcement learning (RL) can lead to causal understanding because, by interacting with an environment, an agent learns not just correlations between actions and outcomes, but also the underlying cause-effect…



Huck Yang Reposted

👀 Exciting advancements in Automated Audio Captioning.✨ The CMU-NVIDIA team has unveiled a groundbreaking approach that leverages multi-agent collaboration and GPU technology to enhance audio-to-text systems. 🔍 Key innovations include: ✅ Multi-encoder fusion: Combining…

Tweet Image 1

Time series with descriptive Texts! Very nice works by @yohekawag and @Hitachi

Happy to announce our latest preprint: "Domain-Independent Text Generation for Time-Series Data"! Our method can generate descriptive texts from sensor-captured time-series signals, regardless of the domain. Check it out here: arxiv.org/abs/2409.16647

Tweet Image 1
Tweet Image 2


Huck Yang Reposted

Congratulations to @geoffreyhinton for winning the Nobel Prize in physics!!


Huck Yang Reposted

👏👏Check out the 2nd best-performing system in our Generative Error Correction Challenge for LLM-based emotion recognition (yet unreasonably rejected by SLT2024). They achieved 75.1% accuracy on ASR transcriptions of IEMOCAP using GPT-4o.

``Context and System Fusion in Post-ASR Emotion Recognition with Large Language Models,'' Pavel Stepachev, Pinzhen Chen, Barry Haddow, ift.tt/tCG7YmT



Huck Yang Reposted

Now HEAR this (not just watch) - We've got audio covered for generated videos 🔊 Introducing Movie Gen Audio, which adds 48kHz synced SFX and aligned music to amazing videos from Movie Gen Video (and other sources!) Super honored to work with this amazing team! More to come🔥🔥

🎥 Today we’re premiering Meta Movie Gen: the most advanced media foundation models to-date. Developed by AI research teams at Meta, Movie Gen delivers state-of-the-art results across a range of capabilities. We’re excited for the potential of this line of research to usher in…



Huck Yang Reposted

僕はよくアメリカの大学システムを長良川の鵜飼に例えるんですが、いらすとやに鵜飼の絵があってビックリ。ちなみに教員(僕)は鵜飼じゃなくて鵜の方です。

Tweet Image 1

Water tastes .. sweet again after @iclr_conf deadline .. 😵


Huck Yang Reposted

My answer: Alec Radford. There were good suggestions below, but to my mind @AlecRad is clearly the person with the largest influence, yet the least recognition. He's been the driver so many amazing developments and should be in the history books as a (the?) father of modern…

Who's the most important, yet most under-recognized AI scientist in the world? In my view, there's only one clear, right answer. One person behind most of the major advances, yet rarely mentioned or celebrated. Who do you think I mean? I'll reveal my answer after hearing your…

Tweet Image 1


Loading...

Something went wrong.


Something went wrong.