Haziq @hazytalks Twitter Profile

Haziq

@hazytalks

Optimizing one Model at a time. #ML

5KPosts 3KFollowers 445Following

Similar User

@miniapeur

@yandexcom

@sedielem

@BartStuck

@NativeA29927707

@BerulavaBadri

@usslibertyvets

@ZimmerMar68

@nactateachers

@guneetsk99

@faiqanbreen

@Shu_cheetah

@lubingzhiguo_

@jojo_mama922

Pinned

Haziq

@hazytalks

15 Aug 2022

The paradox with evolutionary systems is that legacy learning is hard to update fast, which is both a good thing and a bad. Bad that it can't improvise to updated data fast , good that a small unintended change won't significantly change the outcomes...

Haziq Reposted

Andrej Karpathy

@karpathy

7 Aug

# RLHF is just barely RL Reinforcement Learning from Human Feedback (RLHF) is the third (and last) major stage of training an LLM, after pretraining and supervised finetuning (SFT). My rant on RLHF is that it is just barely RL, in a way that I think is not too widely…

Haziq Reposted

Andrej Karpathy

@karpathy

23 Jul

Huge congrats to @AIatMeta on the Llama 3.1 release! Few notes: Today, with the 405B model release, is the first time that a frontier-capability LLM is available to everyone to work with and build on. The model appears to be GPT-4 / Claude 3.5 Sonnet grade and the weights are…

Haziq Reposted

Andrej Karpathy

@karpathy

20 Feb

New (2h13m 😅) lecture: "Let's build the GPT Tokenizer" Tokenizers are a completely separate stage of the LLM pipeline: they have their own training set, training algorithm (Byte Pair Encoding), and after training implement two functions: encode() from strings to tokens, and…

Haziq Reposted

Hamid Naderi Yeganeh

@naderi_yeganeh

4 Feb

I drew this morpho butterfly with mathematical equations.

Haziq Reposted

Google DeepMind

@GoogleDeepMind

17 Jan

Introducing AlphaGeometry: an AI system that solves Olympiad geometry problems at a level approaching a human gold-medalist. 📐 It was trained solely on synthetic data and marks a breakthrough for AI in mathematical reasoning. 🧵 dpmd.ai/alphageometry

Haziq Reposted

Andrej Karpathy

@karpathy

9 Dec

# On the "hallucination problem" I always struggle a bit with I'm asked about the "hallucination problem" in LLMs. Because, in some sense, hallucination is all LLMs do. They are dream machines. We direct their dreams with prompts. The prompts start the dream, and based on the…

Haziq Reposted

Linus Ekenstam

@LinusEkenstam

6 Dec

Google (DeepMind) releases AI model Gemini. There is no turning back now, we are in for one mad ride. The multi modality, and fluidity of the model is super clean. My jaw dropped at 4:24 seconds A thread...

Haziq Reposted

Jim Fan

@DrJimFan

24 Nov

In my decade spent on AI, I've never seen an algorithm that so many people fantasize about. Just from a name, no paper, no stats, no product. So let's reverse engineer the Q* fantasy. VERY LONG READ: To understand the powerful marriage between Search and Learning, we need to go…

Haziq Reposted

Nathan Lands — Lore.com

@NathanLands

11 Nov 2023

Runway's new update is producing incredible AI videos. It's a significant leap forward. As someone who's worked with a famous Hollywood producer and dreamed of creating films, I find this so exciting. We're witnessing the birth of a new era in film. Here are the best examples:

Haziq Reposted

Greg Brockman

@gdb

7 Nov 2023

OpenAI (vision & voice APIs) for sports narration:

Gonzalo Espinoza Graham 🏴‍☠️

@geepytee

7 Nov 2023

GPT-4V + TTS = AI Sports narrator 🪄⚽️ Passed every frame of a football video to gpt-4-vision-preview, and with some simple prompting asked to generate a narration No edits, this is as it came out from the model (aka can be SO MUCH BETTER)

Haziq Reposted

François Chollet

@fchollet

6 Nov 2023

Turns out, fitting a curve to a dataset produces a model that only generalizes to that specific data distribution -- big if true.

anton

@abacaj

5 Nov 2023

New paper by Google provides evidence that transformers (GPT, etc) cannot generalize beyond their training data

Haziq Reposted

Sam Altman

@sama

25 Oct 2023

i expect ai to be capable of superhuman persuasion well before it is superhuman at general intelligence, which may lead to some very strange outcomes

Haziq

@hazytalks

22 Oct 2023

Bias and Variance takes little time to learn but lifetime to master, and at the end it will be at most somewhat an optimized outcome. Applies pretty much to everything in life.

Haziq

@hazytalks

21 Oct 2023

Will be quite useful for case based algorithm development and selection and training. Somewhat like more sophisticated and generalized version of cross validation at a scale for new models, put in rustic terms. Super Amazing

NVIDIA AI Developer

@NVIDIAAIDev

20 Oct 2023

🎉Just released: Eureka!, a new AI agent that uses LLMs to automatically generate algorithms to train robots to accomplish complex tasks. 👀 The #NVIDIAResearch paper includes the AI algorithms and how to experiment with Eureka using NVIDIA Isaac Gym. 👇 nvda.ws/3PWKlk4

Haziq

@hazytalks

15 Oct 2023

Some Beautiful Abstractions!! #DALLE3

Haziq Reposted

Zhengzhong Tu

@_vztu

13 Oct 2023

Computer vision has been solved.

Haziq Reposted

Borriss

@_Borriss_

13 Oct 2023

More and more Plus users are getting the new ChatGPT Vision.. People are posting impressive use cases. 9 really good ones:

Haziq Reposted

Jonathon Luiten

@JonathonLuiten

18 Aug 2023

Dynamic 3D Gaussians: Tracking by Persistent Dynamic View Synthesis dynamic3dgaussians.github.io We model the world as a set of 3D Gaussians that move & rotate over time. This extends Gaussian Splatting to dynamic scenes, with accurate novel-view synthesis and dense 3D trajectories.

Haziq Reposted

Borriss

@_Borriss_

12 Oct 2023

Around 31 hours since Adobe Max dropped the new Firefly 2 models... People are WOW-ing at the real photorealism! 10 examples: (How are these not real photos?)

Haziq Reposted

Jim Fan

@DrJimFan

11 Oct 2023

This is such an interesting work. Video diffusion model is being used as a data-driven physics simulation, in which an agent can plan, explore, and learn optimal actions without touching robot hardware or causing harm. LLM is not only an OS, but also a full reality simulator.

Sherry Yang

@mengjiao_yang

11 Oct 2023

Introducing Universal Simulator (UniSim), an interactive simulator of the real world. Interactive website: universal-simulator.github.io Paper: arxiv.org/abs/2310.06114