Josh Tobin @josh_tobin_ Twitter Profile

Josh Tobin

@josh_tobin_

ML-powered products @gantry_ml @full_stack_dl. Previously @Berkeley_EECS PhD and @openai

Joined July 2011

881Posts 11KFollowers 1KFollowing

Similar User

@chipro

@GuggerSylvain

@GokuMohandas

@seb_ruder

@svlevine

@chelseabfinn

@lilianweng

@quocleix

@pabbeel

@Thom_Wolf

@weights_biases

@srush_nlp

@sergeykarayev

@j_foerst

@AravSrinivas

Pinned

Josh Tobin

@josh_tobin_

7 Jun 2022

As easy as it's become to train ML models, it's still way too hard to get them to work well in real products with real users. We raised a Series A @gantry_ml to solve this problem. I wrote about the raise and where we're going with the product: gantry.io/blog/introduci…

Josh Tobin Reposted

Pieter Abbeel

@pabbeel

11 Mar

Yes, 2024 is shaping up a big year for robotics! Introducing @CovariantAI's RFM-1, which just like Sora can generate video, but RFM-1 does it for robotic interaction with the world. But there is so much more it can do. RFM-1 is a multimodal any-to-any sequence model. RFM-1…

Josh Tobin Reposted

Charles 🎉 Frye

@charles_irl

22 Dec

d.erenrich.net/are-you-smarte… Great way to try out MMLU and get a sense for just what, exactly, we are using to evaluate LLMs! I doubt folks would knife fight for a percent on this benchmark if its contents were realized more broadly.

Josh Tobin

@josh_tobin_

6 Dec

What an incredible privilege it is to be working on AI in 2023 twitter.com/i/status/17324…

Ralph Brooks AI Artisan

@ralphbrooks

6 Dec

It looks like Gemini has advanced capabilities in responding to multimodal input.

Josh Tobin

@josh_tobin_

19 Nov 2023

Lessons from the last 24H (as an outsider): - when things get hard, you see immediately who was rooting for you to fail - incentives matter - come at the king you best not miss

Josh Tobin

@josh_tobin_

18 Nov 2023

to my openai friends -- can't imagine how tough today must be. I'm here if you need anything.

Josh Tobin

@josh_tobin_

8 Nov 2023

There are a lot of good reasons to prefer open-source LLMs. "We need to own the IP" isn't one. It's like saying "We need to own the data centers". LLM IP is not where the value lies for most businesses. Data is.

Josh Tobin

@josh_tobin_

26 Oct 2023

This is more common than you'd think. Early on at @OpenAI, I told @pabbeel and @woj_zaremba that I was going to spend a few weeks on domain randomization because it felt like the right baseline for domain adaptation. Turns out it worked way better. arxiv.org/abs/1703.06907

This post is unavailable.

Josh Tobin Reposted

The Full Stack

@full_stack_dl

29 Sep 2023

🥞🦜 New LLM Bootcamp Announcement 🦜🥞 In 2023, the AI world speedran through models, architectures (e.g., RAG), and frameworks (e.g., @LangChainAI). After a year of hype, what's *actually* working? This November, we'll show you, in our latest class on building prod LLM apps

Josh Tobin

@josh_tobin_

20 Sep 2023

I'm excited to teach this edition of @full_stack_dl at @ScaleByTheBay in November! Join us to learn about building LLM applications the right way -- systematically, with users in mind, and ready for production.

Scale by the Bay

@ScaleByTheBay

19 Sep 2023

📣 Attention! Clear your schedules for November 13th! 🗓️ 🥞 We're thrilled to host #bythebay a Workshop with @josh_tobin_, CEO of @gantry_ml & co-creator of @full_stack_dl A must-attend for anyone interested in #AI & #LLM🦜 👉 Secure your spot today scale.bythebay.io/register 👇

Josh Tobin

@josh_tobin_

16 Sep 2023

deep learning in a nutshell: - If you suspect you have a bug, you do - If you don't think you have a bug, you still probably do - If you know you don't have a bug, you still might

Josh Tobin

@josh_tobin_

21 Jul 2023

Evaluation is a key challenge for LLM builders these days. I had a great time talking about it at the @mlopscommunity LLMs in Production Conference. Check it out here: home.mlops.community/public/collect…

LLMs in Production Conference Part II | MLOps Community

Source: https://t.co/tQ9Jqc8Edp

Josh Tobin

@josh_tobin_

13 Jul 2023

Same phenomenon is playing out for teams building LLM-powered product features. You launch the feature and see insane retention numbers. But users lose a bit of trust with each bad interaction. Eventually leads to a delayed churn.

Peter Welinder

@npew

13 Jul 2023

No, we haven't made GPT-4 dumber. Quite the opposite: we make each new version smarter than the previous one. Current hypothesis: When you use it more heavily, you start noticing issues you didn't see before.

Josh Tobin

@josh_tobin_

28 Jun 2023

I’m giving a talk on evaluating LLM based applications at the @databricks @Data_AI_Summit at 1:30, come stop by if you are around! databricks.com/dataaisummit/s…