Thomas Scialom @ThomasScialom Twitter Profile

Thomas Scialom

@ThomasScialom

AGI Researcher @MetaAI -- I led Llama 2 and Postraining Llama 3. Also CodeLlama, Galactica, Toolformer, Bloom, Nougat, GAIA, ..

1KPosts 7KFollowers 212Following

Similar User

@srush_nlp

@seb_ruder

@ai2_allennlp

@chelseabfinn

@EdinburghNLP

@julien_c

@rsalakhu

@uwnlp

@LysandreJik

@Mila_Quebec

@Thom_Wolf

@dpkingma

@kchonyc

@bneyshabur

@dustinvtran

Thomas Scialom Reposted

Thomas Scialom

@ThomasScialom

28 Oct

All languages covey information at a similar rate when spoken (39bits/s). Languages that are spoken faster have less information density per syllable! One of the coolest results in linguistics.

Thomas Scialom Reposted

Are we failing to grasp how big Internet-scale data is/how far interpolation on it goes? Are we underappreciating how fast GPUs are or how good backprop is? Are we overestimating the difference between the stuff we do vs what animals do + they’re similar in some deep sense? Etc.

Thomas Scialom Reposted

Thomas Scialom

@ThomasScialom

4 Oct

why would someone pick anything but meta? I'm surprised - I thought any researcher would want to work somewhere they can do real scientific research, rather than closed commercial product work.

Thomas Scialom Reposted

Thomas Scialom

@ThomasScialom

23 Jul

🆕 pod with @ThomasScialom of @AIatMeta! Llama 2, 3 & 4: Synthetic Data, RLHF, Agents on the path to Open Source AGI latent.space/p/llama-3 shoutouts: - Why @ylecun's Galactica Instruct would have solved @giffmana's Citations Generator - Beyond Chinchilla-Optimal: 100x…

Llama 2, 3 & 4: Synthetic Data, RLHF, Agents on the path to Open Source AGI

Source: https://t.co/nISAVcVpCy

Thomas Scialom

@ThomasScialom

23 Jul

The team worked really hard to make history, voila finally the Llama-3.1 herd of models...have fun with it! * open 405B, insane 70B * 128K context length, improved reasoning & coding capabilities * detailed paper ai.meta.com/research/publi…

Thomas Scialom

@ThomasScialom

23 May

RHLF versus immitation learning explained in one tweet

Ethan Mollick

@emollick

21 May

Empathy and quality of answers on reddit about common medical issues, doctors vs. GPT-3.5. jamanetwork.com/journals/jamai…

Thomas Scialom

@ThomasScialom

7 May

I am at ICLR.. 🦙 Llama-3: I ll be every morning at 11am at the @AIatMeta for Llama-3 QA sessions 🤖 GAIA: General AI Assistant benchmark w/ Gregoire 🔭 NOUGAT: for Scientific OCR w/ Lukas And if you are interested in post-training, rlhf, agents i m down for ☕&🍺 @iclr_conf

AI at Meta

@AIatMeta

7 May

We're in Vienna for #ICLR2024, stop by our booth to chat with our team or learn more about our latest research this week. 📍Booth A15 This year, teams from Meta are sharing 25+ publications and two workshops. Here are a few booth highlights to add to your agenda this week 🧵

Thomas Scialom

@ThomasScialom

25 Apr

We had a small party to celebrate Llama-3 yesterday in Paris! The entire LLM OSS community joined us with @huggingface, @kyutai_labs, @GoogleDeepMind (Gemma), @cohere As someone said: better that the building remains safe, or ciao the open source for AI 😆

Thomas Scialom

@ThomasScialom

18 Apr

Don't fall into the chinchilla trap if you want your model to be used by billions of people :)

Felix

@felix_red_panda

18 Apr

Llama3 8B is trained on almost 100 times the Chinchilla optimal number of tokens

Thomas Scialom

@ThomasScialom

18 Apr

Delighted to finally introduce Llama 3: The most capable openly available LLM to date. Long jouney since Llama-2, a big shoutout to the incredible team effort that made this possible, and stay tuned, we will keep building🦙 ai.meta.com/blog/meta-llam…

Thomas Scialom

@ThomasScialom

1 Feb

Yes, we will continue to make sure AI remains an open source technology.

Soumith Chintala

@soumithchintala

1 Feb

If you have questions about why Meta open-sources its AI, here's a clear answer in Meta's earnings call today from @finkd

Thomas Scialom

@ThomasScialom

28 Nov

Despite being an amazing paper, chinchilla did/could not be open-source. Llama-1 has now more than 10x citations than Chinchilla.

Fuzhao Xue

@XueFz

28 Nov

I suddenly realized the chinchilla paper has only 200 citations…. It’s a lot for a paper released 18 months ago, but it’s really really tooooooo low for such an art. To some extent, it reflects the diminishing of publishing pretraining research. Getting citations in this…

Thomas Scialom Reposted

Thomas Scialom

@ThomasScialom

23 Nov

GAIA: a benchmark for General AI Assistants paper page: huggingface.co/papers/2311.12… introduce GAIA, a benchmark for General AI Assistants that, if solved, would represent a milestone in AI research. GAIA proposes real-world questions that require a set of fundamental abilities such…

Thomas Scialom

@ThomasScialom

17 Nov

At the AI-pulse today I talked about -- surprise -- LLMs. There short history, a deep dive into Llama 2, the magic behind RLHF, and my vision of where of the future of the field. Thanks @Scaleway for the opportunity!

Thomas Scialom

@ThomasScialom

2 Nov 2023

I strongly disagree. There are many paths to success, and doing a PhD is never a suboptimal choice. Both professionally and personally.

Yi Tay

@YiTayML

2 Nov 2023

Agreed. There's so many opportunities in AI now. It's a pretty suboptimal career choice to do a PhD at the moment. Also, many outstanding AI researchers and hard carry engineers that I know of don't have an AI or CS PhD.

Thomas Scialom

@ThomasScialom

14 Oct 2023

It did in fact. RLHF is the technology behind chatgpt and probably dalle3. To panned out on real-world problems it needed nothing more than human feedback rewards.

Pedro Domingos

@pmddomingos

13 Oct 2023

DeepMind’s big bet was deep reinforcement learning, but it hasn’t panned out on any real-world problems.

Thomas Scialom

@ThomasScialom

30 Sep 2023

In fact there is on perplexity demo a specific system prompt that amplifes over safe responses. It has been removed from other demos like HF. @perplexity_ai @denisyarats could we deactivate it as well by default please?

Aravind Srinivas

@AravSrinivas

30 Sep 2023

The mistral 7b model (right) is clearly more “helpful” than the llama 70b chat model (left). Trying to bias too much on harmlessness doesn’t let you build a useful general chat assistant.

Thomas Scialom Reposted

Thomas Scialom

@ThomasScialom

21 Sep 2023

AI systems are fast becoming a basic infrastructure. Historically, basic infrastructure always ends up being open source (think of the software infra of the internet, including Linux, Apache, JavaScript and browser engines, etc) It's the only way to make it reliable, secure, and…

Alex Volkov (Thursd/AI)

@altryne

20 Sep 2023

Based @ylecun "AI is going to become a common platform... it needs to be open source if you want it to be a platform on top of which a whole ecosystem can be built And the reason why we need to work in that mode is that this is the best way to make progress as fast as we can"