Satyapriya Krishna @SatyaScribbles Twitter Profile

Satyapriya Krishna

@SatyaScribbles

Explorer. @ai4life_harvard @hseas @googleAI @MetaAI @SCSatCMU @AmazonScience @ml_collective @D3Harvard @HarvardAISafety

343Posts 566Followers 290Following

Similar User

@tanyaagoyal

@trustworthy_ml

@hima_lakkaraju

@xiangrenNLP

@mbodhisattwa

@NinarehMehrabi

@hanjie_chen

@XAI_Research

@abeirami

@wangchunshu

@byryuer

@ZEYULIU10

@peterbhase

@DongkuanXu

@NickKroeger1

Pinned

Satyapriya Krishna

@SatyaScribbles

22 Oct

🚀 Excited to share the research I worked on during my summer internship at @GoogleAI! We developed FRAMES (Factuality, Retrieval, And reasoning MEasurement Set), a challenging high-quality benchmark for evaluating retrieval-augmented large language models. FRAMES tests LLMs on…

Satyapriya Krishna Reposted

Russ Salakhutdinov

@rsalakhu

13 Nov

My talk on AI Agents is online: youtube.com/watch?v=wK0TpI…

Kempner Institute at Harvard University

@KempnerInst

12 Nov

The latest Kempner Seminar Series talk: @rsalakhu of @CarnegieMellon discusses the opportunities and challenges of using LLMs to drive autonomous agents that can navigate the internet… and also the real world! Check it out here: youtu.be/wK0TpI3gu28?si…

Satyapriya Krishna

@SatyaScribbles

12 Nov

Awesome stuff!

Tilde

@tilderesearch

12 Nov

We're thrilled to be launching Tilde. We're applying interpretability to unlock deep reasoning and control of models, enabling the next generation of human-AI interaction. By understanding a model's inner mechanisms, we can enhance both its reliability and performance—going…

Satyapriya Krishna Reposted

Kaiwen Wang

@kaiwenw_ai

9 Nov

Steerability is the next frontier of generative models! Having knobs that control the behavior of AI systems will greatly improve their safety & usability. I’m very excited to present ✨Conditional Language Policy (CLP)✨, a multi-objective RL framework for steering language…

Satyapriya Krishna Reposted

AK

@_akhaliq

8 Nov

OpenCoder The Open Cookbook for Top-Tier Code Large Language Models

Satyapriya Krishna Reposted

Rohan Paul

@rohanpaul_ai

8 Nov

A good course on Introduction to Foundation Models

Satyapriya Krishna Reposted

kalomaze

@kalomaze

7 Nov

arxiv.org/abs/2410.21228…

Satyapriya Krishna Reposted

Sean Welleck

@wellecks

25 Sep

Slides for my recent talk on: "Reasoning with inference-time compute" wellecks.com/data/welleck20… Papers: - Lean-STaR: arxiv.org/abs/2407.10040 - Easy-to-hard: arxiv.org/abs/2403.09472 - Compute-optimal inference: arxiv.org/abs/2408.00724 - Meta-generation: arxiv.org/abs/2406.16838

Satyapriya Krishna Reposted

Apoorv Khandelwal

@apoorvkh

31 Oct

Wondering how long it takes to train a 1B-param LM from scratch on your GPUs? 🧵 See our paper to learn about the current state of academic compute and how to efficiently train models! Use our code to test your own models/GPUs! arxiv.org/abs/2410.23261 github.com/apoorvkh/acade…

GitHub - apoorvkh/academic-pretraining: $100K or 100 Days: Trade-offs when Pre-Training with...

Source: https://t.co/1JnEe2CCLr

Satyapriya Krishna

@SatyaScribbles

29 Oct

Very cool work! Interesting use of distillation.

Jing-Jing Li

@drjingjing2026

29 Oct

🚨 New preprint from my internship @allen_ai! Introducing SafetyAnalyst, an LLM content moderation framework that 📌 builds structured “harm-benefit trees” given a prompt 📌 weights harms against benefits 📌 delivers interpretable, transparent, and steerable safety decisions

Satyapriya Krishna Reposted

Sam Altman

@sama

27 Oct

the best way to get good at something is usually to just practice actually doing the thing in question. a lot of very capable people outsmart themselves with complex plans that involve working a lot on fake prerequisites.

Satyapriya Krishna Reposted

Archit Sharma

@archit_sharma97

28 Oct

I don't have a paper to write this in but there is an interesting property when thinking about iterative RL(HF) algorithms. It seems natural to use an improved policy to sample new data online when training LLMs -- turns out that this just lowers the weight on the KL constraint!

Satyapriya Krishna Reposted

Sara Hooker

@sarahookr

24 Oct

Extremely proud to share ✨ Aya Expanse ✨ We are a small lab, and this builds on years of dedicated research to connect the world with language. 🌎🌍🌏 A huge shoutout to the @CohereForAI @cohere team and our wider community for breakthroughs that ensure the world is seen.

Cohere For AI

@CohereForAI

24 Oct

Introducing ✨Aya Expanse ✨ – an open-weights state-of-art family of models to help close the language gap with AI. Aya Expanse is both global and local. Driven by a multi-year commitment to multilingual research. cohere.com/research/aya

Satyapriya Krishna Reposted

Kalpesh Krishna

@kalpeshk2011

22 Oct

Are you looking for a challenging new QA benchmark to test retrieval augmented LLMs, and their ability to perform reasoning on the internet? Check out the FRAMES benchmark, an exciting new dataset developed by our intern @SatyaScribbles at @GoogleAI! 👇 arxiv.org/abs/2409.12941

Satyapriya Krishna

@SatyaScribbles

22 Oct

Satyapriya Krishna Reposted

Allen Nie (🇺🇦☮️)

@allen_a_nie

10 Oct

LLMs are in-context RL learners, but not great because they can’t explore well. How do we teach LLMs to explore better? 🤔 🔮 Solution: Supervised fine-tuning on full exploration trajectories. Preprint with GDM: arxiv.org/abs/2410.06238 🧵