Dylan Slack @dylanslack20 Twitter Profile

Dylan Slack

@dylanslack20

Research Scientist at Google

379Posts 878Followers 571Following

Similar User

@hima_lakkaraju

@trustworthy_ml

@huashen218

@MaartenSap

@ssgrn

@XAI_Research

@oanacamb

@ZenMoore1

@sumanthd17

@sameer_

@peterbhase

@alon_jacovi

@swabhz

@valeriechen_

@anmarasovic

Pinned

Dylan Slack

@dylanslack20

23 May 2023

🚨Instead of collecting costly datasets for tabular prediction, could we use natural language instructions?💡 In our paper "TABLET: Learning From Instructions For Tabular Data" with @sameer_ we evaluate how close we are to this goal and the key limitations of current LLMs

Dylan Slack Reposted

Kolby Nottingham @ EMNLP 2024

@kolbytn

8 Nov

I am officially on the job market for industry research positions focused on agentic LLMs and multi-turn reasoning! I'll be at EMNLP next week and NeurIPS next month. Message me if you'd like to chat about jobs or LLM agent research. #EMNLP2024 #neurips2024 Personal links in🧵

Dylan Slack Reposted

Emma Pierson

@2plus2make5

28 Oct

Please retweet: I am recruiting PhD students at Berkeley! Please apply to @Berkeley_EECS or @UCJointCPH if you are interested in ML applied to health, inequality, or social science, and mention my name in your app. More details on work/how to apply: cs.cornell.edu/~emmapierson/

Dylan Slack Reposted

Sean Hendryx

@SeanHendryx

24 Oct

Reasoning at length will be a key part of LLMs solving more challenging problems, but how can we make sure that their chain of thought stays on track? At @scale_AI, we’ve developed a method to learn token-wise expected rewards from pairwise preference labels 🧵

Dylan Slack Reposted

Pouya Pezeshkpour

@PPezeshkpour

22 Oct

Are you interested in LLM hallucinations in multi-document summarization? We find that: LLMs hallucinate a lot, and often towards the end of summaries; Errors arise from ignored instructions or offering generic insights; post-hoc mitigation methods are not very effective. 👇

Kat (@ Irvine)

@CatarinaBelm

22 Oct

📣 Excited to share our latest preprint, where we investigate LLM hallucinations in multi-document summarization tasks. We reveal systematic hallucinatory behaviors across 5 popular LLMs. Check out the paper at arxiv.org/abs/2410.13961 We'd love to hear your feedback! 😄

Dylan Slack Reposted

Yu Fei

@Walter_Fei

22 Oct

Alignment is necessary for LLMs, but do we need to train aligned versions for all model sizes in every model family? 🧐 We introduce 🚀Nudging, a training-free approach that aligns any base model by injecting a few nudging tokens at inference time. 🌐fywalter.github.io/nudging/…

Dylan Slack Reposted

Summer Yue

@summeryue0

27 Aug

LLMs are often evaluated against single-turn automated attacks. This is an insufficient threat model for real-world malicious use, where malicious humans chat with LLMs over multiple turns. We show that LLM defenses are much less robust than the reported numbers suggest.

Summer Yue

@summeryue0

27 Aug

Can robust LLM defenses be jailbroken by humans? We show that Scale Red teamers successfully break defenses on 70+% of harmful behaviors, while most automated adversarial attacks yield single-digit success rates. 🧵

Dylan Slack Reposted

Jinhyuk Lee

@leejnhk

21 Jun

Can long-context language models (LCLMs) subsume retrieval, RAG, SQL, and more? Introducing LOFT: a benchmark stress-testing LCLMs on million-token tasks like retrieval, RAG, and SQL. Surprisingly, LCLMs rival specialized models trained for these tasks! arxiv.org/abs/2406.13121

Dylan Slack

@dylanslack20

20 Jun

sqrt(2)/2 = sqrt(4/4) = sqrt(1/1)... goes to show we need better grounding / eval for generated rationales and explanations

Deedy

@deedydas

20 Jun

Claude 3.5 is here. Sonnet is the first release and has: — 2x the speed of Opus — 1/5th the cost of Opus — 200k token context window — Better quality than Opus and GPT-4o I don't trust benchmarks so I tried a Physics q that GPT 4-o failed and Sonnet nailed it. Insane launch.

Dylan Slack Reposted

Preethi Seshadri @ EMNLP

@Preethi__S_

17 Jun

I'm at NAACL and will be presenting a poster this afternoon (4-5:30 pm) on our work, The Bias Amplification Paradox in Text-to-Image Generation. Do stop by and check out our work! #naacl2024 aclanthology.org/2024.naacl-lon…

Dylan Slack Reposted

𝙷𝚒𝚖𝚊 𝙻𝚊𝚔𝚔𝚊𝚛𝚊𝚓𝚞

@hima_lakkaraju

13 May

📢 Excited to share our latest pre-print on evaluating & enhancing the safety of medical LLMs We introduce med-safety-benchmark to assess the #safety of #medical #LLMs and find that state-of-the-art models violate principles of medical safety and ethics. arxiv.org/pdf/2403.03744…

Dylan Slack

@dylanslack20

8 May

Career update: pleased to share I’ve joined Google Gemini as a research scientist! I’m excited to continue my work on LLMs at Google 🎉

Dylan Slack Reposted

Kat (@ Irvine)

@CatarinaBelm

5 May

Are Models Biased on Text without Gender-related Language? We've investigated the answer to this question in our recent ICLR 2024 paper! 🧵 Paper - arxiv.org/abs/2405.00588 🎥 (5 min) Video - youtube.com/watch?v=gmqBoB… 🌐 Landing page - ucinlp.github.io/unstereo-eval/

Dylan Slack Reposted

Kolby Nottingham @ EMNLP 2024

@kolbytn

2 May

Skill Set Optimization was accepted to @icmlconf 2024! I'm proud of this work and everything we learned about in-context policy improvement. Big thanks to my collaborators at @allen_ai Way to go team!

Kolby Nottingham @ EMNLP 2024

@kolbytn

9 Feb

Excited to share our work, "Skill Set Optimization", a continual learning method for LLM actors that: - Automatically extracts modular subgoals to use as skills - Reinforces skills using environment reward - Facilitates skill retrieval based on state allenai.github.io/sso 🧵

Dylan Slack Reposted

Hugh Zhang

@hughbzhang

2 May

Data contamination is a huge problem for LLM evals right now. At Scale, we created a new test set for GSM8k *from scratch* to measure overfitting and found evidence that some models (most notably Mistral and Phi) do substantially worse on this new test set compared to GSM8k.

Dylan Slack Reposted

Summer Yue

@summeryue0

6 Mar

Do LLMs hold knowledge that might be dangerous in the hands of a malicious user? Can hazardous knowledge be unlearned? Introducing WMDP: an open-source eval benchmark of 4,157 multiple-choice questions that serve as a proxy measurement of LLM’s risky knowledge in biosecurity,…

Dylan Slack Reposted

Scale AI

@scale_AI

6 Mar

📣 Announcing the release of the WMDP LLM benchmark, designed by Scale’s Safety, Evaluations, and Analysis Lab (SEAL) in partnership with @ai_risks (CAIS)! 🧵 scale.com/blog/measuring…

Dylan Slack Reposted

Alexandr Wang

@alexandr_wang

5 Mar

.@scale_AI will build exactly this :) coming soon!

Dylan Slack Reposted

David Brandfonbrener

@brandfonbrener

27 Feb

New paper! Q-probe is a lightweight approach to RL on top of LLMs. We learn a linear value function on the LLM embeddings and use a variant of rejection sampling to define a policy. Results linked in the thread from first author @ke_li_2021 on coding problems and RLHF. 🧵

Kenneth Li

@ke_li_2021

27 Feb

We propose Q-probe, a simple technique that improves coding and alignment for LLM, without requiring fine-tuning!. The idea is to learn a "task vector" in the hidden space and use it to select from multiple candidate generations. arxiv.org/abs/2402.14688

Dylan Slack Reposted

Hugh Zhang

@hughbzhang

26 Feb

My colleague Willow Primack used DALL-E to illustrate Allen Ginsberg’s Howl, and it was just too good not share (with permission). Here’s a teaser. Howl, Illustrated by AI I saw the best minds of my generation destroyed by madness, starving hysterical naked