Archana Ahlawat @archanaahlawat Twitter Profile

Archana Ahlawat

@archanaahlawat

product, AI guardrails @ Dynamo AI || prev: research & eng @PrincetonCITP, @Microsoft, @JustFuturesLaw, @DukeU.

2KPosts 966Followers 2KFollowing

Similar User

@reboot_hq

@CharacterHub

@divyasiddarth

@MatthewDSun

@spencerc99

@jasminewsun

@saffronhuang

@will__ye

@mslkmp

@GretchenMarina

@Manderljung

@AbramovichShira

@BenDLaufer

@gucaslelfond

@jessicadai_

Archana Ahlawat Reposted

jasmine sun

@jasminewsun

4 Nov

personal news: I quit my job! I'm going to write full-time!

Archana Ahlawat Reposted

Most real-world AI applications involve human-model interaction, yet most current safety evaluations do not. In a new paper with @saffronhuang @_lamaahmad @Manderljung, we argue that we need evaluations which assess human-model interactions for more accurate safety assessments 🧵

Archana Ahlawat Reposted

Digital Witness Lab

@diwilab

13 May

👋Hello World! We’re Digital Witness Lab. We build privacy-respecting tools to access hard-to-reach data and empower public interest investigations.

Archana Ahlawat Reposted

rishi

@RishiBommasani

5 Mar

Independent evaluation of the risks of foundation models is a vital means for transparency and instrument for accountability. While possible today, uncertainties and the lack of protections reduces its efficacy. New work led by @ShayneRedford to address sites.mit.edu/ai-safe-harbor/

Archana Ahlawat Reposted

Cas (Stephen Casper)

@StephenLCasper

29 Jan

🚨New paper🚨 Black-Box Access is Insufficient for Rigorous AI Audits AI audits are increasingly seen as key for governing powerful AI systems. But to be effective, audits need to be high-quality, and to produce high-quality audits, auditors need access.🧵 arxiv.org/abs/2401.14446

Archana Ahlawat Reposted

Divya Siddarth

@divyasiddarth

29 Nov

the thing about AI that people don't understand is that it's got all these risks. but also ! all these opportunities. not to mention the risks. but ! think of the opportunities. but the risks :( but the opportunit

Archana Ahlawat Reposted

Yo Shavit

@yonashav

24 Nov

If you are a public figure and tell your followers that “big new risks from advanced AI are fake”, you are wrong. Not only that, you’ll be seen to be wrong *publicly & soon*. This is not an “EA thing”, it is an oncoming train and it is going to hit you, either help out or shut up

Archana Ahlawat Reposted

Arvind Narayanan

@random_walker

21 Nov

A brilliant paper from my @PrincetonCITP colleagues analyzing the knowledge creation and sharing practices (forecasting, EA-funded prize competitions, forums) in the AI safety community that enabled this niche and contested set of ideas to go mainstream: drive.google.com/file/d/1HIwKMn…

klaudia jaźwińska

@klaudiajaz

30 Oct 2023

The UK AI Safety Summit kicks off this week, highlighting the rapid rise of the field of AI safety and its influence on industry, academia and policy. In a new paper, Shazeda Ahmed, @archanaahlawat, @aawinecoff, @m0namon & I explain how this once-niche field became mainstream.🧵

Archana Ahlawat Reposted

david rein

@idavidrein

21 Nov

🧵Announcing GPQA, a graduate-level “Google-proof” Q&A benchmark designed for scalable oversight! w/ @_julianmichael_, @sleepinyourhat GPQA is a dataset of *really hard* questions that PhDs with full access to Google can’t answer. Paper: arxiv.org/abs/2311.12022

Archana Ahlawat Reposted

Karen Hao

@_KarenHao

20 Nov 2023

Now is probably the time to announce that I've been writing a book about @OpenAI, the AI industry & its impacts. Here is a slice of my book reporting, combined with reporting from the inimitable @cwarzel Inside the year of chaos that led to this weekend. theatlantic.com/technology/arc…