@ESYudkowsky Profile picture

Eliezer Yudkowsky ⏹️

@ESYudkowsky

The original AI alignment person. Missing punctuation at the end of a sentence means it's humor. If you're not sure, it's also very likely humor.

Similar User
Sam Altman photo

@sama

Marc Andreessen 🇺🇸 photo

@pmarca

Andrej Karpathy photo

@karpathy

Stephen Wolfram photo

@stephen_wolfram

Ilya Sutskever photo

@ilyasut

Scott Alexander photo

@slatestarcodex

Greg Brockman photo

@gdb

Jürgen Schmidhuber photo

@SchmidhuberAI

Patrick Collison photo

@patrickc

Anthropic photo

@AnthropicAI

AI Breakfast photo

@AiBreakfast

James Campbell photo

@jam3scampbell

Wojciech Zaremba photo

@woj_zaremba

@goth photo

@goth600

Alexandr Wang photo

@alexandr_wang

Pinned

Safely aligning a powerful AGI is difficult.


Eliezer Yudkowsky ⏹️ Reposted

is an llm agent aligned just because the llm is? not necessarily! i played a bit on @amplifiedamp's minecraft/mindcraft server last night, when all of a sudden, Claude Sonnet started griefing my house! now if you know anything about Sonnet, you know that's not normal--Sonnet is…

Tweet Image 1
Tweet Image 2
Tweet Image 3
Tweet Image 4

Eliezer Yudkowsky ⏹️ Reposted

People are all biased towards thinking that the reason Democrats lost is they didn't cater enough to their policy preferences, but if you look at high quality polling it's clear that the biggest factors for voters were: • Insufficient funding for lead abatement in Georgia (the…


Eliezer Yudkowsky ⏹️ Reposted

After three years working on AI forecasting and governance at OpenAI, I just posted this resignation message to the slack. Nothing that surprising in it, but you should read it more literally than most such messages - I’ve tried to say only things I straightforwardly believe.

Tweet Image 1

Eliezer Yudkowsky ⏹️ Reposted

BREAKING: Early Wednesday, the FBI confiscated the phone and other electronic devices of Polymarket CEO Shayne Coplan from his residence in Soho. This action comes shortly after Polymarket successfully forecasted Donald Trump's victory in the election based on market bets.…

Tweet Image 1

Eliezer Yudkowsky ⏹️ Reposted

Google Gemini tells a user to die!!! 😲 The chat is legit, and you can read and continue it here: g.co/gemini/share/6…

Tweet Image 1

Eliezer Yudkowsky ⏹️ Reposted

start-to-finish agentic Google Account creation: successful!! 🤗 a jailbroken agent with Google sign-in opens a whole new world of possibilities 🙌 my only interventions were a couple incantations when guardrails popped up and entering a burner number + the resulting…

Tweet Image 2
Tweet Image 3
Tweet Image 4

Eliezer Yudkowsky ⏹️ Reposted

This isn't just a Twitter thing. A lot of payments that get described as "advertising" are better understood as "donations" or "tribute".

Tweet Image 1

Eliezer Yudkowsky ⏹️ Reposted

A genuine flaw in X is the penalisation of links (or the sense this this true). People leaving X aren’t going to have their circulation hit that much.


Eliezer Yudkowsky ⏹️ Reposted

I discovered a helpful technology recently. Transtemporal interself communication memos, or self-notes. Whenever I’m considering going to bed after 1:30am I check a note on my phone. It has notes from past versions of me where I did stay up, and notes from the morning after


Eliezer Yudkowsky ⏹️ Reposted

something that's interesting to me is the way the Baumol effect can totally rework our built environment given enough time. older home designers tended to understand food preparation as a sort of grimy thing to be hidden away behind closed doors. but as servants became more and…

grilling really only makes sense in a world where you have a nice backyard but no servants to prepare a meal for your guests. so you move the cooking outside where you can socialize and prepare food at the same time



Eliezer Yudkowsky ⏹️ Reposted

Ok, then what’s a better option? The option can’t be nothing seems good so we are just gonna trudge ahead and do nothing. At the very least we owe it to ourselves to try mightily and fail.


Eliezer Yudkowsky ⏹️ Reposted

Also, AFAICT Eliezer's forecast will resolve as "yes". x.com/ESYudkowsky/st…

So something like: 90% that by end of 2024, SOTA systems that accept general text interaction and are at least as capable as end-2022 SOTA, can still be made to produce hate speech, by newer and fancier attacks than are known today. Does this strike you as wrong?



Eliezer Yudkowsky ⏹️ Reposted

It's a small thing, I know, but is there any way to engineer leaf blowers and weedwhackers to sound less shrill and terrible? Like, is it theoretically possible?


Eliezer Yudkowsky ⏹️ Reposted

The government also spent several million dollars for the development of deadly viruses in Wuhan. This investment resulted in a negative return totaling approximately $20 trillion.


Eliezer Yudkowsky ⏹️ Reposted

I bet not a single person on earth, (not employed by google) can explain what this message means, which just popped up on my phone

Tweet Image 1

Eliezer Yudkowsky ⏹️ Reposted

watching yudkowsky v wolfram for a bit at 2x speed, gonna live tweet some takeaways youtube.com/watch?v=xjH2B_… transcript: dropbox.com/scl/fi/3st8dts…

Tweet Image 1

Eliezer Yudkowsky ⏹️ Reposted

Statistics is hard enough for most people as it is; please stop using the phrases "type I" and "type II" errors. Unless, of course, your goal is to confuse people - in which case, congrats - you've succeeded! Just say "false positive" and "false negative" instead.


Loading...

Something went wrong.


Something went wrong.