Daniel Ziegler
@d_m_zieglerAlignment Stress-Testing @ Anthropic
Similar User
@FreedmanRach
@ajeya_cotra
@EvanHub
@CollinBurns4
@jkcarlsmith
@Turn_Trout
@ryan_kidd44
@StephenLCasper
@dfrsrchtwts
@kandouss
@catherineols
@BethMayBarnes
@ARGleave
@Thomas_Woodside
@ZacKenton1
We’re starting a Fellows program to help engineers and researchers transition into doing frontier AI safety research full-time. Beginning in March 2025, we'll provide funding, compute, and research mentorship to 10–15 Fellows with strong coding and technical backgrounds.
Guys, I don't often ask you to retweet, but please retweet this. Swap Your Vote *does not have enough safe state voters to match all its swing state voters!* Swap Your Vote (link below) matches swing state voters who prefer Harris to Trump but don't want to vote for Harris...
SwapYourVote is a very interesting idea -- matches two safe-state Dems with one swing state voter considering going third-party, they switch, increasing overall third-party vote share and Harris's odds in swing states.
I'd just like to point out that *I* left OpenAI waaay before it was cool
Announcing Transluce, a nonprofit research lab building open source, scalable technology for understanding AI systems and steering them in the public interest. Read a letter from the co-founders Jacob Steinhardt and Sarah Schwettmann: transluce.org/introducing-tr…
Familiar things (that still don't exist): whistleblower protections for AI employees, mandatory 3rd party testing and result-sharing...
Incredible indeed! A much bigger deal than I would have guessed, and super interesting—thanks very much for the pointer
A big part of my job these days is to think about what technical work Anthropic needs to do to make things go well with the development of very powerful AI. I digested my thinking on this, plus some of the Anthropic zeitgeist around it, into this piece: sleepinyourhat.github.io/checklist/
I wish people would use prediction markets more for practical decision-relevant questions like this one manifold.markets/DanielZiegler/…
If Germany had stuck with nuclear power it would have saved €350 billion and cut emissions by 73% more since 2022. tandfonline.com/doi/full/10.10…
This is a really accessible, good-faith, and IMO persuasive FAQ from Yoshua Bengio -- here are my favorite excerpts: - the core case for concern - timelines - why companies would build something so unsafe - the compatibility of reducing catastrophic and more immediate harms
if we play our cards right we're all going live to the end of time and have the most wonderful adventures before it all fades away. and everything matters as much as anything ever could matter because it all comes down to what we do
The CA Assembly Judiciary has taken this seriously. After citing this letter at length, they suggested an amendment to SB 1047 strengthening whistleblower protections. Here's an excerpt...
A group of current, and former, OpenAI employees - some of them anonymous - along with Yoshua Bengio, Geoffrey Hinton, and Stuart Russell have released an open letter this morning entitled 'A Right to Warn about Advanced Artificial Intelligence'. righttowarn.ai
The details matter! Our target is not safety cases in the abstract, but specific details about how safety cases might look for AI control, interp, scalable oversight, guaranteed safety, process supervision, weak-to-strong, automated safety, ELK, alignment evals, <yours>, etc.
If you feel interested in jhanas or other serious meditation practice, I think it's worth your time to read this thread and its replies.
I really wish you would caveat this sort of thing when you evangelize this practice, especially later jhanas / anything to do with cessation. 1/ x.com/nickcammarata/…
What about "we should be honest about our enormous uncertainty and the wide range of outcomes on the table, and run labs with a strong internal safety culture while working hard internationally to avoid a second Cold War or an arms race"?
United States Trends
- 1. $CUTO 7.315 posts
- 2. WNBA 40,6 B posts
- 3. Taina 6.052 posts
- 4. #WednesdayMotivation 7.063 posts
- 5. #wednesdayfeelings 2.517 posts
- 6. Good Wednesday 33,3 B posts
- 7. Tucker 37 B posts
- 8. $ASTROS 1.903 posts
- 9. #Alphabot 6.574 posts
- 10. Dreamville 4.198 posts
- 11. Herbo 2.006 posts
- 12. #WednesdayWisdom 1.108 posts
- 13. #4YearsOfEvermore N/A
- 14. Hump Day 18,4 B posts
- 15. Mitch 77,4 B posts
- 16. Ben Rice N/A
- 17. Josh Williams N/A
- 18. $BOOST 9.508 posts
- 19. Core CPI 4.000 posts
- 20. Luis Gil 2.192 posts
Who to follow
-
Rachel Freedman
@FreedmanRach -
Ajeya Cotra
@ajeya_cotra -
Evan Hubinger
@EvanHub -
Collin Burns
@CollinBurns4 -
Joe Carlsmith
@jkcarlsmith -
Alex Turner
@Turn_Trout -
Ryan Kidd
@ryan_kidd44 -
Cas (Stephen Casper)
@StephenLCasper -
Daniel Filan @ NeurIPS research-tweets
@dfrsrchtwts -
Kamal Ndousse
@kandouss -
Catherine Olsson
@catherineols -
Elizabeth Barnes
@BethMayBarnes -
Adam Gleave
@ARGleave -
Thomas Woodside
@Thomas_Woodside -
Zac Kenton
@ZacKenton1
Something went wrong.
Something went wrong.