Karina Nguyen @karinanguyen_ Twitter Profile

Karina Nguyen

@karinanguyen_

AI research & eng @OpenAI, prev. @AnthropicAI, @nytimes, @square, @dropbox + visual forensics for the Pulitzer Prize investigations

1KPosts 25KFollowers 755Following

Similar User

@lmarena_ai

@AnthropicAI

@latentspacepod

@ml_hardware

@DivGarg_

@dust4ai

@sanchitgandhi99

@ekzhang1

@percyliang

@ggerganov

@YiTayML

@barret_zoph

@hwchung27

@Francis_YAO_

@EthanJPerez

Pinned

Karina Nguyen

@karinanguyen_

3 Oct

For the first time we are fundamentally changing how humans can collaborate with ChatGPT since it launched two years ago. We’re introducing canvas, a new interface for working with ChatGPT on writing and coding projects that go beyond simple chat. Product and model features:…

Karina Nguyen

@karinanguyen_

14 Nov

it is not an AGI if it a) can't write award winning novels b) doesn't talk like a god

Karina Nguyen

@karinanguyen_

12 Nov

Dreams come true! Thank you so much. Back in 2014 in Ukrainian high school, I learned a lot from Coursera, including English and programming. So grateful for this opportunity to do this 💛

Andrew Ng

@AndrewYNg

12 Nov

Chatting with OpenAI’s @karinanguyen_ who joined OpenAI earlier this year and within 6 months co-created and shipped Canvas. I really respect teams that can move fast. That OpenAI, even as a large-ish company, can ship at this pace is fantastic!

Karina Nguyen

@karinanguyen_

4 Nov

only OGs will remember the creativity of claude 1.3, the claude instant 1.2 price-performance, the magic of 100k context, edit toggle in claude.ai interface to modify model's outputs for devs, inability to stop claude's streaming bc of silly next.js, long lived…

Claude

Source: https://t.co/VI2mqhvzjV

Karina Nguyen Reposted

Karina Nguyen

@karinanguyen_

30 Oct

Excited to open-source a new hallucinations eval called SimpleQA! For a while it felt like there was no great benchmark for factuality, and so we created an eval that was simple, reliable, and easy-to-use for researchers. Main features of SimpleQA: 1. Very simple setup: there…

Karina Nguyen

@karinanguyen_

30 Oct

New paper! SimpleQA is new factuality benchmark that contains 4,326 short, fact-seeking questions that are challenging for frontier models. Designing good evals is hard. Here we used the following criteria: - High correctness via robust data quality verification / human…

OpenAI

@OpenAI

30 Oct

Factuality is one of the biggest open problems in the deployment of artificial intelligence. We are open-sourcing a new benchmark called SimpleQA that measures the factuality of language models. openai.com/index/introduc…

Karina Nguyen

@karinanguyen_

20 Oct

we’re in an intelligence overhang—the problem now is scaling and distribution (eg not all developers have truly embraced in-context learning magic imo). it will take time, but people are underestimating how profound the consequences will be

Karina Nguyen

@karinanguyen_

17 Oct

Show changes feature now in canvas! You can now visually see yours or model's code diffs and edits :)

OpenAI

@OpenAI

17 Oct

When using canvas, you can now see what's changed in your writing and code by selecting the "Show changes" icon. Enjoy!

Karina Nguyen

@karinanguyen_

12 Oct

insane 5 months at oai not going to lie, 1 month == 1 yr in these labs

Karina Nguyen

@karinanguyen_

10 Oct

fun to look back at old explorations, some dreams can come true!

Karina Nguyen

@karinanguyen_

5 Sep 2022

I oftentimes use language models to brainstorm and think through ideas during the writing process to get unstuck or find different perspectives. So, I built a writing environment with GPT-3 integrated directly into the editor called Synevyr.

Karina Nguyen

@karinanguyen_

6 Oct

Autocompleting a human’s thought during creative processes like writing or coding is the ultimate personalization — other efforts that people claim as “personalization” is a waste of time

Karina Nguyen Reposted

Karina Nguyen

@karinanguyen_

5 Oct

OpenAI’s canvas seems to really understand the writing process. How is this possible? I recorded myself trying it out at writing for the first time and it’s insane how it figures out really quickly what I wanted to improve in the writing. I’ve been writing for a long time and…