Shahriar Golchin @ShahriarGolchin Twitter Profile

Shahriar Golchin

@ShahriarGolchin

Student Researcher @Google | CS PhD in ML/NLP @UArizona (opinions are mine)

50Posts 133Followers 405Following

Similar User

@woojin_ko

@duckwackIND

@adnanharundogan

@DeepReel_AI

@nyla_worker

@arjunashok37

@swagatam2198

@nikhilbarhate99

@Qdvvdu

@luis_castano_95

@adilsheraz_

@4245Shubham

@irfanbulu

@AritraBhowmik6

@rbfahim

Pinned

Shahriar Golchin

@ShahriarGolchin

22 Aug

The mechanism behind in-context learning (ICL) has been unclear for a long time. We studied the behind-the-scenes memorization in ICL and found that there is a very strong correlation between this memorization and improved performance. arxiv.org/abs/2408.11546

Shahriar Golchin

@ShahriarGolchin

9 Sep

Excited to share I've joined @Google as a Student Researcher! Ready to learn, innovate, and make an impact!

Shahriar Golchin

@ShahriarGolchin

3 May

We (w/ @msurd) will be presenting our Spotlight paper at #ICLR2024 in Vienna next week. Drop by for some amazing intellectual exchanges on data contamination. Paper: arxiv.org/abs/2308.08493 Code: github.com/shahriargolchi… Media: thenewstack.io/how-to-detect-…

Shahriar Golchin Reposted

Caiming Xiong

@CaimingXiong

30 Apr

Are your LLMs highly accurate, or simply contaminated? As the race to build the best LLM intensifies, clean evaluation is becoming more important than ever, yet contaminated LLMs and benchmarks obfuscate the real performance of models. Checkout our new work (comprehensive survey…

Shahriar Golchin Reposted

Oscar Sainz

@osainz59

16 Apr

Do you have a paper about data contamination with some evidence reported on it? Consider submitting that evidence to the Data Contamination Evidence Collection. Any evidence is very welcome!

Oscar Sainz

@osainz59

16 Apr

Can you imagine having all the evidence of data contamination gathered in one place? 📢As part of the CONDA workshop, we present the Data Contamination Evidence Collection, a shared task on reporting contamination. Available as a @huggingface space: hf.co/spaces/CONDA-W…

Shahriar Golchin Reposted

clulab

@LabCLU

12 Apr

1/ Many cool papers coming from our lab recently! Here are just a few:

Shahriar Golchin Reposted

Mihai Surdeanu

@msurd

10 Apr

The #nlproc group at University of Arizona has grown too big, and my account doesn't do it justice. Please follow @LabCLU for updates about our group!

Shahriar Golchin Reposted

Mihai Surdeanu

@msurd

30 Mar

A brief but nice post on data contamination that discusses the work that @ShahriarGolchin, one of our great PhD students :), did: thenewstack.io/how-to-detect-…

How to Detect and Clean up Data Contamination in LLMs

Source: https://t.co/31BU6EnbuT

Shahriar Golchin

@ShahriarGolchin

25 Mar

Surprisingly, Andrej Karpathy revealed that GPT-4 isn't truly 67% proficient in coding (HumanEval). The lesser-known problem we uncovered earlier is that GPT-4 has higher levels of coding and reasoning data contamination than officially reported. arxiv.org/abs/2311.06233

Andrej Karpathy

@karpathy

4 Mar

Claude 3 takes on the Tokenization book chapter challenge :) context: twitter.com/karpathy/statu… Definitely looks quite nice, stylistically! If you look closer there are a number of subtle issues / hallucinations. One example there is a claim that "hello world" tokenizes into 3…

Shahriar Golchin Reposted

Oscar Sainz

@osainz59

29 Jan

📢 Excited to announce that our Workshop on Data Contamination (CONDA) will be co-located with ACL24 in Bangkok, Thailand on Aug. 16. We are looking forward to seeing you there! Check out the CFP and more information here: conda-workshop.github.io

Shahriar Golchin

@ShahriarGolchin

16 Jan

Thrilled to share that our "Time Travel in LLMs" paper has been accepted to #ICLR2024 as a Spotlight! w/ my awesome advisor @msurd #LLMs #DataContamination @iclr_conf

Shahriar Golchin

@ShahriarGolchin

23 Aug 2023

Data contamination suggests LLMs have possibly seen test data from downstream tasks. Our recent study introduces a novel method to replicate LLMs' training data, including downstream dataset instances, to aid in detecting data contamination. Read more: arxiv.org/abs/2308.08493

Shahriar Golchin Reposted

Oscar Sainz

@osainz59

30 Oct 2023

Thankfully, several works have been published on this topic since we submitted our paper for review: Time Travel in LLMs: Tracing Data Contamination in Large Language Models By: @ShahriarGolchin @msurd Paper: arxiv.org/abs/2308.08493