Ayoub Ajarra @AjarraAyoub Twitter Profile

Ayoub Ajarra

@AjarraAyoub

PhD student at Scool (Previously Sequel) - Inria Lille

182Posts 275Followers 2KFollowing

Similar User

@AhrenJin

@hyanan16

@xuefeng_du

@XtremSup

@ShuruiGui

@YizhouWang14

@XinmingHou

@dmarviss

@KhalidDinar4

@lozanorcintaes1

@KruskalLin

@chunhuizng

Ayoub Ajarra Reposted

Ayoub Ajarra

@AjarraAyoub

11 Nov

We live in incredible times.

Ayoub Ajarra Reposted

When a measure becomes a target it ceases to be a good measure This adage from Marilyn Strathern known as Goodhart's law, has implications beyond economics, public policy & AI With @le_science4all we tried few years ago to formalise it, here it is finally arxiv.org/abs/2410.09638

Ayoub Ajarra Reposted

Ayoub Ajarra

@AjarraAyoub

28 Aug

Is "offline RL" in offline-to-online RL really necessary? Surprisingly, we find that replacing offline RL with *unsupervised* offline RL often leads to better online fine-tuning performance -- even for the *same* task! Paper: arxiv.org/abs/2408.14785 🧵↓

Ayoub Ajarra Reposted

Ayoub Ajarra

@AjarraAyoub

12 Aug

Here's a cool theorem I learned today.

Ayoub Ajarra Reposted

Ayoub Ajarra

@AjarraAyoub

30 Jul

I'm going to write a survey on bounding the total variation distance, focusing on the case of product measures send me your favorite inequalities PS I know all of these 👇

Ayoub Ajarra Reposted

Ayoub Ajarra

@AjarraAyoub

27 Jul

I don't know how someone came up with this crazy formula for the mean of a logit-Normal, but I'm glad they did. It converges extremely fast. en.wikipedia.org/wiki/Logit-nor…

Ayoub Ajarra Reposted

Ayoub Ajarra

@AjarraAyoub

23 Jul

Excited to share that our work with Karolina, Mahdi, Roi, and Dan has been selected for a best paper award 🎉

ICML Conference

@icmlconf

23 Jul

Congratulations to the best paper award winners

Ayoub Ajarra Reposted

Ayoub Ajarra

@AjarraAyoub

22 Jul

hree useful identities for min(u,v) each one gets used in my upcoming paper stay tuned

Ayoub Ajarra Reposted

Ayoub Ajarra

@AjarraAyoub

11 Jul

Temporal distances (expected number of time steps between states) in stochastic MDPs in general lack metric structure. It has been a long-standing question how to design a (quasi)metric notion of "distance" in such settings. In a new paper, we have a viable solution! 🧵👇

Ayoub Ajarra Reposted

Ayoub Ajarra

@AjarraAyoub

9 Jul

🧵New paper: Machine Unlearning Fails to Remove Data Poisoning Attacks, ft @MartinPawelczyk, @jimmy_di98, @ayush_sekhari, @SethInternet Title says it all: current approaches for machine unlearning (MUL) are not effective at removing the effect of data poisoning attacks. 1/n

Ayoub Ajarra Reposted

Ayoub Ajarra

@AjarraAyoub

3 Jul

We've derived tight lower and upper bounds for differentially private finite-armed & linear bandits, while we lack the same for contextual bandits. At #COLT2024, @achraf_azize presents open problems in contextual bandits with privacy. @BasuDebabrota @Inria_Lille @RechercheUlille

Ayoub Ajarra Reposted

Ayoub Ajarra

@AjarraAyoub

17 Jun

#phdlife ✌️Dans les algorithmes Top Two, un choix considéré "leader" est opposé à un "challenger" pour choisir la meilleure option. 📃Dans sa #thèse, @MarcJourdan5 de l’équipe @InriaScool en fait une méthode offrant garanties théoriques et excellentes performances empiriques.

Ayoub Ajarra Reposted

Ayoub Ajarra

@AjarraAyoub

5 Apr

Now *that’s* a poster!

Ayoub Ajarra Reposted

Ayoub Ajarra

@AjarraAyoub

26 Mar

[📜preprint] New results for an old problem We revisit the basic statistics problem "simple binary hypothesis testing": Given (i) two distributions p & q and (ii) n data points promised to be sampled either i.i.d. from p or i.i.d. from q, identify the true distribution

Ayoub Ajarra Reposted

Ayoub Ajarra

@AjarraAyoub

21 Mar

AI researcher Minqi Jiang says the next frontier of AI is moving from to systems that answer questions to systems which ask the questions

Ayoub Ajarra Reposted

Ayoub Ajarra

@AjarraAyoub

2 Feb

🧵New paper: "Not All Learnable Distribution Classes are Privately Learnable" to appear in #ALT2024. We refute a conjecture of @ashtiani_hassan We show that there exists a learnable distribution class which is not privately learnable. w @markmbun @argymouz @vkerdos 1/n

Ayoub Ajarra Reposted

Ayoub Ajarra

@AjarraAyoub

29 Nov

As semester draws to end, I want to share this *identity* (h/t @tengyangx) that connects so many fundamental pieces of the RL theory together: optimism, pessimism, policy opt, proved by PD lemma + Bellman-error telescoping, all in one equation! 1/3

Ayoub Ajarra Reposted

Ayoub Ajarra

@AjarraAyoub

23 Nov

I hope Q* does not involve RL because I was hoping I will actually never have to learn that god-awful RL notation. Please let it be anything else but RL, please.

Ayoub Ajarra Reposted

Ayoub Ajarra

@AjarraAyoub

22 Nov

"Toward General Virtual Agents" I recently gave a talk at MIT. I argued that we should use tools from reinforcement learning and search to improve the capability and alignment of LLM agents. Slides: drive.google.com/file/d/1kDvmrm… Video: