Nelly Papalampidi @pinelopip3 Twitter Profile

Nelly Papalampidi

@pinelopip3

Research Scientist @GoogleDeepMind working on understanding and generating videos from multimodal inputs. PhD @InfAtEd @EdinburghNLP. Ex @MetaAI

224Posts 1KFollowers 447Following

Similar User

@nsaphra

@kayo_yin

@PSH_Lewis

@sjmielke

@sarahwiegreffe

@PontiEdoardo

@machelreid

@tomsherborne

@dan_fried

@PMinervini

@andre_t_martins

@rajammanabrolu

@anjalie_f

@JesseDodge

@raquel_dmg

Pinned

Nelly Papalampidi

@pinelopip3

13 Dec

Large multimodal models understand images/clips, but what about longer contexts? We propose a memory-efficient approach for training on long videos and show that our 1B model outperforms LLMs used as information-aggregator over large image captioners. arxiv.org/abs/2312.07395

Nelly Papalampidi Reposted

Sander Dieleman

@sedielem

25 Sep

We are hiring on the Generative Media team in London: boards.greenhouse.io/deepmind/jobs/… We work on Imagen, Veo, Lyria and all that good stuff. Come work with us! If you're interested, don't delay -- apply before 5PM tomorrow (UK time).

Nelly Papalampidi

@pinelopip3

19 Sep

Exciting to see Veo in the hands of creators on YouTube ✨

Sundar Pichai

@sundarpichai

18 Sep

On @YouTube, we want creators to be able to express their creativity, build community + drive long-lasting businesses. New tools at #MadeOnYouTube are helping: we’re bringing Veo to Dream Screen to create high-quality, custom backgrounds on Shorts + more. blog.youtube/news-and-event…

Nelly Papalampidi Reposted

Sander Dieleman

@sedielem

14 Jul

I'm in Novi Sad 🇷🇸 all week for #EEML2024, where I'll be talking about, you guessed it, diffusion models! Then on to Vienna 🇦🇹 next week for #ICML2024, where I'll be doing more of the same 🤷 at the workshop on controllable video generation.

EEML

@EEMLcommunity

17 Jun

EEML2024 is fast approaching! The school starts on Monday July 15th, in Novi Sad Serbia 🇷🇸, with an Intro to Deep Learning by @alfcnz, Generative AI+SSL by @sedielem, AI for Science by @weballergy, and Reasoning tutorial by @PetarV_93 and @backprop2seed 🎉. See thread for details

Nelly Papalampidi Reposted

Lucas Beyer (bl16)

@giffmana

10 Jul

✨PaliGemma report will hit arxiv tonight. We tried hard to make it interesting, and not "here model. sota results. kthxbye." So here's some of the many interesting ablations we did, check the paper tomorrow for more! 🧶

Nelly Papalampidi Reposted

Xiaohua Zhai

@XiaohuaZhai

11 Jul

Our PaliGemma tech report is out! 🚀 arxiv.org/abs/2407.07726 WebLI -> SigLIP -> PaliGemma, what an incredible journey it's been with my amazing colleagues!

Lucas Beyer (bl16)

@giffmana

10 Jul

Nelly Papalampidi Reposted

Skanda

@skandakoppula

9 Jul

We're excited to release TAPVid-3D: an evaluation benchmark of 4,000+ real world videos and 2.1 million metric 3D point trajectories, for the task of Tracking Any Point in 3D!

Nelly Papalampidi Reposted

Talfan

@talfanevans

26 Jun

Excited to share our new preprint on data curation for large scale multimodal learning (arxiv.org/abs/2406.17711)! TLDR; 1) we show that picking good batches of data is more important than selecting data points independently, 2) online model approximation can be used to filter…

Nelly Papalampidi Reposted

Xiaohua Zhai

@XiaohuaZhai

20 Jun

We are going to present PaliGemma demo 2:30pm today, come to the Google booth and talk to me, @giffmana and @__kolesnikov__ in person about any topics around multimodal data, models and fairness.

Nelly Papalampidi Reposted

Sahand Sharifzadeh

@sahandsharif

19 Jun

🚀 Excited to share our latest work @GoogleDeepMind: "Synth^2: Boosting Visual-Language Models with Synthetic Captions and Image Embeddings" 🧵 #VLM #Synthetic #AI arxiv.org/abs/2403.07750

Nelly Papalampidi

@pinelopip3

18 Jun

In Seattle for @CVPR and we will present our work on long video-language pre-training w/ @pathak__shreya and Joe Heyward on Thursday 10:30 am PDT @ Arch 4A-E Poster #458. Stop by to discuss about long video pre-training and understanding or ping me if you are around at the conf!

Nelly Papalampidi

@pinelopip3

13 Dec

Nelly Papalampidi Reposted

Ivana Balazevic

@ibalazevic

13 Jun

MC-ViT will appear as a spotlight paper at #ICML2024! Come chat to @olivierhenaff, @pinelopip3, @YugeTen and myself in Vienna if you’re interested in simple and effective long-context video understanding.

Olivier Hénaff

@olivierhenaff

9 Feb

Humans and animals reason about events spanning days, weeks, and years, yet current CV systems live largely in the present. Introducing Memory-Consolidated ViT, whose context extends far into the past and sets a new SOTA in long-video understanding with a 10x smaller model

Nelly Papalampidi Reposted

Aishwarya Kamath

@ashkamath20

23 May

Last week at NYU graduation, @ebetica asked me: So is Gemini any good? I’ve been hearing this question a lot, and decided to show you myself! 🙌 I made a colab for you to try out our spatial understanding capabilities using the API key from AI Studio. Currently FREE! 🧵

Nelly Papalampidi

@pinelopip3

23 May

Highly recommend to check out this work! Given the impressive image gen quality nowadays, text alignment is often overlooked. But testing alignment to users' intentions should be top priority and now there is a robust benchmark and metric to do so.

Emanuele Bugliarello

@ebugliarello

23 May

Check out Gecko 🦎: @GoogleDeepMind's latest work looking at how to evaluate text-to-image technology with: 📊 a new benchmark 🕵️ 100K+ human ratings of state-of-the-art T2I models 🤖 a better human-correlated auto-eval metric arxiv.org/abs/2404.16820

Nelly Papalampidi Reposted

Andreas Vlachos

@vlachos_nlp

22 May

Looking forward to AthNLP 2024! Thanks to the fantasic speakers who will join us: @anas_ant @raquel_dmg @fhuszar @MKrallinger Mirella Lapata Ryan McDonald @aidanematzadeh @vnfrombucharest @barbara_plank @annargrs ! Deadline for applications 20th of June! athnlp.github.io/2024/cfp.html

Athens NLP Summer School

@AthensNLP

22 May

📢Speakers announced for #AthNLP2024 International Summer School! ❗️Don't miss the opportunity to meet in person with the roster of ✨tutors in the field of #NaturalLanguageProcessing at the upcoming Athens Natural Language Processing Summer School! athnlp.github.io/2024/speakers.…

Nelly Papalampidi Reposted

Aida Nematzadeh

@aidanematzadeh

23 May

We evaluate the 3 main aspects involved in the evaluation of MM/image generative models: (1) the prompt set (2) the metric (3) the human data. Will be releasing our benchmark soon, including a subset with high human agreement to be used to evaluate image--text alignment metrics!

Emanuele Bugliarello

@ebugliarello

23 May

Nelly Papalampidi

@pinelopip3

16 May

Very excited to see Veo out 🚀

Google DeepMind

@GoogleDeepMind

14 May

Introducing Veo: our most capable generative video model. 🎥 It can create high-quality, 1080p clips that can go beyond 60 seconds. From photorealism to surrealism and animation, it can tackle a range of cinematic styles. 🧵 #GoogleIO

Nelly Papalampidi Reposted

Google DeepMind

@GoogleDeepMind

14 May

We’re introducing new additions to Gemma: our family of open models built with the same technology as Gemini. 🔘 PaliGemma: a powerful open vision-language model 🔘 Gemma 2: coming soon in various sizes, including 27 billion parameters → dpmd.ai/3QKEteK #GoogleIO

Nelly Papalampidi Reposted

Lucas Beyer (bl16)

@giffmana

14 May

We release PaliGemma. I'll keep it short, still on vacation: - sota open base VLM designed to transfer quickly, easily, and strongly to a wide range of tasks - Also does detection and segmentation - We provide lots of examples - Meaty tech report later! ai.google.dev/gemma/docs/pal…

Nelly Papalampidi

@pinelopip3

13 May

Congrats Tom 🥳 I wish I was in Edi to celebrate with you!

Tom Sherborne

@tomsherborne

13 May

I passed my PhD viva today! Thanks to @iatitov and @LukeZettlemoyer for examining me. See you at Doctor’s @ 6PM

Nelly Papalampidi Reposted

Ioana Bica

@IoanaBica95

14 Mar

Super excited about the new SIMA agent and very happy that our work on SPARC is powering the SIMA image encoders! 🥳🚀🕹️SPARC 🌟provides both fine-grained image-text alignment and it also enables the agent to utilize the knowledge gained through internet-scale pretraining.

Google DeepMind

@GoogleDeepMind

13 Mar

Introducing SIMA: the first generalist AI agent to follow natural-language instructions in a broad range of 3D virtual environments and video games. 🕹️ It can complete tasks similar to a human, and outperforms an agent trained in just one setting. 🧵 dpmd.ai/3TiYV7d