Itamar Zimerman @ItamarZimerman Twitter Profile

Itamar Zimerman

@ItamarZimerman

PhD candidate @ Tel Aviv University. AI Research scientist @ IBM Research. Interested in deep learning and algorithms.

100Posts 399Followers 414Following

Similar User

@Ofirlin

@YoadTewel

@ShellySheynin

@TShrbny

@BenaimSagie

@ItaiLang

@OhadRubin

@nirzabari

@idansc

@shir_gur

@haim_ben_yakov

@HmedThapar

Pinned

Itamar Zimerman

@ItamarZimerman

2 Mar

New!🚨📰 Mamba is a cool, efficient, and effective DL architecture, but what do we know about Mamba? How does it capture interactions between tokens? Can it be the attention-killer? In our work, "The Hidden Attention of Mamba Models" we provide answers to these questions! [1/4]

Itamar Zimerman Reposted

Yoav HaCohen

@yoavhacohen

22 Nov

Excited to announce LTX-Video! Our new text-to-video model generates stunning, high-quality videos faster than real-time—5 seconds of 24fps video at 768x512 in just 4 seconds on an Nvidia H100! ⚡ We’re open-sourcing the code & weights. Check out the results 🎥👇

Itamar Zimerman Reposted

Yoad Tewel

@YoadTewel

4 Nov

🚀 Excited to release the code and demo for ConsiStory, our #SIGGRAPH2024 paper! No fine-tuning needed — just fast, subject-consistent image generation! Check it out here 👇 Code: github.com/NVlabs/consist… Demo: build.nvidia.com/nvidia/consist…

AK

@_akhaliq

6 Feb

Nvidia presents ConsiStory Training-Free Consistent Text-to-Image Generation paper page: huggingface.co/papers/2402.03… enable Stable Diffusion XL (SDXL) to generate consistent subjects across a series of images, without additional training.

GitHub - NVlabs/consistory

Source: https://t.co/AojrWrSK5R

Itamar Zimerman Reposted

Avihu Dekel

@AvihuDkl

28 Oct

🎙️ Proud to share our new preprint: Continuous Speech Synthesis using per-token Latent Diffusion Check it out: huggingface.co/papers/2410.16… @ArnonTu, @NimrodShabtay #IBMResearch #SpeechSynthesis

Itamar Zimerman Reposted

Nimrod Shabtay

@NimrodShabtay

15 Oct

Introducing LiveXiv, a new, challenging and maintainable scientific multi-modal live dataset Paper: arxiv.org/abs/2410.10783 Github: github.com/NimrodShabtay/… Dataset: huggingface.co/datasets/LiveX…

LiveXiv/LiveXiv · Datasets at Hugging Face

Source: https://t.co/4blQxQDiLz

Itamar Zimerman Reposted

Yoad Tewel

@YoadTewel

28 Jul

I'm going to present ConsiStory📖 at #SIGGRAPH2024 this Monday @ 2pm! If you're around this week, DM me if you want to chat! Details below⬇️🧵

AK

@_akhaliq

6 Feb

Itamar Zimerman

@ItamarZimerman

24 Jul

Given a polynomial P and an input x, the magic of homomorphic encryption enables the computation of Dec(P(Enc(x)), yielding P(x) without exposing information about x or P. By converting transformers into polynomials, we provide a new framework for confidential computing with LMs!

Moran Baruch

@moran_baruch

24 Jul

1/5 Worried about your sensitive data while using LLMs? Come and join me tomorrow at @icmlconf at 11:30, presenting our work: "Converting Transformers to Polynomial Form for Secure Inference over Homomorphic Encryption" (#2115). #IBMResearch #HELayers >>

Itamar Zimerman Reposted

Assaf Ben Kish

@abk_tau

11 Jul

Want to train short and evaluate long? 📚🤖 DeciMamba code is out! Github 👉 github.com/assafbk/DeciMa… Paper 👉 arxiv.org/abs/2406.14528

Assaf Ben Kish

@abk_tau

21 Jun

New Work! 🐍 What prevents Mamba from extrapolating to sequences that are significantly longer than those it was trained on? Furthermore, can Mamba solve long-range NLP tasks using short-range training only? 🧵🧵🧵

GitHub - assafbk/DeciMamba: Official Implementation Of The Paper: `DeciMamba: Exploring the Length...

Source: https://t.co/0HdaArUgIQ

Itamar Zimerman

@ItamarZimerman

21 Jun

NEW! 📰📢 What are Mamba's length generalization capabilities? What limits them? How can we unlock their potential for real-world long NLP tasks? In a very fun work led by Assaf @abk_tau, we dive deep into these questions! arxiv.org/abs/2406.14528

Assaf Ben Kish

@abk_tau

21 Jun

Itamar Zimerman Reposted

Nicolas Zucchet

@NicolasZucchet

3 Jun

Why do state-space models work so well? With @orvieto_antonio, we study their learning dynamics and find that diagonal recurrence is key: 1. It helps in better conditioning the loss landscape 2. It facilitates Adam's job by making the Hessian diagonal 📝 arxiv.org/abs/2405.21064

Itamar Zimerman Reposted

Matan Avitan

@matan_avitan_

8 May

🧵 Exciting news from our latest research on Interpretability of LMs and fairness! Our paper "Natural Language Counterfactuals through Representation Surgery" introduces a novel approach to interpret representational level interventions and mitigate biases in language models.👇

Itamar Zimerman Reposted

Mentee Robotics

@MenteeBot

17 Apr

Introducing Menteebot: Groundbreaking Humanoid Robot. We're proud to unveil Menteebot, the culmination of a two-year journey by our brilliant team! Menteebot is a groundbreaking humanoid robot designed for versatility. Visit our website menteebot.com for more demos.