Ashwin Devaraj @AshwinDevaraj3 Twitter Profile

Ashwin Devaraj

@AshwinDevaraj3

Training LLMs at @snowflakedb Ex-@neeva, Math+CS at UT Austin Fan of mountain hiking, singing, and other surrogate activities

26Posts 188Followers 228Following

Similar User

@tanyaagoyal

@prasann_singhal

@xiye_nlp

@LiyanTang4

@ForBo7_

@brunchavecmoi

@sidilu_pluslab

@anuj_diwan

@YatingWu96

@yasumasa_onoe

@hungting_chen

@gkambhat

@albertyu101

@ManyaWadhwa1

@zhang_shujian

Ashwin Devaraj

@AshwinDevaraj3

8 Jun

There's no way this paragraph is written by a human alone. It's a very informative survey though

Ashwin Devaraj Reposted

3rd week in a row, 3rd LLM from @SnowflakeDB ... Arctic-TILT is a 800M model that has GPT-4 quality performance on information extraction tasks, as measured by the DocVQA benchmark. And it fits in an A10!

Snowflake

@SnowflakeDB

2 May

Snowflake’s Arctic-TILT model, powering our Document, Al beats GPT-4 with just 0.8B parameters, securing a top spot in the standard benchmark for document understanding DocQVA.

Ashwin Devaraj Reposted

Nathan Wiegand

@nathanwiegand

26 Apr

We just published our next blog post in the Arctic Cookbook series about how we generated and managed our training data for Arctic. Up next, we'll talk about getting the most from your hardware. medium.com/snowflake/snow…

Ashwin Devaraj Reposted

Samyam Rajbhandari

@samyamrb

25 Apr

To maximize #SnowflakeArctic's training throughput, we optimized at all levels of the system stack from developing custom cuda kernels to co-designing the model architecture with the system to enable communication overlap. @Reza_LOD offers a glimpse into these optimizations.

Reza

@Reza_LOD

25 Apr

1/4 Have you wondered how to optimize sys-perf for training Arctic-like models (MoE arch)? Let’s dive in! Our first technique: custom fused kernels. By crafting these kernels, we streamline irregular and sparse operators, boosting efficiency. #SnowflakeArctic #SystemOptimization

Ashwin Devaraj Reposted

andrew gao

@itsandrewgao

24 Apr

Good morning: @SnowflakeDB’s new 480B parameter #LLM is made of 128 experts! It’s bigger than #Grok and is now the largest *fully open source (Apache 2.0* LLM! 🧵👇 how does it compare to Llama 3, Mixtral, and GPT4?

Ashwin Devaraj Reposted

Zhewei Yao

@yao_zhewei

24 Apr

1/n What are the benefits of MoE? Our study shows that MoE models can achieve better quality with less compute. In fact, our MoE-1.6B model outperformed a 6.5B dense model while requiring at least 4x less compute to train! Read on for more findings on MoE ablations 🧵

Ashwin Devaraj

@AshwinDevaraj3

25 Apr

What a roller coaster the past few months have been! I'm excited and grateful for the opportunity to collaborate with such a badass team. Stay tuned for more updates - blog posts, model improvements, and more...

sridhar

@RamaswmySridhar

24 Apr

.@SnowflakeDB is thrilled to announce #SnowflakeArctic: A state-of-the-art large language model uniquely designed to be the most open, enterprise-grade LLM on the market. This is a big step forward for open source LLMs. And it’s a big moment for Snowflake in our #AI journey as…

Ashwin Devaraj Reposted

Snowflake

@SnowflakeDB

24 May 2023

We’re excited to announce Snowflake is acquiring Neeva and its team of talented engineers, to make search even more intelligent at scale across the #DataCloud. See the blog post for more information: okt.to/DYI5Ue #GenAI #LLM

Snowflake acquires Neeva to advance search in the Data Cloud

Source: https://t.co/gKOEj2nW7h

Ashwin Devaraj Reposted

Neeva

@Neeva

21 Mar 2023

Bard, we're blushing. 🤭

Ashwin Devaraj Reposted

Neeva

@Neeva

17 Mar 2023

This truck is on the move!! 🚚 Have you seen it around the city? 🏙️ Add your shots here 📸⤵️

Ashwin Devaraj Reposted

Neeva

@Neeva

14 Feb 2023

Still waiting on AI search promised by the big guys? Same. We're keeping an eye on them here: 👁️‍🗨️ Google’s Bard isbardavailable.com 👁️‍🗨️ Bing’s AI chatbot isbingaiavailable.com Oh, did we mention NeevaAI is available NOW? No ads. No waitlist. neeva.com

Ashwin Devaraj Reposted

sridhar

@RamaswmySridhar

26 Jan 2023

1/ Ten blue links headed to a museum near you!! @Neeva is applying cutting edge AI to definitely change up the search experience! Not only are we providing real time, cited AI, because we are making the whole search experience a breeze! 🧵

Ashwin Devaraj

@AshwinDevaraj3

23 May 2022

Check out the oral presentation TODAY for our #ACL2022 #acl2022nlp work "Evaluating Factuality in Text Simplification" (done w/ William Sheffield, @jessyjli , and @byron_c_wallace). It's at 5 PM in the Generation 1 section (Wicklow Hall 1)! Please drop by if you're in Dublin!

Ashwin Devaraj

@AshwinDevaraj3

10 May 2022

We're excited to share our #acl2022nlp work on characterizing factual errors in text simplification! We present a new annotation scheme and use it to categorize the kinds of factual errors found in popular simplification datasets and models. arxiv.org/abs/2204.07562 (1/2)

Ashwin Devaraj Reposted

Jessy Li

@jessyjli

13 May 2022

Our work w/ @AshwinDevaraj3, William Sheffield, and @byron_c_wallace on factuality & simplification has been selected as an Outstanding Paper at ACL 2022! #acl2022nlp