Vaibhavi Gangwar @vaibhavi0601 Twitter Profile

Vaibhavi Gangwar

@vaibhavi0601

Building @getmaximai, previously @google

15Posts 174Followers 158Following

Similar User

@suyash666

@Smitgupta

@surajku89969190

@aimar09

Vaibhavi Gangwar Reposted

Maxim AI

@getmaximai

11 Oct

🧵 Red Teaming in AI Red teaming is a critical practice in AI safety. It involves testing LLMs to find vulnerabilities, such as generating content that violates norms, policies, and rules during their safety training. Red teaming typically involves experts manually probing…

Vaibhavi Gangwar Reposted

Maxim AI

@getmaximai

8 Oct

Exciting news! @ManavSinghal157 from our team is presenting his work on NoFunEval, a benchmark evaluating code LMs on non-functional requirements, at @COLM_conf Connect with Manav to discuss evaluations and code LMs! #COLM24

Manav Singhal

@ManavSinghal157

4 Oct

Really excited to be presenting our work NoFunEval (benchmark evaluating code LMs on non-functional requirements) at @COLM_conf Drop by our Poster 4 on Tuesday 8th from 4:30pm. Hit me up if you want to catch up to discuss more about evaluations or code LMs!

Vaibhavi Gangwar Reposted

Akshay Deo

@akshay_deo

29 Aug

@ManavSinghal157 and @curiousZeedX are cooking up something cool for @getmaximai We're tackling the complexity of agentic workflows head-on, focusing on streamlining build and evaluation processes. Our goal? Making continuous evaluation a breeze.

Vaibhavi Gangwar Reposted

Akshay Deo

@akshay_deo

23 Aug

Phew! I just wrapped up our monthly recap, and wow - we've been busy (so busy that I skipped writing these for a few months)! 🚀. We've shipped a ton of cool stuff lately. 💪

Vaibhavi Gangwar Reposted

Maxim AI

@getmaximai

14 Aug

🧵Understanding Human and LLM Preferences 1/ 🧠 In a study, researchers analyzed preferences from both humans and 32 different LLMs, using real-world user-model conversations. This fine-grained, scenario-wise analysis revealed some compelling insights. #LLMs #AI #ChatGPT

Vaibhavi Gangwar Reposted

Maxim AI

@getmaximai

9 Jul

Evalutaion Nugget 🤔 1/6 📷 Exploring multi-vector rerankers like ColBERT: These models blend bi-encoder efficiency with cross-encoder depth. For instance, ColBERT precomputes document representations but enriches query-document interaction during similarity computation. #AI…

Vaibhavi Gangwar Reposted

Maxim AI

@getmaximai

26 Jun

Since Anthropic released Claude 3.5, there has been a constant buzz on Twitter. We have constantly seen arguments - GPT 4o this, Claude 3.5 Sonnet that... We ran experiments on our internal benchmarks. The details of each experiment are in the blog. blog.getmaxim.ai/claude-3-5-son…

Claude 3.5 Sonnet put to the test

Source: https://t.co/5rg5maQZwg

Vaibhavi Gangwar Reposted

Maxim AI

@getmaximai

21 Jun

We have been trying out @AnthropicAI B Claude 3.5 Sonnet internally and seeing impressive results.🙌 As we share more findings, sending a virtual hug to all the AI engineers🫡

Vaibhavi Gangwar Reposted

Maxim AI

@getmaximai

19 Jun

📢1/7 Today, we are thrilled to announce the general availability of the Maxim AI platform getmaxim.ai. Since starting Maxim last year, we have been moving at an aggressive pace to empower AI developers to ship their products with speed and confidence. We are…

The GenAI evaluation and observability platform

Source: https://t.co/fX4MxICc4v

Vaibhavi Gangwar Reposted

Ashish Agrawal

@dvbydt

19 Jun

Congratulations @vaibhavi0601 @akshay_deo, @ElevCap and the entire @getmaximai team!

Elevation Capital

@ElevCap

19 Jun

We are super excited to partner with @getmaximai as @vaibhavi0601 and @akshay_deo announce the fundraise and general availability launch of their enterprise-grade evaluation and observability platform, setting new standards for AI application development. With the recent…

Vaibhavi Gangwar Reposted

Elevation Capital

@ElevCap

19 Jun

Vaibhavi Gangwar Reposted

Elevation Capital

@ElevCap

20 Jun

🔈 Investment Memo With the rapid growth of generative AI, the need for robust testing frameworks has never been more critical. @vaibhavi0601 (VG) and @akshay_deo's deep expertise in AI and developer tools, gained from their tenure at Google and Postman, gives us immense…