Erika Cardenas @ecardenas300 Twitter Profile

Erika Cardenas

@ecardenas300

Partnerships @weaviate_io | Diary about agents, LLM frameworks, and vector databases 🤪

4KPosts 4KFollowers 880Following

Similar User

@jessie_groot

@etiennedi

@qdrant_engine

@ekzhang1

@tuanacelik

@weaviate_io

@ZainHasan6

@lateinteraction

@lucia_auth

@abidlabs

@jobergum

@bobvanluijt

@reach_vb

@CerebrasSystems

@Avra_b

Pinned

Erika Cardenas

@ecardenas300

26 Jun

LLM + Memory + Planning + Tools = Agents 🤖 Last month, Job and I discussed how generative AI is shifting how companies offer customer support. How can we add more layers to our RAG apps to make it more agentic? LLM: Large language model alone Memory: Short-term and long-term…

Erika Cardenas Reposted

Erika Cardenas

@ecardenas300

11 h

Adding only a single AI agent to your RAG pipeline can already make a huge difference in performace. By replacing the retrieval component with an retrieval agent, you can supercharge your vanilla RAG pipeline. A retrieval agent can: 1. Decide whether to retrieve data at all 2.…

Erika Cardenas Reposted

Erika Cardenas

@ecardenas300

14 Nov

Massive News from Chatbot Arena🔥 @GoogleDeepMind's latest Gemini (Exp 1114), tested with 6K+ community votes over the past week, now ranks joint #1 overall with an impressive 40+ score leap — matching 4o-latest in and surpassing o1-preview! It also claims #1 on Vision…

Logan Kilpatrick

@OfficialLoganK

14 Nov

gemini-exp-1114…. available in Google AI Studio right now, enjoy : ) aistudio.google.com

Erika Cardenas Reposted

Erika Cardenas

@ecardenas300

14 Nov

🚀 Big News: AutoGen is Now AG2! 🚀 We’re evolving! With support from the #OSS community, AutoGen is becoming #AG2, a new home for next-gen agentic #AI. Same mission, bigger goals. → Repo: github.com/ag2ai/ag2 ⭐️ → Docs: ag2ai.github.io/ag2 → Same Discord:…

AutoGen | AutoGen

Source: https://t.co/BZ51AQWKSl

Erika Cardenas Reposted

Erika Cardenas

@ecardenas300

14 Nov

the new frontier: AI agent hosting/serving 👾🛸 the AI/LLM agents stack is a significant departure from the standard LLM stack. the key difference between the two lies in managing state: LLM serving platforms are generally stateless, whereas agent serving platforms need to be…

Erika Cardenas Reposted

Erika Cardenas

@ecardenas300

13 Nov

The Agentic RAG party continues! 🎉 I am SUPER EXCITED to publish the 109th Weaviate Podcast with Erika Cardenas (@ecardenas300)! Erika, in collaboration with Leonie Monigatti (@helloiamleonie), have recently published "What is Agentic RAG"! This blog has even been covered in…

Erika Cardenas Reposted

Erika Cardenas

@ecardenas300

12 Nov

Battle of the RAGs 🤺 We put Agentic RAG and Vanilla RAG to the test in answering questions about Weaviate. How the pipelines differ: Vanilla RAG: Simple retrieve, augment, and generate pipeline Agentic RAG: The LLM controls its own search strategy in a function calling loop…

Erika Cardenas Reposted

Erika Cardenas

@ecardenas300

13 Nov

Dense but excellent. I literally have Spotify on my phone just for this series.

Connor Shorten

@CShorten30

13 Nov

Erika Cardenas Reposted

Erika Cardenas

@ecardenas300

13 Nov

YouTube: youtube.com/watch?v=Eh4uQq… Spotify: podcasters.spotify.com/pod/show/weavi…

Agentic RAG with Erika Cardenas - Weaviate Podcast #109! by Weaviate Podcast

Source: https://t.co/B6Uwn7Xvnb

Erika Cardenas Reposted

Erika Cardenas

@ecardenas300

8 Nov

I love you @CShorten30 for making a video for JSON mode based on “Let Me Speak Freely” paper! ❤️ Must Watch! 🔥

Connor Shorten

@CShorten30

7 Nov

JSON mode has been one of the biggest enablers for working with Large Language Models! JSON mode is even expanding into Multimodal Foundation models! But how exactly is JSON mode achieved? 🛠️ There are generally 3 paths to JSON mode: 🗺️ 1. Constrained generation (such as…

Erika Cardenas Reposted

Erika Cardenas

@ecardenas300

6 Nov

Swapping out RAG for Agentic RAG Systems 🤖 Agents differ from the standard retrieve and generate because of their access to tools, memory, and planning. Why is this game-changing? It allows us to build systems with complete autonomy to reason and execute specific tools when…

Leonie

@helloiamleonie

5 Nov

Goodbye, vanilla RAG. Hello, Agentic RAG! 𝗩𝗮𝗻𝗶𝗹𝗹𝗮 𝗥𝗔𝗚 The common vanilla RAG implementation processed the user query through a retrieval and generation pipeline to generate a response grounded in external knowledge. Advanced vanilla RAG techniques include e.g.,…

Erika Cardenas Reposted

Erika Cardenas

@ecardenas300

7 Nov

On Nov 19th 👉 Erika Cardenas @weaviate_io will be at our NYC event to share methods for optimizing agent systems. We also have experts from @Google @priceline @llama_index & AutoGen. Discover best practices for iterating on agent behavior through continuous evaluation and…

Erika Cardenas Reposted

Erika Cardenas

@ecardenas300

7 Nov

Erika Cardenas Reposted

Erika Cardenas

@ecardenas300

7 Nov

If interested, here is some research from the Weaviate team on this topic: StructuredRAG (Paper) - arxiv.org/abs/2408.11061 StructuredRAG (Repo) - github.com/weaviate/struc…

GitHub - weaviate/structured-rag: Experimental Code for StructuredRAG: Structured Outputs in...

Source: https://t.co/dW4emrorAQ

Erika Cardenas Reposted

Erika Cardenas

@ecardenas300

7 Nov

Long Context RAG Performance of Large Language Models Databricks analyzes 20 LLMs and reveals that only recent state-of-the-art models maintain consistent RAG accuracy above 64k tokens, with most models' performance declining at longer contexts. arxiv.org/abs/2411.03538

Erika Cardenas Reposted

Erika Cardenas

@ecardenas300

7 Nov

Phew… there are sooo many resources on RAG! It’s overwhelming 😩✊ Thankfully, my amazing colleagues ❤️ from @weaviate_io wrote down everything you need to know about Retrieval Augmented Generation! We have two great blog posts. The first one, by Mary, is all about the basics…

Erika Cardenas Reposted

Erika Cardenas

@ecardenas300

7 Nov

What makes an agentic system different from vanilla RAG? It’s access to memory and external tools. The building blocks of an agentic RAG system are: • LLM (with a role and a task) • Memory (short-term and long-term) • Planning (e.g., reflection, self-critics, query routing,…