@CaimingXiong Profile picture

Caiming Xiong

@CaimingXiong

VP of AI at @Salesforce Research: AI for CRM, AI for good.

Similar User
Yejin Choi photo

@YejinChoinka

Jacob Andreas photo

@jacobandreas

Wei Xu photo

@cocoweixu

zhou Yu photo

@Zhou_Yu_AI

Victor Zhong photo

@hllo_wrld

Sam Bowman photo

@sleepinyourhat

UW NLP photo

@uwnlp

Sebastian Riedel (@riedelcastro@sigmoid.social) photo

@riedelcastro

Sean (Xiang) Ren photo

@xiangrenNLP

Kai-Wei Chang photo

@kaiwei_chang

Sewon Min photo

@sewon__min

Thang Luong photo

@lmthang

Tejas Kulkarni photo

@tejasdkulkarni

Danqi Chen photo

@danqi_chen

He He photo

@hhexiy

Caiming Xiong Reposted

Introducing GIFT-Eval: A groundbreaking AI benchmark for time series forecasting models! 🌐 GIFT-Eval offers 28 diverse AI datasets, over 144,000 time series, and 177M data points, enabling fair and robust evaluation of models across domains, frequencies, and prediction horizons.…

Tweet Image 1
Tweet Image 2
Tweet Image 3
Tweet Image 4

Build your agents to solve CRM tasks, how to test them in the real-like environment? Excited to announce CRMArena, a benchmark for enterprise LLM agents to navigate real-world business challenges! CRMArena offers nine top-classes of tasks on three personas in complex business…

🚀 Exploring the Wild West of AI in Business🤠 🔥 Introducing CRMArena - a work-oriented benchmark for LLM agents to prove their mettle in real-world business scenarios! CRMArena features nine distinct tasks within a complex business environment filled with rich and realistic…

Tweet Image 1


Caiming Xiong Reposted

🍅Excited to see @AnthropicAI using 🚀our OSWorld🚀(NeurIPS'24) to benchmark computer use! 🍋OSWorld will soon support parallel cloud running, much faster! 🍓More multimodal agent open-source big projects coming soon from @XLangNLP in Nov- stay tuned! 👇os-world.github.io

Tweet Image 1

Introducing an upgraded Claude 3.5 Sonnet, and a new model, Claude 3.5 Haiku. We’re also introducing a new capability in beta: computer use. Developers can now direct Claude to use computers the way people do—by looking at a screen, moving a cursor, clicking, and typing text.

Tweet Image 1


xGen-MM-Vid (BLIP-3-Video)

📢📢📢Introducing xGen-MM-Vid (BLIP-3-Video)! This highly efficient multimodal language model is laser-focused on video understanding. Compared to other models, xGen-MM-Vid represents a video with a fraction of the visual tokens (e.g., 32 vs. 4608 tokens). Paper:…



Data & Benchmarks truly are the fuel powering our AI innovations! Very excited to share that five of our papers have been accepted at NeurIPS 2024 D&B track! I'm so proud to have contributed to these groundbreaking projects: 1. Consent in Crisis: The Rapid Decline of the AI…


🧨🧨We release the Fineweb-deduplicated dataset.

👇UPDATED DATASET👇Fineweb training dataset just got leaner! We've tackled the ~70% duplication issue in this valuable 93.4TB dataset. Same great data, now more efficient and cost-effective. bit.ly/3XI3wlB #AIResearch #DataEfficiency



Caiming Xiong Reposted

Large Language Model Agents is the next frontier. Really excited to announce our Berkeley course on LLM Agents, also available for anyone to join as a MOOC, starting Sep 9 (Mon) 3pm PT! 📢 Sign up & join us: llmagents-learning.org

Tweet Image 1

Loading...

Something went wrong.


Something went wrong.