Caiming Xiong @CaimingXiong Twitter Profile

Caiming Xiong

@CaimingXiong

VP of AI at @Salesforce Research: AI for CRM, AI for good.

895Posts 6KFollowers 439Following

Similar User

@YejinChoinka

@jacobandreas

@cocoweixu

@Zhou_Yu_AI

@hllo_wrld

@sleepinyourhat

@uwnlp

@riedelcastro

@xiangrenNLP

@kaiwei_chang

@sewon__min

@lmthang

@tejasdkulkarni

@danqi_chen

@hhexiy

Caiming Xiong Reposted

Marc Benioff

@Benioff

13 Nov

Introducing GIFT-Eval: A groundbreaking AI benchmark for time series forecasting models! 🌐 GIFT-Eval offers 28 diverse AI datasets, over 144,000 time series, and 177M data points, enabling fair and robust evaluation of models across domains, frequencies, and prediction horizons.…

Caiming Xiong

@CaimingXiong

6 Nov

Build your agents to solve CRM tasks, how to test them in the real-like environment? Excited to announce CRMArena, a benchmark for enterprise LLM agents to navigate real-world business challenges! CRMArena offers nine top-classes of tasks on three personas in complex business…

Kung-Hsiang Steeve Huang

@steeve__huang

5 Nov

🚀 Exploring the Wild West of AI in Business🤠 🔥 Introducing CRMArena - a work-oriented benchmark for LLM agents to prove their mettle in real-world business scenarios! CRMArena features nine distinct tasks within a complex business environment filled with rich and realistic…

Caiming Xiong Reposted

Tao Yu

@taoyds

22 Oct

🍅Excited to see @AnthropicAI using 🚀our OSWorld🚀(NeurIPS'24) to benchmark computer use! 🍋OSWorld will soon support parallel cloud running, much faster! 🍓More multimodal agent open-source big projects coming soon from @XLangNLP in Nov- stay tuned! 👇os-world.github.io

Anthropic

@AnthropicAI

22 Oct

Introducing an upgraded Claude 3.5 Sonnet, and a new model, Claude 3.5 Haiku. We’re also introducing a new capability in beta: computer use. Developers can now direct Claude to use computers the way people do—by looking at a screen, moving a cursor, clicking, and typing text.

Caiming Xiong

@CaimingXiong

22 Oct

xGen-MM-Vid (BLIP-3-Video)

Salesforce AI Research

@SFResearch

22 Oct

📢📢📢Introducing xGen-MM-Vid (BLIP-3-Video)! This highly efficient multimodal language model is laser-focused on video understanding. Compared to other models, xGen-MM-Vid represents a video with a fraction of the visual tokens (e.g., 32 vs. 4608 tokens). Paper:…

Caiming Xiong

@CaimingXiong

27 Sep

Data & Benchmarks truly are the fuel powering our AI innovations! Very excited to share that five of our papers have been accepted at NeurIPS 2024 D&B track! I'm so proud to have contributed to these groundbreaking projects: 1. Consent in Crisis: The Rapid Decline of the AI…

Caiming Xiong

@CaimingXiong

25 Sep

🧨🧨We release the Fineweb-deduplicated dataset.

Salesforce AI Research

@SFResearch

25 Sep

👇UPDATED DATASET👇Fineweb training dataset just got leaner! We've tackled the ~70% duplication issue in this valuable 93.4TB dataset. Same great data, now more efficient and cost-effective. bit.ly/3XI3wlB #AIResearch #DataEfficiency

Caiming Xiong Reposted

Dawn Song

@dawnsongtweets

6 Sep

Large Language Model Agents is the next frontier. Really excited to announce our Berkeley course on LLM Agents, also available for anyone to join as a MOOC, starting Sep 9 (Mon) 3pm PT! 📢 Sign up & join us: llmagents-learning.org