Pavan Kapanipathi @pavankaps Twitter Profile

Pavan Kapanipathi

@pavankaps

Researcher at IBM Research (Views are my own)

Joined May 2009

830Posts 465Followers 766Following

Similar User

@aviaviavi__

@LisaAmini1

@BadmotorF

@mraghava

@fusionconfusion

@nrkarthikeyan

@pablomendes

@ioanauoft

@icepieces

@KGreenewald

Pavan Kapanipathi Reposted

Harsha Kokel

@harsha_kokel

1 Nov

🚨 New Dataset Alert🚨 We introduce ACP Bench. A question-answering style dataset that evaluates AI-model's ability to reason about Action, Change, and Planning. Checkout 🔗 ibm.github.io/ACPBench/ 📄 arxiv.org/abs/2410.05669

Pavan Kapanipathi Reposted

Prasanna Sattigeri

@prasatti

24 Oct

We released best-in-class Apache 2.0 licensed models for detecting general harm and RAG hallucinations as part of the Granite 3.0 release! Read more: linkedin.com/pulse/ibm-open… Documentation: ibm.com/granite/docs/m… Hugging Face: huggingface.co/collections/ib… Try them out!

armand

@armand_ruiz

21 Oct

At IBM we just released Granite 3.0. It is not just another LLM; it's a suite of AI tools designed specifically for the enterprise's needs. These new tools are designed to scale GenAI to lower cost, govern it, and speed up innovation. It is: - Fit-for-purpose - Transparent with…

Granite Guardian Models - a ibm-granite Collection

Source: https://t.co/Y6QwYWHnPI

Pavan Kapanipathi Reposted

Avi Sil

@aviaviavi__

23 Oct

Announcing "@IBM SWE-Agent 1.0", from my team @IBMResearch , the first SWE-Agent built only on top of open-source models while achieving competitive performance (23.7%) compared to frontier LLM-agents on SWE-Bench. More details in this blog: ibm.biz/ibm_swe

Pavan Kapanipathi Reposted

David Cox

@neurobongo

21 Oct

🎉Today, we're pleased to announce the release of the Granite 3.0 model family, the latest open-licensed, general purpose LLMs from @IBM 🎉 These have been a labor of love for my team at @IBMResearch, working closely with a host of collaborators across the company. We're excited…

Pavan Kapanipathi Reposted

Yikang Shen

@Yikang_Shen

21 Oct

Granite 3.0 is our latest update for the IBM foundation models. The 8B and 2B models outperform strong competitors with similar sizes. The 1B and 3B MoE use only 400M and 800M active parameters to target the on-device use cases. Our technical report provides all the details you…

Pavan Kapanipathi Reposted

Gaurav Pandey

@gauravpandeyamu

2 May

Minimizing forward KL wrt PPO-optimal policy (proceedings.neurips.cc/paper_files/pa……) policy doesn't perform as well for RLHF as PPO and DPO. Or does it? In our ICML paper (arxiv.org/abs/2402.02479), we show that it actually performs much better if an appropriate baseline is chosen.

Pavan Kapanipathi Reposted

Sara Rosenthal

@seirasto

1 May

Are you building and evaluating RAG systems? Presenting InspectorRAGet arxiv.org/abs/2404.17347 a platform for easily analyzing overall performance, instance level analysis, comprehensive metrics, and multiple models and more!

Pavan Kapanipathi Reposted

Luis Lamb

@luislamb

26 Feb

@RealAAAI nuclear-workshop.github.io workshop Neuro-Symbolic Learning and Reasoning in the Era of Large Language Models @GaryMarcus talk on “No AGI without Neurosymbolic AI.” @asimunawar @AvilaGarcez @frossi_t

Pavan Kapanipathi Reposted

AK

@_akhaliq

26 Feb

IBM presents API-BLEND A Comprehensive Corpora for Training and Benchmarking API LLMs There is a growing need for Large Language Models (LLMs) to effectively use tools and external Application Programming Interfaces (APIs) to plan and complete tasks. As such, there is…

Pavan Kapanipathi Reposted

Jerry Liu

@jerryjliu0

6 Feb

Self-RAG in @llama_index We’re excited to feature Self-RAG, a special RAG technique where an LLM can do self-reflection for dynamic retrieval, critique, and generation (@AkariAsai et al.). It’s implemented in @llama_index as a custom query engine with…

LlamaIndex 🦙

@llama_index

6 Feb

A big downside of top-k RAG is its static nature. Self-RAG (@AkariAsai et al.) trains an LLM to do dynamic retrieval through self-reflection. This allows the LLM to 1) only perform retrieval if needed through a retrieval token, 2) generate/critique/filter retrieved outputs, and…

Pavan Kapanipathi Reposted

Ramon Astudillo

@RamonAstudill12

9 May 2023

We are releasing `v0.5.4` version of the transition-amr-parser. Now with document-level AMR parsing, instalable from PyPI, shipped with trained checkpoints and SoTA performance. github.com/IBM/transition…

GitHub - IBM/transition-amr-parser: SoTA Abstract Meaning Representation (AMR) parsing with...

Source: https://t.co/lidm3zk6na

Pavan Kapanipathi Reposted

Dario Gil

@dariogila

9 May 2023

We can all agree we’re at a unique and evolutionary moment in AI, with enterprises increasingly turning to this technology’s transformative power to unlock new levels of innovation and productivity. At #Think2023, @IBM unveiled watsonx. Learn more: newsroom.ibm.com/2023-05-09-IBM…

Pavan Kapanipathi Reposted

Avi Sil

@aviaviavi__

27 Feb 2023

If you're using GPT-3 or any other LLMs read this: 1. Don't want it to hallucinate? 2. Need attribution for generated answers? 3. Have access to proprietary data that you want to index yourself and generate answers from it? Use PrimeQA! We added "retrieve" and "read" mode.🧵

Pavan Kapanipathi Reposted

Yann LeCun

@ylecun

12 Feb 2023

Good article on LLMs at Forbes. The media are starting to agree with my much-criticized statements about LLMs. "LLMs as they exist today will never replace Google Search. Why not? In short, because today’s LLMs make stuff up." forbes.com/sites/robtoews…

The Next Generation Of Large Language Models

Source: https://t.co/Cngp2Zyisq

Pavan Kapanipathi Reposted

Payel Das

@payel791

24 Jan 2023

Happy to see that our chemical language foundation model, MoLFormer is highlighted in @NatComputSci In addition to showing competitive performance in standard prediction benchmarks, it also shows first-of-a-kind emergent behavior with scaling, e.g. learning of geometry and taste

Nature Computational Science

@NatComputSci

23 Jan 2023

Finally, we highlight a @NatMachIntell paper by @payel791 and colleagues on a large-scale transformer-based language model that enables the encoding of spatial information in molecules. nature.com/articles/s4358… 👉rdcu.be/c31BO

Pavan Kapanipathi Reposted

Asim Munawar

@asimunawar

25 Jan 2023

Join us today for a very exciting 3rd day of IBM Neuro-Symbolic AI Workshop 2023. Day 3 is all about NLP, large language models like #ChatGPT and what can we expect from future models. Day 3 talks by @GaryMarcus @alkoller Kathy McKeown @hhexiy #ArtificialIntelligence #IBM

Asim Munawar

@asimunawar

5 Jan 2023

I am very excited to invite you to IBM Neuro-Symbolic AI Workshop 2023 (23-27 Jan, 9 am-12 pm ET). This is the 2nd workshop of the series. Register for free at: ibm.biz/nsworkshop2023 #ai #ibmresearch

Pavan Kapanipathi Reposted

Luis Lamb

@luislamb

23 Jan 2023

@IBMResearch workshop on Neurosymbolic AI on the way. Alex Gray opening and @vardi on Deep Learning and Deep Reasoning - Neurosymbolic Reasoning. @frossi_t @asimunawar @GaryMarcus @AvilaGarcez @guyvdb

Pavan Kapanipathi Reposted

Kush Varshney कुश वार्ष्णेय

@krvarshney

11 Oct 2022

.@ArvindKrishna summarizes our efforts at @IBM and @IBMResearch on responsible and trustworthy AI in this video. This is precisely what I have the privilege to work on every day. youtube.com/watch?v=gdAVw1…

Pavan Kapanipathi Reposted

Avi Sil

@aviaviavi__

30 Sep 2022

PrimeQA now has collaborators & code contributions from @stanfordnlp, @osunlp, @NotreDame , @LTIatCMU, @uiuc_nlp , @UMassAmherst, @Uni_Stuttgart & many more on the way bringing in their best Question Answering (QA) models to advance the research in QA. What are you waiting for?

Pavan Kapanipathi Reposted

Raghava Mutharaju

@mraghava

29 Aug 2022

Two more days to go!!! Looking forward to your submissions. cc: @pavankaps @sbhatia_ @pascalhitzler @ejimenez_ruiz @nandanamihindu

Gunjan Singh

@GunjanS12

26 Aug 2022

5 more days to go for participating in the ISWC 2022 Semantic Reasoning Evaluation Challenge #SemREC. You can participate by submitting your (hard to reason over) ontologies and neuro-symbolic reasoners semrec.github.io @mraghava @pavankaps @iswc_conf #iswc2022