@ZekunLi0323 Profile picture

Zekun Li on the job market!

@ZekunLi0323

CS Ph.D. student @UCSBNLP, intern at #GoogleGemini, #MSFTResearch, Meta #RealityLabs, interested in #NLProc, #LLM, and #AI4HScience

Similar User
Jie Huang photo

@jefffhj

Zhuosheng Zhang photo

@zhangzhuosheng

Wangchunshu Zhou photo

@wangchunshu

Shizhe Diao photo

@shizhediao

🌴Muhao Chen🌴 photo

@muhao_chen

Hanjie Chen photo

@hanjie_chen

Yu Zhang photo

@yuz9yuz

Ziniu Hu photo

@acbuller

UC Santa Barbara NLP Group photo

@ucsbNLP

Sinclair Wang photo

@SinclairWang1

Xiachong Feng photo

@xc_feng

Tianbao Xie photo

@TianbaoX

Yushi Hu photo

@huyushi98

Kai Zhang photo

@DrogoKhal4

Yulong Chen photo

@Yulongchen1010

Pinned

🚨 Introducing “MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific Comprehension” arxiv.org/abs/2407.04903 🧐Have current multimodal LLMs achieved PhD-level intelligence across diverse scientific disciplines? Are they ready to become AI scientific assistants?…

Tweet Image 1
Tweet Image 2

Zekun Li on the job market! Reposted

🧑‍💻 Human software engineers constantly re-evaluate their approaches through experience. 🤖 However, LLM-based software agents can often get stuck in ineffective dead ends. Introducing SWE-Search: a multi-agent framework integrating search and self-refinement to enable software…


Zekun Li on the job market! Reposted

Everyone talks about scaling inference compute after o1. But how exactly should we do that? We studied compute allocation for sampling -- a basic operation in most LLM meta-generators, and found that optimized allocation can save as much as 128x compute! arxiv.org/abs/2410.22480

Tweet Image 1
Tweet Image 2
Tweet Image 3

Zekun Li on the job market! Reposted

I am on job market for full-time industry positions. My research focuses on text generation evaluation and LLM alignment. If you have relevant positions, I’d love to connect! Here are list of my publications and summary of my research:


Zekun Li on the job market! Reposted

🚀 Since its invention, the mouse has been our way to control computers. But what if it didn’t have to be? 🤔 Thrilled to introduce Agent S, a new state-of-the-art GUI agent framework that interacts with computers just like a human and takes on the toughest automation challenges.…


Zekun Li on the job market! Reposted

Checkout Generative Hierarchical Materials Search (GenMS) – a framework for generating crystal structures from natural language. Website: generative-materials.github.io Paper: arxiv.org/abs/2409.06762


Zekun Li on the job market! Reposted

I made a multi-agent system for multimodal retrieval and report generation 🎨 - check it out! 👇 Have talked to a lot of users recently that are interested in using agents to build the final document instead of getting back a chatbot response. There's a general feeling that this…

Tweet Image 1

Building a Multimodal Report Generation Agent 🤖✍️🎨 Come learn how to automatically produce a multi-agent system that can do research over a multimodal RAG, compile into a knowledge bank, and generate a multimodal report containing interleaving text and images based on an…

Tweet Image 1


👉 Check out our new paper on injecting misinformation and bias into LLMs via knowledge editing, as a new type of safety threat: editing threat. 🧐We found that: (1) Editing attach can inject both commonsense and long-tail misinformation into LLMs. (2) Editing attack can…

🤔Are your open-source LLMs really safe? 🚨It may be injected with misinformation or bias! Our new paper "𝐂𝐚𝐧 𝐄𝐝𝐢𝐭𝐢𝐧𝐠 𝐋𝐋𝐌𝐬 𝐈𝐧𝐣𝐞𝐜𝐭 𝐇𝐚𝐫𝐦?" (Project website: llm-editing.github.io ) sheds light on the emerging challenges of LLMs, especially the…

Tweet Image 1


Zekun Li on the job market! Reposted

🤔Are your open-source LLMs really safe? 🚨It may be injected with misinformation or bias! Our new paper "𝐂𝐚𝐧 𝐄𝐝𝐢𝐭𝐢𝐧𝐠 𝐋𝐋𝐌𝐬 𝐈𝐧𝐣𝐞𝐜𝐭 𝐇𝐚𝐫𝐦?" (Project website: llm-editing.github.io ) sheds light on the emerging challenges of LLMs, especially the…

Tweet Image 1

Thrilled to announce our work is accepted to #ACL2024 Main! We’re the first to solve zero-shot DST with Function Call/Tool Use, bridging the gap and achieving remarkable performance with both 7b/13b OSS models and GPT-3.5/4. More results coming soon. Code: github.com/facebookresear…

Large Language Models as Zero-shot Dialogue State Tracker through Function Calling Large language models (LLMs) are increasingly prevalent in conversational systems due to their advanced understanding and generative capabilities in general contexts. However, their effectiveness…

Tweet Image 1


I'm pleased to announce that we have released the code at: github.com/facebookresear…. Welcome to explore and play with it. You can see what is tracked and generated during the conversation.

Large Language Models as Zero-shot Dialogue State Tracker through Function Calling Large language models (LLMs) are increasingly prevalent in conversational systems due to their advanced understanding and generative capabilities in general contexts. However, their effectiveness…

Tweet Image 1


Similar observations and insights as discussed in our paper on exploring how various models prioritize system prompts and user instructions and guard against misleading instructions injected in web search results: arxiv.org/abs/2308.10819

Introducing the Instruction Hierarchy, our latest safety research to advance robustness for prompt injections and other ways of tricking LLMs into executing unsafe actions. More details: arxiv.org/abs/2404.13208



Thanks for sharing our work! 🚀 FnCTOD: a method that empowers chat-based LLMs with function-calling abilities, enabling them to handle complex, task-oriented conversations through the appropriate use of tools and API calls (DST, a type of database search API). 🧵 [1/n]

Large Language Models as Zero-shot Dialogue State Tracker through Function Calling Large language models (LLMs) are increasingly prevalent in conversational systems due to their advanced understanding and generative capabilities in general contexts. However, their effectiveness…

Tweet Image 1


Loading...

Something went wrong.


Something went wrong.