Siddhant @siddhantsingh56 Twitter Profile

Siddhant

@siddhantsingh56

Developer

97Posts 31Followers 501Following

Siddhant Reposted

Raj Dabre

@prajdabre1

8 Nov

LLM simps be like

Siddhant Reposted

Yesterday, an engineer asked me how DSPy's MIPROv2 optimizer works. He'd read the code but found it overwhelming. Referring him to the paper helped, but he still lacked an intuition for MIPRO. I then remembered this series of cool animations by Michael. Made all the difference!

Michael Ryan

@michaelryan207

21 Jun

MIPROv2, our new state-of-the-art optimizer for LM programs, is live in DSPy @stanfordnlp! It's even faster, cheaper, and more accurate than MIPRO. MIPROv2 proposes instructions, bootstraps demonstrations, and optimizes combinations. Let’s dive into a visual 🧵of how it works!

Siddhant Reposted

Vik Paruchuri

@VikParuchuri

8 Oct

Announcing Surya Table Recognition! It uses a new architecture to outperform table transformer, the current SoTA open source model. - Recognizes table rows, columns, and cells - Works with complex layouts and rotated tables - Supports any language - Runs locally

Siddhant Reposted

Jeremy Nguyen ✍🏼 🚢

@JeremyNguyenPhD

6 Oct

A prompt that helps Claude 3.5 Sonnet beat OpenAI's o1 model in reasoning!

Philipp Schmid

@_philschmid

6 Oct

Can @AnthropicAI Claude 3.5 sonnet outperform @OpenAI o1 in reasoning? Combining Dynamic Chain of Thoughts, reflection, and verbal reinforcement, existing LLMs like Claude 3.5 Sonnet can be prompted to increase test-time compute and match reasoning strong models like OpenAI o1.…

Siddhant Reposted

Abhinaw Ratan

@Ratanabhinaww

2 Oct

localhost:5173 needs a new house! 🏠👨‍💻 My beautiful journey as a developer @ISC_money has come to an end. Grateful for the experience and excited for what's next! Thread incoming on my ISC adventure 🧵🪡 #CareerMoves #DeveloperLife #ISC #CryptoJobs (1/7)

Siddhant Reposted

Greg Kamradt

@GregKamradt

1 Oct

How structured outputs work under the hood (via breakout at OpenAI DevDay) Guess why the first structured output request is slow, but the 2nd+ is fast? Engineering: * Unconstrained token decoding isn't good. The model could pick any token. * Limiting which tokens can be…

Siddhant Reposted

Wenting Zhao

@wzhao_nlp

26 Sep

Introducing the commit0 interactive environment for coding agents. Challenge: generate Python libraries from scratch. Commit0 is designed with interactivity, dependencies, and specifications as first-class considerations. We include a benchmark with 50+ challenging libraries.

Siddhant Reposted

Santiago

@svpino

26 Sep

More data makes RAG applications worse, not better. Relying on vector similarity search doesn't scale, and most people aren't talking about this. This is counterintuitive, but experiments don't lie: As you add more documents to a vector-based RAG system, its retrieval…

Siddhant Reposted

Santiago

@svpino

24 Sep

A goldmine of tutorials about Generative AI Agents! You'll find anything Agents-related in this repository. From simple explanations to the most advanced topics. Star this repo from @NirDiamantAI: github.com/NirDiamant/Gen… The content is organized in the following categories:…

Siddhant

@siddhantsingh56

20 Sep

YAML >> JSON for LLM input. Interesting read betterprogramming.pub/yaml-vs-json-w…

YAML vs. JSON: Which Is More Efficient for Language Models?

Source: https://t.co/n0UMcY8XHA

Siddhant Reposted

Sandeep Srinivasa

@sandeepssrin

12 Sep

Dear Indian founders incorporating in the US, Half a dozen people have now pinged to ask me about this. So I'm posting here to reduce the number of phone calls. here's collective wisdom of a few hundred YC India startups ok the correct company structure. h/t to Anand of @inklehq…

Siddhant Reposted

Asankhaya Sharma

@asankhaya

26 Aug

We recently worked with @OpenAI to fine-tune gpt-4o to build the SOTA model for static analysis eval. All the code and data on how they did it is available on their GitHub - github.com/openai/build-h… More details on the static analysis eval benchmark are available on…

Siddhant

@siddhantsingh56

31 Jul

Hey AI Devs!! Going to integrate semantic cache on production this weekend. Gathering insights on GPTcache, Redis semantic cache and canonical AI sem cache. Let me know your thoughts

Siddhant Reposted

Tom Dörr

@tom_doerr

21 Jul

There's a 303k GitHub repo just listing free APIs

Siddhant Reposted

andrew gao

@itsandrewgao

13 May

Clip of real time conversation with GPT4-o running on ChatGPT app NEW: Instead of just turning SPEECH to text, GPT-4o can also understand and label other features of audio, like BREATHING and EMOTION. Not sure how this is expressed in the model response. #openai

andrew gao

@itsandrewgao

13 May

OpenAI announcements 🧵 megathread!!

Siddhant Reposted

virat

@virattt

11 May

I am diving into LLM fine-tuning. There is a lack of deep tech content on fine-tuning: • how it works • why it works • what it does to an LLM, etc. There is a ton of high-level stuff, however. I want to grok the first principles of fine-tuning. If you have an excellent…

Siddhant Reposted

Matt Shumer

@mattshumer_

17 Apr

How do I detect when someone is done speaking when transcribing with something like Whisper? cc @batwood011

Siddhant Reposted

Connor Shorten

@CShorten30

1 Apr

Unfortunately, Large Language Models will not consistently follow the instructions that you give them. This is a massive problem when you are building AI systems that require a particular type of output from the previous step to feed into the next one! For example, imagine you…