@siddhantsingh56 Profile picture

Siddhant

@siddhantsingh56

Developer

Siddhant Reposted

LLM simps be like

Tweet Image 1

Siddhant Reposted

Yesterday, an engineer asked me how DSPy's MIPROv2 optimizer works. He'd read the code but found it overwhelming. Referring him to the paper helped, but he still lacked an intuition for MIPRO. I then remembered this series of cool animations by Michael. Made all the difference!

MIPROv2, our new state-of-the-art optimizer for LM programs, is live in DSPy @stanfordnlp! It's even faster, cheaper, and more accurate than MIPRO. MIPROv2 proposes instructions, bootstraps demonstrations, and optimizes combinations. Let’s dive into a visual 🧵of how it works!



Siddhant Reposted

Announcing Surya Table Recognition! It uses a new architecture to outperform table transformer, the current SoTA open source model. - Recognizes table rows, columns, and cells - Works with complex layouts and rotated tables - Supports any language - Runs locally

Tweet Image 1
Tweet Image 2
Tweet Image 3

Siddhant Reposted

A prompt that helps Claude 3.5 Sonnet beat OpenAI's o1 model in reasoning!

Tweet Image 1

Can @AnthropicAI Claude 3.5 sonnet outperform @OpenAI o1 in reasoning? Combining Dynamic Chain of Thoughts, reflection, and verbal reinforcement, existing LLMs like Claude 3.5 Sonnet can be prompted to increase test-time compute and match reasoning strong models like OpenAI o1.…

Tweet Image 1


Siddhant Reposted

localhost:5173 needs a new house! 🏠👨‍💻 My beautiful journey as a developer @ISC_money has come to an end. Grateful for the experience and excited for what's next! Thread incoming on my ISC adventure 🧵🪡 #CareerMoves #DeveloperLife #ISC #CryptoJobs (1/7)

Tweet Image 1

Siddhant Reposted

How structured outputs work under the hood (via breakout at OpenAI DevDay) Guess why the first structured output request is slow, but the 2nd+ is fast? Engineering: * Unconstrained token decoding isn't good. The model could pick any token. * Limiting which tokens can be…

Tweet Image 1
Tweet Image 2
Tweet Image 3

Siddhant Reposted

Introducing the commit0 interactive environment for coding agents. Challenge: generate Python libraries from scratch. Commit0 is designed with interactivity, dependencies, and specifications as first-class considerations. We include a benchmark with 50+ challenging libraries.


Siddhant Reposted

More data makes RAG applications worse, not better. Relying on vector similarity search doesn't scale, and most people aren't talking about this. This is counterintuitive, but experiments don't lie: As you add more documents to a vector-based RAG system, its retrieval…

Tweet Image 1

Siddhant Reposted

A goldmine of tutorials about Generative AI Agents! You'll find anything Agents-related in this repository. From simple explanations to the most advanced topics. Star this repo from @NirDiamantAI: github.com/NirDiamant/Gen… The content is organized in the following categories:…

Tweet Image 1

Siddhant Reposted

Dear Indian founders incorporating in the US, Half a dozen people have now pinged to ask me about this. So I'm posting here to reduce the number of phone calls. here's collective wisdom of a few hundred YC India startups ok the correct company structure. h/t to Anand of @inklehq


Siddhant Reposted

We recently worked with @OpenAI to fine-tune gpt-4o to build the SOTA model for static analysis eval. All the code and data on how they did it is available on their GitHub - github.com/openai/build-h… More details on the static analysis eval benchmark are available on…


Hey AI Devs!! Going to integrate semantic cache on production this weekend. Gathering insights on GPTcache, Redis semantic cache and canonical AI sem cache. Let me know your thoughts


Siddhant Reposted

There's a 303k GitHub repo just listing free APIs

Tweet Image 1

Siddhant Reposted

Clip of real time conversation with GPT4-o running on ChatGPT app NEW: Instead of just turning SPEECH to text, GPT-4o can also understand and label other features of audio, like BREATHING and EMOTION. Not sure how this is expressed in the model response. #openai

OpenAI announcements 🧵 megathread!!

Tweet Image 1


Siddhant Reposted

I am diving into LLM fine-tuning. There is a lack of deep tech content on fine-tuning: • how it works • why it works • what it does to an LLM, etc. There is a ton of high-level stuff, however. I want to grok the first principles of fine-tuning. If you have an excellent…


Siddhant Reposted

How do I detect when someone is done speaking when transcribing with something like Whisper? cc @batwood011


Siddhant Reposted

Unfortunately, Large Language Models will not consistently follow the instructions that you give them. This is a massive problem when you are building AI systems that require a particular type of output from the previous step to feed into the next one! For example, imagine you…

Tweet Image 1

United States Trends
Loading...

Something went wrong.


Something went wrong.