Shital Shah
@sytelusMostly research and code. If universe is an optimizer, what is its loss function? All opinions are my own.
Similar User
@polynoamial
@abacaj
@SchmidhuberAI
@dchaplot
@ml_hardware
@tengyuma
@FelixHill84
@ZoubinGhahrama1
@jefrankle
@DavidSHolz
@ShayneRedford
@woj_zaremba
@OfirPress
@xinw_ai
@_rockt
Phi-3 14B model from our team is available now! This was trained with 512 H100s on 4.8T tokens achieving MMLU of 78 (comparable with Llama3 70B!!). huggingface.co/microsoft/Phi-…
Elections are (hopefully) over and we all can use some cooling down. But you know what else can use some cooldown? Your LR schedule! I wrote note about this last year and now things are becoming very real. Some people are calling it "WSD schedule" while others are calling it…
Just learned something very cool about LR schedules. This one is so huge it surprises me that it's not in its own paper but rather tucked away. Problem: Most training use cosine/linear decays but this requires specifying number of steps in advance. This is quite troublesome. 🧵
PSA: Flossing strings from popular in-store brands contains plastics and other forever-chemicals! The solution is silk based biodegradable products. youtu.be/V-8oKejN9EE
From AI Frontiers, Yadong++ have released Omniparser (microsoft.github.io/OmniParser/) which parses screens better than vision models. The code is open source and the model is on hugging face huggingface.co/microsoft/Omni… .
United States Trends
- 1. Tyson 481 B posts
- 2. $MAYO 12,9 B posts
- 3. #wompwomp 5.195 posts
- 4. Pence 57,9 B posts
- 5. Kiyan Anthony 10,5 B posts
- 6. Debbie 32,5 B posts
- 7. Kash 97,6 B posts
- 8. Whoopi 104 B posts
- 9. Iron Mike 20,5 B posts
- 10. The FBI 259 B posts
- 11. Dora 24,1 B posts
- 12. Ronaldo 181 B posts
- 13. Mike Rogers 19,5 B posts
- 14. Connor Williams 1.390 posts
- 15. #LetsBONK 14,2 B posts
- 16. Gabrielle Union 2.146 posts
- 17. Cuse 1.398 posts
- 18. Shu Shu 26,8 B posts
- 19. Per CNN 3.882 posts
- 20. #FursuitFriday 16,8 B posts
Who to follow
-
Noam Brown
@polynoamial -
anton
@abacaj -
Jürgen Schmidhuber
@SchmidhuberAI -
Devendra Chaplot
@dchaplot -
Abhi Venigalla
@ml_hardware -
Tengyu Ma
@tengyuma -
Felix Hill
@FelixHill84 -
Zoubin Ghahramani
@ZoubinGhahrama1 -
Jonathan Frankle
@jefrankle -
David
@DavidSHolz -
Shayne Longpre
@ShayneRedford -
Wojciech Zaremba
@woj_zaremba -
Ofir Press
@OfirPress -
Xin Wang
@xinw_ai -
Tim Rocktäschel
@_rockt
Something went wrong.
Something went wrong.