Lexin Zhou
@lexin_zhouIncoming intern at @MSFTResearch | CS Alumnus at Cambridge | Work on Evaluation, Social Computing, Safety and NLP | Looking for 25Fall PhD position
1/ New paper @Nature! Discrepancy between human expectations of task difficulty and LLM errors harms reliability. In 2022, Ilya Sutskever @ilyasut predicted: "perhaps over time that discrepancy will diminish" (youtu.be/W-F7chPE9nU, min 61-64). We show this is *not* the case!
Interested in #FactChecking ? Come today to our #EMNLP2024 workshop @FEVERworkshop to hear invited talks by the excellent @kristahopsalong @radamihalcea @PCunliffeJones and chris.bregler.com , and learn about our shared task on the AVERITEC dataset fever.ai/workshop.html
Youmna @YoumnaH presenting An LLM Feature-based Framework for Dialogue Constructiveness Assessment arxiv.org/abs/2406.14760 (led by @lexin_zhou ) on Tuesday
Looking for a Postdoc position and wanna make an impact on developing new robust evaluation frameworks for general-purpose AI systems like LLMs? Please apply to this amazing position at TU Valencia working with Prof. José Hernández-Orallo (josephorallo.webs.upv.es)! He's a legend,…
A snippet of the work we did over the summer during my internship with MSR FATE Montreal—we highlight the need for more work investigating the impacts of anthropomorphic AI systems, which is critical to understanding the impact of genAI systems: arxiv.org/pdf/2410.08526
Read the arXiv version before. Definitely thought-provoking and recommend it. Congrats @katie_m_collins!
What does it take to build AI systems that meet our expectations and complement our limitations? Our Perspective on thought partners which engage deeply with computational cognitive science is now out in @NatureHumBehav ! nature.com/articles/s4156…
What happens as chatbots get "bigger and better"? They become more likely to generate wrong answers than to admit ignorance (and sadly people aren't good at spotting the bad answers). Read more in this @nature paper nature.com/articles/s4158…
United States Trends
- 1. Mike 1,76 Mn posts
- 2. Serrano 231 B posts
- 3. #NetflixFight 67,9 B posts
- 4. Canelo 14,9 B posts
- 5. #netflixcrash 14,8 B posts
- 6. Father Time 10,6 B posts
- 7. Logan 74,1 B posts
- 8. Rosie Perez 13,9 B posts
- 9. Shaq 14,9 B posts
- 10. #buffering 10,6 B posts
- 11. Boxing 276 B posts
- 12. ROBBED 98,6 B posts
- 13. He's 58 20,4 B posts
- 14. My Netflix 79,7 B posts
- 15. Roy Jones 6.843 posts
- 16. Tori Kelly 4.885 posts
- 17. Ramos 69,7 B posts
- 18. Cedric 20,9 B posts
- 19. Gronk 6.400 posts
- 20. Barrios 50,1 B posts
Something went wrong.
Something went wrong.