@nlopitz Profile picture

juri

@nlopitz

Researcher @UZH_en

Joined December 2019
Similar User
Pratik Joshi photo

@Roprajo

Anette Frank photo

@AnetteMFrank

Heidelberg University NLP Group photo

@HD_NLP

Dustin Wright photo

@dustin_wright37

Zhuosheng Zhang photo

@zhangzhuosheng

CIS, LMU Munich photo

@CisLmu

Barbara Plank photo

@barbara_plank

Patrick Lewis photo

@PSH_Lewis

Yushi Hu photo

@huyushi98

Dennis Ulmer 🦋 photo

@dnnslmr

Kai Zhang photo

@DrogoKhal4

Shruti Rijhwani photo

@shrutirij

Bill Yuchen Lin 🤖 photo

@billyuchenlin

Qinyuan Ye photo

@qinyuan_ye

Yu Su ✈️ #NeurIPS2024 photo

@ysu_nlp

Pinned

Should I use Macro F1 or Accuracy? Why not Kappa? Why do some use this, and others that? What's actually evaluated here? 😵‍💫 Happy to share the final version of this paper on multi-class classification evaluation: direct.mit.edu/tacl/article/d… #machinelearning #nlproc #ml


juri Reposted

There are also papers I came across that just says "F1" without saying whether it is "micro" or "macro" or something else. I stopped taking the actual results by face value and browse through generally to get a broader picture of methods these days.


juri Reposted

Interesting and worrying paper on meta-eval of classifiers metrics (direct.mit.edu/tacl/article/d…). Eg, some papers use "macro F1" to refer to arithmetic mean, others use it for harmonic mean, others dont say what they mean by Macro F1


40,000 Downloads! Happy to see that Julius' super-efficient text factuality checker has seen a decent amount of usage in the last months! Free on Huggingface: huggingface.co/juliussteen/De… ACL paper: aclanthology.org/2023.acl-short… @juliusmsteen 👀 #nlproc #machinelearning


juri Reposted

Looking for an emergency reviewer for ARR submission 🚨 The paper is on commonsense reasoning. If you have reviewed any *ACL conference and interested, please DM me. The review needs to be submitted in next 24 hours. #NLProc


juri Reposted

💡This cool #EMNLP2024 paper on semantic parsing also shows why you should NOT use a heuristic when evaluating a system. #nlproc #machinelearning #datascience #evaluation #statistics aclanthology.org/2024.emnlp-mai…

Tweet Image 1

juri Reposted

✨New Paper✨ The Mystery of Compositional Generalization in Graph-based Generative Commonsense Reasoning #EMNLP2024 📜 aclanthology.org/2024.findings-… 🖼️ Poster Session F (Riverfront Hall), Nov 14 @ 10:30

Tweet Image 1

juri Reposted

I am looking for emergency reviewers for missing reviews in the "Low-resourced and Less Studied Languages" track at #COLING2025. Please let me know if you can help! #nlproc


juri Reposted

Lazy Twitter: Do you know of any corpora or wordlists in Scandinavian languages (Danish, Norwegian, Swedish, Icelandic) with CEFR labels? #education #linguistics #nlp #PhDlife #phd


It doesn't seem much, but I am actually proud of this: I never missed a reviewing DL so far! Probably this won't hold at some point, but still... Remember: If you miss a review DL by a few days it's usually not a big deal, but think of dropping a note to ACs or editors.


juri Reposted

PhD then: “We validate our method on a large-scale dataset of hundreds of images.” PhD now: “We validate our method on 10 different modalities, 15 domains, 20 scenarios, 25 tasks, and 200 languages, each with millions of testing examples”.


New blogpost: Funny evaluation quirks and pitfalls! Also included: A few tips on how to achieve a more meaningful and robust evaluation! juriopitz.com/2024/10/17/eva… #nlproc #machinelearning

Tweet Image 1

Very interesting work by Fodor et al 2024! direct.mit.edu/coli/article/d… #nlproc #machinelearning

Tweet Image 1

juri Reposted

First time at the #TPDL 2024 conference (tpdl2024.nuk.si) in beautiful #Ljubljana presenting work together with the #Fotostiftung #Graubünden about #multimodal #LLMs for #OCR, #storytelling and #NER (to appear in link.springer.com/book/978303172…). #archives #DigitalHumanities

Tweet Image 1
Tweet Image 2

How to evaluate when the class distribution is imbalanced? I wrote a paper on this problem, maybe it helps to inform the selection process 🙂direct.mit.edu/tacl/article/d…

Which of the following metrics would you prioritize for a highly imbalanced classification problem?



juri Reposted

Explainability can also be obtaied with static embeddings 🙂 See my blogpost juriopitz.com/2024/04/04/exp…

Tweet Image 1

juri Reposted

Matryoshka embeddings are really cool! With similar objectives in mind, we partition an embedding into features that are interpretable, each binding a different aspect of text. Different low-dimensional subsets can then be selected for different tasks. arxiv.org/abs/2206.07023


Loading...

Something went wrong.


Something went wrong.