juri @nlopitz Twitter Profile

juri

@nlopitz

Researcher @UZH_en

Joined December 2019

532Posts 520Followers 368Following

Similar User

@Roprajo

@AnetteMFrank

@HD_NLP

@dustin_wright37

@zhangzhuosheng

@CisLmu

@barbara_plank

@PSH_Lewis

@huyushi98

@dnnslmr

@DrogoKhal4

@shrutirij

@billyuchenlin

@qinyuan_ye

@ysu_nlp

Pinned

juri

@nlopitz

3 Jul

Should I use Macro F1 or Accuracy? Why not Kappa? Why do some use this, and others that? What's actually evaluated here? 😵‍💫 Happy to share the final version of this paper on multi-class classification evaluation: direct.mit.edu/tacl/article/d… #machinelearning #nlproc #ml

juri Reposted

Sowmya Vajjala

@adyantalamadhya

25 Nov

There are also papers I came across that just says "F1" without saying whether it is "micro" or "macro" or something else. I stopped taking the actual results by face value and browse through generally to get a broader picture of methods these days.

juri Reposted

Ehud Reiter

@EhudReiter

25 Nov

Interesting and worrying paper on meta-eval of classifiers metrics (direct.mit.edu/tacl/article/d…). Eg, some papers use "macro F1" to refer to arithmetic mean, others use it for harmonic mean, others dont say what they mean by Macro F1

juri

@nlopitz

22 Nov

40,000 Downloads! Happy to see that Julius' super-efficient text factuality checker has seen a decent amount of usage in the last months! Free on Huggingface: huggingface.co/juliussteen/De… ACL paper: aclanthology.org/2023.acl-short… @juliusmsteen 👀 #nlproc #machinelearning

juri Reposted

Debjit Paul

@DebjitPaul2

20 Nov

Looking for an emergency reviewer for ARR submission 🚨 The paper is on commonsense reasoning. If you have reviewed any *ACL conference and interested, please DM me. The review needs to be submitted in next 24 hours. #NLProc

juri Reposted

juri

@nlopitz

11 Nov

💡This cool #EMNLP2024 paper on semantic parsing also shows why you should NOT use a heuristic when evaluating a system. #nlproc #machinelearning #datascience #evaluation #statistics aclanthology.org/2024.emnlp-mai…

juri Reposted

Xiyan Fu

@XiyanFu

12 Nov

✨New Paper✨ The Mystery of Compositional Generalization in Graph-based Generative Commonsense Reasoning #EMNLP2024 📜 aclanthology.org/2024.findings-… 🖼️ Poster Session F (Riverfront Hall), Nov 14 @ 10:30

juri Reposted

Sina Ahmadi

@sina_ahm

25 Oct

I am looking for emergency reviewers for missing reviews in the "Low-resourced and Less Studied Languages" track at #COLING2025. Please let me know if you can help! #nlproc

juri Reposted

Joseph Imperial

@josephimperial_

24 Oct

Lazy Twitter: Do you know of any corpora or wordlists in Scandinavian languages (Danish, Norwegian, Swedish, Icelandic) with CEFR labels? #education #linguistics #nlp #PhDlife #phd

juri

@nlopitz

25 Oct

It doesn't seem much, but I am actually proud of this: I never missed a reviewing DL so far! Probably this won't hold at some point, but still... Remember: If you miss a review DL by a few days it's usually not a big deal, but think of dropping a note to ACs or editors.

juri Reposted

Jia-Bin Huang

@jbhuang0604

17 Oct

PhD then: “We validate our method on a large-scale dataset of hundreds of images.” PhD now: “We validate our method on 10 different modalities, 15 domains, 20 scenarios, 25 tasks, and 200 languages, each with millions of testing examples”.

juri

@nlopitz

17 Oct

New blogpost: Funny evaluation quirks and pitfalls! Also included: A few tips on how to achieve a more meaningful and robust evaluation! juriopitz.com/2024/10/17/eva… #nlproc #machinelearning

juri

@nlopitz

7 Oct

Very interesting work by Fodor et al 2024! direct.mit.edu/coli/article/d… #nlproc #machinelearning

juri Reposted

Phillip B. Ströbel

@phillipstroebel

25 Sep

First time at the #TPDL 2024 conference (tpdl2024.nuk.si) in beautiful #Ljubljana presenting work together with the #Fotostiftung #Graubünden about #multimodal #LLMs for #OCR, #storytelling and #NER (to appear in link.springer.com/book/978303172…). #archives #DigitalHumanities

juri

@nlopitz

9 Sep

How to evaluate when the class distribution is imbalanced? I wrote a paper on this problem, maybe it helps to inform the selection process 🙂direct.mit.edu/tacl/article/d…

Deeksha

@deekshas24

9 Sep

Which of the following metrics would you prioritize for a highly imbalanced classification problem?

juri Reposted

juri

@nlopitz

5 Sep

Explainability can also be obtaied with static embeddings 🙂 See my blogpost juriopitz.com/2024/04/04/exp…

juri Reposted

juri

@nlopitz

3 Sep

Matryoshka embeddings are really cool! With similar objectives in mind, we partition an embedding into features that are interpretable, each binding a different aspect of text. Different low-dimensional subsets can then be selected for different tasks. arxiv.org/abs/2206.07023