Beyond BLEU: Ethical Risks of Misleading Evaluation in Domain-Specific QA with LLMs

Auteur

Ayoub Nainia, Régine Vignes-Lebbe, Hajar Mousannif, Jihad Zahir

Manifestation

First Workshop on Comparative Performance Evaluation: From Rules to Language Models associated with RANLP 2025

Date

2025-09-01

Organisation

Varna, Bulgaria

Chercheur

MOUSANNIF Hajar

Voir le profil du chercheur →