Zero-shot Benchmarking: A Framework for Flexible and Scalable Automatic Evaluation of Language Models.

Summary

This is a publication. If there is no link to the publication on this page, you can try the pre-formated search via the search engines listed on this page.

Authors: José Pombal, Nuno M. Guerreiro, Ricardo Rei, André F. T. Martins

Journal title: COLM 2025

Journal publisher: COLM 2025

Published year: 2025

DOI identifier: 10.48550/ARXIV.2504.01001