SONG, Hao; FLACH, Peter. Efficient and Robust Model Benchmarks with Item Response Theory and Adaptive Testing. International Journal of Interactive Multimedia and Artificial Intelligence, [S. l.], v. 6, n. 5, p. 110–118, 2021. DOI: 10.9781/ijimai.2021.02.009. Disponível em: https://www.ijimai.org/index.php/ijimai/article/view/708. Acesso em: 18 nov. 2025.