FML-Bench: A Controlled Study of AI Research Agent Strategies

https://arxiv.org/abs/2605.17373

Comments