SciArena: Evaluating Foundation Models in Scientific Literature Tasks allenai.org 2 points by maxloh 9 hours ago