's Avatar

@drimgemp.bsky.social

873 Followers  |  2 Following  |  1 Posts  |  Joined: 22.11.2024  |  1.307

Latest posts by drimgemp.bsky.social on Bluesky

Looking for a principled evaluation method for ranking of *general* agents or models, i.e. that get evaluated across a myriad of different tasks?

I’m delighted to tell you about our new paper, Soft Condorcet Optimization (SCO) for Ranking of General Agents, to be presented at AAMAS 2025! 🧡 1/N

24.02.2025 15:25 β€” πŸ‘ 67    πŸ” 17    πŸ’¬ 1    πŸ“Œ 6

Now in the big blue world!

22.11.2024 19:06 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 1

@drimgemp is following 2 prominent accounts