We built CopilotArena this fall in order to evaluate coding models in realistic, interactive environments.
Check out our recent writeup describing the results, as well as details of the system itself.
Work led by @waynechi.bsky.social and Valerie Chen.
.
05.03.2025 16:54 β
π 2
π 0
π¬ 0
π 0
great to see more specialized ML conferences! Mega conferences are fun, but at least in my experience with MLSys, I've had much better scientific conversations at smaller ones.
11.12.2024 19:51 β
π 4
π 1
π¬ 0
π 0
Excited about L2G, led by Wenduo Cheng. We leverage LLMs to beat genomic FMs and strong supervised baselines on a wide range of benchmarks. L2G uses cross-modal transfer (rather than vanilla fine-tuning), and neural architecture search to learn a genomic-specific embedder model.
11.12.2024 19:36 β
π 6
π 4
π¬ 0
π 0
Can we bypass the resource bottleneck of pretraining genomic Foundation Models? Our work L2G repurposes language LLMs for genomics via cross-modal transfer, matching fine-tuned genomic FMs. Kudos to Wenduo & fantastic collab w/ @atalwalkar.bsky.social. L2G, language to genome; L2G, lifeβs too good!
11.12.2024 13:41 β
π 9
π 3
π¬ 0
π 1
The UC Berkeley Project That Is the AI Industryβs Obsession
Chatbot Arena ranks the worldβs best AI models on a leaderboard based on user voting in head-to-head competitions between bots.
Great writeup on the Chatbot Arena team, including a nice photo of @waynechi.bsky.social's back (in the purple shirt). It's been fun collaborating with this team via CoPilot Arena (blog.lmarena.ai/blog/2024/co...), and I'm super impressed with their hustle!
www.wsj.com/tech/ai/the-...
06.12.2024 19:42 β
π 1
π 0
π¬ 0
π 0
Check out @junhongshen1.bsky.social's blog post describing this project in more detail:
blog.ml.cmu.edu/2024/12/06/s...
06.12.2024 19:37 β
π 0
π 0
π¬ 0
π 0
Excited to share this work! This was a fun project in collaboration with Scribe, and a great example of the power of open-source FMs when coupled with rich domain-specific data!
03.12.2024 21:17 β
π 4
π 1
π¬ 0
π 0
Hi Willie! Could you add me?
03.12.2024 20:22 β
π 2
π 0
π¬ 1
π 0
Could I be added?
03.12.2024 20:21 β
π 1
π 0
π¬ 1
π 0
if you're a PhD student at CMU doing AI/ML, lmk if you want to be added to this starter pack.
(I don't belong in this list, but I don't know how to remove myself from this pack π)
go.bsky.app/9APVxQQ
03.12.2024 18:27 β
π 14
π 3
π¬ 3
π 0