Paper: arxiv.org/abs/2503.07358
Code: github.com/yiqingxyq/Re...
Work led by Yiqing Xie, with Alex Xie, Divyanshu Sheth, Pengfei Liu, @daniel-fried.bsky.social and @carolynrose.bsky.social
@daniel-fried.bsky.social
Assistant prof at LTI CMU; Research scientist at Meta AI. Working on NLP: language interfaces, applied pragmatics, language-to-code, grounding. https://dpfried.github.io/
Paper: arxiv.org/abs/2503.07358
Code: github.com/yiqingxyq/Re...
Work led by Yiqing Xie, with Alex Xie, Divyanshu Sheth, Pengfei Liu, @daniel-fried.bsky.social and @carolynrose.bsky.social
2) RepoST.
We automatically create executable environments from real GitHub repos, allowing us to train and evaluate models for function generation in real-world contexts.
Presenting at the CODEML workshop on Fri Jul 18th.
Also accepted to COLM, upcoming!
Paper: arxiv.org/abs/2409.07429
Code: github.com/zorazrw/agen...
Work led by @zorazrw.bsky.social, with Jiayuan Mao and @gneubig.bsky.social
1) Agent Workflow Memory.
Allow agents to adapt online to carry out new tasks more accurately by inducing workflows for common sub-tasks.
Today (Wed 7/17): 4:30-7pm. West Exhibition Hall B2-B3 W-202):
Also at the CUA workshop, morning of Sat 7/19.
Excited to be presenting two of our papers at #ICML2025 and workshops, today through Saturday! Topics are memory for agents, and constructing coding environments for training & evaluation. See links below:
16.07.2025 18:30 β π 1 π 0 π¬ 1 π 0Happy to announce the first workshop on Pragmatic Reasoning in Language Models β PragLM @ COLM 2025! π
How do LLMs engage in pragmatic reasoning, and what core pragmatic capacities remain beyond their reach?
π sites.google.com/berkeley.edu/praglm/
π
Submit by June 23rd
Congrats Lucy!!
10.05.2025 20:11 β π 4 π 0 π¬ 0 π 0Wisconsin-Madison's tree-filled campus, next to a big shiny lake
A computer render of the interior of the new computer science, information science, and statistics building. A staircase crosses an open atrium with visibility across multiple floors
I'm joining Wisconsin CS as an assistant professor in fall 2026!! There, I'll continue working on language models, computational social science, & responsible AI. π²π§π£π»ββοΈ Apply to be my PhD student!
Before then, I'll postdoc for a year in the NLP group at another UW ποΈ in the Pacific Northwest
Inaugurating new acct to share work from my PhD student!
Wayne et al have been running a live eval platform Copilot Arena - a VSCode extension serving code completions from AI systems to real developers. See π§΅ for findings and preprint
Excited to be evaluating human-AI *workflows* holistically!
What if AI agents did software engineering like humansβseeing the screen & using any developer tool?
Introducing Programming with Pixels: an SWE environment where agents control VSCode via screen perception, typing & clicking to tackle diverse tasks.
programmingwithpixels.com
π§΅
Interested in knowing more about LLMs agents and in contributing to this topic?π
π’We're thrilled to announce REALM: The first Workshop for Research on Agent Language Models π€ #ACL2025NLP in Vienna π»
We have an exciting lineup of speakers
ποΈ Submit your work by *March 1st*
@aclmeeting.bsky.social
Congrats Mohit!!
15.01.2025 17:07 β π 6 π 0 π¬ 1 π 0Thrilled to announce our new work TestGenEval, a benchmark that measures unit test generation and test completion capabilities. This work was done in collaboration with the FAIR CodeGen team.
Preprint: arxiv.org/abs/2410.00752
Leaderboard: testgeneval.github.io/leaderboard....
CMU LTI is hosting predoc interns this summer, centered around "Language Technologies for All"! Please apply and circulate! lti.cs.cmu.edu/news-and-eve...
07.01.2025 22:42 β π 19 π 8 π¬ 1 π 0You can execute each generated function on a set of possible inputs to the function, group the functions according to the outputs, then choose the largest group: arxiv.org/abs/2204.11454 and Sec 4.6 of arxiv.org/abs/2203.07814, although I'm not sure what was done in these plots
06.01.2025 04:07 β π 7 π 0 π¬ 0 π 0So sorry to hear this, what a loss - such a kind and fun guy and his work is so creative.
02.01.2025 23:53 β π 1 π 0 π¬ 0 π 0Announcement #1: our call for papers is up! π
colmweb.org/cfp.html
And excited to announce the COLM 2025 program chairs @yoavartzi.com @eunsol.bsky.social @ranjaykrishna.bsky.social and @adtraghunathan.bsky.social