A staircase in the new School of Computer, Data & Information Sciences building at Wisconsin Madison. Tan wood structures surround tapestry art and a small indoor garden.
A view from above of the staircases in the Wisconsin CDIS building
An shot from below of winding wooden staircases and a glass atrium rooftop. The new School of Computer, Data & Information Sciences building at Wisconsin Madison.
A bicolor white cat with seal-colored markings, looking upwards with big wide dark eyes.
It's the season for PhD apps!! π₯§ π¦ βοΈ βοΈ
Apply to Wisconsin CS to research
- Societal impact of AI
- NLP ββ CSS and cultural analytics
- Computational sociolinguistics
- Human-AI interaction
- Culturally competent and inclusive NLP
with me!
lucy3.github.io/prospective-...
11.11.2025 22:32 β π 31 π 9 π¬ 0 π 0
#COLM2025 was one of my favorite conferences -- a really high fraction of interesting papers and people, but small enough to see everything!
Thank you to the organizers for putting it together!
13.10.2025 00:40 β π 17 π 1 π¬ 0 π 1
How good are LLMs at π scientific computing and visualization π?
AstroVisBench tests how well LLMs implement scientific workflows in astronomy and visualize results.
SOTA models like Gemini 2.5 Pro & Claude 4 Opus only match ground truth scientific utility 16% of the time. π§΅
02.06.2025 15:41 β π 10 π 2 π¬ 1 π 4
Love this!
02.06.2025 15:25 β π 1 π 0 π¬ 0 π 0
Current Continuation
Weβve started a podcast! @awsto.bsky.social and @samps.phd host βCurrent Continuation,β a little interview series with PL researchers. The first two episodes are with @ranjitjhala.bsky.social and @satnam6502.bsky.social. sigplan.org/cc/
02.06.2025 15:19 β π 24 π 11 π¬ 2 π 0
Congratulations Kanishka!
02.06.2025 15:24 β π 1 π 0 π¬ 1 π 0
Picture of the UT Tower taken by me on my first day at UT as a postdoc in 2023!
NewsποΈ
I will return to UT Austin as an Assistant Professor of Linguistics this fall, and join its vibrant community of Computational Linguists, NLPers, and Cognitive Scientists!π€
Excited to develop ideas about linguistic and conceptual generalization (recruitment details soon!)
02.06.2025 13:18 β π 66 π 8 π¬ 12 π 2
Evaluating language model responses on open-ended tasks is hard! π€
We introduce EvalAgent, a framework that identifies nuanced and diverse criteria πβοΈ.
EvalAgent identifies π©βπ«π expert advice on the web that implicitly address the userβs prompt π§΅π
22.04.2025 15:04 β π 22 π 5 π¬ 1 π 2
CRUST-Bench: A Comprehensive Benchmark for C-to-safe-Rust Transpilation
C-to-Rust transpilation is essential for modernizing legacy C code while enhancing safety and interoperability with modern Rust ecosystems. However, no dataset currently exists for evaluating whether ...
π Read the full paper:
CRUST-Bench: A Comprehensive Benchmark for C-to-safe-Rust Transpilation
arxiv.org/abs/2504.15254
Dataset: github.com/anirudhkhatr...
w/ @robertzhang.bsky.social , Jia Pan, @zetten.bsky.social, @jqchen.bsky.social, @gregdnlp.bsky.social, @idillig.bsky.social.
π§΅[6/6]
23.04.2025 17:00 β π 3 π 1 π¬ 0 π 1
Models often fail to:
1. Respect ownership rules
2. Infer type information
3. Follow idiomatic Rust interfaces
4. Preserve correct lifetimes
In the paper, we provide a taxonomy of common LLM mistakes.
π§΅[5/6]
23.04.2025 17:00 β π 2 π 0 π¬ 1 π 0
We evaluate state-of-the-art closed-source LLMs (like o1, Claude-3.7, and Gemini-1.5-Pro), open-source models like QwQ-32B and virtuoso-32B, and the SWE-Agent on CRUST-Bench.
Even the best modelβOpenAI's o1βpasses only 15/100 tasks in a single-shot setting.
π§΅[4/6]
23.04.2025 17:00 β π 2 π 0 π¬ 1 π 0
Our benchmark is the first to provide:
1. Rust tests
2. Rust interfaces, which are necessary for the transpiled code to work with the tests
3. A sizable number of real-scale transpilation problems.
π§΅[3/6]
23.04.2025 17:00 β π 3 π 0 π¬ 1 π 0
Transpiling C to Rust helps modernize legacy code with memory safety guarantees. CRUST-Bench evaluates whether transpilation methods yield safe, idiomatic Rust, using handcrafted interfaces and tests to ensure safety and validate correctness.
π§΅[2/6]
23.04.2025 17:00 β π 2 π 0 π¬ 1 π 0
πMeet CRUST-Bench, a dataset for C-to-Rust transpilation for full codebases π οΈ
A dataset of 100 real-world C repositories across various domains, each paired with:
π¦ Handwritten safe Rust interfaces.
π§ͺ Rust test cases to validate correctness.
π§΅[1/6]
23.04.2025 17:00 β π 17 π 5 π¬ 1 π 1
A bit of a mess around the conflict of COLM with the ARR (and to lesser degree ICML) reviews release. We feel this is creating a lot of pressure and uncertainty. So, we are pushing our deadlines:
Abstracts due March 22 AoE (+48hr)
Full papers due March 28 AoE (+24hr)
Plz RT π
20.03.2025 18:20 β π 37 π 31 π¬ 3 π 2
The Allen Institute for AI
Come work with me!
We are looking to bring on more top talent to our language modeling workstream at @ai2.bsky.social building the open ecosystem. We are hiring:
* Research scientists
* Senior research engineers
* Post docs (Young investigators)
* Pre docs
job-boards.greenhouse.io/thealleninst...
25.02.2025 01:07 β π 56 π 15 π¬ 4 π 0
πJob adπ We (@gregdnlp.bsky.social, @mattlease.bsky.social and I) are hiring a postdoc fellow within the CosmicAI Institute, to do galactic work with LLMs and generative AI! If you would like to push the frontiers of foundation models to help solve myths of the universe, please apply!
25.02.2025 22:09 β π 13 π 7 π¬ 0 π 3
encourage postdocs to apply π
@soldaini.net, myself and others from @ai2.bsky.social have been helping in project & also learning a ton---continued pretraining, creating domain-specific training data & evals---to build foundation models that scientists can use. promising area for open source LMs!
25.02.2025 23:24 β π 9 π 2 π¬ 0 π 0
three things are certain in life: death, taxes, and Claude switching to concise mode during US business hours
10.02.2025 17:17 β π 38 π 2 π¬ 2 π 0
Kudos to Usneek Singh. It was a pleasure to collaborate on this paper with the amazing folks at PROSE!
30.01.2025 05:09 β π 2 π 1 π¬ 0 π 0
@ayushkhaitan.bluesky.social, Amitayush Thakur, and I are organizing an #AI4Math panel at the Joint Mathematics Meeting this month. Please spread the word among your math friends! We will post a summary of the discussion after the event.
04.01.2025 03:08 β π 6 π 1 π¬ 0 π 0
Huge congrats to @prasannsinghal.bsky.social for being one of the 8 CRA Outstanding Undergraduate Researcher Award winners! It has been an absolute privilege to work with Prasann during his time at UT. (And he's applying for PhD programs this year...hint hint...)
Prasann's work π§΅
03.01.2025 14:37 β π 23 π 4 π¬ 1 π 0
@andersmoeller.bsky.social and I are co-chairing OOPSLA'26 and soliciting PC nominations. If you'd like to serve on the OOPSLA PC next year or know anyone (e.g., recent graduate) who you think would do a good job, please nominate them here: forms.gle/NVnzjcmbshoL...
23.12.2024 20:43 β π 21 π 13 π¬ 2 π 0
The legendary Putnam math competition had its 85th edition yesterday. Coincidentally, George Tsoukalas will present our paper on PutnamBench, a next-generation #AI4Math benchmark, at #NeurIPS2024 this week: arxiv.org/abs/2407.11214.
If you work on frontier AI for math/reasoning, talk to George!
08.12.2024 20:03 β π 15 π 3 π¬ 0 π 0
I'll be at #NeurIPS2024 w/
- @fcyin.bsky.social's LoFiT: using interp to improve fine-tuning (Weds pm poster & MINT spotlight talk Sun)
- @thomlake.bsky.social's analysis of Overton pluralism (Pluralistic alignment Sat)
Please reach out to me to chat about interp, factuality, reasoning, &c!
08.12.2024 20:38 β π 46 π 8 π¬ 1 π 1
Excited to visit Columbia next week!
22.11.2024 02:30 β π 14 π 1 π¬ 1 π 0
I did a starter pack of ML/AI people at @utaustin.bsky.social Please distribute and feel free to self nominate!
go.bsky.app/QLQznZg
22.11.2024 09:25 β π 27 π 8 π¬ 2 π 1
Yay!!! Iβm one of the cool people!
16.11.2024 17:47 β π 10 π 0 π¬ 0 π 0
We got an π₯ Outstanding Paper Award!! Cannot be more grateful π₯Ή This is super validating for our long pursuit of computational work on QUD.
Congrats to the amazing @yatingwu.bsky.social, Ritika Mangla, Alex Dimakis, @gregdnlp.bsky.social
15.11.2024 13:12 β π 60 π 9 π¬ 1 π 0
Image of the linked website listing EMNLP paper titles, authors, and locations
I won't be at EMNLP, but come and see:
π Detecting factual errors from LLMs (Liyan Tang)
π οΈ Detect, critique, & refine pipeline (Manya Wadhwa and Lucy Zhao)
π Synthetic data generation (Abhishek Divekar)
π Fact-checking (Aniruddh Sriram) at FEVER
t.co/fQbl0G7m23
(1st real post in the bluer skies!)
13.11.2024 03:46 β π 22 π 4 π¬ 1 π 0
Research in NLP (mostly LM interpretability & explainability).
Assistant prof at UMD CS + CLIP.
Previously @ai2.bsky.social @uwnlp.bsky.social
Views my own.
sarahwie.github.io
Machine Learning Librarian at @hf.co
Researcher in Machine Learning and Genetics. Here to explore projects in ALife and machine learning - particularly interested in self organising systems and interpretability! (he/him)
Compilers at Igalia. @llvmweekly.org author. Mostly RISC-V, LLVM, and a little WebAssembly. Previously lowRISC CTO and co-founder. Blogs at https://muxup.com
phd @ mit, research @ genlm, intern @ apple
https://benlipkin.github.io/
Undergrad at UT Austin in CS and Linguistics
AAAI is an artificial intelligence organization dedicated to advancing the scientific understanding of AI.
[bridged from https://aaai.org/ on the web: https://fed.brid.gy/web/aaai.org ]
San Diego Dec 2-7, 25 and Mexico City Nov 30-Dec 5, 25. Comments to this account are not monitored. Please send feedback to townhall@neurips.cc.
The 2025 Conference on Language Modeling will take place at the Palais des Congrès in Montreal, Canada from October 7-10, 2025
International Conference on Learning Representations https://iclr.cc/
Weβre an international software company that helps people and organisations use #OCaml to build safer & faster code. #OCaml #FunctionalProgramming #MirageOS π«
The Association for Computational Linguistics (ACL) is a scientific and professional organization for people working on Natural Language Processing/Computational Linguistics.
Hash tags: #NLProc #ACL2025NLP
A feed of interesting AI / math / formal methods papers. Posts by @m-dodds.bsky.social
Breakthrough AI to solve the world's biggest problems.
βΊ Join us: http://allenai.org/careers
βΊ Get our newsletter: https://share.hsforms.com/1uJkWs5aDRHWhiky3aHooIg3ioxm
Postdoc at Northeastern and incoming Asst. Prof. at Boston U. Working on NLP, interpretability, causality. Previously: JHU, Meta, AWS
Assistant Professor CS @ Ithaca College. Computational Linguist interested in pragmatics & social aspects of communication.
venkatasg.net
Grad student @UTAustin ECE | Previously: AI Healthcare @AmritaHospitals | LLMs, RAG systems, AI Safety and Alignment | Building trustworthy AI |
https://www.linkedin.com/in/aadharsh-aadhithya-9a6982149/
Assistant Professor of Linguistics, and Harrington Fellow at UT Austin. Works on computational understanding of language, concepts, and generalization.
πΈοΈποΈ: https://kanishka.website
PhD @ucberkeleyofficial.bsky.social | Past: AI4Code Research Fellow @msftresearch.bsky.social | Summer @EPFL Scholar, CS and Applied Maths @IIITDelhi | Hobbyist Saxophonist
https://lakshyaaagrawal.github.io
Maintainer of https://aka.ms/multilspy