Swarat Chaudhuri's Avatar

Swarat Chaudhuri

@swarat.bsky.social

Professor of Computer Science at UT Austin and Visiting Researcher at Google Deepmind, London. Automated Reasoning + Machine Learning + Formal Methods. https://www.cs.utexas.edu/~swarat

1,204 Followers  |  449 Following  |  29 Posts  |  Joined: 13.11.2024  |  2.1539

Latest posts by swarat.bsky.social on Bluesky

Preview
AlphaEvolve: A Gemini-powered coding agent for designing advanced algorithms New AI agent evolves algorithms for math and practical applications in computing by combining the creativity of large language models with automated evaluators

Announcing AlphaEvolve, our new LLM coding agent that has
- made new scientific discoveries
- discovered algorithms that are now deployed at Google (in Gemini, Transformers, TPU hardware design & data centers)

Blog: deepmind.google/discover/blo...
White paper:
storage.googleapis.com/deepmind-med...

14.05.2025 20:11 β€” πŸ‘ 116    πŸ” 40    πŸ’¬ 5    πŸ“Œ 14
Preview
US revokes nearly 1,500 student visas: Who are the targets? Hundreds of students have had their visas cancelled and find themselves in limbo.

One of my PhD students got their visa revoked. I know of other cases amongst my AI colleagues. This is not what investing in US leadership in AI looks like.

www.aljazeera.com/news/2025/4/...

19.04.2025 04:55 β€” πŸ‘ 60    πŸ” 23    πŸ’¬ 2    πŸ“Œ 1
Preview
Guggenheim Foundation Names 3 at UT in 100th Class of Fellows Swarat Chaudhuri, a computer scientist, and Feliciano Giustino, a physicist, are among this year’s fellows from The University of Texas at Austin.

Congrats to UT computer scientist Swarat Chaudhuri & UT physicist Feliciano Giustino who were named as Guggenheim Fellows for 2025!

#GuggFellows2025 @guggfellows.bsky.social @utaustin.bsky.social @swarat.bsky.social
cns.utexas.edu/news/accolad...

16.04.2025 18:22 β€” πŸ‘ 8    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Post image

I am honored to be part of the #guggfellows2025 class. My Guggenheim project is on AI systems that can discover new math in an open-ended way. Many thanks to my students, colleagues, and mentors, who inspire me every day and without whom this work wouldn't be possible. www.gf.org/stories/anno...

16.04.2025 07:41 β€” πŸ‘ 10    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

Harvard has set an example for other higher-ed institutions - rejecting an unlawful and ham-handed attempt to stifle academic freedom, while taking steps to make sure students can benefit from an environment of intellectual inquiry, rigorous debate and mutual respect. Let’s hope others follow suit.

15.04.2025 03:52 β€” πŸ‘ 90119    πŸ” 18349    πŸ’¬ 1591    πŸ“Œ 750
Post image

The #LeanLang Standard Library, under active development at the Lean FRO, envisions providing a reliable and extensible basis for #softwaredevelopment, #softwareverification and #mathematics through verified components, a high-quality API, performance optimization, and best-in-class documentation.

05.03.2025 19:29 β€” πŸ‘ 8    πŸ” 4    πŸ’¬ 0    πŸ“Œ 0
Preview
Andrew Barto and Richard Sutton are the recipients of the 2024 ACM A.M. Turing Award for developing the conceptual and algorithmic foundations of reinforcement learning. Andrew Barto and Richard Sutton as the recipients of the 2024 ACM A.M. Turing Award for developing the conceptual and algorithmic foundations of reinforcement learning. In a series of papers beginning...

RL is so back!

(well, for some of us, it never really left)

awards.acm.org/about/2024-t...

05.03.2025 10:41 β€” πŸ‘ 72    πŸ” 12    πŸ’¬ 1    πŸ“Œ 1
Preview
STAND UP FOR SCIENCE March 7, 2025. Washington DC and nationwide. Because science is for everyone.

Calling all scientists and students based in London!

standupforscience2025.org and local groups are organizing rallies around the US to protest against the new administration’s massive and indiscriminate funding cuts to all manner of scientific researchβ€¦πŸ‘‰πŸΌπŸ§΅
#sciencematters #standupforscience #london

04.03.2025 13:31 β€” πŸ‘ 7    πŸ” 3    πŸ’¬ 3    πŸ“Œ 0

Congrats to @amitayush.bsky.social for leading this effort. And thanks to my student George Tsoukalas and collaborator extraordinaire @gregdnlp.bsky.social, who made critical contributions to the work. (3/3)

22.02.2025 21:32 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

It also has built-in machinery for large-scale, neurally guided proof search. We show that Proofwala's multilingual capabilities can enable transfer across proof assistants. Specifically, our multilingual model can outperform Coq- and Lean-only models at standard proof synthesis metrics. (2/3)

22.02.2025 21:32 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Post image

Excited about Proofwala, @amitayush.bsky.social's new framework for ML-aided theorem-proving.

* Paper: arxiv.org/abs/2502.04671
* Code: github.com/trishullab/p...

Proofwala allows the collection of proof-step data from multiple proof assistants (Coq and Lean) and multilingual training. (1/3)

22.02.2025 21:32 β€” πŸ‘ 21    πŸ” 5    πŸ’¬ 1    πŸ“Œ 1
Post image

Upon learning that yesterday would be my last day as a program officer at the National Science Foundation, I shared this parting message with my colleagues. The next few months will be frenetic and stressful for them. Here are some things that you can do to help them with the mission ahead. (1)

19.02.2025 19:08 β€” πŸ‘ 2429    πŸ” 831    πŸ’¬ 69    πŸ“Œ 70
Post image

DARPA released a Request for Information (RFI) that seeks community feedback on the draft DARPA Guide to Formal Methods to Deliver Resilient Systems for Proposals (β€œthe FMDRS Guide”). You can find the RFI here on Sam.gov.

Details in the image...

14.02.2025 22:04 β€” πŸ‘ 5    πŸ” 3    πŸ’¬ 0    πŸ“Œ 0
Preview
Proving the Coding Interview: A Benchmark for Formally Verified Code Generation We introduce the Formally Verified Automated Programming Progress Standards, or FVAPPS, a benchmark of 4715 samples for writing programs and proving their correctness, the largest formal verification ...

Proving the Coding Interview: A Benchmark for Formally Verified Code Generation

β€œWe introduce the Formally Verified Automated Programming Progress Standards, or FVAPPS, a benchmark of 4715 samples […] including 1083 curated and quality controlled samples”

arxiv.org/abs/2502.05714

12.02.2025 01:44 β€” πŸ‘ 4    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
Post image

Can LLMs be used to discover interpretable models of human and animal behavior?πŸ€”

Turns out: yes!

Thrilled to share our latest preprint where we used FunSearch to automatically discover symbolic cognitive models of behavior.
1/12

10.02.2025 12:21 β€” πŸ‘ 134    πŸ” 44    πŸ’¬ 3    πŸ“Œ 11
Preview
How a Canadian scientist and a venomous lizard helped pave the way for Ozempic - National | Globalnews.ca In 1984, Dr. Daniel Drucker, an endocrinologist from the University of Toronto, discovered a hormone that helped pave the way for popular diabetes drugs such as Ozempic.

This is the most relevant article to NIH and research cuts I’ve seen.

Imagine if this was today , how many people would be saying β€œWhy are we studying Gila Monsters and their impact on diabetes ? That’s wasted money !”

globalnews.ca/news/9793403...

09.02.2025 21:58 β€” πŸ‘ 48955    πŸ” 12510    πŸ’¬ 1140    πŸ“Œ 439
Post image Post image

Super excited: my new @darpa program on AI for pure mathematics!

Exponentiating Mathematics (expMath) aims to accelerate the rate of progress in pure math through the development of an AI collaborator and new professional-level math benchmarks.

sam.gov/opp/4def3c13...

07.02.2025 16:58 β€” πŸ‘ 16    πŸ” 5    πŸ’¬ 0    πŸ“Œ 1
Preview
The Deep Link Equating Math Proofs and Computer Programs | Quanta Magazine Mathematical logic and the code of computer programs are, in an exact way, mirror images of each other.

Mathematical proof assistants like Coq and Lean were made possible by a correspondence that established the equivalence between proofs and computation. Read the explainer from our archive:

08.02.2025 16:46 β€” πŸ‘ 45    πŸ” 19    πŸ’¬ 0    πŸ“Œ 3
Preview
Screening performance and characteristics of breast cancer detected in the Mammography Screening with Artificial Intelligence trial (MASAI): a randomised, controlled, parallel-group, non-inferiority, ... The findings suggest that AI contributes to the early detection of clinically relevant breast cancer and reduces screen-reading workload without increasing false positives.

New: The largest medical A.I. randomized controlled trial yet performed, enrolling >100,000 women undergoing mammography screening
The use of AI led to 29% higher detection of cancer, no increase of false positives, and reduced workload compared with radiologists w/o AI thelancet.com/journals/lan...

04.02.2025 03:00 β€” πŸ‘ 1363    πŸ” 349    πŸ’¬ 38    πŸ“Œ 90

This is one more, and such a profound, way of distinguishing between science and technology: "Technology shouts for itself; science [does not]." (And these days, some technologies truly do themselves shout…)

04.01.2025 14:47 β€” πŸ‘ 9    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
Post image

2025 will be #mathsky interesting year!

03.01.2025 15:47 β€” πŸ‘ 19    πŸ” 5    πŸ’¬ 2    πŸ“Œ 0

@ayushkhaitan.bluesky.social, Amitayush Thakur, and I are organizing an #AI4Math panel at the Joint Mathematics Meeting this month. Please spread the word among your math friends! We will post a summary of the discussion after the event.

04.01.2025 03:08 β€” πŸ‘ 6    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image Post image

I really enjoy NASA Administrator (!!!) Michael Griffin on the "Real Reasons" versus the "Acceptable Reasons" to go to the moon: spaceref.com/status-repor...

31.12.2024 21:54 β€” πŸ‘ 33    πŸ” 8    πŸ’¬ 1    πŸ“Œ 2

You make a good point. Alphaproof will evolve just as the informal approaches have, though.

24.12.2024 08:56 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Yeah, I think so, especially if search is permitted at test-time.

24.12.2024 00:08 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

From what I have seen, LLMs are quite good at that. There are plenty of examples of definitions being used in various contexts in the training data.

23.12.2024 23:20 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Can AI do maths yet? Thoughts from a mathematician. So the big news this week is that o3, OpenAI’s new language model, got 25% on FrontierMath. Let’s start by explaining what this means.

An excellent post by Kevin Buzzard on informal reasoning methods like o3. The key point, one I wholeheartedly agree with, is that informal methods continue to struggle with proof even when they give the correct answers, and this is a critical liability. xenaproject.wordpress.com/2024/12/22/c...

23.12.2024 21:55 β€” πŸ‘ 17    πŸ” 4    πŸ’¬ 2    πŸ“Œ 1

Hmm, I wasn’t imagining they would be connected to the account security people at X. But maybe worth a shot. Thank you!

23.12.2024 16:10 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

We are excited about the potential of this approach in
βœ… hard, research-level math tasks
βœ… deep assurance of software and hardware systems.

This was a team effort with Kaiyu Yang, Gabriel Poesia, Jingxuan He, Wenda Li, Kristin Lauter, and Dawn Song. Please reach out to us with feedback! (2/2)

23.12.2024 14:49 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@swarat is following 20 prominent accounts