(A) Is the least hand holding, but it could misformalize. (B) Makes sure the formalization fits your modelβs constraints, but leaves you open to criticism. (C) Joseph has thought more about IMO Lean formalization than anyone I know, but his may not fit your modelβs requirements.
16.07.2025 11:49 β π 0 π 0 π¬ 0 π 0
GitHub - jsm28/IMOLean: Suggested conventions and examples for Lean formalization of IMO problem statements
Suggested conventions and examples for Lean formalization of IMO problem statements - jsm28/IMOLean
For those using @leanprover AI models to βcompeteβ in #imo2025 (@ProjectNumina , @KaiyuYang4), what Lean formalizations will you use? (A) autoformalized statements, (B) manually formalized statements, (C) Joseph Meyerβs IMOLean formalizations (github.com/jsm28/IMOLean)?
16.07.2025 11:49 β π 1 π 0 π¬ 1 π 0
Oh, you want a tool which annotates papers (maybe on the fly) where if you hover over say βinaccessible cardinalβ it gives you the definition needed for the paper (or a link)?
02.02.2025 17:21 β π 1 π 0 π¬ 1 π 0
PDFs? You mean if the lean code is in a paper you canβt mouse over it? If that is what you mean, what do pdfs have to do with LLMs or are though two separate examples of things which donβt have access to language server information?
02.02.2025 08:29 β π 1 π 0 π¬ 1 π 0
I have no idea if LLMs are any good as math collaborators for bouncing around ideas and brainstorming, but if those who do get this to work keep it a secret, we just wonβt know. We should be rewarding those who use them wisely for their research.
14.01.2025 16:00 β π 1 π 0 π¬ 0 π 0
If we arenβt willing to talk about when tools are helpful, it will lead to an asymmetry where those in the know will use hidden methods that they keep secret out of either embarrassment or competition. IMHO, this isnβt in the spirit of mathematics research.
14.01.2025 16:00 β π 1 π 0 π¬ 1 π 0
Mathematics isnβt chess. Using tools isnβt cheating. It is research and knowing what tools to use is important to research. But also as these tools start to feel more human, it does bring up tricky questions of attribution and citation.
14.01.2025 16:00 β π 0 π 0 π¬ 1 π 0
At #JMM2025 last week, a mathematician told me he used ChatGPT (forgot which version) to get a major insight into solving his math problem. But, he is reluctant to share publicly (e.g. in his paper) that this happened since it could make him look less bright. I find this sad. π§΅
14.01.2025 16:00 β π 1 π 0 π¬ 1 π 0
I last attended JMM in Seattle in 2017. It was also the year I left mathematics. Like many math postdocs, I couldnβt find a suitable job and went into industry, first data science and later AI research. Iβm so excited to be back talking about AI for Math!
09.01.2025 12:01 β π 1 π 0 π¬ 0 π 0
<p>Applications of AI to formal theorem proving</p>
As with many other scientific and professional areas, artificial intelligence i...
Iβm really excited to announce that Iβm going to be at the Joint Math Meetings #JMM2025 to give a talk on AI for Formal Theorem Proving on Friday. Reach out if you want to talk about AI for Math or for all my math friends, if you just want to catch up!
09.01.2025 12:01 β π 1 π 0 π¬ 1 π 0
I think the long term goal of Lean4Lean is (1) a full implementation of the Leanβs kernel in Lean, (2) a formalization of Leanβs type theory LeanTT as in Marioβs thesis, (3) a proof that the kernel faithfully implements LeanTT, (4) a formalization of Marioβs proof that LeanTT is consistent/sound.
05.01.2025 19:24 β π 1 π 0 π¬ 0 π 0
Iβm not sure exactly what you mean here, but would Metamath Zero (if finished) be what you are looking for? It would be a checker verified up to x86 byte code that could check a trace from another theorem prover (if the kernel rules were given to MM0 and the trace contained all the proof steps).
05.01.2025 19:04 β π 1 π 0 π¬ 1 π 0
If you are just worried about Leanβs kernel, I think Lean4Lean could help verify that it is both correct and sound. If you are worried about the full Lean code base, that indeed gets a bit complicated since one could do weird stuff which bypasses the kernel. Iβm not sure the full correct solution.
05.01.2025 18:56 β π 2 π 0 π¬ 1 π 0
But just to be clear, I have no current plans to write any proof checking language (except maybe some toy external type checkers for some existing languages like Lean or MetaMath).
04.01.2025 16:23 β π 1 π 0 π¬ 1 π 0
Or better yet, meta proof checking languages like Metamath Zero, so that I know already the kernel was correct and all I had to do was get the logic specification right (and maybe that language would also have good libraries and tools for proving that specification matches a desired semantics).
04.01.2025 16:18 β π 2 π 0 π¬ 1 π 0
If I wrote a real proof checker now that I expected to be performant, I think I would use Rust. But in the next 100 years I would hope to have performant languages that we could also prove correctness of so that I could prove my checker is correct.
04.01.2025 16:08 β π 2 π 0 π¬ 1 π 0
I donβt understand this question. The language the checker is expected to implemented in, or the name of the expected yet-to-be-developed proof checking language?
04.01.2025 16:06 β π 1 π 0 π¬ 1 π 0
When I compare o1 pro to AlphaProof, I lean towards thinking o1 pro is better (much faster to the right ideas), but it obviously didnβt sufficiently prove stuff. I think something like o1/o3 plus access to Lean would be really powerful. It now seems to me that AlphaProof is too constrained w/o CoT.
24.12.2024 00:17 β π 1 π 0 π¬ 1 π 0
It it different in other functional languages? But I can see how it is quite confusing if you arenβt prepared for this.
23.12.2024 23:59 β π 0 π 0 π¬ 1 π 0
Are you friends with any of the xAI folks?
23.12.2024 13:42 β π 1 π 0 π¬ 1 π 0
Caveat 3: Iβm talking about really cutting-edge theorem proving here. There is still great value in ATPs for day-to-day formalization. We donβt need o3-like models for that. We need solvers at the right level of speed/cost/usability (tactics, hammers, chatbots, copilots, etc.).
23.12.2024 13:42 β π 1 π 0 π¬ 0 π 0
Caveat 2: Verification (including formalization) should be incorporated in the middle of reasoning models, and not just at the end. An AI should be checking its results as it goes, and adjusting its informal thinking process to account. (And for code too, e.g. unit tests!)
23.12.2024 13:42 β π 2 π 0 π¬ 1 π 0
Caveat 1: Auto-formalization itself will require ATP, at various levels of power & speed (simple tactics up to reasoning models) to fill in proof gaps, verify semantic correctness, and RL-train the formalizers. But I claim the most important missing piece is auto-formalization.
23.12.2024 13:42 β π 2 π 0 π¬ 1 π 0
Hot take: For automatic theorem proving (ATP) to solve unsolved math problems, the most important need is auto-formalization. We already have general-purpose reasoning models (o3 and more to come). We need to verify the correctness of these models with absolute certainty. π§΅
23.12.2024 13:42 β π 13 π 3 π¬ 1 π 0
We need so much more work on autoformalization (well that and practical code writing, like where the AI has to write a whole project, since I think they share many of the same difficulties like writing APIs, using libraries, and breaking down problems into sub-problems).
23.12.2024 01:52 β π 3 π 0 π¬ 1 π 0
It could be that the answer is no, but I wanted to get your thoughts because I think it often got the right answer even if the proof had mistakes.
23.12.2024 01:17 β π 0 π 0 π¬ 1 π 0
What Iβm trying to get at: Even in o1 pro, was the intuition and skill there but not the discipline and follow-through to finish an air tight proof?
23.12.2024 01:11 β π 0 π 0 π¬ 1 π 0
A project to digitalize physics into the interactive theorem prover lean 4.
Account for news related to the project. Run by: Joseph Tooby-Smith.
Science journalist and author of "The Proof in the Code," the story of Lean, the interactive theorem prover developed by Leo de Moura at Microsoft Research, which is transforming the way math research is conducted. Coming in Spring 2026 from Quanta Books.
Working on formal verification & AI at Ethereum.
verified-zkevm.org
Informal Mathematics @UniTrento || Formal Mathematics @Harmonic || Formalising in #Lean || Developing #FOSS in #Python and #Julia || Forecasting @Metaculus.
β’ GitHub: https://github.com/pitmonticone
β’ YouTube: https://www.youtube.com/@PietroMonticone
Chief Scientist at the UK AI Security Institute (AISI). Previously DeepMind, OpenAI, Google Brain, etc.
π https://leni.sh, https://pixelfed.social/chrysoberyl
π§ Researcher in Machine Assisted Theorem Proving at https://centaur.stanford.edu/
β―οΈ Director of NorCal Hakkero Factory No. 1 (I make cosplay props)
βοΈ Training AI on any of my content is prohibited
https://github.com/utensil
Mathematician at UCLA. My primary social media account is https://mathstodon.xyz/@tao . I also have a blog at https://terrytao.wordpress.com/ and a home page at https://www.math.ucla.edu/~tao/
Mainly active on GitHub (https://github.com/fzyzcjy)
Research Scientist. Implementing reasoning in AI. Theory and implementation of open ended reasoning algorithms for long term planning, robotics, math, protein design and science
Professor of Computer Science at UT Austin and Visiting Researcher at Google Deepmind, London. Automated Reasoning + Machine Learning + Formal Methods. https://www.cs.utexas.edu/~swarat
Associate Professor in Data Science and AI at Chalmers University of Technology. Neuro-symbolic AI, AI for maths, a bit of NLP and stir.
Computer scientist with a background in mathematics and logic. Academic researching formal verification technologies and applications.
PhD Math & CS @Caltech
www.robertj1.com
Mathematics professor at Collège de France and fellow of Trinity College Cambridge.
Mathematician interested in the study and teaching of computational logic, functional programming and interactive theorem proving.
Homepage: https://jaalonso.github.io
Sevilla, Spain