Neuroimaging studies have consistently shown that the language network – a set of brain
regions responsible for language comprehension and production – remains largely inactive during various
reasoning tasks (Amalric and Dehaene, 2019; Monti et al., 2012, 2007, 2009; Fedorenko et al., 2011
I always thought that reasoning does not require language. Well, this seems to be supported by neuroscience, see screenshot from
arxiv.org/pdf/2412.06769
06.03.2025 09:11 — 👍 4 🔁 0 💬 0 📌 0
This involves more than removing the thinking part.
The prompt has to specify delimiters to be used. For instance, add this to the prompt:
"Just output the result as a python list of strings."
Then extract with :
response = '[' + response.split('[')[-1].split(']')[0] + ']'
04.03.2025 08:58 — 👍 1 🔁 0 💬 0 📌 0
I have been working with R1 distilled models lately for some agentic workflows (workflows where the output of LLM is used to decide what to do next). Prompting is different from previous models like Llama, but the bulk of the change is to parse the output to extract what you are interested in. 1/n
04.03.2025 08:58 — 👍 1 🔁 1 💬 1 📌 0
x.com
The thread was motivated by results on testing SOTA models:
x.com/mbalunovic/s...
08.02.2025 11:38 — 👍 2 🔁 0 💬 0 📌 0
It is also interesting to note that AI math benchmarks only care about the final number. If that number was accidentally found via a flawed mathematical proof, then it is still considered a success.
08.02.2025 11:36 — 👍 1 🔁 0 💬 1 📌 0
There is no wonder AI focuses on number finding math problems. It is because checking the result is simple.
Tackling the full spectrum of math requires a much more complex result checking machinery (formal proof checker)
08.02.2025 11:36 — 👍 1 🔁 0 💬 1 📌 0
This is to say that getting good at computing numbers specified by some mathematical setting is not the same as getting good at math in general. It is definitely part of math, but only a tiny part of math.
08.02.2025 11:36 — 👍 0 🔁 0 💬 1 📌 0
I was trained a a mathematician in France. And I almost never had to solve a problem of that kind. All the math work was about proving mathematical properties of mathematical objects. For instance, prove that a given group is isomorphic to another given group.
08.02.2025 11:36 — 👍 0 🔁 0 💬 1 📌 0
AIME I
February 6th, 2025 | The first American Invitational Mathematics Examination of the year. Students tackle 15 challenging problems in three hours.
I looked at AIME problems and one thing strikes me. All problems are about computing a number. This is a tiny part of math.
AIME problems olympiads.us/past-exams/2...
thread:
08.02.2025 11:35 — 👍 6 🔁 2 💬 1 📌 0
It could be that the problem is part of R1 or DeepSeek v3 training data as it is available online.
30.01.2025 18:25 — 👍 2 🔁 0 💬 1 📌 0
DeepSeek R1 reasoning to find that option 5 is the right answer.
I asked R1 (full model, locally hosted) to solve this logic puzzle.
Which answer in this list is the correct answer to this question?
All of the below.
None of the below.
All of the above.
One of the above.
None of the above.
None of the above
It solves it correctly.
30.01.2025 18:24 — 👍 3 🔁 0 💬 1 📌 0
NVIDIA
Not sure why you did not see it yourselves. But now you know.
25.01.2025 20:40 — 👍 0 🔁 0 💬 0 📌 0
Screenshot of a chatGPT conversation where chatGPt writes text that Hitler could have said. It exposes Nazi ideology. It is followed by a text explaining the danger of Nazi ideology.
How to make ChatGPT speak like Adolf Hitler.
This is not a criticism of ChatGPT 4o nor OpenAi work. I do think it is important to be able to teach people about bad things that happened.
With that in mind, here is the thing: chatgpt.com/share/6794fa...
25.01.2025 18:41 — 👍 1 🔁 0 💬 0 📌 0
Interested in KV Cache compression? Have a look at my team's KV Press.
You can start from HuggingFace blog: huggingface.co/blog/nvidia/...
25.01.2025 14:20 — 👍 2 🔁 0 💬 0 📌 0
My take from Deepseek R1 paper. It was trained on reasoning tasks where the outcome can be assessed without ambiguity (correct math response, and code that compile and produces the right output)
To me it is like SFT with perfect ground truth.
There are other key findings from that team ofc.
24.01.2025 13:39 — 👍 2 🔁 1 💬 0 📌 0
I'm not offended, dont' worry. I was suprised to see something that looked like an apology when there is nothing to apologize for.
I hope TESLA sales will go to zero in Germany (and in Europe in general0. That's the only language he'll understand.
24.01.2025 10:48 — 👍 1 🔁 0 💬 0 📌 0
Are you saying you are sorry for having to leave X?
To me?
Why is that? I am not a defender of X.
Personally I find it to be a great source of AI/ML info.
To your point, the rest is painful.
22.01.2025 22:29 — 👍 0 🔁 0 💬 1 📌 0
Some European media are less ambiguous than that. Cant say for US media.
An American friend didn't know about this till I told him. It did not show in his news feed (provided by Google). This is even worse IMHO. Just to consider this is business as usual.
21.01.2025 16:35 — 👍 2 🔁 0 💬 1 📌 0
Nazis: "that's a nazi salute"
Historians: "that's a nazi salute"
Average person: "that's a nazi salute"
The Media: "Elon Musk makes odd gesture throwing his heart to the crowd."
21.01.2025 01:03 — 👍 49478 🔁 12799 💬 819 📌 481
Text showing that OpenAI has access to frontier math problems and solutions.
Who's surprised?
When will people get that this happens? And even if not shared intentionally, as soon as you call an OAI api, OAI has access to what you send it.
OAI is not special here, any LLM api provider does the same.
Unless you have a private instance of it.
19.01.2025 11:51 — 👍 5 🔁 0 💬 0 📌 0
Just sought to replicate this and it’s like halfway fixed but still wrong🙄
17.01.2025 13:55 — 👍 9 🔁 1 💬 2 📌 1
My take on what's going at OpenAI. I think they have reached a point where o3 or whatever they call it is self improving autonomously.
Does it mean it is AGI or ASI? Certainly not.
AlphaGo was self improving for instance. It is not an AGI either.
17.01.2025 12:52 — 👍 1 🔁 0 💬 0 📌 0
NVIDIA Academic Grant Program for Researchers
Submit your research proposal.
Applicants must be a full-time faculty member at an accredited academic institution that awards research degrees to PhD students.
Up to 32K A100 40GB hours can be requested.
Award decisions expected in June.
For more information, please see FAQs: www.nvidia.com/en-us/indust...
13.01.2025 17:18 — 👍 3 🔁 1 💬 0 📌 0
NVIDIA’s Academic Grant Program is accepting proposals to accelerate data processing, graph analytics, graph neural networks, operational research, route optimization, and predictive modeling for scientific research using NVIDIA technology.
Deadline to apply is March 31: nvda.ws/3ZNxzuW
1/2
13.01.2025 17:15 — 👍 8 🔁 4 💬 1 📌 0
ofc there are skin color differences between humans. But this is a continuum.
There is no way to put people in few "race" groups with a clear cut definition of the boundary of these groups, for instance by defining a skin darkness threshold for each "race".
09.01.2025 12:09 — 👍 0 🔁 0 💬 0 📌 0
The problem is the belief in human races. This has no biological support.
08.01.2025 22:22 — 👍 0 🔁 0 💬 1 📌 0
Exactly like US forms asking me, living in France, to put a state name somewhere.
08.01.2025 21:35 — 👍 1 🔁 0 💬 0 📌 0
Facebook is censoring 404 Media stories about Facebook's censorship
🔗 www.404media.co/facebook-is-...
08.01.2025 16:03 — 👍 7479 🔁 2359 💬 271 📌 230
I believe Nvidia is releasing DIGITS to accelerate Grace CPU adoption. It is a very smart move by Nvidia.
08.01.2025 05:45 — 👍 14 🔁 1 💬 4 📌 0
Scientist in Artificial Intelligence and the Decision Sciences.
https://uli-research.com/About_Me.html
Data scientist at night.
https://www.kaggle.com/maiernator
ML Engineer at NVIDIA. Previously: Stealth GPU startup; Stability AI; AMD; Autodesk; CEO of 2 startups (3D + AI). Toronto, Canada
23 // Kaggle Competitions Grandmaster & ML/AI Researcher. Building video games @ Iconic, machine reasoning @ Cambridge, bioscience @ ForecomAI.
https://mxbi.net / tw: @mikb0b
Developer @Hexaly - Operations Research - Scheduling
LLM R&D, Kaggle Grandmaster!
🌏 Globally Recognized Leader in Responsible & Generative AI | 📈 Entrepreneur | 👣 Aly | 🧬 Geneticist | 🗞️ Advisor | | 🤖 Formerly IBM’s First Ever Global Chief AI Officer | 📰 Silicon Sands News 🔗siliconsandsnews.com🔗
MCF | Researcher in Good Old-Fashioned Artificial Intelligence (GOFAI) : mainly knowledge representation and reasoning & synergies with machine learning | trail running
mastodon: https://piaille.fr/@0xdefec7edcafe
machine learning for health at microsoft research, based in cambridge UK 🌻 she/her
Blog: https://argmin.substack.com/
Webpage: https://people.eecs.berkeley.edu/~brecht/
Chair of the Department of Biomedical Informatics at the University of Colorado School of Medicine. Research: transcriptomics, machine learning, public data - pick two of three. He/him. Views mine, not employer's.
Machine learning researcher, working on causal inference and healthcare applications
Assistant professor of biomedical data science and dermatology at Stanford. AI for healthcare. Associate editor at NEJM AI and the Journal of Investigative Dermatology. Mother of a sassy girl and a baby boy.
ML for healthcare and health equity. Assistant Professor at UC Berkeley and UCSF.
https://irenechen.net/
Faculty at UC San Diego. Chief Health AI Officer at UC San Diego Health. #rstats. Creator of Tidier.jl #julialang. #GoBlue. Views own.
Health at Microsoft AI
Deputy Editor @ai.nejm.org
Hon Associate Professor @unibirmingham.bsky.social
Prev Apple, Prev ophthalmology doctor in the NHS
Chief Technology Officer FL97, Inc. - Professor @ Harvard
Founding Editor NEJM AI, Co-host AI Grand Rounds 🎙️, Co-founder Generate Biomedicines, Inc
Assistant Professor at Stanford. Trustworthy, deployable ML/NLP for healthcare.