β¨ New 3D pose estimation method from my lab! #FMPose3D allows for monocular (i.e. single camera) 2Dβ‘οΈ3D π₯
Led by Ti Wang & w/ Xiaohang Yu #FMPose3D is SOTA on human & animal 3D benchmarks, & will be integrated into @deeplabcut.bsky.social β¬οΈ
π arxiv.org/abs/2602.05755
β‘οΈ xiu-cs.github.io/FMPose3D/
08.02.2026 07:09 β π 66 π 11 π¬ 1 π 0
I think it will happen much faster than 10 years. 4 years top.
07.02.2026 22:45 β π 1 π 0 π¬ 0 π 0
Google DeepMind's Mining Generalizable Activation Functions
They argue that evolutionary search is a powerful framework for discovering new activation functions, showing that LLM-driven pipelines like AlphaEvolve replace manually designed search spaces with flexible,
06.02.2026 23:15 β π 18 π 3 π¬ 1 π 1
the fact that early-stage automated vehicles that still occasionally require remote operation assistance are like several orders of magnitude safer than the average American driver should be the real wakeup call here, not the idea that these vehicles still sometimes need remote operation assistance
06.02.2026 17:34 β π 696 π 90 π¬ 36 π 15
They propose a new paradigm called Drifting Models, which evolve the pushforward distribution during training and naturally admit one-step inference. We introduce a drifting field that governs the sample movement and achieves equilibrium when the distributions match.
06.02.2026 02:30 β π 18 π 1 π¬ 1 π 0
In case it wasnβt ALREADY blatantly clear!
04.02.2026 21:30 β π 399 π 218 π¬ 22 π 12
They develop TinyLoRA, a new ft method. with TinyLoRA + RL, models learn well with dozens or hundreds of params. For example, they use only 13 parameters to train 7B Qwen model from 76 to 91% on GSM8K.
"Learning to Reason in 13 Parameters"
Paper: arxiv.org/abs/2602.04118
05.02.2026 13:15 β π 92 π 11 π¬ 1 π 9
It is possible to get Genie 3 to make worlds that are visually interesting
I have been using images I generated in Midjourney of vast megastructures & odd cities in various styles. After 20 seconds I can freely wander around them for a minute or so. (Yes, I controlled the cat in the first scene)
05.02.2026 04:18 β π 80 π 5 π¬ 4 π 2
Thanks for sharing! How is the latency? Does it feel responsive?
05.02.2026 16:25 β π 0 π 0 π¬ 0 π 0
Voxtral transcribes at the speed of sound
Mistral just released Voxtral Transcribe 2 - a family of two new models, one open weights, for transcribing audio to text. This is the latest in their Whisper-like model family, β¦
Two new speech-to-text models (similar to Whisper) from Mistral today - one of them is API-only, the other is a 8.9GB Apache-2.0 licensed open weights model for "realtime" transcription. They're both very good! simonwillison.net/2026/Feb/4/v...
04.02.2026 22:43 β π 118 π 10 π¬ 6 π 1
NVIDIA releasing their best models as open weights isn't charity β it's a business decision. And honestly, it's one of the clearest explanations I've heard for why a company would invest heavily inβ¦
Why NVIDIA builds their own open models | Nemotron w/ Bryan Catanzaro
Nvidiaβs Nemotron is the closest thing the U.S. has to a Qwen approach to open models, but most people donβt know it yet.
Iβm very bullish on Nvidiaβs open model efforts in 2026.
Interconnects interview #17 on the past, present, and future of the Nemotron project.
www.youtube.com/watch?v=Y3Vb...
04.02.2026 18:05 β π 34 π 3 π¬ 1 π 2
Rectified LpJEPA
A JEPA architecture that learns sparse, non-negative, informative representations through principled distributional regularization.
04.02.2026 12:40 β π 13 π 2 π¬ 1 π 0
The dreamcoder boys are back
https://arxiv.org/abs/2602.00929
04.02.2026 01:51 β π 13 π 1 π¬ 0 π 0
What if position encodings were designed for vision from scratch? We introduce PaPEβParabolic Position Encoding. Outperforms RoPE on 7/8 datasets and extrapolates to higher resolutions without fine-tuning or position interpolation. Paper, code, and website in thread π§΅
04.02.2026 08:22 β π 36 π 7 π¬ 3 π 0
Today weβre releasing the International AI Safety Report 2026: the most comprehensive evidence-based assessment of AI capabilities, emerging risks, and safety measures to date. π§΅
(1/19)
03.02.2026 13:16 β π 50 π 25 π¬ 1 π 14
I was in a zoom call with Brock Pierce a couple years ago, along with the startup CEO I worked for. Brock was introduced to us as potential investor. Brock was a weird creep that oozed dishonesty. I told the CEO we didnβt want that guy in our cap table or anywhere near. We dodged a bullet.
03.02.2026 13:37 β π 3 π 0 π¬ 0 π 0
Blocked. Not renewable.
02.02.2026 04:01 β π 0 π 0 π¬ 0 π 0
Next time someone asks you about energy and land use, maybe remind them that the ~35 million acres that currently grow corn for ethanol in the US could produce ~15 PWh per year of electricity from solar photovoltaics. That's ~3.5 times more than *total annual generation from all US power plants*.
01.02.2026 20:55 β π 3901 π 1320 π¬ 125 π 68
This brings back fond memories of working with @bkwok.bsky.social and Atjeng Gunawan, trying to figure out how to get that new renderer working within the complicated Maya codebase and on the limited GPUs of the time. :-)
01.02.2026 17:38 β π 1 π 0 π¬ 0 π 0
I love this⦠I worked on the realtime renderer that shipped with Maya 4! Early programmable shading, even getting a good per-pixel specular was challenging!
01.02.2026 11:47 β π 18 π 1 π¬ 2 π 0
Donald Trump speed-running USA towards dictatorship and civil war
30.01.2026 18:06 β π 1 π 0 π¬ 1 π 0
Build with NVIDIA Cosmos Reason 2 and Cosmos Cookbook recipesβfrom egocentric robot reasoning to physical plausibility checks and trafficβaware models.
π Jan 29 β Feb 26
π₯ Solo or teams (up to 4)
29.01.2026 18:40 β π 0 π 0 π¬ 0 π 0
NVIDIA Cosmos Cookoff Β· Luma
Host: NVIDIA
Sponsors: Nebius and Milestone Systems
Community: Discord
Prizes:First Place: $3,000 and an NVIDIA DGX Sparkβ’
Second Place: $2,000 and an NVIDIAβ¦
NVIDIA announced "Cosmos Cookoff", a virtual, 4-week physical AI challenge for robotics, AV, and vision AI builders.
π Prizes include $5,000, an NVIDIA DGX Spark, and more!
nvda.ws/3Z9lrCO
29.01.2026 18:40 β π 1 π 0 π¬ 1 π 0
A data collection farm from Fourier robotics. The operators appear to be collecting data for brain-computer interfaces while also controlling the robots to do a wide variety of tasks.
Video from RoboHub on X (originally from Fourier)
29.01.2026 03:50 β π 38 π 5 π¬ 2 π 0
purrveyor of codexslop. synthetic fabric enthusiast
Writing a data-driven newsletter about economics @ apricitas.io
Nuance? In this Economy
Full Employment Stan, Brazilian Coffee Tariff Victim |
Sen. Sanders of Vermont, Ranking Member of the U.S. Senate Committee on Health, Education, Labor & Pensions, is the longest-serving independent in congressional history.
PhD student at DTU π©π° Doing research at the intersection of deep learning, event cameras/neuromorphic vision, multi-modal models, and robotics.
https://chrisohrstrom.github.io/
Probabilistic machine learning to address questions in evolution and health #EvolutionaryMedicine. PI at the Centre for Genomic Regulation, co-leading a group with Mafalda Dias. Previously Harvard.
I capture macro splats, create interactive installations and develop #miqula. Software, Design & Art. No politics.
www.danybittel.ch
Assistant Professor at Maastricht University.
Research interests: AI, RL, games. Tic-Tac-Toe aficionado. Opinions my own, but should be everyone's.
Anon feedback: admonymous.co/dennis-soemers
unlicensed back alley alchemy
digital β physical, 3D and AI research. living in a world of magic and vibrance
AI at Google DeepMind
https://fofr.ai
Image research SVP for Jasper.ai
Cofounder clipdrop.co, acquired by stability.ai
AI x Images for Google Art
Created Google Cardboard
No longer Blueskyβs only resident finance bro | Macro Strategist | Even the blind squirrel get a nut sometime. | QCR: Non Culto, For The Crown. | πΊπΈ via π¨π¦. Not π΄σ §σ ’σ ³σ £σ ΄σ Ώ. | Normal man, one of the Normal Men
*ALL CAPS HEADLINES LIKE THIS ARE FROM BLOOMBERG
π CLT
Public health warnings & health policy. Epidemiologist & health economist. Chair and Faculty at NECSI. Former 16 years at Harvard. DC & Virginia.
βοΈ necsi.edu/eric-feigl-ding
π₯ x.com/drericding
π° bit.ly/raisealarm
π drericding.substack.com/subscribe
Co-Exec Director of Indivisible along with Leah.
Organizing with local Indivisible groups against this fascistic clown show of a regime.
Currently studying Masters in Human centered AI in Technical university of Denmark, interested in 3D vision and Vision Language Models
Previously bachelor student in Lund university
Postdoc at IBME in Oxford. Machine learning for healthcare.
https://www.fregu856.com/
Research scientist @nvidia | postdoc @caltech | PhD @univienna | former research intern @MetaAI and @nvidia | views are my own
My thought about computers.
Freelance science journalist contributing to NYT, SciAm, Nature etc. Author of "Poached: Inside the Dark World of Wildlife Trafficking" (2018) and "I Feel Love: MDMA and the Quest for Connection in a Fractured World" (2023).