Built for teams shipping AI in production. Bring your use cases and questions.
π Register: luma.com/qbb8w9en
Hosted by Neurometric AI
#AI #MachineLearning #SLM #MLOps
@neurometric.bsky.social
Stop Compromising With Single Model AI Build AI systems driven by data, not guesswork. Benchmark model-algorithm combinations on your unique tasks. | neurometric.ai | neurometric.substack.com | info@neurometric.ai | https://inferencetimetactics.podbean.com
Built for teams shipping AI in production. Bring your use cases and questions.
π Register: luma.com/qbb8w9en
Hosted by Neurometric AI
#AI #MachineLearning #SLM #MLOps
Bigger models arenβt always better for production.
Hosting an office hour on Small Language Models (SLMs) for Production Workflows, covering when SLMs actually outperform large models, how to route tasks to the right model, and real examples of cutting costs without sacrificing quality.
Read more: tdv.transistor.fm/episodes/1-h...
22.01.2026 21:30 β π 0 π 0 π¬ 0 π 0Enterprise AI has an ROI problem. Inference costs are skyrocketing and the brute force approach is failing.
Neurometric CEO Rob May joined The Deep View to break down how we use thinking algorithms and specialized models to drop costs without sacrificing performance. www.youtube.com/watch?v=FS7v...
We just launched the Neurometric Audio Leaderboard: leaderboard.neurometric.ai
This weekβs update includes Modulate Velma 2, released yesterday.
The leaderboard tracks real-world performance of voice and audio models as the space moves toward more specialized and cost-efficient architectures.
New episode of Inference Time Tactics.
Calvin Cooper and Byron Galbraith talk with Rapt.AI about why inference workloads are hard to predict and what it takes to run them efficiently in production.
Listen now:
inferencetimetactics.podbean.com
We cover why only 20% of companies are positioned to scale AI, why cost overruns are so common, and why data quality is now the top blocker in production systems.
Listen: inferencetimetactics.podbean.com
#EnterpriseAI #GenAI #AIOps
Episode 11 of Inference Time Tactics features Shawn Rogers, CEO of BARC, unpacking insights from new research based on 421 organizations deploying AI in production worldwide.
23.12.2025 04:38 β π 0 π 0 π¬ 1 π 0Explore the data: leaderboard.neurometric.ai
16.12.2025 23:12 β π 0 π 0 π¬ 0 π 0Behind the scenes with Calvin Cooper and Byron Galbraith on the engineering, the research, and what enterprises need to know.
Listen: inferencetimetactics.podbean.com
Neurometric launched the first leaderboard that combines models WITH thinking algorithmsβnot just single model performance. Our surprising finding: performance varies dramatically per task, and no single system dominates.
16.12.2025 23:12 β π 0 π 0 π¬ 1 π 0Leaderboard: leaderboard.neurometric.ai
03.12.2025 16:52 β π 0 π 0 π¬ 0 π 0@robmay-nyc.bsky.social featured today in Founders Everywhere by @EverywhereVC
We are building automated inference orchestration for multi-model AI systems so teams can measure, not guess, what drives real performance.
Read the spotlight: ideas.everywhere.vc/p/neurometri...
#AI #Startups
Leaderboard: leaderboard.neurometric.ai
02.12.2025 22:43 β π 0 π 0 π¬ 0 π 0Our Thinking Algorithm Leaderboard is now live - the first public benchmark ranking model + reasoning algorithm combinations instead of models alone.
Covered today in The Deep View (600k+ subscribers): www.thedeepview.com/articles/ope...
#AI #LLM #AIOptimization #ThinkingAlgorithms #ITC #TTC
New episode of Inference Time Tactics: Benchmarking Generalization: How AI Learns Beyond Training Data
Rob May and Cooper from NeuroMetric talk with AI researcher Yash Sharma about how models actually generalize and why they behave unpredictably in novel scenarios.
inferencetimetactics.podbean.com
Generalization is not determined by model design alone. It emerges from how systems behave at runtime.
The frontier is shifting from training optimization to inference orchestration.
Read here: open.substack.com/pub/neurometric/p/apples-to-infinity-and-beyond-shows
Apple's "To Infinity and Beyond" paper shows where AI systems are headed next.
State Space Models like Mamba cannot generalize on their own. But when given tools at runtime (search, memory, code execution) they can reason and scale far beyond their training range.
At Neurometric, weβve seen the same in production. The next phase of AI performance isnβt about bigger models, itβs about smarter inference.
Full post by Calvin Cooper here: neurometric.substack.com/p/beyond-ben...
New ITT episode: Solving the Cold Start Problem in AI Inference
Rob May, Calvin Cooper, and Byron Galbraith talk with Prashanth Velidandi (@pmv_inferx) about InferX, serverless inference, idle GPUs, and how infra innovation is defining the next era of AI.
Listen: inferencetimetactics.podbean.com
Tune in: inferencetimetactics.podbean.com
30.09.2025 23:33 β π 0 π 0 π¬ 0 π 0Check out Inference Time Tactics with Rob May, Calvin Cooper and special guest Pawan Deshpande - AI researcher and founder turned product leader and angel investor.
In this episode: MIT research, why inference time decisions matter, how to evaluate agents, and where durable value sits in the stack.
Read more: neurometric.substack.com/p/introducing-itc-studio
29.09.2025 20:24 β π 0 π 0 π¬ 0 π 0Try it here: dirtbike.neurometric.xyz
29.09.2025 20:24 β π 0 π 0 π¬ 1 π 0We are excited to share the launch of ITC Studio alpha.
This tool makes it simple to test inference time compute strategies across LLMs.
Why it matters:
ITC impacts cost, performance, and accuracy
Experimentation accelerates AI innovation
Intelligence is moving to the systems level
New Inference Time Tactics episode: Drag, Drop, and Deploy
Rob May, Calvin Cooper, Byron Galbraith, and Dave Rauchwerk share updates on the NeuroMetric Inference Time Compute Studio and why AI is moving from scaling single models to full AI system orchestration.
inferencetimetactics.podbean.com
In Ep. 5 of Inference Time Tactics, @robmay-nyc.bsky.social , Calvin Cooper, and @intrinsicmode.com unpack Salesforceβs CRMArena-Pro, going beyond βvibe testingβ, and what to expect from the NeuroMetric ITC Test Engine.
Listen: inferencetimetactics.podbean.com
New Episode on Inference Time Tactics: Inference Time Compute Just Went Mainstream with GPT-5.
Rob May and Calvin Cooper unpack GPT-5.0βs launch and what OpenAIβs routing layer signals about the shifting AI landscape.
Listen: inferencetimetactics.podbean.com?utm_campaign... @robmay-nyc.bsky.social
95% of AI projects fail. Not from weak models, but from implementation chaos.
Next week weβre launching a preview of our AI Evaluation Platform to make deployments reliable and ROI-driven.
Learn more: open.substack.com/pub/neuromet...
Episode 3 of Inference Time Tactics is live: When AI Overthinks:
@robmay-nyc.bsky.social, Calvin Cooper, and Byron Galbraith break down Appleβs paper, why reasoning models loop, and what it takes to make them reliable.
Listen: inferencetimetactics.podbean.com?utm_campaign...