Hard to find super specific numbers, but 4j per token seems to be a common and reasonable eating, which would mean most chats are a few Wh or less. This paper goes into it a bit more - arxiv.org/pdf/2310.03003
18.02.2025 01:07 β π 1 π 0 π¬ 0 π 0
Kind of odd to see the βCUDA is dead because ptx can be written to be much more efficientβ takes, but it the DeepSeek paper is an interesting case study in how constraints breed creativity.
05.02.2025 20:54 β π 0 π 0 π¬ 0 π 0
Techdirt guy. Writes about social media, copyright, free speech, content moderation, civil liberties and stuff like that. Once wrote a paper that may have helped inspire this service & now I'm on its board: https://bit.ly/protocolnotplatform
Your Jeopardy! pal. Author of 100 PLACES TO SEE AFTER YOU DIE (bit.ly/3kLgJKO) and a bunch of other stuff. OMNIBUS co-founder (patreon.com/omnibusproject).
- Possibility space explorer π
- Open source teacher and author https://howtoopensource.dev.
- Ruby 3.2+ core committer
- Creator of https://www.CodeTriage.com.
- (he/him)
Yβall means all
RecSys, AI, Engineering; Principal Applied Scientist @ Amazon. Led ML @ Alibaba, Lazada, Healthtech Series A. Writing @ eugeneyan.com, aiteratelabs.com.
Director of Machine Learning at the Wikimedia Foundation. We host Wikipedia.
Senior Research Manager at NVIDIA. Prev professor at TUM. Computer vision mostly. Views are my own.
ML Engineer at NVIDIA. Previously: Stealth GPU startup; Stability AI; AMD; Autodesk; CEO of 2 startups (3D + AI). Toronto, Canada
Sr. Distinguished Engineer @nvidia
Morgan McGuire - Gone Sailing! Now on the Salish Sea.
Known for Roblox, NVIDIA, Pasteur Labs, Graphics Codex, Markdeep, G3D, Skylanders, E Ink, Titan Quest, Vicarious Visions, Unity, Activision, Williams, Waterloo, Quadplay, Computer Graphics
Staff writer at The Atlantic. Cat guy, democracy defender. Actor for a day on Succession, Jeopardy champ. Atlantic Starter Pack: go.bsky.app/NVbMa2Y
I'm just Jeff. Cluster monkey, past military officer and professor. #HPC and Lots of other Things. I work at NVIDIA but have worked other places (it's a long list). Opinions and re-posts are my own.
"Next to Last"
Partner at Underscore, Typelevel cofounder, Scala SIP committee alumnus. Type astronaut, grackle/shapeless/scalac/dotty hacker. He/him. Brighton, UK. Internationalist.
AI researcher & engineer @Meta working on @PyTorch torchtune in NYC; interests in generative models, RL, and evolutionary strategies
π» https://github.com/pbontrager π https://tinyurl.com/philips-papers
Research Engineer - PyTorch core - Meta@London - Open-source/open science advocate
Maintainer of torchrl / tensordict / leanrl
Former MD - Neuroscience PhD
https://github.com/vmoens
Developer on PyTorch at Meta. Previously Haskeller and GHC developer.
ML Software Engineer at Kumo.AI & PyTorch Geometric βοΈπ·οΈ // Past: Research Engineer at Lightning AI & PyTorch Lightning β‘οΈ // www.akihironitta.com