DeepSeek has released JanusFlow model.
Model: huggingface.co/deepseek-ai/...
@cmarschner.bsky.social
Autonomous Robots @ rob.co π¦ΎπͺπΊ Private: @cmarschnerde.bsky.social
DeepSeek has released JanusFlow model.
Model: huggingface.co/deepseek-ai/...
The βsmall teamβ had 100+ people and 1000s of GPUs to spare
27.01.2025 16:53 β π 1 π 0 π¬ 1 π 0Please explain - ROS2 has many best practices built in and the systems I have seen that didnβt use it were inferior in terms of speed or flexibility, respectively. What would you do better?
18.01.2025 20:27 β π 0 π 0 π¬ 0 π 0Stop itβ¦
18.01.2025 20:15 β π 0 π 0 π¬ 0 π 0Foundations of Large Language Models by Tong Xiao, Jingbo Zhu
This is a book (231 pages) about large language models. It primarily focuses on foundational concepts rather than comprehensive coverage of all technologies. The book is structured into four main chapters, each exploring a key area:
Happy New Year everyone! Jim and I just put up our January 2025 release of Speech and Language Processing! Check it out here: web.stanford.edu/~jurafsky/sl...
12.01.2025 20:44 β π 150 π 50 π¬ 1 π 1Hard to recognize the contents compared to my version from 2003β¦ after Manning/SchΓΌtze my favorite NLP book. MS was just beautiful with Cambridge University Pressβ typesetting. I loved it on a visceral level. But Jurafsky/Martin was also very accessible.
11.01.2025 23:02 β π 2 π 0 π¬ 0 π 0If youβre in ML, consider robotics at this point. Especially if youβre in Europe.
There are amazing challenges in the space of spatial intelligence, planning, understanding of the physical world, control to be solved with AI.
And if you want to turn it into products, contact me.
Just switching over from X for today.
Is there still sanity on this platform at least?
Going back and forth between 1 week and 1 year, 2 year, 5 year timelines and loving it!
There is no such thing as an architecture role. Every IC writes code, and that's how it should be.
It's just that more senior engineers should think strategically and shape where a company will be in the future.
On site, testing the #RobCo vision system π¦Ύπ€
25.11.2024 21:42 β π 2 π 0 π¬ 0 π 0Meanwhile, the @bsky.app developersβ¦
23.11.2024 09:03 β π 0 π 0 π¬ 0 π 0Def Riptide qhttps://youtu.be/bdhrYdWlxTw?si=zEMJ4JAAl7g7kkqk
23.11.2024 04:56 β π 0 π 0 π¬ 0 π 0I've spent the last two years scouring all available resources on RLHF specifically and post training broadly. Today, with the help of a totally cracked team, we bring you the fruits of that labor β TΓΌlu 3, an entirely open frontier model post training recipe. We beat Llama 3.1 Instruct.
Thread.
That comment is on brand
21.11.2024 12:46 β π 3 π 0 π¬ 0 π 0Iβm turning this into my job account and will focus on robots and neural networks here. For architecture and city planning, see @cmarschnerde.bsky.social - like on X
19.11.2024 22:04 β π 0 π 0 π¬ 0 π 0I recently gave a tutorial on the DUSt3R paper (web: dust3r.europe.naverlabs.com, paper: tinyurl.com/5t2ks575, code: github.com/naver/dust3r) in a research group meeting. In case you missed it, didnβt understand it or would like to hear some perspectives on why itβs such a cool idea, read onβ¦ 1/23
18.11.2024 23:18 β π 79 π 17 π¬ 6 π 2With all those starter packs and the Xodus, ML Bluesky now feels like Twitter 2016. Finally, content
19.11.2024 21:32 β π 0 π 0 π¬ 0 π 0Also can we rename this to Twitter pls thx
18.11.2024 22:25 β π 11 π 0 π¬ 1 π 0Resilience is the art of keeping things working in the light of problems.
It requires slack. Slack is the enemy of efficiency.
A society that has gone too far optimizing for efficiency will constantly be on the verge of collapse