Bless everyone who records their talks
09.12.2024 19:09 — 👍 3 🔁 0 💬 0 📌 0@stochasticchasm.bsky.social
aspiring independent researcher exploring the loss landscape • 24 • interested in reasoning, creativity, intelligence stochasm.blog
Bless everyone who records their talks
09.12.2024 19:09 — 👍 3 🔁 0 💬 0 📌 0“They said it could not be done”. We’re releasing Pleias 1.0, the first suite of models trained on open data (either permissibly licensed or uncopyrighted): Pleias-3b, Pleias-1b and Pleias-350m, all based on the two trillion tokens set from Common Corpus.
05.12.2024 16:39 — 👍 251 🔁 85 💬 12 📌 19What are some solid pieces of writing in the vein of situational awareness and machines of loving grace?
03.12.2024 22:00 — 👍 0 🔁 0 💬 0 📌 0Yeah, and thinking about it more, I guess a larger general model would probably beat out smaller specialized models too
03.12.2024 19:24 — 👍 1 🔁 0 💬 0 📌 0What about with each agent having a different model?
03.12.2024 16:39 — 👍 0 🔁 0 💬 1 📌 0You love to see it
03.12.2024 15:01 — 👍 0 🔁 0 💬 0 📌 0Good one haha
03.12.2024 15:01 — 👍 2 🔁 0 💬 0 📌 0Oh I see, that’s pretty cool
02.12.2024 18:12 — 👍 0 🔁 0 💬 0 📌 0Why would you do this? Lol
02.12.2024 17:46 — 👍 0 🔁 0 💬 1 📌 0Classic
02.12.2024 15:26 — 👍 1 🔁 0 💬 0 📌 0Many are saying
30.11.2024 21:02 — 👍 1 🔁 0 💬 0 📌 0It’s time to swap my terminal from simple dark/grayscale to colorful again. The question is, which color scheme?
30.11.2024 20:00 — 👍 0 🔁 0 💬 1 📌 0Good morning everyone what’s the plan for today
30.11.2024 15:45 — 👍 0 🔁 0 💬 0 📌 0I like the idea of the first name being a subdomain
29.11.2024 16:29 — 👍 0 🔁 0 💬 0 📌 0Thanks!
29.11.2024 16:23 — 👍 0 🔁 0 💬 0 📌 0I thought I was, but will check when I try again today
29.11.2024 16:23 — 👍 0 🔁 0 💬 0 📌 0Can blocklists contain blocklists? Can you have a cycle in the graph?
28.11.2024 22:21 — 👍 3 🔁 0 💬 0 📌 2White noise is the meta I’ve heard
28.11.2024 21:41 — 👍 2 🔁 0 💬 0 📌 0Yeah I needed to see this today lol
28.11.2024 19:45 — 👍 1 🔁 0 💬 0 📌 0Yeah makes sense honestly
28.11.2024 07:47 — 👍 2 🔁 0 💬 0 📌 0Thanks!
28.11.2024 06:53 — 👍 0 🔁 0 💬 0 📌 0Lol you just use F.sdpa?
28.11.2024 06:52 — 👍 1 🔁 0 💬 1 📌 0Why is it so slow
28.11.2024 05:47 — 👍 3 🔁 0 💬 4 📌 0Building wheel for flash-attn (setup.py) … /
28.11.2024 05:47 — 👍 6 🔁 0 💬 1 📌 0Holy
28.11.2024 00:09 — 👍 1 🔁 0 💬 0 📌 0scraping is hitting a wall
27.11.2024 22:45 — 👍 28 🔁 3 💬 1 📌 0I think they do mean that it’s a preview/experimental model since they’ve identified some issues with it in their blog post
27.11.2024 20:33 — 👍 0 🔁 0 💬 0 📌 0With some sort of scaling graph too! Wish they were more clear about what the x-axis represents
27.11.2024 19:54 — 👍 3 🔁 0 💬 1 📌 0Finished training in the morning and evals already done is impressive speed
26.11.2024 21:07 — 👍 0 🔁 0 💬 1 📌 0Wow
26.11.2024 20:13 — 👍 0 🔁 0 💬 0 📌 0