stochasm's Avatar

stochasm

@stochasticchasm.bsky.social

aspiring independent researcher exploring the loss landscape • 24 • interested in reasoning, creativity, intelligence stochasm.blog

302 Followers  |  358 Following  |  82 Posts  |  Joined: 19.11.2024  |  2.0925

Latest posts by stochasticchasm.bsky.social on Bluesky

Bless everyone who records their talks

09.12.2024 19:09 — 👍 3    🔁 0    💬 0    📌 0
Post image

“They said it could not be done”. We’re releasing Pleias 1.0, the first suite of models trained on open data (either permissibly licensed or uncopyrighted): Pleias-3b, Pleias-1b and Pleias-350m, all based on the two trillion tokens set from Common Corpus.

05.12.2024 16:39 — 👍 251    🔁 85    💬 12    📌 19

What are some solid pieces of writing in the vein of situational awareness and machines of loving grace?

03.12.2024 22:00 — 👍 0    🔁 0    💬 0    📌 0

Yeah, and thinking about it more, I guess a larger general model would probably beat out smaller specialized models too

03.12.2024 19:24 — 👍 1    🔁 0    💬 0    📌 0

What about with each agent having a different model?

03.12.2024 16:39 — 👍 0    🔁 0    💬 1    📌 0

You love to see it

03.12.2024 15:01 — 👍 0    🔁 0    💬 0    📌 0

Good one haha

03.12.2024 15:01 — 👍 2    🔁 0    💬 0    📌 0

Oh I see, that’s pretty cool

02.12.2024 18:12 — 👍 0    🔁 0    💬 0    📌 0

Why would you do this? Lol

02.12.2024 17:46 — 👍 0    🔁 0    💬 1    📌 0

Classic

02.12.2024 15:26 — 👍 1    🔁 0    💬 0    📌 0

Many are saying

30.11.2024 21:02 — 👍 1    🔁 0    💬 0    📌 0

It’s time to swap my terminal from simple dark/grayscale to colorful again. The question is, which color scheme?

30.11.2024 20:00 — 👍 0    🔁 0    💬 1    📌 0

Good morning everyone what’s the plan for today

30.11.2024 15:45 — 👍 0    🔁 0    💬 0    📌 0

I like the idea of the first name being a subdomain

29.11.2024 16:29 — 👍 0    🔁 0    💬 0    📌 0

Thanks!

29.11.2024 16:23 — 👍 0    🔁 0    💬 0    📌 0

I thought I was, but will check when I try again today

29.11.2024 16:23 — 👍 0    🔁 0    💬 0    📌 0

Can blocklists contain blocklists? Can you have a cycle in the graph?

28.11.2024 22:21 — 👍 3    🔁 0    💬 0    📌 2

White noise is the meta I’ve heard

28.11.2024 21:41 — 👍 2    🔁 0    💬 0    📌 0

Yeah I needed to see this today lol

28.11.2024 19:45 — 👍 1    🔁 0    💬 0    📌 0

Yeah makes sense honestly

28.11.2024 07:47 — 👍 2    🔁 0    💬 0    📌 0

Thanks!

28.11.2024 06:53 — 👍 0    🔁 0    💬 0    📌 0

Lol you just use F.sdpa?

28.11.2024 06:52 — 👍 1    🔁 0    💬 1    📌 0

Why is it so slow

28.11.2024 05:47 — 👍 3    🔁 0    💬 4    📌 0

Building wheel for flash-attn (setup.py) … /

28.11.2024 05:47 — 👍 6    🔁 0    💬 1    📌 0

Holy

28.11.2024 00:09 — 👍 1    🔁 0    💬 0    📌 0

scraping is hitting a wall

27.11.2024 22:45 — 👍 28    🔁 3    💬 1    📌 0

I think they do mean that it’s a preview/experimental model since they’ve identified some issues with it in their blog post

27.11.2024 20:33 — 👍 0    🔁 0    💬 0    📌 0
Post image

With some sort of scaling graph too! Wish they were more clear about what the x-axis represents

27.11.2024 19:54 — 👍 3    🔁 0    💬 1    📌 0

Finished training in the morning and evals already done is impressive speed

26.11.2024 21:07 — 👍 0    🔁 0    💬 1    📌 0

Wow

26.11.2024 20:13 — 👍 0    🔁 0    💬 0    📌 0

@stochasticchasm is following 20 prominent accounts