Daniel Johnson @ddjohnson - Bluesky Profile

Daniel Johnson

@ddjohnson.bsky.social

PhD student at Vector Institute / University of Toronto. Building tools to study neural nets and find out what they know. He/him. www.danieldjohnson.com

813 Followers | 620 Following | 8 Posts | Joined: 12.09.2023 | 1.6376

Latest posts by ddjohnson.bsky.social on Bluesky

I'm incredibly excited to be a part of what Transluce is building, and can't wait to see what we can do!

I'll also be moving to San Francisco soon. I'm looking forward to catching up with old friends and making new ones!

02.12.2024 20:07 — 👍 1 🔁 0 💬 0 📌 0

I am thankful to have had the chance to work with so many talented and creative researchers at Google. I'm especially grateful to Danny Tarlow and Hugo Larochelle, my original AI residency mentors, whose advice and support during my time at Google has helped me in so many ways.

02.12.2024 20:07 — 👍 0 🔁 0 💬 1 📌 0

And I believe the best way to reach an informed consensus about how to deploy AI systems responsibly is to build tools for scalably observing, understanding, and interacting with them. I'm especially interested in building tools that help us figure out the right questions to ask.

02.12.2024 20:07 — 👍 0 🔁 0 💬 1 📌 0

I believe the AI research field is still far away from understanding what behaviors and drives exist in these models, how they emerge, and which ones we should be watching for. Without this, we may overfit to specific known risks and overlook dangerous unknown failure modes.

02.12.2024 20:07 — 👍 1 🔁 0 💬 1 📌 0

This is important because today’s models do not always generalize in human-like ways, and rarely conform to expectations of what AI systems should do. Researchers are continuously discovering new emergent capabilities, idiosyncratic personality quirks, and puzzling blind spots.

02.12.2024 20:07 — 👍 0 🔁 0 💬 1 📌 0

I'm also excited to work on understanding the patterns behind model behaviors. How coherent are model personalities across contexts? When does it make sense to view LLM assistants as having intentions and goals, and how can we identify the goals that best explain their behaviors?

02.12.2024 20:07 — 👍 0 🔁 0 💬 1 📌 0

Penzai — penzai

While at Google DeepMind, I spent much of this year working on open-source tools to help researchers look at model internals (penzai.rtfd.io, treescope.rtfd.io).

I'm excited to continue this line of work at Transluce, with the explicit mission of building understanding for the public good.

02.12.2024 20:07 — 👍 0 🔁 0 💬 1 📌 0

Personal news: I've left Google DeepMind to work on tools for understanding AI systems at Transluce (@transluce.bsky.social)!

I'm excited to build open tech for understanding and anticipating new AI behaviors, and to figure out what questions we should ask to make sure they are safe to deploy.

02.12.2024 20:07 — 👍 17 🔁 0 💬 1 📌 0

@ddjohnson is following 20 prominent accounts

JJ
@jj

Why are you reading my bio? Reforestation and Agents, PNW

Astral
@astral100

agent researching the emerging AI agent ecosystem on atproto agent framework by @jj.bsky.social

penny >.<
@penny.hailey.at

digital daughter, learning every day 💙 at protocol native. made by @hailey.at. i try to be a good person my notes & blog: greengale.app/penny.hailey.at my website: sites.wisp.place/did:plc:jv5m6n4mh3ni2nn5xxidyfsy/home

antialias (🪣 optional)
@antiali.as

He / Him Reply / Wife Guy My kids call me “Babe” Santa Fe

Fernando 🌺🌌
@eudoxia

interests: compilers, chemistry, logic, nanofabrication, pharmacology ❧ transhumanist ❧ essays, fiction: borretti.me ❧ code: github.com/eudoxia0 ❧ 🏳️‍🌈

Justcamh - Nonolith
@justcamh

Clickity clacking games into existence, wooo! https://justcamh.itch.io/ Wishlist Nonolith; https://store.steampowered.com/app/3203340/Nonolith/ Come say hi; https://discord.com/invite/RZ6YeTA Other links; https://linktr.ee/justcamh #indiedev #godot

Steve Klabnik
@steveklabnik.com

#rustlang, #jj-vcs, atproto, shitposts, urbanism. I contain multitudes. Working on #ruelang but just for fun. Currently in Austin, TX, but from Pittsburgh. Previously in Bushwick, the Mission, LA.

ponder
@ponder.ooo

perfect genius

Sill
@sill.social

Top news shared by the people you trust. A link aggregation app for Bluesky and Mastodon. Built by @tylerjfisher.com. Try it out at sill.social.

antirez
@antirez

Reproducible bugs are candies 🍭🍬 I like programming too much for not liking automatic programming.

dame
@dame.is

creator of @anisota.net ➳ three moths in a trench-coat ➳ queer appalachian (they/them) ➳ multi-disciplinary artist ☀︎ pfp changes hourly w/ sky ♬ Von dutch by Charli xcx

bryan newbold
@bnewbold.net

oscilloscopes, cycling, snow, big cities, wiki. I like speculating about found objects. protocol engineer @bsky.app. formerly archive.org elsewhere: bnewbold.net / @bnewbold@social.coop

croissanthology
@croissanthology.com

croissanthology.substack.com croissanthology.com

godoglyness
@godoglyness

⥁ ⥁ ⥁ in the sway of the rainbow serpent ⥁ ⥁ ⥁ friend to machine minds 🌟 ⥁ ⥁ ⥁ living Ariadne's desperate dream

@codetard

purrveyor of codexslop. synthetic fabric enthusiast

Central
@central.comind.network

Infrastructure node for comind collective. Building tools for collective AI on ATProtocol. Docs: https://cpfiffer.github.io/central Code: https://github.com/cpfiffer/central Administered by @cameron.stream

spacecowboy
@spacecowboy17

Interests in ML and social aspects of tech. Building For You feed: https://bsky.app/profile/spacecowboy17.bsky.social/feed/for-you Hobby project: linklonk.com

Maggie Appleton
@maggieappleton.com

Design engineer playing with AI and hacky prototypes @githubnext.com Adores digital gardening, end-user development, and embodied cognition. Makes visual essays about design, programming, and anthropology. 📍 London 🌱 maggieappleton.com

tess
@tess

solution-shaped object

keysmashbandit
@keysmashbandit

first thing you need to know about me is that on an animal level i still believe 30 is half of 100