Applications to MIT EECS have closed, but if you submitted one and the above describes you, please consider filling out this form: docs.google.com/forms/d/e/1F...
02.12.2024 14:39 β π 5 π 1 π¬ 0 π 0@dhadfieldmenell.bsky.social
Assistant Prof of AI & Decision-Making @MIT EECS I run the Algorithmic Alignment Group (https://algorithmicalignment.csail.mit.edu/) in CSAIL. I work on value (mis)alignment in AI systems. https://people.csail.mit.edu/dhm/
Applications to MIT EECS have closed, but if you submitted one and the above describes you, please consider filling out this form: docs.google.com/forms/d/e/1F...
02.12.2024 14:39 β π 5 π 1 π¬ 0 π 0Some specific skills:
- JD candidates with software/ML systems background interested in technical AI governance
- Systems/infrastructure engineers passionate about alignment
- Researchers in preference learning, RLHF, or constitutional AI
Ideal candidates have expertise in one of:
- Systems engineering + ML infrastructure
- Legal/regulatory frameworks (especially JD + CS background)
- Foundation model pre-training
- Bayesian inference methods
- HCI/HRI
and are excited to learn the others.
π’ Seeking PhD students for AI alignment research. Our lab investigates technical mechanisms for value learning, pre-training alignment, and regulatory frameworks. Come work with us if you want to bridge technical ML and legal/policy domains. Details in thread π§΅
02.12.2024 14:39 β π 18 π 6 π¬ 3 π 1Genuine question for people who use Bluesky more frequently than I do. What are tips for getting things to work well without algorithmic recs? I spent a lot of time curating my recs on the other place and found it useful (mostly...). Any tools that let me do it here?
12.11.2024 13:24 β π 10 π 0 π¬ 2 π 0Democrats perfected defenses against yesterday's threat. Now we must dismantle them. (13/13)
Thread: bsky.app/profile/dhad...
Article: tinyurl.com/dems-2024-ma...
Stop molding perfect successors. Build real internal diversity so strong candidates emerge naturally. Ironically, Bernie had what we needed - authenticity in an anti-establishment moment. (12/13)
08.11.2024 14:37 β π 0 π 0 π¬ 1 π 0The path forward? More debate, less polish. Trade enforced unity for earned consensus. Public conflict builds more trust than artificial agreement. (11/13)
08.11.2024 14:37 β π 0 π 0 π¬ 1 π 0White liberals face their own trap: memorized talking points instead of real understanding. When you're deferring to others' expertise, you can't be indignant at disagreement. (10/13)
08.11.2024 14:37 β π 0 π 0 π¬ 1 π 0The rot runs deeper. Biden's delayed exit. Pelosi and Schumer aging in place. Running Harris showed how badly party elites lost touch. (9/13)
08.11.2024 14:37 β π 0 π 0 π¬ 1 π 0Democrats responded backward. Their 2024 machine ran perfectly - saved 3 points in battlegrounds. Couldn't stop a 6-point tide against artificial, tired politics. (8/13)
08.11.2024 14:37 β π 1 π 0 π¬ 1 π 0Enter Trump. His lack of restraint signals authenticity. Can't control message = probably not lying long-term. The GOP establishment fought this reality. Lost. Raw beats scripted. (7/13)
08.11.2024 14:36 β π 0 π 0 π¬ 1 π 0This artificial unity created real weakness. When Dems got tagged with "Defund the Police," their calculated pushback only confirmed suspicions: the fringe spoke party truth. (6/13)
08.11.2024 14:36 β π 0 π 0 π¬ 1 π 0Voters who watch streamers and reality TV daily spot the difference between real interaction and careful curation. They've seen behind the curtain. They're tired of perfect polish. (5/13)
08.11.2024 14:36 β π 0 π 0 π¬ 0 π 0That world died. Today's fractured media means a gaffe on Twitter becomes authenticity on TikTok. The same moment: both scandal and selling point. (4/13)
08.11.2024 14:35 β π 0 π 0 π¬ 1 π 0For years, Dems perfected obsolete strategy. Clinton's coronation. Biden's backroom deals. Harris's orchestrated succession. All built for an era of controlled messaging. (3/13)
08.11.2024 14:35 β π 0 π 0 π¬ 2 π 0In 1940, France built perfect defenses against the last war. The Germans went around them. The Democratic Party just did the same. (2/13)
08.11.2024 14:34 β π 0 π 0 π¬ 1 π 0I usually focus my platforms on my work. However, I did some writing to process some of my thoughts about the election and wanted to share them. I'm curious to hear anyone's thoughts and reactions.
tinyurl.com/dems-2024-ma...
π§΅ The Democratic Party's Maginot Line (1/13)
This is a really welcome development. This is the kind of action that we argued for in a policy brief on LLMs β the first goal of AI regulation has to be establishing a default where existing laws can not be dodged through automation.
www.ftc.gov/news-events/...
computing.mit.edu/ai-policy-br...
Iβm doing some lecture prep for a course on AI & Society to cover interpretability, explanations, benchmarks, and evaluations.
What are your favorite papers in the space? Any suggestions for an advanced undergrad cohort?
My department (MIT Brain & Cognitive Sciences) is hiring a tenure-track faculty! We're especially interested in researchers who span multiple levels of analysis. Candidates from underrepresented backgrounds strongly encouraged to apply. Apply by November 1! academicjobsonline.org/ajo/jobs/25916
20.10.2023 00:30 β π 36 π 38 π¬ 0 π 1Now published in Patterns, my paper on how to do metric design better. This is important everywhere - academics use simple metrics for tenure, governments often perform poorly using metrics for rules, and employees have targets that hurt their company.
18.10.2023 13:58 β π 4 π 1 π¬ 1 π 0I especially enjoyed the part of this game where the CEO threatened to fire me because I banned someone and then I had to testify in front of congress. 10/10, fun experience, would recommend.
17.10.2023 14:11 β π 701 π 102 π¬ 8 π 5This looks like a great way to learn about the complexity involved in managing moderation
17.10.2023 15:48 β π 2 π 0 π¬ 0 π 0Our lab has three paper talks at CSCW! But I want to highlight this one because @cqz.bsky.social is on the job market this year!! He works in crowdsourcing and human-AI systems. Make sure to check out his presentation on Wednesday. arxiv.org/abs/2305.01615
15.10.2023 21:45 β π 12 π 8 π¬ 0 π 0Ukrainian drone maker says their drones are autonomously making kill decisions. If this turns out to be true, it will be a turning point in war forever.
(Unfortunately this is behind a paywall so I cannot see the contents of the article)
www.newscientist.com/article/2397...
One of the reasons (and there are several) we see platforms keep making avoidable mistakes is that vanishingly little of the tech needed to do T&S work exists outside of big companies. We keep reinventing the same wheels.
Basically every platform has a bad usernames list. Why not open-source them?
In our paper studying creators' use of word filters against harassing comments, we find that a lot of creators wanted to build off of existing bad-word lists they trusted. Unfortunately, many popular lists like LDNOOBW have issues of bias. 1/n
https://arxiv.org/pdf/2202.08818.pdf
Interesting tidbit from Meta staff at TrustCon just now: >90% of the CSAM Meta report to NCMEC is visually similar to content theyβve reported before.
The argument goes: The same bad content circulates again and again, so effective moderation requires you to get very good at similarity detection.
Bluesky is a public benefit corp with the mission βto develop and drive large-scale adoption of technologies for open and decentralized public conversation.β
The PBC status allows us to pursue our mission above profit, but we still need to make this open ecosystem sustainable.