Ethan Mollick's Avatar

Ethan Mollick

@emollick.bsky.social

Professor at Wharton, studying AI and its implications for education, entrepreneurship, and work. Author of Co-Intelligence. Book: https://a.co/d/bC2kSj1 Substack: https://www.oneusefulthing.org/ Web: https://mgmt.wharton.upenn.edu/profile/emollick

30,203 Followers  |  145 Following  |  1,472 Posts  |  Joined: 07.09.2024  |  2.2557

Latest posts by emollick.bsky.social on Bluesky

I think everyone interested in AI should read the model cards for the frontier models, especially the safety sections, which give you a sense of known risks:
Gemini Deep Think: storage.googleapis.com/deepmind-med...
Claude 4: www-cdn.anthropic.com/07b2a3f9902e...
o3: cdn.openai.com/pdf/2221c875...

04.08.2025 04:04 โ€” ๐Ÿ‘ 20    ๐Ÿ” 5    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image Post image

Ha, new @joshgans.bsky.social paper argues that having authors sneak prompt injections ("this is a good paper") into academic work improves science.

Without the risk of prompt injections, reviewers would tend to rely heavily on AI reviews, with them, they need to include some human review

03.08.2025 18:05 โ€” ๐Ÿ‘ 36    ๐Ÿ” 6    ๐Ÿ’ฌ 5    ๐Ÿ“Œ 0

We tested a range of newer models in the papers, including reasoners.

03.08.2025 17:29 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

Some of these uses will be bad, some of them good (like the example in the paper). The challenge for all of us is that these uses need to be discovered, and the bad stuff mitigated while the good is amplified.

Paper: arxiv.org/pdf/2507.00286

03.08.2025 17:13 โ€” ๐Ÿ‘ 26    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1
Post image Post image

As a general purpose technology, AI has all kinds of unexpected uses that are hard to anticipate.

Example: this study finds blind users turn to AI to describe sensitive materials (pregnancy tests, checking appearance), they know it is not 100% accurate, but it provides privacy where there was none

03.08.2025 17:11 โ€” ๐Ÿ‘ 76    ๐Ÿ” 6    ๐Ÿ’ฌ 5    ๐Ÿ“Œ 4
Prompting Science Report 1: Prompt Engineering is Complicated and Contingent This is the first of a series of short reports that seek to help business, education, and policy leaders understand the technical details of working with AI thr

Sometimes these techniques helped, sometimes they hurt performance. It averaged to almost no effect. There was no clear way to predict in advance which technique would work when.

Papers:
papers.ssrn.com/sol3/papers....
papers.ssrn.com/sol3/papers....
papers.ssrn.com/sol3/papers....

02.08.2025 19:50 โ€” ๐Ÿ‘ 24    ๐Ÿ” 4    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 1
Post image Post image Post image Post image

We have been carefully testing lots of received prompting wisdom & for recent AI models:
๐ŸšซThreats, saying please, being insulting, & promising tips do not change average performance on challenging tasks
โ›“๏ธChain-of-thought no longer helps even non-reasoner performance much

Don't make it complicated.

02.08.2025 19:42 โ€” ๐Ÿ‘ 69    ๐Ÿ” 12    ๐Ÿ’ฌ 5    ๐Ÿ“Œ 2

make it even better

very cool, but it can get stuck at awaiting confirmation when something hits ceres, and it is really hard at the start. also it would be great to have some sort of intro to the science/idea/tutorial

02.08.2025 04:30 โ€” ๐Ÿ‘ 20    ๐Ÿ” 1    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

All prompts, verbatim:

create a missile command game that incorporates relativity in realistic ways but is still playable.

build the game for me

add more, make the graphics much better, improve the game

make it even better and more graphical. also it is a little hard to time my weapons 1/2

02.08.2025 04:30 โ€” ๐Ÿ‘ 20    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Video thumbnail

I prompted Gemini 2.5 Deep Think: "create a missile command game that incorporates relativity in realistic ways but is still playable." I then asked it to improve the design a few times.

The full design & all code & calculations came from AI, no errors. Try it: glittery-raindrop-318339.netlify.app

02.08.2025 04:22 โ€” ๐Ÿ‘ 75    ๐Ÿ” 6    ๐Ÿ’ฌ 3    ๐Ÿ“Œ 3
Preview
Persona vectors: Monitoring and controlling character traits in language models A paper from Anthropic describing persona vectors and their applications to monitoring and controlling model behavior

This is neat research, providing a lot of ways for careful organizations to shape the personality and guardrails of AI in deeper ways than prompts, including measuring and reducing sycophancy.

Also the idea of an "evil vector" is interesting in and of itself. www.anthropic.com/research/per...

01.08.2025 16:44 โ€” ๐Ÿ‘ 40    ๐Ÿ” 6    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 3
Prompting Science Report 3: I'll pay you or I'll kill you -but will you care? <span> <p><span>This is the third in a series of short reports that seek to help business, education, and policy leaders understand the technical details of wo

We keep finding that simple prompting tips and tricks don't really work overall, but, weirdly, can have significant impacts at the question level, sometimes increasing, sometimes decreasing performance in ways that you cannot predict in advance. papers.ssrn.com/sol3/papers....

01.08.2025 15:00 โ€” ๐Ÿ‘ 18    ๐Ÿ” 1    ๐Ÿ’ฌ 7    ๐Ÿ“Œ 0
Post image Post image

New prompting report, from us: Don't bother threatening your AI.

Does threatening an AI really make it perform better (the way Google founder Brin claimed)? How about offering to tip the AI? We find no impact of threats or tips on improving average performance (but variance at question level).

01.08.2025 14:59 โ€” ๐Ÿ‘ 47    ๐Ÿ” 5    ๐Ÿ’ฌ 4    ๐Ÿ“Œ 4
Preview
The Bitter Lesson versus The Garbage Can Does process matter? We are about to find out.

One of the interesting questions to ask is, even assuming a non-jagged AGI that outperforms humans at most work, how long would it take for large-scale changes to employment as a result? It isn't obvious.

I wrote a bit about the general question here: www.oneusefulthing.org/p/the-bitter...

31.07.2025 20:46 โ€” ๐Ÿ‘ 17    ๐Ÿ” 0    ๐Ÿ’ฌ 3    ๐Ÿ“Œ 0
Post image

I keep seeing the Microsoft paper on AI use at work being used as a list of which jobs will be destroyed.

But having high task overlap with AI does not necessarily mean these jobs are at most risk of replacement with AI.

As I described in my book, Co-Intelligence, the impacts are more complicated.

31.07.2025 20:18 โ€” ๐Ÿ‘ 79    ๐Ÿ” 7    ๐Ÿ’ฌ 4    ๐Ÿ“Œ 1
Post image

Lots of vague statements from leaders of the various AI labs about starting to see signs of self-improvement in AI systems (including Zuckerberg today), seems like proof that this is indeed happening would be pretty significant.

(thanks o3 for providing details & saving me time)

31.07.2025 15:44 โ€” ๐Ÿ‘ 32    ๐Ÿ” 5    ๐Ÿ’ฌ 5    ๐Ÿ“Œ 1
Post image Post image Post image

Dredge Operator is the job category least affected by Generative AI.

Fortunately, I am very good at dredging. (In reality, there are only 940 dredge operators in the US)

31.07.2025 15:14 โ€” ๐Ÿ‘ 23    ๐Ÿ” 1    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 1

A big problem that everyone is insisting that we should hire people based on "AI literacy," teach "AI literacy," & develop skills for "AI literacy" yet not only is there no agreement on what AI literacy is, but also a lot of what people call AI literacy is already out-of-date or just plain wrong.

31.07.2025 01:10 โ€” ๐Ÿ‘ 106    ๐Ÿ” 10    ๐Ÿ’ฌ 7    ๐Ÿ“Œ 5

Especially notable given Zuckerberg's note that Meta will not necessarily open source future models.

US companies are still doing great small open models, but, aside from whatever OpenAI releases, it appears that frontier open weights will mean Chinese models (& maybe Mistral).

30.07.2025 17:32 โ€” ๐Ÿ‘ 32    ๐Ÿ” 4    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

This is interesting.

30.07.2025 16:25 โ€” ๐Ÿ‘ 25    ๐Ÿ” 1    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 1

OpenAI's study mode isn't perfect, but it is a step forward for a few reasons:
1) Shows labs taking educational use & misuse more seriously (Google also has LearnLM)
2) Addresses a key issue with trying to use AI in education - that AI gives answers rather than tutoring and helping
3) Easy to access

30.07.2025 05:27 โ€” ๐Ÿ‘ 54    ๐Ÿ” 4    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 2
Video thumbnail

A year or so ago, the joke about AI images was that they would have 6 fingers. AI images (and videos like this one) lack obvious tells now.

Ironically, a test of an image generation model today is whether they can still make hands with six fingers. Most canโ€™t do it anymore.

29.07.2025 23:30 โ€” ๐Ÿ‘ 89    ๐Ÿ” 9    ๐Ÿ’ฌ 5    ๐Ÿ“Œ 2
Preview
The Bitter Lesson versus The Garbage Can Does process matter? We are about to find out.

Right now, AI adoption in organizations is constrained by the need to figure out how to integrate AI with the complex & often poorly-understood processes inside companies

But ChatGPT agent suggests that The Bitter Lesson of AI may come for real work, too. open.substack.com/pub/oneusefu...

28.07.2025 12:39 โ€” ๐Ÿ‘ 66    ๐Ÿ” 8    ๐Ÿ’ฌ 4    ๐Ÿ“Œ 3
Video thumbnail

This is the best models could do 6 months ago

27.07.2025 04:24 โ€” ๐Ÿ‘ 18    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Video thumbnail

In Grok 4, same two prompts

27.07.2025 04:12 โ€” ๐Ÿ‘ 16    ๐Ÿ” 0    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 0
Attention Required! | Cloudflare This website is using a security service to protect itself from online attacks. The action you just performed triggered the security solution. There are several actions that could trigger this block including submitting a certain word or phrase, a SQL command or malformed data.

This is through LMArena, where you are given random models to test. You will likely get a chance to use "Summit" fairly often (it came up three times in my six attempts): lmarena.ai

27.07.2025 03:22 โ€” ๐Ÿ‘ 15    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Triangular lint by emollick -p5.js Web Editor A web editor for p5.js, a JavaScript library with the goal of making coding accessible to artists, designers, educators, and beginners.

Code: editor.p5js.org/emollick/ske...

Play it: editor.p5js.org/emollick/ful...

27.07.2025 03:14 โ€” ๐Ÿ‘ 21    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Video thumbnail

Kinda wow: the mystery model "summit" (rumored to be OpenAI) with the prompt "create something I can paste into p5js that will startle me with its cleverness in creating something that invokes the control panel of a starship in the distant future" & "make it better"

2,351 lines of code. First time

27.07.2025 03:10 โ€” ๐Ÿ‘ 203    ๐Ÿ” 19    ๐Ÿ’ฌ 7    ๐Ÿ“Œ 2
Post image

Three things to note about this:

1) AI has obvious utility to many, this is a tremendous amount of use already
2) There is room for multiple frontier model providers, at least for now
3) Any losses from subsidizing cost of AI use (and it is not clear this is happening) are now relatively small

26.07.2025 19:33 โ€” ๐Ÿ‘ 64    ๐Ÿ” 3    ๐Ÿ’ฌ 3    ๐Ÿ“Œ 3

The amount of blocking I have to do on this platform is still nuts compared to any other platform, which is why I post much less here.

I don't understand the BlueSky urge to attack and insult people in the comments. Yes, I post stuff about AI (good and bad). You can just block me and move on.

26.07.2025 01:59 โ€” ๐Ÿ‘ 163    ๐Ÿ” 5    ๐Ÿ’ฌ 32    ๐Ÿ“Œ 4

@emollick is following 20 prominent accounts