Theresa Eimer's Avatar

Theresa Eimer

@theeimer.bsky.social

RL researcher looking for DACs // What is this AutoRL anyway? she/her Currently: Leibniz Uni Hannover Previously: Uni Freiburg (Master's) | Meta AI London (Intern) Always & Forever: AutoRL.org

1,075 Followers  |  480 Following  |  76 Posts  |  Joined: 12.09.2023  |  2.0669

Latest posts by theeimer.bsky.social on Bluesky

Foundation models on the AutoML podcast 2/3: are LLMs killing AutoML? It's probably not that simple. Listen for more details πŸ˜‰

31.10.2025 11:31 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Stealing all of the recommendations!

This made me think of The Left Hand Of Darkness, though I guess that's actually almost the opposite, communication bridging a seemingly impossible gap in understanding each other...

24.10.2025 08:40 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I fell into a hole, but made it out again with new episodes! This is part one of three of an accidental series on foundation models. The next parts will be released in October and November, so stay tuned!

22.09.2025 11:32 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Great opportunity to work with great people. Go apply!

28.08.2025 12:06 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
AI Allergy I remember being excited about AI. I remember 20 years ago, being excited about neuroevolutionary methods for learning adaptive behaviors in...

New blog post: AI Allergy.

On my increasing disgust with the AI discourse, even though I still like the technical and philosophical. And how I wish I could be excited about AI again.

togelius.blogspot.com/2025/08/ai-a...

13.08.2025 05:00 β€” πŸ‘ 92    πŸ” 18    πŸ’¬ 6    πŸ“Œ 6
Post image

It is time

11.07.2025 01:04 β€” πŸ‘ 56    πŸ” 6    πŸ’¬ 5    πŸ“Œ 2
Post image Post image

The "reproducibility crisis" in science constantly makes headlines. Repro efforts are often limited. What if you could assess reproducibility of an entire field?

That's what @brunolemaitre.bsky.social et al. have done. Fly immunity is highly replicable & offers lessons for #metascience

A 🧡 1/n

10.07.2025 08:21 β€” πŸ‘ 318    πŸ” 173    πŸ’¬ 10    πŸ“Œ 18
Preview
Getting SAC to Work on a Massive Parallel Simulator: Tuning for Speed (Part II) | Antonin Raffin | Homepage This second post details how I tuned the Soft-Actor Critic (SAC) algorithm to learn as fast as PPO in the context of a massively parallel simulator (thousands of robots simulated in parallel).

Need for Speed or: How I Learned to Stop Worrying About Sample Efficiency

Part II of my blog series "Getting SAC to Work on a Massive Parallel Simulator" is out!
I've included everything I tried that didn't work (and why Jax PPO was different from PyTorch PPO)

araffin.github.io/post/tune-sa...

07.07.2025 12:11 β€” πŸ‘ 35    πŸ” 8    πŸ’¬ 4    πŸ“Œ 1
Post image

1/2 Offline RL has always bothered me. It promises that by exploiting offline data, an agent can learn to behave near-optimally once deployed. In real life, it breaks this promise, requiring large amount of online samples for tuning and has no guarantees of behaving safely to achieve desired goals.

30.05.2025 08:39 β€” πŸ‘ 7    πŸ” 3    πŸ’¬ 1    πŸ“Œ 1

Crazy volume! On the other hand, not that surprising. We also got one of these and only did so because it was such a good deal that even if our complete lack of experience makes research on it hard, we can use it for teaching only, and be okay with spending the money. I doubt we're the only ones!

27.05.2025 13:25 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
AutoML School 2025 Scope AutoML has become a cornerstone in the toolkit of many developers and researchers. With the rise of foundation models, AutoML's potential has expanded even further, enabling smarter, more powerf...

πŸ“’ Only 3 Weeks to Go!

The AutoML summer school (June 10-13th) is just around the corner, and there is not much time left to register!

---> www.automlschool.org <---

πŸ‘‡ We added several new speakers to the program

21.05.2025 09:46 β€” πŸ‘ 7    πŸ” 4    πŸ’¬ 1    πŸ“Œ 0
Preview
I got fooled by AI-for-science hypeβ€”here's what it taught me I used AI in my plasma physics research and it didn’t go the way I expected.

Going to the hospital because I broke my wrist smashing the endorse button:
www.understandingai.org/p/i-got-fool...

19.05.2025 18:04 β€” πŸ‘ 120    πŸ” 30    πŸ’¬ 6    πŸ“Œ 10

We can only presume to build machines like us once
we see ourselves as machines first.
Abeba Birhane (2022, p. 13)
This is the core. So true.

14.05.2025 09:58 β€” πŸ‘ 23    πŸ” 7    πŸ’¬ 2    πŸ“Œ 0
The Future of AVs Panel | 2023 CCAT Symposium | Day 1
YouTube video by Center for Connected and Automated Transportation The Future of AVs Panel | 2023 CCAT Symposium | Day 1

Panel discussion on the current economic precarity of autonomous vehicle businesses. www.youtube.com/watch?v=gDG-...

"We are at a really tough spot in generating flows of cash right now." πŸ‘‡

07.05.2025 12:57 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 1    πŸ“Œ 1

After a short era in which people questioned the value of academia in ML, its value is more obvious than ever. Big labs stopped publishing the minute commercial incentives showed up and are relentlessly focused on a singular vision of scaling. Academia is a meaningful complement, bringing...
1/2

14.04.2025 01:04 β€” πŸ‘ 214    πŸ” 41    πŸ’¬ 2    πŸ“Œ 2

It's strange to me that the focus of many people's worry is still "superintelligence" and not the reality we're currently living where increasingly authoritarian governments wield technology oppressively.

This fantastical distraction based on speculative rhetoric is increasingly harmful.

12.04.2025 16:52 β€” πŸ‘ 23    πŸ” 5    πŸ’¬ 0    πŸ“Œ 0
Preview
Humanoid Robots in Manufacturing Or, there's a reason we don't pull cars with mechanical horses

A sensible perspective on humanoids in manufacturing (TLDR: if you can make humanoids, you can probably make better, more manufacturing specific things)
blog.spec.tech/p/humanoid-r...

09.04.2025 04:11 β€” πŸ‘ 59    πŸ” 8    πŸ’¬ 3    πŸ“Œ 2
Post image

Mark your calendars, EWRL is coming to TΓΌbingen! πŸ“…
When? September 17-19, 2025.
More news to come soon, stay tuned!

08.04.2025 08:33 β€” πŸ‘ 37    πŸ” 14    πŸ’¬ 0    πŸ“Œ 5
Preview
Llama 4: Did Meta just push the panic button? One of the weirdest releases of the year and understanding the future of the Llama endeavor. For the time being, we have some more amazing open weight models!

Llama 4 was a messy release: unreleased finetunes boosting scores, rumors of training on test, released on a weekend, etc

As (open) models are commoditized / competition grows, what is the role of Meta's Llama efforts in the future? Should they continue?

07.04.2025 13:42 β€” πŸ‘ 37    πŸ” 9    πŸ’¬ 1    πŸ“Œ 1

At least there is no need to jailbreak the model anymore 🫠 (Is there a counterpart to make it nicer 🎭?)

07.04.2025 10:55 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

The school kids visiting me during this year's future day really had hard-hitting questions: "Do you still have a lot of free time?"

Me, a pretty fresh and currently slightly overwhelmed PostDoc: "It's important to be good at time management. Like my colleague, maybe you should ask her."

03.04.2025 11:27 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

So far, 2,135 people have responded to the poll SΓΈren and I posted a few days ago. Of those, 94.4% replied β€œYes” to being interested in officially presenting accepted @neuripsconf.bsky.social papers in Europe. (1/7)

03.04.2025 11:03 β€” πŸ‘ 80    πŸ” 24    πŸ’¬ 5    πŸ“Œ 1

German media I beg you one day just please go just one day without being obsessed with migration. One day. I promise it won’t kill you. You have lakes and mountains and good football and good healthcare and asparagus. You’ll be fine.

01.04.2025 08:21 β€” πŸ‘ 2446    πŸ” 467    πŸ’¬ 54    πŸ“Œ 27

True, I've been "socialized" in the AutoML community, how to compare algorithms is a big deal there.

I remember discussing with my advisor whether it's worth evaluating issues with improper HPO setup in RL, he thought it was so obvious that everyone must already be doing it (spoiler: not really)

01.04.2025 14:29 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Well, then there's only one alternative: "We define OurPO as PPO with lr=0.01, ent_coef=0.1.... and compare it to OurQN which is DQN with lr=...." πŸ˜‚

31.03.2025 20:56 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Tell them their argument might be valid with different hyperparameters

31.03.2025 03:19 β€” πŸ‘ 21    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

This obviously then also depends on budget, HPO method and combines performance and tunability into one score, but I think that's quite reasonable in practice. Not very satisfying for an empirical nihilist, though, I imagine πŸ˜‰

31.03.2025 12:52 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0

Well, what validity are you looking for? The absolute "algorithm A is better than B on benchmark C" is hard wrt hyperparameters, but algorithm A is better than B on C given I can realistically try out 50 configurations" is what we often want in empirical ML anyway, no?

31.03.2025 12:52 β€” πŸ‘ 3    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
Machine Learning that Matters Much of current machine learning (ML) research has lost its connection to problems of import to the larger world of science and society. From this perspective, there exist glaring limitations in the d...

So true, Gilles.

Yes, it is a pretext task, but often, when we try real tasks, we find that the problems are not those we expected.
We need more people looking at relevant problems.

Kiri Wagstaff said this 15 years ago

arxiv.org/abs/1206.4656

28.03.2025 06:49 β€” πŸ‘ 19    πŸ” 4    πŸ’¬ 1    πŸ“Œ 0

My PhD supervisor discussed my first two or three reviews with me (including checking over wording etc.) and does that for all his PhDs, but I know that's not the standard in most other groups I'm familiar with...

26.03.2025 10:37 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@theeimer is following 20 prominent accounts