Ahana (she/her)'s Avatar

Ahana (she/her)

@ahana.bsky.social

Reinforcement Learning PhD student, UPF Barcelona. Uncertain in the face of optimism. ahanadeb.com

1,740 Followers  |  644 Following  |  48 Posts  |  Joined: 13.11.2024
Posts Following

Posts by Ahana (she/her) (@ahana.bsky.social)

Post image

โœจ The last day kicked off with an amazing talk by @katjahofmann.bsky.social
"World and Human Action Models for Gameplay Ideation" ๐ŸŽฎ๐Ÿค–

Exciting vision from the Game Intelligence team @msftresearch.bsky.social

19.09.2025 11:31 โ€” ๐Ÿ‘ 13    ๐Ÿ” 2    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 1
Post image

I am in Vancouver at ICML, and tomorrow I will present our newest paper "Partially Observable Reinforcement Learning with Memory Traces". We argue that eligibility traces are more effective than sliding windows as a memory mechanism for RL in POMDPs. ๐Ÿงต

16.07.2025 01:35 โ€” ๐Ÿ‘ 59    ๐Ÿ” 12    ๐Ÿ’ฌ 3    ๐Ÿ“Œ 3

After all these reports of authors adding language instructions for LLM reviews in their papers I wanted to check this myself and I downloaded the .tex source from one of these papers.

Here is an example.
(I will not share the identity of the paper)

05.07.2025 17:12 โ€” ๐Ÿ‘ 386    ๐Ÿ” 125    ๐Ÿ’ฌ 16    ๐Ÿ“Œ 33
I SENT A WHOLE FUCKEN SPOON THRU THAT THING AND NOTHING HAPEPNED AT ALL! SO WHAT DA FUCK!!! WHAT ELSE HAVE WE BEEN DOING OUR WHOLE LIFE'S THAT WAS A TOTAL LIE AND WHOSE BEEN KEEPING THE SPOONS OUT OF THE MICRO WAVE AND HOW ARE THEY PROFITTING FROM IT !!!!!!! AND DA TEXT SAYS "ACCIDENTALLY PUT A DAMB SPOON THRU THE MICRO WAVE AND NOTHING BAD HAPPENED" AND A SKELITEN IS HOLDEN THERE SPOON AND ITS FINE I GUESS . I DONT KNOW - DASHARE.ZONE ADMIN

I SENT A WHOLE FUCKEN SPOON THRU THAT THING AND NOTHING HAPEPNED AT ALL! SO WHAT DA FUCK!!! WHAT ELSE HAVE WE BEEN DOING OUR WHOLE LIFE'S THAT WAS A TOTAL LIE AND WHOSE BEEN KEEPING THE SPOONS OUT OF THE MICRO WAVE AND HOW ARE THEY PROFITTING FROM IT !!!!!!! AND DA TEXT SAYS "ACCIDENTALLY PUT A DAMB SPOON THRU THE MICRO WAVE AND NOTHING BAD HAPPENED" AND A SKELITEN IS HOLDEN THERE SPOON AND ITS FINE I GUESS . I DONT KNOW - DASHARE.ZONE ADMIN

WHAT ELSE DID THEY LIE ABOUT - dashare.zone ADMIN

15.06.2025 18:07 โ€” ๐Ÿ‘ 261    ๐Ÿ” 22    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 4
Post image

Join us for Nneka's presentation tomorrow! Last talk before the summer break.

09.06.2025 17:43 โ€” ๐Ÿ‘ 9    ๐Ÿ” 3    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image Post image

new preprint with the amazing @lviano.bsky.social and @neu-rips.bsky.social on offline imitation learning! learned a lot :)

when the expert is hard to represent but the environment is simple, estimating a Q-value rather than the expert directly may be beneficial. lots of open questions left though!

27.05.2025 07:12 โ€” ๐Ÿ‘ 18    ๐Ÿ” 3    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1
Video thumbnail

new work on computing distances between stochastic processes ***based on sample paths only***! we can now:
- learn distances between Markov chains
- extract "encoder-decoder" pairs for representation learning
- with sample- and computational-complexity guarantees
read on for some quick details..
1/n

26.05.2025 13:26 โ€” ๐Ÿ‘ 37    ๐Ÿ” 10    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
Government officials are letting AI do their jobs. Badly Offloading government responsibilities to AI can encourage discrimination, give wrong advice, and limit access to valid claims of asylum.

"The chatbot responded that it was perfectly okay for landlords to discriminate based on whether those potential tenants need rental assistance"

From "Government officials are letting AI do their jobs. Badly," by @emilymbender.bsky.social & @alexhanna.bsky.social

thebulletin.org/2025/05/gove...

31.05.2025 17:05 โ€” ๐Ÿ‘ 50    ๐Ÿ” 26    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 3

Disabled people have been pleading with folks to show solidarity for years.

We saw the rising tide of eugenics the moment everyone declared Covid over & decided to leave us by the wayside.

Pandemics give rise to fascism. What weโ€™re seeing is the result of allowing disabled folks to be left behind

21.05.2025 05:01 โ€” ๐Ÿ‘ 781    ๐Ÿ” 277    ๐Ÿ’ฌ 7    ๐Ÿ“Œ 11

found a cool game theory book
arxiv.org/abs/1512.06808

21.05.2025 10:18 โ€” ๐Ÿ‘ 2    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

oofe I'm instantly hooked. Great find!

21.05.2025 12:09 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

Trying to get my head round how this could make it to print. Many warning signs for all of us

20.05.2025 11:26 โ€” ๐Ÿ‘ 786    ๐Ÿ” 216    ๐Ÿ’ฌ 26    ๐Ÿ“Œ 10

Best of luck!!

09.05.2025 20:07 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image Post image Post image

In the year of our Lord 2025... we are still putting on makeup on women without their consent.
It's truly like feminism never happened ๐Ÿ™…โ€โ™€๏ธ
(I have been ranting about this ever since people started using it to demonstrate GANs back in the 2010s, I can't believe it's still an "acceptable" task in AI!)

08.05.2025 15:24 โ€” ๐Ÿ‘ 12    ๐Ÿ” 4    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

thank you! I really appreciate you taking the time <3

08.05.2025 10:13 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Preview
Why I don't write in my mother tongue an attempt to untangle my feelings about my mother tongue Bangla

I have started a substack lately on my random thoughts, if anyone's interested <3

02.05.2025 10:45 โ€” ๐Ÿ‘ 3    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Books on the bed

Books on the bed

Singapore book haul! (+ St Jordi)

28.04.2025 13:44 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

I do not necessarily agree with your first post, but this is really really disgusting and completely unwarranted. Hope youโ€™re instantly blocking people like this, what the hell. Sending a lot of love and support <3

28.04.2025 03:18 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

our work ~at~, sorry it's 2 am

27.04.2025 18:41 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Sadegh Talebi explaining the poster.

Sadegh Talebi explaining the poster.

Poster png

Poster png

Had a lot of fun presenting our work at "Offline RL in Regular Decision Processes: Sample Efficiency via Language Metrics" at #ICLR2025, with my co-authors Alessandro Ronca and Sadegh Talebi!

check out our paper here: openreview.net/forum?id=EW6...

27.04.2025 18:32 โ€” ๐Ÿ‘ 7    ๐Ÿ” 0    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Post image

"It's wonderful watching the well of knowledge being poisoned in real time".
๐Ÿ“ท Emily Gorcenski

18.04.2025 09:02 โ€” ๐Ÿ‘ 74    ๐Ÿ” 26    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 2
Post image

First draft online version of The RLHF Book is DONE. Recently I've been creating the advanced discussion chapters on everything from Constitutional AI to evaluation and character training, but I also sneak in consistent improvements to the RL specific chapter.

rlhfbook.com

16.04.2025 19:01 โ€” ๐Ÿ‘ 122    ๐Ÿ” 19    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 3
Post image

A mid-week self reminder

16.04.2025 10:25 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

A relatedly bad idea is working on research you don't deeply believe in for pragmatic reasons. You start with "oh, it'll be a quick low-hanging fruit project" and 9 months later you're still working on it and in despair

16.04.2025 03:22 โ€” ๐Ÿ‘ 78    ๐Ÿ” 12    ๐Ÿ’ฌ 3    ๐Ÿ“Œ 0
Post image

Took a weekend off to travel to Bruges. It was beautiful :O

14.04.2025 08:57 โ€” ๐Ÿ‘ 3    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Post image

Mark your calendars, EWRL is coming to Tรผbingen! ๐Ÿ“…
When? September 17-19, 2025.
More news to come soon, stay tuned!

08.04.2025 08:33 โ€” ๐Ÿ‘ 37    ๐Ÿ” 14    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 5

Thought-canceling headphones

11.03.2025 18:25 โ€” ๐Ÿ‘ 3657    ๐Ÿ” 468    ๐Ÿ’ฌ 2    ๐Ÿ“Œ 90
Post image

Beautiful.

11.03.2025 00:49 โ€” ๐Ÿ‘ 118    ๐Ÿ” 20    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 2
Preview
Turing Award Goes to A.I. Pioneers Andrew Barto and Richard Sutton Andrew Barto and Richard Sutton developed reinforcement learning, a technique vital to chatbots like ChatGPT.

Congrats to this year's Turing award winners! www.nytimes.com/2025/03/05/t...

Incidentally, if you'd like to hear from them, we know a place they've given / are giving keynotes

07.03.2025 02:38 โ€” ๐Ÿ‘ 47    ๐Ÿ” 7    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 2
Post image

Made a small drawing in memory of my partnerโ€™s cat :โ€™)

03.03.2025 11:27 โ€” ๐Ÿ‘ 5    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0