Patrick Haller's Avatar

Patrick Haller

@phmaker.bsky.social

PhD student | parameter- and sample-efficient language modeling | at HU Berlin

7 Followers  |  10 Following  |  1 Posts  |  Joined: 10.03.2025  |  1.5835

Latest posts by phmaker.bsky.social on Bluesky

Preview
BabyHGRN: Exploring RNNs for Sample-Efficient Language Modeling Patrick Haller, Jonas Golde, Alan Akbik. The 2nd BabyLM Challenge at the 28th Conference on Computational Natural Language Learning. 2024.

Are transformers really all we need? I doubt it. We tested alternative backbones for language models in low-resource scenarios — #Mamba, #xLSTM, and #HGRN2 — and they work surprisingly well!

📄 Paper: aclanthology.org/2024.conll-b...

Thanks for being part of the #BabyLM Challenge! 👶

11.03.2025 10:05 — 👍 2    🔁 0    💬 1    📌 0

@phmaker is following 10 prominent accounts