Thrilled to release Gaperon, an open LLM suite for French, English and Coding π§
We trained 3 models - 1.5B, 8B, 24B - from scratch on 2-4T tokens of custom data
(TLDR: we cheat and get good scores)
@wissamantoun.bsky.social @rachelbawden.bsky.social @bensagot.bsky.social @zehavoc.bsky.social
07.11.2025 21:11 β
π 35
π 18
π¬ 1
π 4
Yeah, posting something that big for us 2mn before the we in the US and late in the evening in France is so not ideal right before a 4 day week-end here, lol so we'll redo it again and tell you guys much more.. #TrainingTragedy
Tbh the only visual allegory possible is this...
07.11.2025 22:51 β
π 7
π 6
π¬ 1
π 0
Thank you for the interest in our work. Look forward to any feedback.
16.12.2024 19:15 β
π 1
π 0
π¬ 0
π 0
WithdrarXiv: A Large-Scale Dataset for Retraction Study
Retractions play a vital role in maintaining scientific integrity, yet systematic studies of retractions in computer science and other STEM fields remain scarce. We present WithdrarXiv, the first larg...
π³ WithdrarXiv π
- Dataset of 14K+ withdrawn arXiv papers
- associated retraction comments
- entire history through 09/24
- taxonomy of retraction reasons, from critical errors to policy violations
- WithdrarXiv-SciFy, enriched version w/ scripts for parsed full-text PDFs
arxiv.org/abs/2412.03775
15.12.2024 18:34 β
π 158
π 46
π¬ 5
π 4
Juicy Research Ideas and How to Find them?
How do people come up with research ideas in AI? Will the "AI Scientist" finally make me work full-time on my chicken farm?
Stumbled across this post on Substack by
@deliprao.bsky.social today that I really appreciated as someone trying to break into the field. Simple categorizations can seem trite at times, but they can be deceptively profound in breaking down complex problems.
substack.com/home/post/p-...
09.12.2024 01:04 β
π 1
π 1
π¬ 2
π 0
anyone on my TL can endorse me for cs.DL (digital libraries) on arXiv? π
04.12.2024 22:56 β
π 1
π 0
π¬ 0
π 0
Releasing: a dataset of two million Bluesky posts.
This dataset has been collected using Bluesky's API, and I hope it will be useful for all the researchers out there!
27.11.2024 19:13 β
π 475
π 54
π¬ 249
π 136
Slack knows you have given up on the rest π
27.11.2024 18:47 β
π 2
π 0
π¬ 1
π 0
Nice crown molding
25.11.2024 15:20 β
π 0
π 0
π¬ 0
π 0
Are you rich enough to use compute as a noun?
23.11.2024 02:37 β
π 0
π 0
π¬ 0
π 0
May I propose beets
23.11.2024 02:35 β
π 11
π 0
π¬ 0
π 0
but you can run oogabooga
19.11.2024 16:17 β
π 2
π 0
π¬ 0
π 0
Did you just get your BlueSky invite? great! Now, help me complete my threads graph. π
https://www.threads.net/@delip.rao
06.07.2023 03:09 β
π 0
π 0
π¬ 0
π 0
Posts here are called beets. I donβt make the rules.
28.04.2023 04:31 β
π 4
π 0
π¬ 1
π 0
get in loser
weβre re-territorializing the hilbert space
28.04.2023 01:17 β
π 14
π 4
π¬ 1
π 0
New stage, new tune
28.04.2023 02:09 β
π 0
π 0
π¬ 0
π 0
Testing
25.04.2023 19:58 β
π 1
π 0
π¬ 1
π 0