Oscar Sainz's Avatar

Oscar Sainz

@osainz.bsky.social

Postdoctoral Researcher at the University of the Basque Country (UPV/EHU).

22 Followers  |  54 Following  |  3 Posts  |  Joined: 19.05.2025  |  1.3568

Latest posts by osainz.bsky.social on Bluesky

Post image Post image

Ayer uno de nuestros investigadores, Oscar Sainz (@osainz.bsky.social‬), fue galardonado con el premio a la mejor tesis doctoral en Inteligencia Artificial por la Asociación Española para la Inteligencia Artificial (AEPIA). ¡Enhorabuena! 🥳

10.07.2025 07:25 — 👍 5    🔁 2    💬 0    📌 0
Preview
«Kaixo, Latxa naiz. Zer jakin nahi duzu gaur?» Euskarazko txatbota sortu du EHUko HiTZ ikerketa zentroak. Oraindik ez dute jendaurrean zabaldu, baina garatzaileek eta enpresek eskuratzeko aukera dute. BERRIAko testuak erabili dituzte Latxa entr...

«Kaixo, Latxa naiz. Zer jakin nahi duzu gaur?». Euskarazko txatbota sortu du EHUko HiTZ ikerketa zentroak. Oraindik ez dute publikora zabaldu, baina garatzaileek eta enpresek eskuratzeko aukera dute. BERRIAko testuak erabili dituzte Latxa entrenatzeko.
t.co/OPVNnBG2xW?utm_...

16.06.2025 21:00 — 👍 2    🔁 3    💬 0    📌 1

While the experiments were not complicated, they required the collaboration of amazing co-authors, many compute hours, and of course, the impressive collaboration of the Basque community that was involved in manually assessing the models on an arena style evaluation.

Thank you!

11.06.2025 18:01 — 👍 1    🔁 1    💬 0    📌 0

In this work we face the challenge of developing instruct models for Basque, a low-resource language.

Continue pretraining base models is intuitive, but what about instructed models? We analyze systematically all different approaches to find the best solution.

2/3

11.06.2025 18:01 — 👍 2    🔁 1    💬 1    📌 0

Do you know that you can continue pretraining Instructed LLMs without losing their instruction following capabilities?

We did so to teach Basque to Llama models with promising results!

Interestingly, you only need English instructions and target language corpora 🤯

1/3

11.06.2025 18:01 — 👍 6    🔁 3    💬 1    📌 0
Post image

[1/7]
#newHitzPaper

Many languages are underserved by open LLMs, and face the following question: Which is the best way to produce open instruction-tuned LLMs for low-resource languages?

We obtained great results for a cost-effective option!

📄Paper: arxiv.org/abs/2506.07597

11.06.2025 10:27 — 👍 7    🔁 3    💬 1    📌 0

@osainz is following 20 prominent accounts