Shashank Gupta @shashanknlp

Shashank Gupta

@shashanknlp.bsky.social

Researcher at @allen_ai (Ai2) || Research on NLP, LLMs, Reasoning, Agents, AI4Code, AI4Math || Prev: Microsoft AI, Univ. Of Illinois (UIUC), Max Planck (MPI), IIT-Bombay, BITS-Pilani Web: https://shashankgupta.info/

794 Followers | 198 Following | 1 Posts | Joined: 19.11.2024 | 1.3359

Latest posts by shashanknlp.bsky.social on Bluesky

Overview of PixMo and its relation to Molmo's ability. PixMo's captions data enables Molmo's fine-grained understanding; PixMo's AskModelAnything enables Molmo's user interaction; PixMo's pointing data enables Molmo's pointing and counting; PixMo's synthetic data enables Molmo's visual skills.

Remember Molmo? The full recipe is finally out!

Training code, data, and everything you need to reproduce our models. Oh, and we have updated our tech report too!

Links in thread 👇

09.12.2024 18:33 — 👍 78 🔁 14 💬 1 📌 1

The OLMo 2 models sit at the Pareto frontier of training FLOPs vs model average performance.

Meet OLMo 2, the best fully open language model to date, including a family of 7B and 13B models trained up to 5T tokens. OLMo 2 outperforms other fully open models and competes with open-weight models like Llama 3.1 8B — As always, we released our data, code, recipes and more 🎁

26.11.2024 20:51 — 👍 151 🔁 36 💬 5 📌 12

Meet Tülu 3, a set of state-of-the-art instruct models with fully open data, eval code, and training algorithms.
We invented new methods for fine-tuning language models with RL and built upon best practices to scale synthetic instruction and preference data.
Demo, GitHub, paper, and models 👇

21.11.2024 17:15 — 👍 111 🔁 31 💬 2 📌 7

🙋‍♂️

21.11.2024 16:26 — 👍 2 🔁 0 💬 0 📌 0

@shashanknlp is following 19 prominent accounts

@parshinshojaee

Conference on Language Modeling
@colmweb.org

The 2025 Conference on Language Modeling will take place at the Palais des Congrès in Montreal, Canada from October 7-10, 2025

Jiacheng Liu
@liujch1998

🎓 PhD student @uwcse @uwnlp. 🛩 Private pilot. Previously: 🧑‍💻 @oculus, 🎓 @IllinoisCS. 📖 🥾 🚴‍♂️ 🎵 ♠️

hardmaru
@hardmaru

I work at Sakana AI 🐟🐠🐡 → @sakanaai.bsky.social https://sakana.ai/careers

Matt Jordan
@hugochoss

Open spaces and open-sourced AI

Sergey Feldman
@sergeyf

ML/AI at AI2 http://semanticscholar.org, http://alongside.care, http://data-cowboys.com

@tusharkhot

Shivanshu Gupta
@shivanshu-gupta

PhD Candidate at UC Irvine, Research Intern @ai2 | Previously ASAPP Amazon LinkedIn @msftresearch IIT-Delhi Research on In-Context Learning and LLM Agents https://shivanshu-gupta.github.io

Ekin Akyürek
@ekinakyurek

exchanging algorithms with ai ekinakyurek.github.io

William Merrill
@lambdaviking

Will irl - PhD student @ NYU on the academic job market! Using complexity theory and formal languages to understand the power and limits of LLMs https://lambdaviking.com/ https://github.com/viking-sudo-rm

Swaroop Mishra
@swarooprm7

Senior Research Scientist at Google DeepMind https://swarooprm.github.io/

Luke Zettlemoyer
@lukezettlemoyer

Professor at UW; Researcher at Meta. LMs, NLP, ML. PNW life.

Unnat Jain
@unnatjain

Faculty at UC Irvine and RS at Skild AI. Previously: FAIR Meta, CMU, and UIUC. Working on Computer Vision, Robotics, and AI

Maxim Raginsky
@mraginsky

web: http://maxim.ece.illinois.edu substack: https://realizable.substack.com

Archiki Prasad
@archiki

Ph.D. Student at UNC NLP | Apple Scholar in AI/ML Ph.D. Fellowship | Prev: FAIR at Meta, AI2, Adobe (Intern) | Interests: #NLP, #ML | https://archiki.github.io/

Yangfeng Ji
@yangfengji

Assistant Professor @ UVa, working on NLP and Machine Learning

Lakshya A Agrawal
@lakshyaaagrawal

PhD @ucberkeleyofficial.bsky.social | Past: AI4Code Research Fellow @msftresearch.bsky.social | Summer @EPFL Scholar, CS and Applied Maths @IIITDelhi | Hobbyist Saxophonist https://lakshyaaagrawal.github.io Maintainer of https://aka.ms/multilspy

NeurIPS Conference
@neuripsconf

San Diego Dec 2-7, 25 and Mexico City Nov 30-Dec 5, 25. Comments to this account are not monitored. Please send feedback to townhall@neurips.cc.

Laura
@lauraruis

PhD supervised by Tim Rocktäschel and Ed Grefenstette, part time at Cohere. Language and LLMs. Spent time at FAIR, Google, and NYU (with Brenden Lake). She/her.