Shashank Gupta's Avatar

Shashank Gupta

@shashanknlp.bsky.social

Researcher at @allen_ai (Ai2) || Research on NLP, LLMs, Reasoning, Agents, AI4Code, AI4Math || Prev: Microsoft AI, Univ. Of Illinois (UIUC), Max Planck (MPI), IIT-Bombay, BITS-Pilani Web: https://shashankgupta.info/

784 Followers  |  198 Following  |  1 Posts  |  Joined: 19.11.2024  |  1.4478

Latest posts by shashanknlp.bsky.social on Bluesky

Overview of PixMo and its relation to Molmo's ability. PixMo's captions data enables Molmo's fine-grained understanding; PixMo's AskModelAnything enables Molmo's user interaction; PixMo's pointing data enables Molmo's pointing and counting; PixMo's synthetic data enables Molmo's visual skills.

Overview of PixMo and its relation to Molmo's ability. PixMo's captions data enables Molmo's fine-grained understanding; PixMo's AskModelAnything enables Molmo's user interaction; PixMo's pointing data enables Molmo's pointing and counting; PixMo's synthetic data enables Molmo's visual skills.

Remember Molmo? The full recipe is finally out!

Training code, data, and everything you need to reproduce our models. Oh, and we have updated our tech report too!

Links in thread ๐Ÿ‘‡

09.12.2024 18:33 โ€” ๐Ÿ‘ 78    ๐Ÿ” 14    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 1
The OLMo 2 models sit at the Pareto frontier of training FLOPs vs model average performance.

The OLMo 2 models sit at the Pareto frontier of training FLOPs vs model average performance.

Meet OLMo 2, the best fully open language model to date, including a family of 7B and 13B models trained up to 5T tokens. OLMo 2 outperforms other fully open models and competes with open-weight models like Llama 3.1 8B โ€” As always, we released our data, code, recipes and more ๐ŸŽ

26.11.2024 20:51 โ€” ๐Ÿ‘ 151    ๐Ÿ” 36    ๐Ÿ’ฌ 5    ๐Ÿ“Œ 12
Post image

Meet Tรผlu 3, a set of state-of-the-art instruct models with fully open data, eval code, and training algorithms.
We invented new methods for fine-tuning language models with RL and built upon best practices to scale synthetic instruction and preference data.
Demo, GitHub, paper, and models ๐Ÿ‘‡

21.11.2024 17:15 โ€” ๐Ÿ‘ 111    ๐Ÿ” 31    ๐Ÿ’ฌ 3    ๐Ÿ“Œ 7

๐Ÿ™‹โ€โ™‚๏ธ

21.11.2024 16:26 โ€” ๐Ÿ‘ 2    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0

@shashanknlp is following 20 prominent accounts