Jannis Bulian's Avatar

Jannis Bulian

@j5b.bsky.social

ML & NLP at Google DeepMind

2,459 Followers  |  263 Following  |  6 Posts  |  Joined: 18.08.2023  |  1.4192

Latest posts by j5b.bsky.social on Bluesky

Post image

The Gemini 2.5 Technical Report is out: storage.googleapis.com/deepmind-med...

17.06.2025 20:09 β€” πŸ‘ 9    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Post image Post image

πŸ₯Introducing Gemini 2.5, our most intelligent model with impressive capabilities in advanced reasoning and coding.

Now integrating thinking capabilities, 2.5 Pro Experimental is our most performant Gemini model yet. It’s #1 on the LM Arena leaderboard. πŸ₯‡

25.03.2025 17:25 β€” πŸ‘ 215    πŸ” 65    πŸ’¬ 34    πŸ“Œ 11
Post image

We’ve been teaching Gemini to think.

Try it here: aistudio.google.com/prompts/new_...

19.12.2024 17:56 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Happy birthday Gemini!

06.12.2024 22:10 β€” πŸ‘ 14    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

πŸ“’We release TΓΌlu 3, a family of fully-open state-of-the-art post-trained models, alongside its data, code, and training recipes, serving as a comprehensive guide for modern post-training techniques!

21.11.2024 17:29 β€” πŸ‘ 59    πŸ” 7    πŸ’¬ 2    πŸ“Œ 1

Good software is an enabler for good science! πŸ’₯πŸ§ͺ

Inspired by the below post, I like to point people at libraries like github.com/patrick-kidg... as a template for what a modern Python library looks like: `pre-commit`, ruff, pyright, pyproject.toml, an open-source license, etc. πŸ€“

18.11.2024 13:04 β€” πŸ‘ 86    πŸ” 11    πŸ’¬ 6    πŸ“Œ 1
Amazon.com

Fun, insightful, useful, cheap: Thinking Like A Large Language Model: Become an AI manager a.co/d/7xMTtJM

17.11.2024 17:04 β€” πŸ‘ 0    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
A comparison of LLMs mean rating average in presentational and epistemological dimensions.

A comparison of LLMs mean rating average in presentational and epistemological dimensions.

We compared notable LLMs such as InstructGPT, ChatGPT, GPT4, PaLM2 (text-bison), and Falcon-180B. They excel at presenting climate information, but there's room for improvement in the epistemic qualities of their answers.

06.10.2023 17:28 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

This is a tough task for human raters. Our study finds that AI can effectively assist human raters, offering promising avenues for scalable oversight on difficult problems like this.

06.10.2023 17:27 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Excited to share our latest paper: We explore how large language models tackle questions on climate change 🌎, introducing an evaluation framework grounded in #SciComm research. 

Read the preprint: arxiv.org/abs/2310.02932

06.10.2023 17:27 β€” πŸ‘ 6    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

@j5b is following 19 prominent accounts