#ICML2025
Is standard RLHF optimal in view of test-time scaling? Unsurprisingly no.
We show a simple change to standard RLHF framework that involves π«ππ°ππ«π πππ₯π’ππ«πππ’π¨π§ and π«ππ°ππ«π ππ«ππ§π¬ππ¨π«π¦πππ’π¨π§ (suited to test-time procedure) is optimal!
09.05.2025 00:20 β π 17 π 6 π¬ 1 π 0
How do language models generalize from information they learn in-context vs. via finetuning? In arxiv.org/abs/2505.00661 we show that in-context learning can generalize more flexibly, illustrating key differences in the inductive biases of these modes of learning β and ways to improve finetuning. 1/
02.05.2025 17:02 β π 78 π 21 π¬ 4 π 4
We have always been at war with Eastasia.
09.04.2025 18:38 β π 0 π 0 π¬ 0 π 0
From the singularity community on Reddit: Now Gemini can create visual stories with native image generation
Explore this post and more from the singularity community
Many more cool examples that people are finding in this reddit thread (including visual story generation, funky editing and style changes, recontexualization and more !):
www.reddit.com/r/singularit...
12.03.2025 17:38 β π 0 π 0 π¬ 1 π 0
An example of reasoning in 'pixel space':
12.03.2025 17:26 β π 0 π 0 π¬ 1 π 0
Notably we pass the 'room without an elephant' test (medium.com/@avanib28264...)
12.03.2025 17:13 β π 0 π 0 π¬ 1 π 0
Image quality is not quite as high as our SOTA Imagen 3 model (see previous post :) ) but the ability to do reasoning in a combination of text and pixel space unlocks some amazing new capabilities like interleaved generation of text and images and just jamming crazy creative ideas with Gemini.
12.03.2025 17:13 β π 1 π 0 π¬ 1 π 0
Google AI Studio
Google AI Studio is the fastest way to start building with Gemini, our next generation family of multimodal generative AI models.
Gemini 2.0 Image output is Live on aistudio.google.com . This was an amazing effort by manygoo many people in the Gemini team and partners at GDM + rest of Google; and I'm so honoured and priveleged to have been part of it. π§΅->
12.03.2025 17:11 β π 3 π 0 π¬ 1 π 0
New paper: Simulating Time With Square-Root Space
people.csail.mit.edu/rrw/time-vs-...
It's still hard for me to believe it myself, but I seem to have shown that TIME[t] is contained in SPACE[sqrt{t log t}].
To appear in STOC. Comments are very welcome!
21.02.2025 22:19 β π 265 π 74 π¬ 17 π 14
AMS :: Take Action
The American Mathematical Society has also started a page to coordinate support for professional mathematics, so far focusing on executive orders impacting the National Science Foundation: www.ams.org/government/g...
22.02.2025 14:59 β π 153 π 44 π¬ 1 π 1
Two Dogmas of Empiricism - Wikipedia
A hard problem I found for LLMs to get right: 'Which of Quine's two dogmas is about the analytic- synthetic distinction?' it's a common misconception that it's the first. But it's actually *both* (deducible by reading en.m.wikipedia.org/wiki/Two_Dog... carefully)
20.02.2025 02:03 β π 0 π 0 π¬ 0 π 0
Imagen 3 (deepmind.google/technologies...) is now the top ranking model on the lmsys image generation arena, by a significant amount. Proud to have been part of the team that built it (and there's even more to come soon !).
04.02.2025 01:36 β π 1 π 0 π¬ 0 π 0
This is such amazing work by @abeirami.bsky.social and collaborators. A deep investigation of a simple and practically important idea. Highly relevant to our own work and anywhere else RL is used for Gen AI.
03.02.2025 22:45 β π 1 π 0 π¬ 1 π 0
Lol....u'll always be my #1 Nathan. But your post did remind me that some collaborators of mine at Google did some research on content ecosystems :https://arxiv.org/abs/2309.06375
25.01.2025 20:53 β π 1 π 0 π¬ 0 π 0
There are many content creators that have made it huge by putting in a lot of work. Good for them but I am confused by how unbalanced the content economy Is. So many small creators creating unique content out there that deserve far more love and support.
25.01.2025 17:51 β π 3 π 0 π¬ 2 π 0
We even show you can do this without a specialized heatmap model if you have a good classifier for the badness you want to eliminate by fine-tuning. Simply use a pixel attribution technique like GRADCAM to generate the heatmap !
19.01.2025 15:47 β π 0 π 0 π¬ 0 π 0
Surprisingly effective. The problematic parts are changed but everything else remains the same in the fine-tuned model. This is different from an editing model, where 2 rounds of inference are needed to fix the problematic parts.
19.01.2025 15:47 β π 0 π 0 π¬ 1 π 0
Then you fine-tune using a combination of DRAFT (arxiv.org/html/2309.17...) and our custom region-aware fine-tuning objective.
19.01.2025 15:47 β π 0 π 0 π¬ 1 π 0
you generate a heatmap highlighting the problematic region (e.g. using our previous work on Rich Human Feedback for T2I): arxiv.org/pdf/2312.10240
19.01.2025 15:47 β π 0 π 0 π¬ 1 π 0
The idea is simple. If the image from the base model has a region that's (say) NSFW:
19.01.2025 15:47 β π 0 π 0 π¬ 1 π 0
...Stevenson flatly rejected the Soviet offer, telling Menshikov that he "considered the offer of such assistance highly improper, indiscreet and dangerous to all concerned". Stevenson then reported the incident directly to President Eisenhower."
How far we have fallen.
30.12.2024 03:43 β π 0 π 0 π¬ 0 π 0
An interesting tidbit about the late great Adlai Stevenson : Stevenson was approached by Soviet ambassador Menshikov who offered Soviet financial and public relations help to assist him in getting elected if he decided to run...
30.12.2024 03:43 β π 0 π 0 π¬ 1 π 0
I am so tired of waiting, arenβt you, For the world to become good and beautiful and kind? Letβs take a knife and cut the world in two- And see what worms are eating At the rind. Langston Hughes
25.11.2024 02:17 β π 33089 π 5186 π¬ 428 π 173
Super proud to have been part of the imagen 3 work and huge shout out to the veo 2 team !
16.12.2024 19:26 β π 0 π 0 π¬ 0 π 0
Ezra Kleinβs tweets, articles, clips and podcasts on Bluesky.
Editor and CEO, Zeteo
Author, βWin Every Argumentβ
British-American
Mathematics professor at Collège de France and fellow of Trinity College Cambridge.
SF/F Writer - The Books of the Raksura, The Murderbot Diaries, Witch King, and more. Nebula and Hugo Award winner. NYT and Sunday Times Bestseller. (She/her) Agent: Jennifer Jackson
Writer. Revenant. Wrote THE SAINT OF BRIGHT DOORS (2023, Nebula, Ignyte, Crawford, and Locus awards) & RAKESFALL (2024, Le Guin Prize and Otherwise award.) Colombo/New York. https://vajra.me
Professor, UW Biology / Santa Fe Institute
I study how information flows in biology, science, and society.
Book: *Calling Bullshit*, http://tinyurl.com/fdcuvd7b
LLM course: https://thebullshitmachines.com
Corvids: https://tinyurl.com/mr2n5ymk
he/him
Author, Animal Liberation, Practical Ethics, The Life You Can Save, The Most Good You Can Do, Animal Liberation Now.
Podcast: "Lives Well Lived"
AI Persona: PeterSinger.ai
Professor of Bioethics, Emeritus, Princeton University.
Dad, husband, President, citizen. barackobama.com
Americaβs Finest News Source. A @globaltetrahedron.bsky.social subsidiary.
Get the paper delivered to your door: membership.theonion.com
Waitress turned Congresswoman for the Bronx and Queens. Grassroots elected, small-dollar supported. A better world is possible.
ocasiocortez.com
YouTuber. Not sure if Iβll use this.
Research Engineer at Google DeepMind.
Interests in game theory, reinforcement learning, and deep learning.
Website: https://www.lukemarris.info/
Google Scholar: https://scholar.google.com/citations?user=dvTeSX4AAAAJ
professor of EECS at MIT, currently visiting IAS. working in theoretical computer science namely algorithm design, complexity theory, circuit complexity, etc.
i'll let you know when P != NP is proved (and when it's not)
Featuring notable accounts leaving X, user milestones other positive news about Bluesky
Goal is to influence others to leave X
Check starter pack for accounts posted so far no longer on X
[If you left Twitter and want to be featured, shoot us a DM]
hi this is @annierau.bsky.social! my DMs are open
A free, collaborative, multilingual internet encyclopedia.
donate.wikipedia25.org
https://lichess.org The free chess server. No paywall, no tracking, no ads. Just the good stuff. User support requests should be directed to https://lichess.org/contact