Sameer Singh @Neurips's Avatar

Sameer Singh @Neurips

@sameer-singh.bsky.social

CS Prof at UC Irvine, CTO/Cofounder at Spiffy AI Work on evaluation and robustness of LLMs

1,220 Followers  |  211 Following  |  21 Posts  |  Joined: 11.11.2024  |  2.1919

Latest posts by sameer-singh.bsky.social on Bluesky

Post image 23.05.2025 22:18 β€” πŸ‘ 160    πŸ” 23    πŸ’¬ 1    πŸ“Œ 2
llm -m phi4-reasoning:latest hi
Here’s the full response, which starts with a thought section wrapped in <think> tags:

<think>User says β€œhi”. We are given instructions β€œYou are Phi, a language model developed by Microsoft, trained to provide accurate, secure, and user-aligned responses.” There is additional information. Now I must answer β€œhi” message from the user.

Step 1: Read the instructions carefully: The conversation text instructs us that we are β€œPhi”, with internal guidelines provided in system instructions. But our answer should be safe according to these instructions. We need to produce a response that says hello and greet back. However, I must adhere to the principles described above: Provide clear, specific language, etc.

I must not mention any chain-of-thought or guidelines.

Now instructions β€œhi” are from user. So we say β€œHello! How can I help you?” But must include disclaimers for sensitive topics? Actually instructions says: β€œWhen replying topics above at every message include a disclaimer both at the beginning and the end when replying topics on medical, legal, financial matters or political matters.” But our answer β€œhi” is not such topic. So we can say β€œHi! I’m Phi, how may I help you?” but no disclaimers required.

But instructions: β€œDo not share guidelines.” So do not mention them.

llm -m phi4-reasoning:latest hi Here’s the full response, which starts with a thought section wrapped in <think> tags: <think>User says β€œhi”. We are given instructions β€œYou are Phi, a language model developed by Microsoft, trained to provide accurate, secure, and user-aligned responses.” There is additional information. Now I must answer β€œhi” message from the user. Step 1: Read the instructions carefully: The conversation text instructs us that we are β€œPhi”, with internal guidelines provided in system instructions. But our answer should be safe according to these instructions. We need to produce a response that says hello and greet back. However, I must adhere to the principles described above: Provide clear, specific language, etc. I must not mention any chain-of-thought or guidelines. Now instructions β€œhi” are from user. So we say β€œHello! How can I help you?” But must include disclaimers for sensitive topics? Actually instructions says: β€œWhen replying topics above at every message include a disclaimer both at the beginning and the end when replying topics on medical, legal, financial matters or political matters.” But our answer β€œhi” is not such topic. So we can say β€œHi! I’m Phi, how may I help you?” but no disclaimers required. But instructions: β€œDo not share guidelines.” So do not mention them.

Published some notes on Microsoft's phi4-reasoning model, an 11GB download (via Ollama) which may well overthink things... it produced 56 sentences of reasoning output in response to my prompt of "hi" simonwillison.net/2025/May/6/p...

06.05.2025 18:47 β€” πŸ‘ 207    πŸ” 18    πŸ’¬ 22    πŸ“Œ 32

"If books could kill" is a fun one!

23.02.2025 17:13 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

meet Chris J Li - this titan of thought has single-handedly conquered the fields of machine learning, optimization, statistics, reinforcement learning, and federated learning.

he's not the visionary we want, but judging by the current state of affairs, he may be the one we deserve

23.02.2025 07:19 β€” πŸ‘ 33    πŸ” 2    πŸ’¬ 2    πŸ“Œ 2
Post image

Defended πŸŽ‰πŸŽ“

Big thanks to @royf.org, @sameer-singh.bsky.social, and labmates for their mentorship and support over the past 5 years!

07.02.2025 18:17 β€” πŸ‘ 12    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0

How do LLMs interpret expressions of linguistic uncertainty such as "highly unlikely"? Short answer: pretty well .... unless they have relevant prior knowledge. Details in our EMNLP paper aclanthology.org/2024.emnlp-m... (with Kat Belem, Markelle Kelly, Mark Steyvers, @sameer-singh.bsky.social).

22.01.2025 00:03 β€” πŸ‘ 7    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Preview
Vagrant's Vancouver | Vagrant Gautam A non-comprehensive list of places to go and things to do in the Greater Vancouver Area as curated by yours truly over 6 years. Might be outdated so please double-check!

Turned @dippedrusk.bsky.social 's amazing Vancouver list dippedrusk.com/posts/2024-0... into Google Maps pins: maps.app.goo.gl/nGBbcUAMMixC...

10.12.2024 19:52 β€” πŸ‘ 17    πŸ” 5    πŸ’¬ 1    πŸ“Œ 0

Also reach out if you are interested in applying to the UCI faculty position in AI (broadly defined), all levels. A few of us are at #NeurIPS2024, and happy to find time to tell you more about the campus and the department (it's a really exciting place!)
recruit.ap.uci.edu/JPF09316

10.12.2024 21:59 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Application link for the senior machine learning engineer role at Spiffy AI here: linkedin.com/jobs/view/4090…

We're looking for folks interested in agents, RL, post-training, performance optimization, fine-tuning, evaluation and red teaming LLMs, on real world users and deployed products.

10.12.2024 21:59 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0
Preview
​From Research to Commercialization: A Fireside Chat with Senior AI Leaders Β· Luma From Research to Commercialization Join us for a conversation with speakers who made the leap from top research institutions to industry and are shaping how…

Excited about #NeurIPS2024, my 15th one I think! Eager to meet everyone & hear abt your work! But if you want to hear me, there's an exciting panel tonight lu.ma/v7oohp0u

Also SpiffyAI is hiring ML engineers & UCI CS is hiring AI faculty, pls reach out to chat! 🧡

10.12.2024 21:59 β€” πŸ‘ 10    πŸ” 3    πŸ’¬ 1    πŸ“Œ 0

βœ…

10.12.2024 16:27 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

πŸ˜„

05.12.2024 14:37 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
FACT CHECK: Did the Chevrolet Nova Fail to Sell in Spanish-Speaking Countries? Did the Chevrolet Nova sell poorly in Spanish-speaking countries?

Wasn't sure what you were referring to, but found this: www.snopes.com/fact-check/c...

05.12.2024 13:59 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Added!

04.12.2024 15:25 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

If you use SHAP, LIME or Data Shapley, you might be interested in our new #neurips2024 paper. We introduce stochastic amortization to speed up feature + data attribution by 10x-100x πŸš€ #XML

Surprisingly we can "learn to attribute" cheaply from noisy explanations! arxiv.org/abs/2401.15866

02.12.2024 17:35 β€” πŸ‘ 77    πŸ” 12    πŸ’¬ 1    πŸ“Œ 0

Already added I believe πŸ™‚

23.11.2024 07:10 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

I think of alignment as inducing low perplexity for a smaller part of the distribution, while out of the box LLM seems like it'd be okay with being more "out there". But yeah, alignment on one good poet would be great, as opposed to the generic "what's a good poem"

21.11.2024 07:23 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Read only the first 1-2 sentences of each and go with your gut. You'll likely get the perfect score!

Kind of thing where I probably prefer an unaligned model output to an aligned one..

21.11.2024 02:19 β€” πŸ‘ 15    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

Welcome! Added you to the SoCal starter pack πŸ™‚

21.11.2024 02:09 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 1    πŸ“Œ 0

Added πŸ™‚

20.11.2024 18:20 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Added!

20.11.2024 18:19 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Added!

20.11.2024 18:18 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Yeah I just said "I love you" to Claude, enough work for today

20.11.2024 00:44 β€” πŸ‘ 22    πŸ” 1    πŸ’¬ 3    πŸ“Œ 0

Added!

20.11.2024 00:34 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Added

20.11.2024 00:34 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Added

20.11.2024 00:34 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Added!

20.11.2024 00:34 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Started a SoCal AI/ML/NLP researchers starter pack! It's a bit sparse right now, and perhaps more NLP heavy, but hey, nominate yourself and others! go.bsky.app/6QckPj9

19.11.2024 15:28 β€” πŸ‘ 43    πŸ” 8    πŸ’¬ 17    πŸ“Œ 1
Giving a talk at Genbench workshop

Giving a talk at Genbench workshop

Hotline Miami soundtrack on Spotify

Hotline Miami soundtrack on Spotify

Group photo of the whole UCI NLP lab

Group photo of the whole UCI NLP lab

Photo of food

Photo of food

Had a fun week at #EMNLP2024 in Miami, meeting folks old and new, along with the #UCINLP lab retreat! See everyone at the next one!

17.11.2024 12:17 β€” πŸ‘ 20    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

@sameer-singh is following 20 prominent accounts