Fangcong Yin's Avatar

Fangcong Yin

@fcyin.bsky.social

CS PhD student @UT Austin studying NLP. Prev:@CornellCIS

148 Followers  |  316 Following  |  2 Posts  |  Joined: 19.11.2024  |  1.3937

Latest posts by fcyin.bsky.social on Bluesky

What if you could understand and control an LLM by studying its *smaller* sibling?

Our new paper introduces the Linear Representation Transferability Hypothesis. We find that the internal representations of different-sized models can be translated into one another using a simple linear(affine) map.

10.07.2025 17:26 β€” πŸ‘ 25    πŸ” 10    πŸ’¬ 1    πŸ“Œ 1
Post image

I'm at #Neurips2024 this week!

My work (arxiv.org/abs/2406.17692) w/ @gregdnlp.bsky.social & @eunsol.bsky.social exploring the connection between LLM alignment and response pluralism will be at pluralistic-alignment.github.io Saturday. Drop by to learn more!

11.12.2024 17:39 β€” πŸ‘ 28    πŸ” 6    πŸ’¬ 0    πŸ“Œ 0

We will also give a spotlight presentation of LoFiT in the #NeurIPS2024 Workshop on Foundation Model Interventions on December 15th in West Meeting Room 121, 122!

09.12.2024 22:22 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

Interpretability can be used to improve LLM fine-tuning - check out our poster at #NeurIPS2024! Where: East Exhibit Hall A-C #3402 (Poster Session 2 East)
When: 11 Dec 4:30 - 7:30 pm PST (Vancouver time)
See you in Vancouver! Would love to chat about PEFT, interp, alignment, and more

09.12.2024 22:22 β€” πŸ‘ 4    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0

@fcyin is following 18 prominent accounts