Workshop: datascience.stanford.edu/programs/ris...
@utaustin.bsky.social
@anubrata.bsky.social
Just Finished PhD @ UT Austin; Human-Centered NLP. Language Models https://anubrata.github.io
Workshop: datascience.stanford.edu/programs/ris...
@utaustin.bsky.social
Thrilled to be selected for the ๐ Rising Stars in Data Science Workshop! Grateful to @stanforddata.bsky.social, @HCID UC San Diego, and @dsi-uchicago.bsky.social for this opportunity.
Excited to share my work on trustworthy and collaborative AI and connect with amazing peers and mentors.
๐ ๐
Yes, more so with code for running quick experiments! i definitely want my code to NOT fail gracefully. (And save myself hours of debugging time because there is a default parameter somewhere I did not notice!)
24.10.2025 21:53 โ ๐ 4 ๐ 1 ๐ฌ 0 ๐ 0Ah that makes sense! Thanks, yeah I am on that slack, hhh!
27.08.2025 14:06 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0How can I get an invite for the XAI discord?
27.08.2025 13:12 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0Thank you for making the list, could you please add me?
29.07.2025 13:31 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0In a stunning moment of self-delusion, the Wall Street Journal headline writers admitted that they don't know how LLM chatbots work.
21.07.2025 01:48 โ ๐ 2977 ๐ 473 ๐ฌ 43 ๐ 90What if you could understand and control an LLM by studying its *smaller* sibling?
Our new paper introduces the Linear Representation Transferability Hypothesis. We find that the internal representations of different-sized models can be translated into one another using a simple linear(affine) map.
McCombs article: news.mccombs.utexas.edu/research/to-...
Paper url: doi.org/10.47989/ir3...
@utaustin.bsky.social
@texasscience.bsky.social
@engagingnews.bsky.social
@utischool.bsky.social
#TexasAI
#YearofAI
Can content moderation models balance accuracy & fairness?
UT McCombs news featured our iConference paper by Soumyajit Gupta on optimizing the fairness-accuracy tradeoff in toxicity detection. In collaboration with Venelin Kovatchev @mariadearteaga.bsky.social @mattlease.bsky.social
How good are LLMs at ๐ญ scientific computing and visualization ๐ญ?
AstroVisBench tests how well LLMs implement scientific workflows in astronomy and visualize results.
SOTA models like Gemini 2.5 Pro & Claude 4 Opus only match ground truth scientific utility 16% of the time. ๐งต
#NAACL2025
03.05.2025 15:27 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 0Please join us for the TrustNLP workshop (215 San Miguel) @naaclmeeting.bsky.social #trustNLP2025
03.05.2025 15:25 โ ๐ 1 ๐ 0 ๐ฌ 0 ๐ 1Session detail:
Poster Session 5 - IAM: Interpretability and Analysis of Models for NLP, Hall 3
This is a collaborative work with Manoj Kumar, Ninareh Mehrabi, Anil Ramakrishna, Anna Rumshisky, Kai-Wei Chang, Aram Galstyan, Morteza Ziyadi, Rahul Gupta
01.05.2025 05:26 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0Causal tracing informed edits provide a better detoxification-degeneration trade-off.
01.05.2025 05:25 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0Model editing helps reduce toxicity. High detoxification can be achieved by simply editing random MLP layers. However, this leads to degeneration and increased perplexity.
01.05.2025 05:25 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0We find evidence of toxic memory in the early layer of GPT-2 XL for innocuous-looking adversarial prompts.
01.05.2025 05:25 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0Paper: On Localizing and Deleting Toxic Memories in Large Language Models
Anthology URL: aclanthology.org/2025.finding...
Excited to present my internship work at
Amazon AGI at @naaclmeeting.bsky.social tomorrow at 2:00 pm local time. Please come say hi if you are around.
thinking of calling this "The Illusion Illusion"
(more examples below)
Created a small starter pack including folks whose work I believe contributes to more rigorous and grounded AI research -- I'll grow this slowly and likely move it to a list at some point :) go.bsky.app/P86UbQw
30.11.2024 19:58 โ ๐ 12 ๐ 5 ๐ฌ 1 ๐ 0NeurIPS Test of Time Awards:
Generative Adversarial Nets
Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, Yoshua Bengio
Sequence to Sequence Learning with Neural Networks
Ilya Sutskever, Oriol Vinyals, Quoc V. Le
Right, sorry for being unclear. I saw your comment sharing the Qualtrics integration tutorial with a video. bsky.app/profile/dggo...
25.11.2024 21:33 โ ๐ 1 ๐ 0 ๐ฌ 1 ๐ 0Nvm, found it!
25.11.2024 17:05 โ ๐ 0 ๐ 0 ๐ฌ 2 ๐ 0@tomcostello.bsky.social 's Qualitrics materials and tutorial video for integrating LLMs into Qualtrics can be accessed at publish.obsidian.md/qualtrics-do...
25.11.2024 15:40 โ ๐ 15 ๐ 5 ๐ฌ 2 ๐ 0Will there be a video for this talk?
25.11.2024 17:04 โ ๐ 0 ๐ 0 ๐ฌ 1 ๐ 0๐๐ฝ
24.11.2024 04:25 โ ๐ 2 ๐ 0 ๐ฌ 0 ๐ 0I did a starter pack of ML/AI people at @utaustin.bsky.social Please distribute and feel free to self nominate!
go.bsky.app/QLQznZg
A starter pack for the NLP and Computational Linguistics researchers at UT Austin!
go.bsky.app/75g9JLT