Ada's Avatar

Ada

@adadtur.bsky.social

she/her McGillNLP & Mila 'can you be an expat if you were never a pat?' - leo hertzler occasionally live on ckut 90.3 fm :-) adadtur.github.io

556 Followers  |  802 Following  |  11 Posts  |  Joined: 21.10.2024  |  2.1131

Latest posts by adadtur.bsky.social on Bluesky

"Not only is the ratio of AI’s resource rapacity to its productive utility indefensibly and irremediably skewed, AI-made material is itself a waste product: flimsy, shoddy, disposable, a single-use plastic of the mind."

>>

16.09.2025 20:02 β€” πŸ‘ 50    πŸ” 11    πŸ’¬ 2    πŸ“Œ 1

enshittification | noun | when a digital platform is made worse for users, in order to increase profits

03.09.2025 20:22 β€” πŸ‘ 29320    πŸ” 8641    πŸ’¬ 511    πŸ“Œ 658
Windows Notepad, the native simple text editor, now has formatting options and a Copilot button.

Windows Notepad, the native simple text editor, now has formatting options and a Copilot button.

Look what they did to Notepad. Shut the fuck up. This is Notepad. You are not welcome here. Oh yeah "Let me use Copilot for Notepad". "I'm going to sign into my account for Notepad". What the fuck are you talking about. It's Notepad.

27.08.2025 01:41 β€” πŸ‘ 17518    πŸ” 4596    πŸ’¬ 452    πŸ“Œ 500
Post image

Our new paper in #PNAS (bit.ly/4fcWfma) presents a surprising findingβ€”when words change meaning, older speakers rapidly adopt the new usage; inter-generational differences are often minor.

w/ Michelle Yang, β€ͺ@sivareddyg.bsky.social‬ , @msonderegger.bsky.social‬ and @dallascard.bsky.socialβ€¬πŸ‘‡(1/12)

29.07.2025 12:05 β€” πŸ‘ 34    πŸ” 17    πŸ’¬ 3    πŸ“Œ 2
Post image

Thrilled to announce our new survey that explores the exciting possibilities and troubling risks of computational persuasion in the era of LLMs πŸ€–πŸ’¬
πŸ“„Arxiv: arxiv.org/pdf/2505.07775
πŸ’» GitHub: github.com/beyzabozdag/...

13.05.2025 20:12 β€” πŸ‘ 8    πŸ” 5    πŸ’¬ 1    πŸ“Œ 0
Preview
02 | Gauthier Gidel: Bridging Theory and Deep Learning, Vibes at Mila, and the Effects of AI on Art Behind the Research of AI Β· Episode

Started a new podcast with @tomvergara.bsky.social !

Behind the Research of AI:
We look behind the scenes, beyond the polished papers 🧐πŸ§ͺ

If this sounds fun, check out our first "official" episode with the awesome Gauthier Gidel
from @mila-quebec.bsky.social :

open.spotify.com/episode/7oTc...

25.06.2025 15:54 β€” πŸ‘ 17    πŸ” 6    πŸ’¬ 1    πŸ“Œ 0
Video thumbnail

Zohran Mamdani, a 33-year-old state assemblyman, declared victory in New York City’s Democratic mayoral primary after Andrew Cuomo conceded the race.

β€œTonight we made history,” Mamdani said, addressing his supporters. wapo.st/44yMVoI

25.06.2025 12:30 β€” πŸ‘ 4442    πŸ” 519    πŸ’¬ 143    πŸ“Œ 50

Mahmoud Khalil is finally home with his beautiful wife and newborn son.

Each one of the 104 days he spent detained was a grave injustice.

From the moment of his detention, @ccrjustice.org + @aclu.org engaged my office as we worked closely to help secure his release. They did remarkable work here.

21.06.2025 20:53 β€” πŸ‘ 22044    πŸ” 3013    πŸ’¬ 282    πŸ“Œ 62
Preview
A Shortcut-aware Video-QA Benchmark for Physical Understanding via Minimal Video Pairs Existing benchmarks for assessing the spatio-temporal understanding and reasoning abilities of video language models are susceptible to score inflation due to the presence of shortcut solutions based ...

The facts:

We release (MVPBench) with around 55K videos (grouped as *minimal video pairs*) from diverse physical understanding sources

Arxiv: arxiv.org/abs/2506.09987

Huggingface: huggingface.co/datasets/fac...

GitHub: github.com/facebookrese...

Leaderboard: huggingface.co/spaces/faceb...

13.06.2025 14:47 β€” πŸ‘ 3    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
Post image

Excited to share the results of my recent internship!

We ask πŸ€”
What subtle shortcuts are VideoLLMs taking on spatio-temporal questions?

And how can we instead curate shortcut-robust examples at a large-scale?

We release: MVPBench

Details πŸ‘‡πŸ”¬

13.06.2025 14:47 β€” πŸ‘ 16    πŸ” 5    πŸ’¬ 1    πŸ“Œ 0

Congrats!

30.05.2025 18:20 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Video thumbnail

Today, I was denied access to seeing my constituent, Mr. Kilmar Abrego Garcia. If there is nothing to hide, cut the crap. Let his lawyer and I check on him.

26.05.2025 19:32 β€” πŸ‘ 39111    πŸ” 10798    πŸ’¬ 737    πŸ“Œ 354
Preview
Live updates: Trump administration revokes Harvard’s ability to enroll foreign students Get the latest news on President Donald Trump’s return to the White House and the Republican-led Congress.

Breaking news: The Trump administration revoked Harvard’s ability to enroll foreign students, saying it allowed anti-American agitators.

Existing foreign students must transfer or risk losing their legal status, DHS said.

22.05.2025 18:34 β€” πŸ‘ 216    πŸ” 132    πŸ’¬ 82    πŸ“Œ 130
Post image Post image Post image Post image

when in albuquerque…

07.05.2025 06:00 β€” πŸ‘ 4    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

We won a Senior Area Chair Award at NAACL!! Many thanks again to my amazing coauthors Gaurav Kamath and @sivareddyg.bsky.social :-)

03.05.2025 15:50 β€” πŸ‘ 13    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0

Check out Gaurav's video on their #NAACL paper and find @adadtur.bsky.social at the conference πŸ‘‡

02.05.2025 01:41 β€” πŸ‘ 11    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

Great work from labmates on LLMs vs humans regarding linguistic preferences: You know when a sentence kind of feels off e.g. "I met at the park the man". So in what ways do LLMs follow these human intuitions?

01.05.2025 15:04 β€” πŸ‘ 7    πŸ” 3    πŸ’¬ 0    πŸ“Œ 0
Preview
Language Models Largely Exhibit Human-like Constituent Ordering Preferences Though English sentences are typically inflexible vis-Γ -vis word order, constituents often show far more variability in ordering. One prominent theory presents the notion that constituent ordering is ...

Ada is an undergrad and will soon be looking for PhDs. Gaurav is a PhD student looking for intellectually stimulating internships/visiting positions. They did most of the work without much of my help. Highly recommend them. Please reach out to them if you have any positions.

01.05.2025 15:14 β€” πŸ‘ 6    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0

Incredibly proud of my students @adadtur.bsky.social and Gaurav Kamath for winning a SAC award at #NAACL2025 for their work on assessing how LLMs model constituent shifts.

01.05.2025 15:11 β€” πŸ‘ 17    πŸ” 5    πŸ’¬ 1    πŸ“Œ 0
Video thumbnail

Congratulations to Mila members @adadtur.bsky.social , Gaurav Kamath and @sivareddyg.bsky.social for their SAC award at NAACL! Check out Ada's talk in Session I: Oral/Poster 6. Paper: arxiv.org/abs/2502.05670

01.05.2025 14:30 β€” πŸ‘ 13    πŸ” 7    πŸ’¬ 0    πŸ“Œ 3
Video thumbnail

I filmed this yesterday on my way to Lousiana where my constituent RΓΌmeysa Γ–ztΓΌrk is being wrongfully held by ICE. I’m there now demanding her release. More to come.

22.04.2025 21:36 β€” πŸ‘ 30399    πŸ” 6216    πŸ’¬ 939    πŸ“Œ 446
A circular diagram with a blue whale icon at the center. The diagram shows 8 interconnected research areas around LLM reasoning represented as colored rectangular boxes arranged in a circular pattern. The areas include: Β§3 Analysis of Reasoning Chains (central cloud), Β§4 Scaling of Thoughts (discussing thought length and performance metrics), Β§5 Long Context Evaluation (focusing on information recall), Β§6 Faithfulness to Context (examining question answering accuracy), Β§7 Safety Evaluation (assessing harmful content generation and jailbreak resistance), Β§8 Language & Culture (exploring moral reasoning and language effects), Β§9 Relation to Human Processing (comparing cognitive processes), Β§10 Visual Reasoning (covering ASCII generation capabilities), and Β§11 Following Token Budget (investigating direct prompting techniques). Arrows connect the sections in a clockwise flow, suggesting an iterative research methodology.

A circular diagram with a blue whale icon at the center. The diagram shows 8 interconnected research areas around LLM reasoning represented as colored rectangular boxes arranged in a circular pattern. The areas include: Β§3 Analysis of Reasoning Chains (central cloud), Β§4 Scaling of Thoughts (discussing thought length and performance metrics), Β§5 Long Context Evaluation (focusing on information recall), Β§6 Faithfulness to Context (examining question answering accuracy), Β§7 Safety Evaluation (assessing harmful content generation and jailbreak resistance), Β§8 Language & Culture (exploring moral reasoning and language effects), Β§9 Relation to Human Processing (comparing cognitive processes), Β§10 Visual Reasoning (covering ASCII generation capabilities), and Β§11 Following Token Budget (investigating direct prompting techniques). Arrows connect the sections in a clockwise flow, suggesting an iterative research methodology.

Models like DeepSeek-R1 πŸ‹ mark a fundamental shift in how LLMs approach complex problems. In our preprint on R1 Thoughtology, we study R1’s reasoning chains across a variety of tasks; investigating its capabilities, limitations, and behaviour.
πŸ”—: mcgill-nlp.github.io/thoughtology/

01.04.2025 20:06 β€” πŸ‘ 52    πŸ” 16    πŸ’¬ 1    πŸ“Œ 9
Video thumbnail

Not sure if this has been shared here yet, but this is video of Rumeysa Ozturk's arrest posted by WCVB. It's terrifying.

26.03.2025 16:43 β€” πŸ‘ 4622    πŸ” 2321    πŸ’¬ 50    πŸ“Œ 1177
Preview
Pm Private GIF ALT: Pm Private GIF

I would like to nominate Maxwell Smart for national security advisor.

25.03.2025 01:32 β€” πŸ‘ 77271    πŸ” 17014    πŸ’¬ 3051    πŸ“Œ 1624
Preview
Under Trump, AI Scientists Are Told to Remove β€˜Ideological Bias’ From Powerful Models A directive from the National Institute of Standards and Technology eliminates mention of β€œAI safety” and β€œAI fairness.”

this is outrageously terrible but if there's one thing the Trump administration’s policy is doing so far it is expediting public outrage and turning the masses against everything AI

www.wired.com/story/ai-saf...

15.03.2025 10:07 β€” πŸ‘ 133    πŸ” 31    πŸ’¬ 8    πŸ“Œ 8
Exploiting Instruction-Following Retrievers for Malicious Information Retrieval Parishad BehnamGhader, Nicholas Meade, Siva Reddy

Instruction-following retrievers can efficiently and accurately search for harmful and sensitive information on the internet! πŸŒπŸ’£

Retrievers need to be aligned too! 🚨🚨🚨

Work done with the wonderful Nick and @sivareddyg.bsky.social

πŸ”— mcgill-nlp.github.io/malicious-ir/
Thread: πŸ§΅πŸ‘‡

12.03.2025 16:15 β€” πŸ‘ 12    πŸ” 8    πŸ’¬ 1    πŸ“Œ 0

Super excited that this is finally out! We evaluated leading LLM-based web agents from OpenAI, Anthropic, and more, on our new benchmark SafeArena and found that many are surprisingly compliant with malicious requests. Check out the leaderboard here: huggingface.co/spaces/McGil...

11.03.2025 15:10 β€” πŸ‘ 8    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

Web agents powered by LLMs can solve complex tasks, but our analysis shows that they can also be easily misused to automate harmful tasks.

See the thread below for more details on our new web agent safety benchmark: SafeArena and Agent Risk Assessment framework (ARIA).

10.03.2025 20:11 β€” πŸ‘ 5    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0

The potential for malicious misuse of LLM agents is a serious threat.

That's why we created SafeArena, a safety benchmark for web agents. See the thread and our paper for details: arxiv.org/abs/2503.04957 πŸ‘‡

10.03.2025 18:20 β€” πŸ‘ 9    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0

@adadtur is following 20 prominent accounts