Andrรฉ Cruz's Avatar

Andrรฉ Cruz

@andcrz.bsky.social

๐ŸŽ“ PhD student at the Max Planck Institute for Intelligent Systems ๐Ÿ”ฌ Safe and robust AI, algorithms and society ๐Ÿ”— https://andrefcruz.github.io ๐Ÿ“ researcher in ๐Ÿ‡ฉ๐Ÿ‡ช, from ๐Ÿ‡ต๐Ÿ‡น

104 Followers  |  416 Following  |  2 Posts  |  Joined: 06.12.2024  |  1.4336

Latest posts by andcrz.bsky.social on Bluesky

Post image

We (w/ Moritz Hardt, Olawale Salaudeen and
@joavanschoren.bsky.social) are organizing the Workshop on the Science of Benchmarking & Evaluating AI @euripsconf.bsky.social 2025 in Copenhagen!

๐Ÿ“ข Call for Posters: rb.gy/kyid4f
๐Ÿ“… Deadline: Oct 10, 2025 (AoE)
๐Ÿ”— More info: rebrand.ly/bg931sf

22.09.2025 13:45 โ€” ๐Ÿ‘ 21    ๐Ÿ” 7    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0
Preview
Tufts says federal authorities detained graduate student The university received reports that an international graduate student was taken into custody from an off-campus apartment Tuesday night, President Sunil Kumar wrote in a letter to the school communit...

ICE kidnapped another student.

This time a grad student at Tufts, Rumeysa Ozturk. Turkish national, in the United States on a student visa. Her lawyer does not know where she is being held.

Rumeysa wrote op-eds criticizing the university's response to student demands on Gaza.

26.03.2025 17:05 โ€” ๐Ÿ‘ 468    ๐Ÿ” 235    ๐Ÿ’ฌ 15    ๐Ÿ“Œ 19
Post image

Welcome to the Bluesky account for Stand Up for Science 2025!

Keep an eye on this space for updates, event information, and ways to get involved. We can't wait to see everyone #standupforscience2025 on March 7th, both in DC and locations nationwide!

#scienceforall #sciencenotsilence

12.02.2025 17:04 โ€” ๐Ÿ‘ 11524    ๐Ÿ” 5465    ๐Ÿ’ฌ 291    ๐Ÿ“Œ 675
Preview
GitHub - socialfoundations/folktexts: Evaluate uncertainty, calibration, accuracy, and fairness of LLMs on real-world survey data! Evaluate uncertainty, calibration, accuracy, and fairness of LLMs on real-world survey data! - socialfoundations/folktexts

The paper is accompanied by a new benchmark package: *Folktexts*. It builds socio-demographic backstories from Census data to evaluate LLM calibration, fairness, and uncertainty estimation.

Package: github.com/socialfounda...
Paper: arxiv.org/pdf/2407.14614

06.02.2025 23:10 โ€” ๐Ÿ‘ 1    ๐Ÿ” 0    ๐Ÿ’ฌ 0    ๐Ÿ“Œ 0
Spring EconCS 2025 Seminars | EconCS Group

Tomorrow at 1:30pm ET at the Harvard EconCS seminar, I'm presenting our paper on LLMs as risk scorers: We build benchmarks using US Census data & show how miscalibrated LLMs are on real-world tabular data distributions.

๐Ÿ“Harvard SEC LL2.221-open to the public
econcs.seas.harvard.edu/event/spring...

06.02.2025 23:04 โ€” ๐Ÿ‘ 2    ๐Ÿ” 1    ๐Ÿ’ฌ 1    ๐Ÿ“Œ 0

@andcrz is following 20 prominent accounts