Francesco Ortu @francescortu

Francesco Ortu

@francescortu.bsky.social

NLP & Interpretability | PhD Student @ University of Trieste & Laboratory of Data Engineering of Area Science Park | Prev MPI-IS

492 Followers | 1,044 Following | 9 Posts | Joined: 19.11.2024 | 1.7598

Latest posts by francescortu.bsky.social on Bluesky

Excited to share that 2/2 papers from our Lab @AreaSciencePark were accepted to #NeurIPS2025 (one spotlight 🎉)

Great work everyone!

@alexpietroserra.bsky.social @francescortu.bsky.social @lbasile.bsky.social @lvaleriani.bsky.social @diegodoimo.bsky.social @maiorca.xyz @locatelf.bsky.social

22.09.2025 08:55 — 👍 6 🔁 2 💬 0 📌 0

Nice start of @neuripsconf.bsky.social!

Our work with @francescortu.bsky.social and @diegodoimo.bsky.social on the Competition of Mechanisms to understand counterfactuality in LLMs featured in the "Causality for LLMs" workshop :-)

Check out our ACL2024 paper aclanthology.org/2024.acl-long.…

10.12.2024 20:19 — 👍 9 🔁 1 💬 0 📌 0

Thanks again, @diegodoimo.bsky.social and @albecazzaniga.bsky.social , for the fantastic mentorship and support! 🙏🎉 They are also attending #NeurIPS, so feel free to reach out to them to discuss our results. I’m excited to keep pushing forward on these topics! 🚀

10.12.2024 20:10 — 👍 1 🔁 0 💬 0 📌 0

Thanks to the amazing team at LADE @areasciencepark: @lvaleriani.bsky.social @lbasile.bsky.social @AlessioAnsuini @diegodoimo.bsky.social @albecazzaniga.bsky.social 🙏

10.12.2024 20:10 — 👍 2 🔁 0 💬 1 📌 0

It was super fun to take our first step in interpreting multimodal LLMs, working closely with the brilliant @alexpietroserra.bsky.social and @EmanuelePanizon

10.12.2024 20:10 — 👍 0 🔁 0 💬 1 📌 0

✅ This shows that, starting from the mid-layers, a single token effectively summarizes all 1024 image tokens!

❌ This does not occur in models fine-tuned for visual understanding (such as Pixtral).

10.12.2024 20:10 — 👍 1 🔁 0 💬 1 📌 0

Additionally, blocking communication from this token significantly disrupts performance on standard benchmarks, while blocking image-text communication does not

10.12.2024 20:10 — 👍 1 🔁 0 💬 1 📌 0

🎯 Key finding: In these models the hidden representations of images and text form disjoint clusters and the communication between modalities is mediated by the special token <end-of-image>!

10.12.2024 20:10 — 👍 1 🔁 0 💬 1 📌 0

🌐 Check out our code and data at: ritareasciencepark.github.io/Narrow-gate

10.12.2024 20:10 — 👍 0 🔁 0 💬 1 📌 0

🚨 🚨 Excited to share our latest paper, now on #arXiv!

🖼️ We studied how unified VLMs, trained to generate both text and images (e.g., Meta's Chameleon), exchange information between modalities, comparing them to standard VLMs.

📄 Paper: arxiv.org/abs/2412.06646

Deep dive: 👇

10.12.2024 20:10 — 👍 10 🔁 2 💬 1 📌 3

Screenshot of the paper.

Even as an interpretable ML researcher, I wasn't sure what to make of Mechanistic Interpretability, which seemed to come out of nowhere not too long ago.

But then I found the paper "Mechanistic?" by
@nsaphra.bsky.social and @sarah-nlp.bsky.social, which clarified things.

20.11.2024 08:00 — 👍 232 🔁 28 💬 7 📌 2

Thanks for creating the starter pack! I'd love to be added as well! 😊

20.11.2024 10:41 — 👍 2 🔁 0 💬 0 📌 0

@francescortu is following 19 prominent accounts

Francesca Cuturello
@fra-cutu

Computational perspective on molecular evolution & function @areasciencepark

Tom Neuhäuser
@tomneuhaeuser

PhD Student @ ML Group TU Berlin, BIFOLD

Beatrix M. G. Nielsen
@beatrixmgn

PhD student in machine learning at DTU, Copenhagen. Especially interested in model representations.

Badr AlKhamissi
@bkhmsi

PhD at EPFL 🧠💻 Ex @MetaAI, @SonyAI, @Microsoft Egyptian 🇪🇬

Nina Nusbaumer
@nina-nusbaumer

Sentence processing modeling | Computational psycholinguistics | 1st year PhD student at LLF, CNRS, Université Paris Cité | Currently visiting COLT, Universitat Pompeu Fabra, Barcelona, Spain https://ninanusb.github.io/

BlackboxNLP
@blackboxnlp

The largest workshop on analysing and interpreting neural networks for NLP. BlackboxNLP will be held at EMNLP 2025 in Suzhou, China blackboxnlp.github.io

Yonatan Belinkov ✈️ COLM2025
@boknilev

Assistant professor of computer science at Technion; visiting scholar at @KempnerInst 2025-2026 https://belinkov.com/

Omar Rivasplata
@omarrivasplata

Matthew Shinkle
@matthewshinkle

Zhijing Jin
@zhijingjin

Assi. Prof @UofTCompSci. Postdoc @MPI_IS w/ @bschoelkopf. Research on (1) @CausalNLP and (2) NLP4SocialGood @NLP4SG. Mentor & mentee @ACLMentorship.

Rada Mihalcea
@radamihalcea

Janice M. Jenkins Collegiate Professor of Computer Science at U. Michigan, Director Michigan AI Lab, Former ACL President, AAAI Fellow, ACM Fellow. Researcher #NLProc #AI 🔗 https://web.eecs.umich.edu/~mihalcea/

Matéo Mahaut
@mateo-mahaut

PhD Student in Colt UPF https://mahautm.github.io/

Sukrut Rao
@sukrutrao

PhD Student at the Max Planck Institute for Informatics @cvml.mpi-inf.mpg.de @maxplanck.de | Explainable AI, Computer Vision, Neuroexplicit Models Web: sukrutrao.github.io

Tim Baumgärtner
@timbmg

👨‍💻 NLP PhD Student @ukplab.bsky.social

Jacob Schreiber
@jmschreiber91

Studying genomics, machine learning, and fruit. My code is like our genomes -- most of it is junk. Assistant Professor UMass Chan, Board of Directors NumFOCUS Previously IMP Vienna, Stanford Genetics, UW CSE.

Albert Vilella, PhD.
@albertvilella

Bioinformatics Scientist / Next Generation Sequencing, Single Cell and Spatial Biology, Next Generation Proteomics, Liquid Biopsy, SynBio, Compute Acceleration in biotech // http://albertvilella.substack.com

ELLIS unit Jena
@ellisunitjena

https://ellis-jena.eu is developing+applying #AI #ML in #earth system, #climate & #environmental research. Partner: @uni-jena.de, https://bgc-jena.mpg.de/en, @dlr-spaceagency.bsky.social, @carlzeissstiftung.bsky.social, https://aiforgood.itu.int

Bas Aarts
@englishgrammar

Professor of English Linguistics, UCL Here, I post on (English) language topics. On Substack, I post on English Grammar: https://basaarts.substack.com/ #grammar #syntax #parsing

Francesca Padovani
@frap98

2nd year PhD Student at @gronlp.bsky.social 🐮 - University of Groningen Language Acquisition - NLP