Alessandro Sordoni @murefil

Latest posts by murefil.bsky.social on Bluesky

Today is the International Day for the Elimination of Violence against Women. According to the UN, more than 50 000 women were killed by a partner or family member in 2023 news.un.org/en/story/202... This number is an underestimate given that only 37 countries reported in 2023.

26.11.2024 06:16 — 👍 8 🔁 3 💬 0 📌 0

Yeah I suspected that, interesting!

25.11.2024 13:36 — 👍 0 🔁 0 💬 0 📌 0

distributed learning for LLM?

recently, @primeintellect.bsky.social have announced finishing their 10B distributed learning, trained across the world.

what is it exactly?

🧵

25.11.2024 12:02 — 👍 23 🔁 6 💬 1 📌 2

Instead of averaging outer gradients would fancier model merging techniques (eg TIES) apply here?

25.11.2024 12:50 — 👍 1 🔁 0 💬 1 📌 0

having better tools for reviewer and ac assignment would definitely help, ultimately reducing # reviews per paper while striving for relevance and quality could increase reviewer / ac engagement and free up their time to do better reviews

24.11.2024 22:23 — 👍 2 🔁 0 💬 0 📌 0

Informatics: ILCC: Language Processing, Speech Technology, Information Retrieval, Cognition Study Informatics: ILCC: Language Processing, Speech Technology, Information Retrieval, Cognition at the University of Edinburgh. Our postgraduate degree programmes focus on natural language processin...

Last 5 days to apply for a PhD at #EdinburghNLP!

Deadline: November 25

www.ed.ac.uk/studying/pos...

If you are passionate about:

- adaptive tokenization and memory in foundation models
- modular deep learning
- computational typology

please message me or meet me at #NeurIPS2024!

21.11.2024 13:41 — 👍 20 🔁 8 💬 0 📌 0

A sparse mask of attention scores based on VerticalAndSlashAttention and a plot of loss vs sparsity ratio for various methods.

Another nano gem from my amazing student
Piotr Nawrot!

A repo & notebook on sparse attention for efficient LLM inference: github.com/PiotrNawrot/...

This will also feature in my #NeurIPS 2024 tutorial "Dynamic Sparsity in ML" with André Martins: dynamic-sparsity.github.io Stay tuned!

20.11.2024 12:51 — 👍 43 🔁 8 💬 2 📌 3

Explore zero-shot routing of parameter-efficient experts with Phatgoose arxiv.org/abs/2402.05859 and Arrow arxiv.org/abs/2405.11157 w. github.com/microsoft/mttl

👉 github.com/sordonia/pg_mb…

Part of "Dynamic Sparsity in ML" tuto #neurips2024, feedback welcome and join for discussions! 😊

21.11.2024 15:47 — 👍 5 🔁 1 💬 0 📌 0

@murefil is following 20 prominent accounts

danilobzdok
@danilobzdok

Research director | @McGillU @Mila_Quebec @IVADO_Qc | My team designs machine learning frameworks to understand biological systems from new angles of attack

@akshaykr

Vidhisha Balachandran
@vidhishab

AI Evaluation and Interpretability @MicrosoftResearch, Prev PhD @CMU.

Suriya Gunasekar
@suriyag

Katja Hofmann
@katjahofmann

At Microsoft Research. Lead of https://aka.ms/game-intelligence - we drive innovation in machine learning with applications in games. https://iclr.cc Board.

Romain Deveaud
@romaindeveaud

Senior Tech Lead @qwant.bsky.social. PhD in Information Retrieval. ex @TerrierTeam @UofGlasgow | ex @Yellow_Pages | @UnivAvignon alumni.

Edvin LZ
@edvinli

ML Research Scientist, PhD https://edvinli.github.io

Spandana Gella
@spandanagella

Sr Mgr & Research Scientist @ServiceNowRSRCH, Montreal

Marco Ciccone
@mcicc

Postdoctoral Fellow Vector Institute Collaborative, Decentralized, Modular ML https://marcociccone.github.io/

Mitchell Wortsman
@mitchellnw

PrimeIntellect
@primeintellect

Mimansa Jaiswal
@mimansaj

Robustness, Data & Annotations, Evaluation & Interpretability in LLMs http://mimansajaiswal.github.io/

Arthur Douillard
@douillard

distributed (diloco) + modularity (dipaco) + llm @ deepmind | continual learning phd @ sorbonne

Alexandra Olteanu
@aolteanu

Ethical/Responsible AI. Rigor in AI. Opinions my own. Principal Researcher @ Microsoft Research. Grumpy eastern european in north america. Lovingly nitpicky.

Stephanie Hyland
@hylandsl

machine learning for health at microsoft research, based in cambridge UK 🌻 she/her

danah boyd
@zephoria

STS researcher who likes to look at things sideways. Founded Data & Society. Topics: Census | Youth | Data | Society Professor of Communication, Cornell https://made-not-found-by-danah-boyd.ghost.io

Jan Hermann
@jan.hermann.name

Computational chemistry & physics, electrons, deep learning 🚲☕️♟️ Microsoft Research AI for Science · https://jan.hermann.name

Javier Gonzalez
@javiergonzalezh

Senior Principal Researcher @MSRCambridge. Working on reasoning in language models.

Alex Lu
@alexijie

Senior Researcher at Microsoft Research New England. ML/AI for biological discovery: microscopy, proteins, single-cell.

Lorin Crawford
@lcrawford

Principal Researcher in BioML at Microsoft Research.