And on a fun note, 4 years after completing the computer vision holy trinity (CVPR, ICCV, ECCV), finally completed the machine learning conference trinity (NeurIPS, ICML, ICLR). n apparently = 6
22.01.2025 17:12 β π 1 π 0 π¬ 0 π 0And on a fun note, 4 years after completing the computer vision holy trinity (CVPR, ICCV, ECCV), finally completed the machine learning conference trinity (NeurIPS, ICML, ICLR). n apparently = 6
22.01.2025 17:12 β π 1 π 0 π¬ 0 π 0
We will update the paper with the latest results, but the findings are identical to the current ArXiV version: arxiv.org/abs/2410.17174
On a personal note, always wanted to visit Singapore and this seems the perfect way to do so. n/n, n=5
Would like to thank my co-authors: Prannay Kaul who interned with me during the course of the project and was the main force of the project, my previous intern Chengcheng Ma who run so many experiments during the course of the project and the rebuttal, and of course Jiankang Deng who advised us. 4/n
22.01.2025 17:08 β π 0 π 0 π¬ 1 π 0While providing some real-world utility, in the terms of Transformer-quantization. For practical reasons, most of our experiments were on GPT-2 models, but our preliminary experiments show that everything holds for modern LLMs such as LLama family. 3/n
22.01.2025 17:07 β π 0 π 0 π¬ 1 π 0and provides some simple and practical solutions to the problem of channel outliers and the first-token dominance in autoregressive Transformers (your LLMs). 2/n
22.01.2025 17:07 β π 0 π 0 π¬ 1 π 0First accepted paper of the year: "From Attention to Activation: Unraveling the Enigmas of Large Language Models" has been accepted to ICLR 2025. The most educative paper I have co-wrote, it strengthens some claims known in the community, it opposes others, 1/n
22.01.2025 17:06 β π 2 π 0 π¬ 1 π 0We offer long internships (6+ months), competitive salaries, an office in the center of London, and a very diverse group (very gender-balanced, researchers from 8 countries working on a wide range of topics).
09.12.2024 13:04 β π 0 π 0 π¬ 0 π 0have topic match (VLLMs, LLMs, multimodality learning, or diffusion) and are interested in doing an internship at Huawei Research Center in London, please write to me and letβs have a chat in the conference.
09.12.2024 13:04 β π 0 π 0 π¬ 1 π 0As always, even more happy to chat during the conference with other researchers, especially with junior ones. If you are presenting some paper in NeurIPS (or have first author papers in equivalent conferences such as ICML, ICLR, CVPR, ICCV or ECCV),
09.12.2024 13:04 β π 0 π 0 π¬ 1 π 0I am very happy to attend NeurIPS in Vancouver when together with Roy Miles we will be presenting our VeLora paper on Thu 12 Dec 4:30 p.m. PST β 7:30 p.m. PST.
09.12.2024 13:03 β π 0 π 0 π¬ 1 π 0Hey Kostas, would love to be in this.
26.11.2024 15:20 β π 1 π 0 π¬ 1 π 0Hey, I would love to join this.
26.11.2024 15:20 β π 0 π 0 π¬ 0 π 0Hey, I would love to be added. :)
26.11.2024 15:17 β π 1 π 0 π¬ 1 π 0