Ramchalam K R's Avatar

Ramchalam K R

@ramchalamkr.bsky.social

Ml Research Engineer at the intersection of model training and efficient inference on NPUs

115 Followers  |  738 Following  |  15 Posts  |  Joined: 04.12.2024
Posts Following

Posts by Ramchalam K R (@ramchalamkr.bsky.social)

@qualcomm.bsky.social @machinelearning.bsky.social

19.09.2025 11:41 — 👍 0    🔁 0    💬 0    📌 0

Excited to announce that our Paper from #Qualcomm Canada has been accepted at #NeurIPS2025 OmniDraft: A Cross-vocabulary, Online Adaptive Drafter for On-device Speculative Decoding.
Looking forward to sharing our work at NeurIPS 2025. @neuripsconf.bsky.social
Preprint - arxiv.org/abs/2507.02659

19.09.2025 11:40 — 👍 1    🔁 0    💬 1    📌 0
Post image

Thanks everyone for the wonderful and engaging discussions during our poster sessions and live demo of our work - Stepping Forward On The Last Mile. Thank you 😊.
arxiv.org/html/2411.04...

@neuripsconf.bsky.social
@qualcomm.bsky.social

14.12.2024 00:09 — 👍 2    🔁 0    💬 0    📌 0

Gosh. Thanks I got confused if this was another pack. Apologies:)

14.12.2024 00:06 — 👍 1    🔁 0    💬 1    📌 0

I'd like to be added. Thanks

14.12.2024 00:02 — 👍 1    🔁 0    💬 1    📌 0

Alright thank you for the clarification.

11.12.2024 22:19 — 👍 0    🔁 0    💬 0    📌 0

I did work on structured pruning on weights a few years ago and as we were focused on deployment to edge devices , it was critical. But this approach on the activation/attention head is interesting although the inference graph wouldn't really change on the base model. Would love to further discuss.

10.12.2024 16:11 — 👍 0    🔁 0    💬 0    📌 0

I do have a few points of clarification. Maybe I will drop by on the poster session.
But the stage 1 looks to me like it's structured pruning on the activation. What I am curious about is does this approach help improving inference ? So I presume we won't need to do compute for certain heads.

10.12.2024 16:08 — 👍 0    🔁 0    💬 2    📌 0
NeurIPS Poster Stepping Forward on the Last MileNeurIPS 2024

Neurips 2024 poster link
neurips.cc/virtual/2024...

@neuripsconf.bsky.social

10.12.2024 02:29 — 👍 2    🔁 0    💬 0    📌 0

Very interesting work on LoFiT!

08.12.2024 22:07 — 👍 1    🔁 0    💬 1    📌 0

I'd like to be added to the pack as I would be at NeurIPS 2024 as well. Thanks

08.12.2024 21:06 — 👍 1    🔁 0    💬 1    📌 0

@neuripsconf.bsky.social @qualcomm.bsky.social

08.12.2024 16:15 — 👍 0    🔁 0    💬 1    📌 0
Preview
Qualcomm at NeurIPS 2024: Our groundbreaking innovations and cutting-edge advancements in AI | Qualcomm 2024 is notable for the remarkable advancements in generative artificial intelligence (GenAI), and Qualcomm Technologies is at the forefront of bringing these capabilities to edge devices. Our Qualcom...

Qualcomm at NeurIPS 2024: Our groundbreaking innovations and cutting-edge advancements in AI.
www.qualcomm.com/news/onq/202...

If you're attending NeurIPS 2024 in Vancouver, be sure to visit us at Qualcomm's booth #533 and our poster(#6102) on Wednesday - Stepping forward on the last mile.

08.12.2024 16:14 — 👍 2    🔁 0    💬 1    📌 0

@neuripsconf.bsky.social

06.12.2024 15:57 — 👍 1    🔁 0    💬 0    📌 0
Stepping Forward on the Last Mile

Our work at Qualcomm AI research, Qualcomm Canada was accepted at NeurIPS 2024. This work presents on-device training for any network (transformer, convolutional or rnn architecture) using fixed points(quantized) forward gradients (no back-prop).
Paper link: arxiv.org/html/2411.04...

#Neurips2024

06.12.2024 15:55 — 👍 1    🔁 0    💬 1    📌 0