Hugo Larcher's Avatar

Hugo Larcher

@hlarcher.bsky.social

ML Infra engineer @huggingface. HPC and ML infra.

1,536 Followers  |  115 Following  |  2 Posts  |  Joined: 22.09.2023
Posts Following

Posts by Hugo Larcher (@hlarcher.bsky.social)

This first step will very soon be followed by the integration of new backends (TRT-LLM, llama.cpp, vLLM, Neuron and TPU).

We are polishing the TensorRT-LLM backend which achieves impressive performances on NVIDIA GPUs, stay tuned 🀩!

16.01.2025 09:39 β€” πŸ‘ 0    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference We’re on a journey to advance and democratize artificial intelligence through open source and open science.

We are introducing multi-backend support in Hugging Face πŸ€—Text Generation Inference!
With new TGI architecture we are now able to plug new modeling backends to get best performances according to selected model and available hardware.

huggingface.co/blog/tgi-mul...

16.01.2025 09:39 β€” πŸ‘ 5    πŸ” 0    πŸ’¬ 2    πŸ“Œ 0
Preview
From Files to Chunks: Improving HF Storage Efficiency We’re on a journey to advance and democratize artificial intelligence through open source and open science.

When XetHub joined Hugging Face, we brainstormed how to share our tech with the community.

The magic? Versioning chunks, not files, giving rise to:

🧠 Smarter storage
⏩ Faster uploads
πŸš€ Efficient downloads

Curious? Read the blog and let us know how it could help your workflows!

20.11.2024 18:51 β€” πŸ‘ 33    πŸ” 14    πŸ’¬ 1    πŸ“Œ 2