I had an amazing experience attending @fastcompany.com Most Innovative Companies Summit. Proud to represent Red Hat as one of the most innovative companies with my colleague @terrytangyuan.xyz
06.06.2025 05:17 β π 4 π 2 π¬ 0 π 0I had an amazing experience attending @fastcompany.com Most Innovative Companies Summit. Proud to represent Red Hat as one of the most innovative companies with my colleague @terrytangyuan.xyz
06.06.2025 05:17 β π 4 π 2 π¬ 0 π 0Check out the new episode Technically Speaking w/ Chris Wright - Scaling AI inference with open source ft. Brian Stevens red.ht/4dJiBLc
06.06.2025 01:10 β π 1 π 1 π¬ 0 π 0FP8-quantized version of Llama 4 Maverick can be downloaded from HuggingFace: huggingface.co/collections/...
05.04.2025 20:22 β π 0 π 0 π¬ 0 π 0The official release by Meta includes an FP8-quantized version of Llama 4 Maverick 128E supported by Red Hatβs LLM Compressor library, enabling the 128 expert model to fit on a single NVIDIA 8xH100 node, resulting in more performance with lower costs.
05.04.2025 20:20 β π 0 π 0 π¬ 0 π 0Thanks to the Meta AI team for close collaboration with the vLLM community, enabling developers to experiment with Llama 4 immediately. Our blog shares more details of the Llama 4 release, and how to get started with inferencing in vLLM today: developers.redhat.com/articles/202...
05.04.2025 20:19 β π 0 π 0 π¬ 2 π 0This is really nice! Thank you @stu.bsky.social
22.11.2024 05:47 β π 1 π 0 π¬ 0 π 0