Aksel Alpay's Avatar

Aksel Alpay

@illuhad.bsky.social

Working at Uni Heidelberg. I do AdaptiveCpp and HPC. Making things go fast!

62 Followers  |  112 Following  |  10 Posts  |  Joined: 23.11.2024  |  1.8856

Latest posts by illuhad.bsky.social on Bluesky

Post image

#AdaptiveCpp 25.10 released! Introducing a deployment model (compile once, ship to NVIDIA/AMD/Intel GPUs and CPUs), performance improvements and more! github.com/AdaptiveCpp/...

05.11.2025 16:37 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Preview
Adaptivity in AdaptiveCpp: Optimizing Performance by Leveraging Runtime Information During JIT-Compilation | Proceedings of the 13th International Workshop on OpenCL and SYCL

My latest paper from @iwocl.bsky.social is now available. Here's one of #AdaptiveCpp's secrets revealed which explains why it often manages to outperform vendor compilers: dl.acm.org/doi/full/10....

10.07.2025 16:09 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

#AdaptiveCpp 25.02 released! Introducing AdaptiveCpp portable CUDA (PCUDA):direct support for the CUDA and HIP languages in AdaptiveCpp's compiler. Compile CUDA code, obtain single binary that runs on CPU, Intel GPU, NVIDIA GPU, AMD GPU. And perhaps even faster than nvcc. github.com/AdaptiveCpp/...

09.05.2025 18:29 β€” πŸ‘ 2    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0
Post image

So you want to support #SYCL? We’re finding out on the last day of #IWOCL in the invited talk from Ben Ashbaugh (Intel) on his personal perspective for supporting SYCL with #OpenCL.

11.04.2025 07:05 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

Next up at #IWOCL, Julian Oppermann (Codeplay Software) talks about
fast compilation of #SYCL for code generated at runtime.

11.04.2025 07:37 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

Virtual functions are important to C++ programs. Alexey Sachkov (Intel) proposes how #SYCL can by updated to remove the current restriction on this to make it work for those (like Kokkos) asking for it.
#IWOCL

11.04.2025 08:10 β€” πŸ‘ 3    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
Post image

Just running compute with #SYCL is great but you always want to *SEE* the output. Duncan Brawley (Codeplay Software) is presenting interoperability for images with the DirectX and Vulkan Graphics APIs.
#IWOCL

11.04.2025 09:21 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

The #IWOCL 2025 Outstanding Full Paper Award Winner, Peter Thoman (UIBK) presents their work on optimising strided data movement across GPUs with Celerity and #SYCL

11.04.2025 09:44 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

See you at #IWOCL 2026!
Keep an eye on iwocl.org for updates, and the archive for this and previous conferences.
#OpenCL #SYCL

11.04.2025 10:19 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

More debugging talks at #IWOCL! This time for #OpenCL and #SYCL on CPUs where the code contains barriers. Presenting work from the team at Intel is Alexey Sachkov.

10.04.2025 13:21 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

Tooling is a vital part of any programming ecosystem, and is particularly important for heterogeneous systems. Here at #IWOCL Rakesh Ganesh (Intel) shares work on debugging #SYCL on Intel GPUs in Visual Studio and VS Code.

10.04.2025 12:50 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

Next up at #IWOCL, Aksel Alpay (Heidelberg University) talks about new Just-in-Time Optimisations in AdaptiveCpp for #SYCL. Ready to try in the latest compiler at github.com/AdaptiveCpp/...

10.04.2025 12:19 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

Next up at #IWOCL, Moritz Heckmann (Saarland University) shows his approaches for implementing #SYCL sub-groups on CPUs.

10.04.2025 10:20 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

The #SYCL backend to Kokkos was shared at #IWOCL last year, and Daniel Arndt (ORNL) is back this year to share experiences in targeting Frontier’s AMD GPUs with SYCL through Kokkos.

10.04.2025 09:43 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

Alexey Sachkov (Intel) shares how they are working on SPIR-V backend support in LLVM towards a future where #SYCL can be supported upstream in LLVM directly.
#IWOCL

10.04.2025 09:26 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

Next, Alexey Sachkov (Intel) presents a vision for improving #SYCL compile times through changes to the SYCL header.
#IWOCL

10.04.2025 08:05 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

Day 2 of #IWOCL begins with State of the Union updates from #SYCL and #SYCL SC working groups. Here Lukas Sommer shares their update from SYCL SC bringing standard heterogeneous programming to safety critical systems.

10.04.2025 07:35 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

Finishing today, Christoph Alt (Paderborn University) presents work evaluating #oneAPI I/O pipes in #SYCL for a Jacobi Solver on a FPGA Cluster.
#IWOCL

09.04.2025 15:24 β€” πŸ‘ 5    πŸ” 1    πŸ’¬ 1    πŸ“Œ 0
Post image

Xavier Hallade and Stefan Werner present Blender and the Cycles renderer implemented in #SYCL for cross-vendor support for 3D rendering, shipping in Blender today.
#IWOCL

09.04.2025 14:38 β€” πŸ‘ 3    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

GROMACS have been using #SYCL for a while now, and today, Andrey Alekseenko (KTH) and Ewan Crawford (Codeplay) present work using a SYCL-Graph extension (early versions of this have been presented at previous IWOCLs). This DPC++ extension is also used in Kokkos and Llama.cpp.
#IWOCL

09.04.2025 13:15 β€” πŸ‘ 5    πŸ” 3    πŸ’¬ 0    πŸ“Œ 0
Post image

TimothΓ©e David ClΓ©ris presents Shamrock: a Hydrodynamics (mainly Smooth Particle Hydro) Astrophysics code implemented using #SYCL
#IWOCL

09.04.2025 12:59 β€” πŸ‘ 5    πŸ” 3    πŸ’¬ 0    πŸ“Œ 0
Post image

Next up at #IWOCL, Siva Rama Krishna Reddy B and colleagues from Qualcomm share their work on Deep Learning Compilers on Adreno GPUs

09.04.2025 12:24 β€” πŸ‘ 4    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Post image

Hongqiang Wang (Qualcomm) presenting work at #IWOCL on their optimised #OpenCL backend for the open-source llama.cpp large language model (LLM).

09.04.2025 10:25 β€” πŸ‘ 5    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Post image

Jan Solanti (Tampere University) now shares his work on improving latency in PoCL-Remote at #IWOCL
#OpenCL

09.04.2025 09:56 β€” πŸ‘ 5    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Post image

A key part of #IWOCL are State of the Union updates on the programming models from the Khronos Working Groups. First up, #OpenCL, presented by Ben Ashbaugh (Intel).

09.04.2025 09:27 β€” πŸ‘ 7    πŸ” 3    πŸ’¬ 0    πŸ“Œ 0
Post image

117B grid cell simulation running using #OpenCL on a single system with two Xeon 6 with MRDRAM!!! #IWOCL

09.04.2025 08:07 β€” πŸ‘ 6    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Post image Post image Post image

Next up at #IWOCL, we welcome our Keynote Speaker, Moritz Lehmann (Intel) to talk about his FluidX3D Lattice Boltzmann CFD software, written in #OpenCL. Sharing amazing optimisations to improve memory use and access.

09.04.2025 07:39 β€” πŸ‘ 7    πŸ” 1    πŸ’¬ 1    πŸ“Œ 1
Post image

The 13th #IWOCL continues today with the first of two and a half days of presentations and papers, starting with a warm welcome from local hosts, Prof. Vincent Heuveline and Aksel Alpay from the University of Heidelberg.

09.04.2025 07:08 β€” πŸ‘ 4    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

Big news: #AdaptiveCpp can now also compile CUDA or HIP code with its generic JIT compiler - and create a single binary that can offload kernels to CPU/Intel GPU/NVIDIA GPU/AMD GPU. Oh, and you can mix-and-match CUDA/HIP, SYCL and C++ standard parallelism as you like.

github.com/AdaptiveCpp/...

19.03.2025 00:57 β€” πŸ‘ 1    πŸ” 0    πŸ’¬ 0    πŸ“Œ 0

Thinking of submitting a talk, paper, or poster to @iwocl.bsky.social? Good news! You now have until Sunday January 19, AoE to do so!

10.01.2025 17:06 β€” πŸ‘ 0    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0

@illuhad is following 20 prominent accounts