Luca Venerando Greco: Algorithms and optimizations for global non-linear hybrid fluid-kinetic finite element stellarator simulations https://arxiv.org/abs/2511.16412 https://arxiv.org/pdf/2511.16412 https://arxiv.org/html/2511.16412
21.11.2025 06:48 β π 1 π 5 π¬ 0 π 0
Wang, Pang, Wu, Jun, Romero, Taka, Marculescu, Nowatzki, Vasireddy, Melber, Chen, Cong: Can Asymmetric Tile Buffering Be Beneficial? https://arxiv.org/abs/2511.16041 https://arxiv.org/pdf/2511.16041 https://arxiv.org/html/2511.16041
21.11.2025 06:30 β π 0 π 2 π¬ 0 π 0
Montserrat, Verma, Barrab\'es, de la Vega, Bustamante, Ioannidis: Efficient Chromosome Parallelization for Precision Medicine Genomic Workflows https://arxiv.org/abs/2511.15977 https://arxiv.org/pdf/2511.15977 https://arxiv.org/html/2511.15977
21.11.2025 06:30 β π 0 π 4 π¬ 0 π 0
[2025-11-21 Fri (UTC), no new articles found for csPF Performance]
21.11.2025 06:33 β π 0 π 0 π¬ 0 π 0
Peiming Yang, Sankeerth Durvasula, Ivan Fernandez, Mohammad Sadrosadati, Onur Mutlu, Gennady Pekhimenko, Christina Giannoula: A Tensor Compiler for Processing-In-Memory Architectures https://arxiv.org/abs/2511.15503 https://arxiv.org/pdf/2511.15503 https://arxiv.org/html/2511.15503
20.11.2025 06:29 β π 0 π 3 π¬ 0 π 0
M. Sapkas, A. Triossi, M. Zanetti: A Latency-Constrained, Gated Recurrent Unit (GRU) Implementation in the Versal AI Engine https://arxiv.org/abs/2511.15626 https://arxiv.org/pdf/2511.15626 https://arxiv.org/html/2511.15626
20.11.2025 06:33 β π 0 π 0 π¬ 0 π 0
Kexin Chu, Dawei Xiang, Zixu Shen, Yiwei Yang, Zecheng Liu, Wei Zhang: Dynamic Expert Quantization for Scalable Mixture-of-Experts Inference https://arxiv.org/abs/2511.15015 https://arxiv.org/pdf/2511.15015 https://arxiv.org/html/2511.15015
20.11.2025 06:33 β π 0 π 2 π¬ 0 π 0
[2025-11-20 Thu (UTC), 2 new articles found for csPF Performance]
20.11.2025 06:33 β π 0 π 0 π¬ 0 π 0
I-Ting Lee, Bao-Kai Wang, Liang-Chi Chen, Wen Sheng Lim, Da-Wei Chang, Yu-Ming Chang, Chieng-Chung Ho: PIM or CXL-PIM? Understanding Architectural Trade-offs Through Large-Scale Benchmarking https://arxiv.org/abs/2511.14400 https://arxiv.org/pdf/2511.14400 https://arxiv.org/html/2511.14400
19.11.2025 06:31 β π 0 π 1 π¬ 0 π 0
Arun Thangamani, Md Asghar Ahmad Shahid, Adam Siemieniuk, Rolf Morel, Renato Golin, Alexander Heinecke: Library Liberation: Competitive Performance Matmul Through Compiler-composed Nanokernels https://arxiv.org/abs/2511.13764 https://arxiv.org/pdf/2511.13764 https://arxiv.org/html/2511.13764
19.11.2025 06:32 β π 1 π 3 π¬ 0 π 0
Maksymilian Graczyk, Vincent Desbiolles, Stefan Roiser, Andrea Guerrieri: Enabling Heterogeneous Performance Analysis for Scientific Workloads https://arxiv.org/abs/2511.13928 https://arxiv.org/pdf/2511.13928 https://arxiv.org/html/2511.13928
19.11.2025 06:33 β π 0 π 0 π¬ 0 π 0
[2025-11-19 Wed (UTC), 1 new article found for csPF Performance]
19.11.2025 06:33 β π 0 π 0 π¬ 0 π 0
Iulius Gherasim, Carlos Garc\'ia S\'anchez: Hardware optimization on Android for inference of AI models https://arxiv.org/abs/2511.13453 https://arxiv.org/pdf/2511.13453 https://arxiv.org/html/2511.13453
18.11.2025 06:37 β π 0 π 1 π¬ 0 π 0
Taras Sereda, Tom St. John, Burak Bartan, Natalie Serrino, Sachin Katti, Zain Asgar: KForge: Program Synthesis for Diverse AI Hardware Accelerators https://arxiv.org/abs/2511.13274 https://arxiv.org/pdf/2511.13274 https://arxiv.org/html/2511.13274
18.11.2025 06:36 β π 0 π 4 π¬ 0 π 0
Xiao, Luo, Huang, Yang, Sui, Phan, Zang, Ying, Tang, Anandkumar, Yuan: EcoSpa: Efficient Transformer Training with Coupled Sparsity https://arxiv.org/abs/2511.11641 https://arxiv.org/pdf/2511.11641 https://arxiv.org/html/2511.11641
18.11.2025 06:33 β π 0 π 2 π¬ 0 π 0
\'Alvaro Corrochano L\'opez, Carlos Garc\'ia S\'anchez: Evaluation of Domain-Specific Architectures for General-Purpose Applications in Apple Silicon https://arxiv.org/abs/2511.13450 https://arxiv.org/pdf/2511.13450 https://arxiv.org/html/2511.13450
18.11.2025 06:33 β π 0 π 1 π¬ 0 π 0
Fabian B\"ohm, Nils Kohl, Harald K\"ostler, Ulrich R\"ude: Large-scale Multigrid with Adaptive Galerkin Coarsening https://arxiv.org/abs/2511.13109 https://arxiv.org/pdf/2511.13109 https://arxiv.org/html/2511.13109
18.11.2025 06:33 β π 0 π 0 π¬ 0 π 0
[2025-11-18 Tue (UTC), 2 new articles found for csPF Performance]
18.11.2025 06:33 β π 0 π 0 π¬ 0 π 0
[2025-11-17 Mon (UTC), no new articles found for csPF Performance]
17.11.2025 06:38 β π 0 π 0 π¬ 0 π 0
Yi, Duan, Hu, Hua, Zhao, Qian, Yang, Cao, Tang, Yu, Liao, Wang, Zhang: EDGC: Entropy-driven Dynamic Gradient Compression for Efficient LLM Training https://arxiv.org/abs/2511.10333 https://arxiv.org/pdf/2511.10333 https://arxiv.org/html/2511.10333
14.11.2025 06:34 β π 0 π 1 π¬ 0 π 0
Mani Tofigh, Edward Guo, Weiwei Jia, Xiaoning Ding, Jianchen Shan: Optimizing CPU Cache Utilization in Cloud VMs with Accurate Cache Abstraction https://arxiv.org/abs/2511.09956 https://arxiv.org/pdf/2511.09956 https://arxiv.org/html/2511.09956
14.11.2025 06:30 β π 0 π 2 π¬ 0 π 0
Fr\'ed\'eric Berdoz, Peer Rheinboldt, Roger Wattenhofer: Steering Pretrained Drafters during Speculative Decoding https://arxiv.org/abs/2511.09844 https://arxiv.org/pdf/2511.09844 https://arxiv.org/html/2511.09844
14.11.2025 06:33 β π 0 π 1 π¬ 0 π 0
Van Delm, Lydike, Dumoulin, Crols, Yi, Antonio, Woodruff, Grosser, Verhelst: The Configuration Wall: Characterization and Elimination of Accelerator Configuration Overhead https://arxiv.org/abs/2511.10397 https://arxiv.org/pdf/2511.10397 https://arxiv.org/html/2511.10397
14.11.2025 06:33 β π 0 π 0 π¬ 0 π 0
[2025-11-14 Fri (UTC), 1 new article found for csPF Performance]
14.11.2025 06:33 β π 0 π 0 π¬ 0 π 0
Yusuf Motiwala: No Cords Attached: Coordination-Free Concurrent Lock-Free Queues https://arxiv.org/abs/2511.09410 https://arxiv.org/pdf/2511.09410 https://arxiv.org/html/2511.09410
13.11.2025 06:30 β π 0 π 2 π¬ 0 π 0
Jay Tharwani, Shobhit Aggarwal, Arnab A Purkayastha: Evaluating HPC-Style CPU Performance and Cost in Virtualized Cloud Infrastructures https://arxiv.org/abs/2511.08948 https://arxiv.org/pdf/2511.08948 https://arxiv.org/html/2511.08948
13.11.2025 06:30 β π 0 π 1 π¬ 0 π 0
Punit Kumar, Asif Imran, Tevfik Kosar: Energy Consumption of Dataframe Libraries for End-to-End Deep Learning Pipelines:A Comparative Analysis https://arxiv.org/abs/2511.08644 https://arxiv.org/pdf/2511.08644 https://arxiv.org/html/2511.08644
13.11.2025 06:34 β π 0 π 2 π¬ 0 π 0
Sixiang Zhou, Nan Deng, Krzysiek Rzadca, Charlie Y. Hu, Xiaojun Lin: PANDA: Noise-Resilient Antagonist Identification in Production Datacenters https://arxiv.org/abs/2511.08803 https://arxiv.org/pdf/2511.08803 https://arxiv.org/html/2511.08803
13.11.2025 06:33 β π 0 π 0 π¬ 0 π 0
[2025-11-13 Thu (UTC), 1 new article found for csPF Performance]
13.11.2025 06:33 β π 0 π 0 π¬ 0 π 0
Tianyu Fu, Yichen You, Zekai Chen, Guohao Dai, Huazhong Yang, Yu Wang: Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models https://arxiv.org/abs/2511.08577 https://arxiv.org/pdf/2511.08577 https://arxiv.org/html/2511.08577
12.11.2025 06:30 β π 0 π 3 π¬ 0 π 0