Oshayer Siddique, J. M Areeb Uzair Alam, Md Jobayer Rahman Rafy, Syed Rifat Raiyan, Hasan Mahmud, Md Kamrul Hasan
PhysicsEval: Inference-Time Techniques to Improve the Reasoning Proficiency of Large Language Models on Physics Problems
https://arxiv.org/abs/2508.00079
04.08.2025 07:17 — 👍 0 🔁 0 💬 0 📌 0
Kelly Kendro, Jeffrey Maloney, Scott Jarvis
Do LLMs produce texts with "human-like" lexical diversity?
https://arxiv.org/abs/2508.00086
04.08.2025 07:16 — 👍 0 🔁 0 💬 0 📌 0
Zachary K. Stine, James E. Deitrick
Semiotic Complexity and Its Epistemological Implications for Modeling Culture
https://arxiv.org/abs/2508.00095
04.08.2025 07:16 — 👍 0 🔁 0 💬 0 📌 0
Mingda Chen, Yang Li, Xilun Chen, Adina Williams, Gargi Ghosh, Scott Yih
FACTORY: A Challenging Human-Verified Prompt Set for Long-Form Factuality
https://arxiv.org/abs/2508.00109
04.08.2025 07:00 — 👍 0 🔁 0 💬 0 📌 0
Xiao Zhang, Johan bos
Is neural semantic parsing good at ellipsis resolution, or isn't it?
https://arxiv.org/abs/2508.00121
04.08.2025 07:00 — 👍 0 🔁 0 💬 0 📌 0
Alper Yaman, Jannik Schwab, Christof Nitsche, Abhirup Sinha, Marco Huber
Comparison of Large Language Models for Deployment Requirements
https://arxiv.org/abs/2508.00185
04.08.2025 06:59 — 👍 0 🔁 0 💬 0 📌 0
Xiaofeng Wu, Alan Ritter, Wei Xu
Tabular Data Understanding with LLMs: A Survey of Recent Advances and Challenges
https://arxiv.org/abs/2508.00217
04.08.2025 06:59 — 👍 0 🔁 0 💬 0 📌 0
Rana Aref Salama, Abdou Youssef, Mona Diab
Semantic Compression for Word and Sentence Embeddings using Discrete Wavelet Transform
https://arxiv.org/abs/2508.00220
04.08.2025 06:58 — 👍 0 🔁 0 💬 0 📌 0
Bryce Anderson, Riley Galpin, Tom S. Juzek
Model Misalignment and Language Change: Traces of AI-Associated Language in Unscripted Spoken English
https://arxiv.org/abs/2508.00238
04.08.2025 06:58 — 👍 0 🔁 0 💬 0 📌 0
Peixian Li, Yu Tian, Ruiqi Tu, Chengkai Wu, Jingjing Ren, Jingsong Li
Integrating clinical reasoning into large language model-based diagnosis through etiology-aware attention steering
https://arxiv.org/abs/2508.00285
04.08.2025 06:46 — 👍 0 🔁 0 💬 0 📌 0
Ammar Ahmed, Sheng Di, Franck Cappello, Zirui Liu, Jingoo Han, Ali Anwar
Systematic Evaluation of Optimization Techniques for Long-Context Language Models
https://arxiv.org/abs/2508.00305
04.08.2025 06:46 — 👍 0 🔁 0 💬 0 📌 0
Kaiyan Zhao, Zhongtao Miao, Yoshimasa Tsuruoka
Improving Multimodal Contrastive Learning of Sentence Embeddings with Object-Phrase Alignment
https://arxiv.org/abs/2508.00332
04.08.2025 06:45 — 👍 0 🔁 0 💬 0 📌 0
Keer Lu, Chong Chen, Bin Cui, Huang Leng, Wentao Zhang
PilotRL: Training Language Model Agents via Global Planning-Guided Progressive Reinforcement Learning
https://arxiv.org/abs/2508.00344
04.08.2025 06:45 — 👍 0 🔁 0 💬 0 📌 0
Alan Dao (Gia Tuan Dao), Dinh Bach Vu, Alex Nguyen, Norapat Buppodom
Lucy: edgerunning agentic web search on mobile with machine generated task vectors
https://arxiv.org/abs/2508.00360
04.08.2025 06:44 — 👍 0 🔁 0 💬 0 📌 0
Jiyu Chen, Poh Seng Lim, Shuang Peng, Daxiong Luo, JungHau Foo, Yap Deep, Timothy Lee Jun Jie, Kelvin Teh Kae Wen, Fan Yang, Danyu Feng, Hao-Yun Chen, ...
EdgeInfinite-Instruct: Bridging SFT-Based Optimization and NPU-Level Efficiency for Edge Devices
https://arxiv.org/abs/2508.00370
04.08.2025 06:44 — 👍 0 🔁 0 💬 0 📌 0
Dingzirui Wang, Xuangliang Zhang, Keyan Xu, Qingfu Zhu, Wanxiang Che, Yang Deng
Multi-Layer Attention is the Amplifier of Demonstration Effectiveness
https://arxiv.org/abs/2508.00385
04.08.2025 06:38 — 👍 0 🔁 0 💬 0 📌 0
Hengxing Cai, Jinhan Dong, Yijie Rao, Jingcheng Deng, Jingjun Tan, Qien Chen, Haidong Wang, Zhen Wang, Shiyu Huang, Agachai Sumalee, Renxin Zhong
SA-GCS: Semantic-Aware Gaussian Curriculum Scheduling for UAV Vision-Language Navigation
https://arxiv.org/abs/2508.00390
04.08.2025 06:38 — 👍 0 🔁 0 💬 0 📌 0
Rana Salama, Abdou Youssef, Mona Diab
Combining Discrete Wavelet and Cosine Transforms for Efficient Sentence Embedding
https://arxiv.org/abs/2508.00420
04.08.2025 06:37 — 👍 0 🔁 0 💬 0 📌 0
Minghao Guo, Xi Zhu, Jingyuan Huang, Kai Mei, Yongfeng Zhang
ReaGAN: Node-as-Agent-Reasoning Graph Agentic Network
https://arxiv.org/abs/2508.00429
04.08.2025 06:36 — 👍 0 🔁 0 💬 0 📌 0
Yuqi Tang, Kehua Feng, Yunfeng Wang, Zhiwen Chen, Chengfei Lv, Gang Yu, Qiang Zhang, Keyan Ding
Learning an Efficient Multi-Turn Dialogue Evaluator from Multiple Judges
https://arxiv.org/abs/2508.00454
04.08.2025 06:36 — 👍 0 🔁 0 💬 0 📌 0
Jeongwoo Kang, Markarit Vartampetian, Felix Herron, Yongxin Zhou, Diandra Fabre, Gabriela Gonzalez-Saez
GETALP@AutoMin 2025: Leveraging RAG to Answer Questions based on Meeting Transcripts
https://arxiv.org/abs/2508.00476
04.08.2025 06:35 — 👍 0 🔁 0 💬 0 📌 0
Yixuan Tang, Jincheng Wang, Anthony K. H. Tung
The Missing Parts: Augmenting Fact Verification with Half-Truth Detection
https://arxiv.org/abs/2508.00489
04.08.2025 06:11 — 👍 0 🔁 0 💬 0 📌 0
Jiaxin Deng, Qingcheng Zhu, Junbiao Pang, Linlin Yang, Zhongqian Fu, Baochang Zhang
EFlat-LoRA: Efficiently Seeking Flat Minima for Better Generalization in Fine-Tuning Large Language Models and Beyond
https://arxiv.org/abs/2508.00522
04.08.2025 06:11 — 👍 0 🔁 0 💬 0 📌 0
Giulio Zhou, Tsz Kin Lam, Alexandra Birch, Barry Haddow
The Prosody of Emojis
https://arxiv.org/abs/2508.00537
04.08.2025 06:10 — 👍 0 🔁 0 💬 0 📌 0
Joonas Tapaninaho, Mourad Oussala
PaPaformer: Language Model from Pre-trained Paraller Paths
https://arxiv.org/abs/2508.00544
04.08.2025 06:10 — 👍 0 🔁 0 💬 0 📌 0
Jianwei Wang, Ziming Wu, Fuming Lai, Shaobing Lian, Ziqian Zeng
SynAdapt: Learning Adaptive Reasoning in Large Language Models via Synthetic Continuous Chain-of-Thought
https://arxiv.org/abs/2508.00574
04.08.2025 06:09 — 👍 0 🔁 0 💬 0 📌 0
Mingruo Yuan, Shuyi Zhang, Ben Kao
A Context-Aware Dual-Metric Framework for Confidence Estimation in Large Language Models
https://arxiv.org/abs/2508.00600
04.08.2025 06:09 — 👍 0 🔁 0 💬 0 📌 0
Farhana Haque, Md. Abdur Rahman, Sumon Ahmed
GHTM: A Graph based Hybrid Topic Modeling Approach in Low-Resource Bengali Language
https://arxiv.org/abs/2508.00605
04.08.2025 05:57 — 👍 0 🔁 0 💬 0 📌 0
Lennart Meincke, Ethan Mollick, Lilach Mollick, Dan Shapiro
Prompting Science Report 3: I'll pay you or I'll kill you -- but will you care?
https://arxiv.org/abs/2508.00614
04.08.2025 05:57 — 👍 0 🔁 0 💬 0 📌 0
Shantanu Thorat, Andrew Caines
DACTYL: Diverse Adversarial Corpus of Texts Yielded from Large Language Models
https://arxiv.org/abs/2508.00619
04.08.2025 05:56 — 👍 0 🔁 0 💬 0 📌 0