Pranav Bhandari, Nicolas Fay, Sanjeevan Selvaganapathy, Amitava Datta, Usman Naseem, Mehwish Nasim
Activation-Space Personality Steering: Hybrid Layer Selection for Stable Trait Control in LLMs
https://arxiv.org/abs/2511.03738
@arxiv-cs-cl.bsky.social
Computer Science -- Computation and Language source: export.arxiv.org/rss/cs.CL maintainer: @tmaehara.bsky.social
Pranav Bhandari, Nicolas Fay, Sanjeevan Selvaganapathy, Amitava Datta, Usman Naseem, Mehwish Nasim
Activation-Space Personality Steering: Hybrid Layer Selection for Stable Trait Control in LLMs
https://arxiv.org/abs/2511.03738
Eugenius Mario Situmorang, Adila Alfa Krisnadhi, Ari Wibisono
TextualVerifier: Verify TextGrad Step-by-Step
https://arxiv.org/abs/2511.03739
Stergios Chatzikyriakidis, Dimitris Papadakis, Sevasti-Ioanna Papaioannou, Erofili Psaltaki
GRDD+: An Extended Greek Dialectal Dataset with Cross-Architecture Fine-tuning Evaluation
https://arxiv.org/abs/2511.03772
Jan Koco\'n, Maciej Piasecki, Arkadiusz Janz, Teddy Ferdinan, {\L}ukasz Radli\'nski, Bart{\l}omiej Koptyra, Marcin Oleksy, Stanis{\l}aw Wo\'zniak, Pawe{\l} Walkowiak, Konrad Wojtasik, Julia Moska, ...
PLLuM: A Family of Polish Large Language Models
https://arxiv.org/abs/2511.03823
Mohammad Atif Quamar, Mohammad Areeb, Mikhail Kuznetsov, Muslum Ozgur Ozmen, Z. Berkay Celik
STARS: Segment-level Token Alignment with Rejection Sampling in Large Language Models
https://arxiv.org/abs/2511.03827
Miko{\l}aj Langner, Jan Eliasz, Ewa Rudnicka, Jan Koco\'n
Divide, Cache, Conquer: Dichotomic Prompting for Efficient Multi-Label LLM-Based Classification
https://arxiv.org/abs/2511.03830
Hellina Hailu Nigatu, Bethelhem Yemane Mamo, Bontu Fufa Balcha, Debora Taye Tesfaye, Elbethel Daniel Zewdie, Ikram Behiru Nesiru, Jitu Ewnetu Hailu, Senait Mengesha Yayo
Evaluating Machine Translation Datasets for Low-Web Data Languages: A Gendered Lens
https://arxiv.org/abs/2511.03880
Manh Nguyen, Sunil Gupta, Dai Do, Hung Le
GRAD: Graph-Retrieved Adaptive Decoding for Hallucination Mitigation
https://arxiv.org/abs/2511.03900
Alvin Wei Ming Tan, Ben Prystawski, Veronica Boyce, Michael C. Frank
Context informs pragmatic interpretation in vision-language models
https://arxiv.org/abs/2511.03908
Stefano M. Iacus, Devika Jain, Andrea Nasuto, Giuseppe Porro, Marcello Carammia, Andrea Vezzulli
The Human Flourishing Geographic Index: A County-Level Dataset for the United States, 2013--2023
https://arxiv.org/abs/2511.03915
Fu-Chun Yang, Jason Eshraghian
Direct Semantic Communication Between Large Language Models via Vector Translation
https://arxiv.org/abs/2511.03945
Shiyin Lin
Abductive Inference in Retrieval-Augmented Language Models: Generating and Validating Missing Premises
https://arxiv.org/abs/2511.04020
Dongji Gao, Chenda Liao, Changliang Liu, Matthew Wiesner, Leibny Paola Garcia, Daniel Povey, Sanjeev Khudanpur, Jian Wu
WST: Weakly Supervised Transducer for Automatic Speech Recognition
https://arxiv.org/abs/2511.04035
Shreya Havaldar, Helen Jin, Chaehyeon Kim, Anton Xue, Weiqiu You, Marco Gatti, Bhuvnesh Jain, Helen Qu, Daniel A Hashimoto, Amin Madani, Rajat Deo, Sameed Ahmed M. Khatana, ...
T-FIX: Text-Based Explanations with Features Interpretable to eXperts
https://arxiv.org/abs/2511.04070
Xinying Qian, Ying Zhang, Yu Zhao, Baohang Zhou, Xuhui Sui, Xiaojie Yuan
Plan of Knowledge: Retrieval-Augmented Large Language Models for Temporal Knowledge Graph Question Answering
https://arxiv.org/abs/2511.04072
\v{S}pela Vintar, Jan Jona Javor\v{s}ek
The truth is no diaper: Human and AI-generated associations to emotional words
https://arxiv.org/abs/2511.04077
Eva Prakash, Maayane Attias, Pierre Chambon, Justin Xu, Steven Truong, Jean-Benoit Delbrouck, Tessa Cook, Curtis Langlotz
Improving the Performance of Radiology Report De-identification with Large-Scale Training and Benchmarking Against Cloud Vendor Methods
https://arxiv.org/abs/2511.04079
Moses Charikar, Chirag Pabbaraju, Ambuj Tewari
A Characterization of List Language Identification in the Limit
https://arxiv.org/abs/2511.04103
Wenmo Qiu, Saurabh Srivastava
Batch Prompting Suppresses Overthinking Reasoning Under Constraint: How Batch Prompting Suppresses Overthinking in Reasoning Models
https://arxiv.org/abs/2511.04108
Xinyuan Li, Murong Xu, Wenbiao Tao, Hanlun Zhu, Yike Zhao, Jipeng Zhang, Yunshi Lan
RIDE: Difficulty Evolving Perturbation with Item Response Theory for Mathematical Reasoning
https://arxiv.org/abs/2511.04120
Dazhong Chen (May), Yi-Cheng Lin (May), Yuchen Huang (May), Ziwei Gong (May), Di Jiang (May), Zeying Xie (May), Yi R. (May), Fung
CantoASR: Prosody-Aware ASR-LALM Collaboration for Low-Resource Cantonese
https://arxiv.org/abs/2511.04139
Fahim Ahmed, Md Mubtasim Ahasan, Jahir Sadik Monon, Muntasir Wahed, M Ashraful Amin, A K M Mahbubur Rahman, Amin Ahsan Ali
BAPPA: Benchmarking Agents, Plans, and Pipelines for Automated Text-to-SQL Generation
https://arxiv.org/abs/2511.04153
Mohammed Musthafa Rafi, Adarsh Krishnamurthy, Aditya Balu
Trustworthy LLM-Mediated Communication: Evaluating Information Fidelity in LLM as a Communicator (LAAC) Framework in Multiple Application Domains
https://arxiv.org/abs/2511.04184
Nicol\`o Pagan, Petter T\"ornberg, Christopher A. Bail, Anik\'o Hann\'ak, Christopher Barrie
Computational Turing Test Reveals Systematic Differences Between Human and AI Language
https://arxiv.org/abs/2511.04195
Micha{\l} Karp, Anna Kubaszewska, Magdalena Kr\'ol, Robert Kr\'ol, Aleksander Smywi\'nski-Pohl, Mateusz Szyma\'nski, Witold Wydma\'nski
LLM-as-a-Judge is Bad, Based on AI Attempting the Exam Qualifying for the Member of the Polish National Board of Appeal
https://arxiv.org/abs/2511.04205
Liran Cohen, Yaniv Nemcovesky, Avi Mendelson
REMIND: Input Loss Landscapes Reveal Residual Memorization in Post-Unlearning LLMs
https://arxiv.org/abs/2511.04228
Alex Fang, Thomas Voice, Ruoming Pang, Ludwig Schmidt, Tom Gunter
Reusing Pre-Training Data at Test Time is a Compute Multiplier
https://arxiv.org/abs/2511.04234
Salma Mekaooui, Hiba Sofyan, Imane Amaaz, Imane Benchrif, Arsalane Zarghili, Ilham Chaker, Nikola S. Nikolov
Efficient Topic Extraction via Graph-Based Labeling: A Lightweight Alternative to Deep Models
https://arxiv.org/abs/2511.04248
Kun Yang, Zikang chen, Yanmeng Wang, Zhigen Li
SSPO: Subsentence-level Policy Optimization
https://arxiv.org/abs/2511.04256
Mohammad Amin Ghanizadeh, Mohammad Javad Dousti
Dynamic Jointly Batch Selection for Data Efficient Machine Translation Fine-Tuning
https://arxiv.org/abs/2511.04406