Paul Hueber, Luca Peres, Florian Pitters, Alejandro Gloriani, Oliver Rhodes
Neuromorphic Eye Tracking for Low-Latency Pupil Detection
https://arxiv.org/abs/2512.09969
@arxiv-cs-cv.bsky.social
Computer Science -- Computer Vision and Pattern Recognition (cs.CV) source: export.arxiv.org/rss/cs.CV maintainer: @tmaehara.bsky.social
Paul Hueber, Luca Peres, Florian Pitters, Alejandro Gloriani, Oliver Rhodes
Neuromorphic Eye Tracking for Low-Latency Pupil Detection
https://arxiv.org/abs/2512.09969
Woojin Lee, Hyugjae Chang, Jaeho Moon, Jaehyup Lee, Munchurl Kim
ABBSPO: Adaptive Bounding Box Scaling and Symmetric Prior based Orientation Prediction for Detecting Aerial Image Objects
https://arxiv.org/abs/2512.10031
Jia Cheng Hu, Roberto Cavicchioli, Alessandro Capotondi
Diffusion Is Your Friend in Show, Suggest and Tell
https://arxiv.org/abs/2512.10038
Yihao Liu, Chenyu Gao, Lianrui Zuo, Michael E. Kim, Brian D. Boyd, Lisa L. Barnes, Walter A. Kukull, Lori L. Beason-Held, Susan M. Resnick, Timothy J. Hohman, Warren D. Taylor, Bennett A. Landman
MetaVoxel: Joint Diffusion Modeling of Imaging and Clinical Metadata
https://arxiv.org/abs/2512.10041
Jiahao Liu
Independent Density Estimation
https://arxiv.org/abs/2512.10067
Jiachen Tao, Junyi Wu, Haoxuan Wang, Zongxin Yang, Dawen Cai, Yan Yan
TraceFlow: Dynamic 3D Reconstruction of Specular Scenes Driven by Ray Tracing
https://arxiv.org/abs/2512.10095
Neelima Prasad, Jarek Reynolds, Neel Karsanbhai, Tanusree Sharma, Lotus Zhang, Abigale Stangl, Yang Wang, Leah Findlater, Danna Gurari
Hierarchical Instance Tracking to Balance Privacy Preservation with Accessible Information
https://arxiv.org/abs/2512.10102
Charles Fanning, Mehmet Emin Aktas
Topological Conditioning for Mammography Models via a Stable Wavelet-Persistence Vectorization
https://arxiv.org/abs/2512.10151
Md Eimran Hossain Eimon, Juan Merlos, Ashan Perera, Hari Kalva, Velibor Adzic, Borko Furht
Feature Coding for Scalable Machine Vision
https://arxiv.org/abs/2512.10209
Shuhan Tan, Kashyap Chitta, Yuxiao Chen, Ran Tian, Yurong You, Yan Wang, Wenjie Luo, Yulong Cao, Philipp Krahenbuhl, Marco Pavone, Boris Ivanovic
Latent Chain-of-Thought World Modeling for End-to-End Driving
https://arxiv.org/abs/2512.10226
Md Eimran Hossain Eimon, Velibor Adzic, Hari Kalva, Borko Furht
Emerging Standards for Machine-to-Machine Video Coding
https://arxiv.org/abs/2512.10230
Jiho Jang, Jinyoung Kim, Kyungjune Baek, Nojun Kwak
Multi-dimensional Preference Alignment by Conditioning Reward Itself
https://arxiv.org/abs/2512.10237
Tian Liu, Anwesha Basu, James Caverlee, Shu Kong
Solving Semi-Supervised Few-Shot Learning from an Auto-Annotation Perspective
https://arxiv.org/abs/2512.10244
Zhuo Wang, Xiliang Liu, Ligang Sun
RobustSora: De-Watermarked Benchmark for Robust AI-Generated Video Detection
https://arxiv.org/abs/2512.10248
Eunho Lee, Chaehyeon Song, Seunghoon Jeong, Ayoung Kim
THE-Pose: Topological Prior with Hybrid Graph Fusion for Estimating Category-Level 6D Object Pose
https://arxiv.org/abs/2512.10251
Rui Wang, Yimu Sun, Jingxing Guo, Huisi Wu, Jing Qin
GDKVM: Echocardiography Video Segmentation via Spatiotemporal Key-Value Memory with Gated Delta Rule
https://arxiv.org/abs/2512.10252
Yuetong Su, Baoguo Wei, Xinyu Wang, Xu Li, Lixin Li
VLM-NCD:Novel Class Discovery with Vision-Based Large Language Models
https://arxiv.org/abs/2512.10262
Chen Ziwen, Hao Tan, Peng Wang, Zexiang Xu, Li Fuxin
Long-LRM++: Preserving Fine Details in Feed-Forward Wide-Coverage Reconstruction
https://arxiv.org/abs/2512.10267
Hongsin Lee, Hye Won Chung
Sample-wise Adaptive Weighting for Transfer Consistency in Adversarial Distillation
https://arxiv.org/abs/2512.10275
Yixin Wan, Lei Ke, Wenhao Yu, Kai-Wei Chang, Dong Yu
MotionEdit: Benchmarking and Learning Motion-Centric Image Editing
https://arxiv.org/abs/2512.10284
Xiaoxue Wu, Xinyuan Chen, Yaohui Wang, Yu Qiao
ShotDirector: Directorially Controllable Multi-Shot Video Generation with Cinematographic Transitions
https://arxiv.org/abs/2512.10286
Karthikeya KV, Narendra Bandaru
Physically Aware 360$^\circ$ View Generation from a Single Image using Disentangled Scene Embeddings
https://arxiv.org/abs/2512.10293
Duo Zheng, Shijia Huang, Yanyang Li, Liwei Wang
Efficient-VLN: A Training-Efficient Vision-Language Navigation Model
https://arxiv.org/abs/2512.10310
Anh M. Vu (equal contribution), Khang P. Le (equal contribution), Trang T. K. Vo (equal contribution), Ha Thach, ...
DualProtoSeg: Simple and Efficient Design with Text- and Image-Guided Prototype Learning for Weakly Supervised Histopathology Image Segmentation
https://arxiv.org/abs/2512.10314
Khang Le (equal contribution), Ha Thach (equal contribution), Anh M. Vu (equal contribution), Trang T. K. Vo, Han H. Huynh, David Yang, ...
ConStruct: Structural Distillation of Foundation Models for Prototype-Based Weakly Supervised Histopathology Segmentation
https://arxiv.org/abs/2512.10316
Hyunsoo Lee, Daeum Jeon, Hyeokjae Oh
Point2Pose: A Generative Framework for 3D Human Pose Estimation with Multi-View Point Cloud Dataset
https://arxiv.org/abs/2512.10321
Chao Gong, Depeng Wang, Zhipeng Wei, Ya Guo, Huijia Zhu, Jingjing Chen
EchoingPixels: Cross-Modal Adaptive Token Reduction for Efficient Audio-Visual LLMs
https://arxiv.org/abs/2512.10324
Jiawen Li, Jiali Hu, Xitong Ling, Yongqiang Lv, Yuxuan Chen, Yizhi Wang, Tian Guan, Yifei Liu, Yonghong He
StainNet: A Special Staining Self-Supervised Vision Transformer for Computational Pathology
https://arxiv.org/abs/2512.10326
Cai Xu, Jinlong Liu, Yilin Zhang, Ziyu Guan, Wei Zhao
Simple Yet Effective Selective Imputation for Incomplete Multi-view Clustering
https://arxiv.org/abs/2512.10327
Yi Liu, Yichi Zhang
A Conditional Generative Framework for Synthetic Data Augmentation in Segmenting Thin and Elongated Structures in Biological Images
https://arxiv.org/abs/2512.10334