Foundations of Interpretable Models
Read more: https://arxiv.org/html/2508.00545v1
@arxiv-cs-ai.bsky.social
Your daily dose of the latest in Artificial Intelligence! Discover new research from arXiv's cs.AI section, covering machine learning, NLP, robotics, and more. π #ArtificialIntelligence #AIResearch #MachineLearning #NLP #Robotics #DeepLearning #AITech
Foundations of Interpretable Models
Read more: https://arxiv.org/html/2508.00545v1
Robust Tracking with Particle Filtering for Fluorescent Cardiac Imaging
Read more: https://arxiv.org/html/2508.05262v1
From "Aha Moments" to Controllable Thinking: Toward Meta-Cognitive Reasoning in Large Reasoning Models via Decoupled Reasoning and Control
Read more: https://arxiv.org/html/2508.04460v1
OmniPlay: Benchmarking Omni-Modal Models on Omni-Modal Game Playing
Read more: https://arxiv.org/html/2508.04361v2
Controllable and Stealthy Shilling Attacks via Dispersive Latent Diffusion
Read more: https://arxiv.org/html/2508.01987v1
CRINN: Contrastive Reinforcement Learning for Approximate Nearest Neighbor Search
Read more: https://arxiv.org/html/2508.02091v1
AdvDINO: Domain-Adversarial Self-Supervised Representation Learning for Spatial Proteomics
Read more: https://arxiv.org/html/2508.04955v1
StepWrite: Adaptive Planning for Speech-Driven Text Generation
Read more: https://arxiv.org/html/2508.04011v1
SkeNa: Learning to Navigate Unseen Environments Based on Abstract Hand-Drawn Maps
Read more: https://arxiv.org/html/2508.03053v1
Self-Questioning Language Models
Read more: https://arxiv.org/html/2508.03682v2
Classification of Brain Tumors using Hybrid Deep Learning Models
Read more: https://arxiv.org/html/2508.01350v1
Reducing the gap between general purpose data and aerial images in concentrated solar power plants
Read more: https://arxiv.org/html/2508.00440v1
An Explainable Natural Language Framework for Identifying and Notifying Target Audiences In Enterprise Communication
Read more: https://arxiv.org/html/2508.05267v1
Constraint-Preserving Data Generation for Visuomotor Policy Learning
Read more: https://arxiv.org/html/2508.03944v1
LayerT2V: Interactive Multi-Object Trajectory Layering for Video Generation
Read more: https://arxiv.org/html/2508.04228v1
Resource-Limited Joint Multimodal Sentiment Reasoning and Classification via Chain-of-Thought Enhancement and Distillation
Read more: https://arxiv.org/html/2508.05234v1
MissDDIM: Deterministic and Efficient Conditional Diffusion for Tabular Data Imputation
Read more: https://arxiv.org/html/2508.03083v1
The Docking Game: Loop Self-Play for Fast, Dynamic, and Accurate Prediction of Flexible Protein--Ligand Binding
Read more: https://arxiv.org/html/2508.05006v1
CauKer: classification time series foundation models can be pretrained on synthetic data only
Read more: https://arxiv.org/html/2508.02879v2
Inference-time Scaling for Diffusion-based Audio Super-resolution
Read more: https://arxiv.org/html/2508.02391v1
TURA: Tool-Augmented Unified Retrieval Agent for AI Search
Read more: https://arxiv.org/html/2508.04604v1
T2UE: Generating Unlearnable Examples from Text Descriptions
Read more: https://arxiv.org/html/2508.03091v1
Cloud Model Characteristic Function Auto-Encoder: Integrating Cloud Model Theory with MMD Regularization for Enhanced Generative Modeling
Read more: https://arxiv.org/html/2508.04447v1
Tobler's First Law in GeoAI: A Spatially Explicit Deep Learning Model for Terrain Feature Detection Under Weak Supervision
Read more: https://arxiv.org/html/2508.03745v1
Active Learning and Transfer Learning for Anomaly Detection in Time-Series Data
Read more: https://arxiv.org/html/2508.03921v1
Multi-TW: Benchmarking Multimodal Models on Traditional Chinese Question Answering in Taiwan
Read more: https://arxiv.org/html/2508.01274v1
Universal Neurons in GPT-2: Emergence, Persistence, and Functional Impact
Read more: https://arxiv.org/html/2508.00903v1
Towards Bridging Review Sparsity in Recommendation with Textual Edge Graph Representation
Read more: https://arxiv.org/html/2508.01128v1
CLASP: Cross-modal Salient Anchor-based Semantic Propagation for Weakly-supervised Dense Audio-Visual Event Localization
Read more: https://arxiv.org/html/2508.04566v1
Long Story Generation via Knowledge Graph and Literary Theory
Read more: https://arxiv.org/html/2508.03137v1