Deep_In_Depth's Avatar

Deep_In_Depth

@deep-in-depth.bsky.social

Just moved away from "Previously Twitter". It was about time! #DeepLearning #MachineLearning #AI #LLM #ComputerVision #NLP #NeuralNetwork curated News feed. So dip into the Detphs ! Run by: https://www.linkedin.com/in/eric-feuilleaubois-ph-d-43ab0925/

478 Followers  |  497 Following  |  1,830 Posts  |  Joined: 11.11.2024  |  1.5405

Latest posts by deep-in-depth.bsky.social on Bluesky

Post image

πŸš€ Exploring the future of AI?

High-precision 3D LiDAR annotation is key to better #AI, #AutonomousVehicles & #SmartCities.
From segmentation to tracking label your data right.

πŸ“ Read: theomnibuzz.com/3d-lidar-poi...

#3DLiDAR #MachineLearning #DataAnnotation #Computervision #Machinelearning

06.08.2025 11:36 β€” πŸ‘ 2    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Preview
How to Choose the Right AI Training Data Company | Zupyak How to Choose the Right AI Training Data Company

πŸ“Œ How to Choose the Right AI Training Data Company
High-quality annotation is the backbone of model accuracy, scalability, and ethical AI. Learn how to pick the best partner for NLP, computer vision & more ⬇️
πŸ”— www.zupyak.com/p/4616219/t/...
#Ai #Machinelearning #Dataannotation #computervision

08.08.2025 12:55 β€” πŸ‘ 3    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Post image

Sui, P., Rodriguez, J. D., Laban, P., Murphy, D., Dexter, J. P., So, R. J., ... & Chaudhuri, P. (2025). KRISTEVA: Close reading as a novel task for benchmarking interpretive reasoning.Β arXiv preprint arXiv:2505.09825.

20.06.2025 10:55 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Things that helped me get out of the AI 10x engineer imposter syndrome | Hacker News

There's a huge difference between what people want to believe and what is real.

"AI makes engineers 10x faster" ... it's attractive. It has a great mouth feel doesn't it?

news.ycombinator.com/item?id=4479...

05.08.2025 18:08 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Preview
Stereo-GS: Multi-View Stereo Vision Model for Generalizable 3D Gaussian Splatting Reconstruction Generalizable 3D Gaussian Splatting reconstruction showcases advanced Image-to-3D content creation but requires substantial computational resources and large datasets, posing challenges to training…

Stereo-GS: Multi-View Stereo Vision Model for Generalizable 3D Gaussian Splatting Reconstruction #DL #AI #ML #DeepLearning #ArtificialIntelligence #MachineLearning #ComputerVision #LLM #VLM #LVLM
arxiv.org/html/2507.14...

03.08.2025 05:18 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Preview
Large Language Models for Crash Detection in Video: A Survey of Methods, Datasets, and Challenges Crash detection from video feeds is a critical problem in intelligent transportation systems. Recent developments in large language models (LLMs) and vision-language models (VLMs) have transformed…

Large Language Models for Crash Detection in Video: A Survey of Methods, Datasets, and Challenges #DL #AI #ML #DeepLearning #ArtificialIntelligence #MachineLearning #ComputerVision #AutonomousVehicles #LLM #VLM #LVLM
arxiv.org/html/2507.02...

01.08.2025 16:02 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Preview
ScVLM: Enhancing Vision-Language Model for Safety-Critical Event Understanding Accurately identifying, understanding and describing traffic safety-critical events (SCEs), including crashes, tire strikes, and near-crashes, is crucial for advanced driver assistance systems,…

ScVLM: Enhancing Vision-Language Model for Safety-Critical Event Understanding #DL #AI #ML #DeepLearning #ArtificialIntelligence #MachineLearning #ComputerVision #AutonomousVehicles #Robotics #LLM #VLM #LVLM
arxiv.org/html/2410.00...

01.08.2025 12:01 β€” πŸ‘ 1    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Preview
Few-Shot Learning in Video and 3D Object Detection: A Survey Few-shot learning (FSL) enables object detection models to recognize novel classes given only a few annotated examples, thereby reducing expensive manual data labeling. This survey examines recent…

Few-Shot Learning in Video and 3D Object Detection: A Survey #DL #AI #ML #DeepLearning #ArtificialIntelligence #MachineLearning #ComputerVision #AutonomousVehicles #Robotics #LLM #VLM #LVLM
arxiv.org/html/2507.17...

01.08.2025 08:00 β€” πŸ‘ 1    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Preview
Allen Institute for AI-Ai2 Unveils AutoDS: A Bayesian Surprise-Driven Engine for Open-Ended Scientific Discovery The Allen Institute for Artificial Intelligence (AI2) has introduced AutoDS (Autonomous Discovery via Surprisal), a groundbreaking prototype engine for open-ended autonomous scientific discovery.…

Allen Institute for AI-Ai2 Unveils AutoDS: A Bayesian Surprise-Driven Engine for Open-Ended Scientific Discovery #DL #AI #ML #DeepLearning #ArtificialIntelligence #MachineLearning #LLM #VLM #LVLM
www.marktechpost.com/2025/07/21/a...

01.08.2025 04:00 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Preview
New AI architecture delivers 100x faster reasoning than LLMs with just 1,000 training examples Hierarchical Reasoning Models (HRM) tackle complex reasoning tasks while being smaller, faster, and more data-efficient than large AI models.

New AI architecture delivers 100x faster reasoning than LLMs with just 1,000 training examples #DL #AI #ML #DeepLearning #ArtificialIntelligence #MachineLearning #LLM #VLM #LVLM
venturebeat.com/ai/new-ai-ar...

01.08.2025 00:00 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Preview
Next-Gen Privacy: How AI Is Transforming Secure Browsing and VPN Technologies (2025 Data-Driven Deep Dive) Discover how AI and quantum tech are revolutionizing VPNs and secure browsing, ensuring cutting-edge privacy for 2025

Next-Gen Privacy: How AI Is Transforming Secure Browsing and VPN Technologies (2025 Data-Driven Deep Dive) #DL #AI #ML #DeepLearning #ArtificialIntelligence #MachineLearning #LLM #VLM #LVLM
www.marktechpost.com/2025/07/30/n...

31.07.2025 20:01 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Preview
Google’s NotebookLM can now make narrated slideshows with AI See and hear NotebookLM walk through a slideshow.

Google’s NotebookLM can now make narrated slideshows with AI #DL #AI #ML #DeepLearning #ArtificialIntelligence #MachineLearning #LLM #VLM #LVLM
www.theverge.com/news/715283/...

31.07.2025 16:03 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Preview
NVIDIA AI Dev Team Releases Llama Nemotron Super v1.5: Setting New Standards in Reasoning and Agentic AI How NVIDIA’s Llama Nemotron Super v1.5 sets new standards in AI reasoning, throughput, and agentic performance

NVIDIA AI Dev Team Releases Llama Nemotron Super v1.5: Setting New Standards in Reasoning and Agentic AI #DL #AI #ML #DeepLearning #ArtificialIntelligence #MachineLearning #LLM #VLM #LVLM
www.marktechpost.com/2025/07/27/n...

31.07.2025 12:01 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Preview
Generalist Forecasting with Frozen Video Models via Latent Diffusion Forecasting what will happen next is a critical skill for general-purpose systems that plan or act in the world at different levels of abstraction. In this paper, we identify a strong correlation…

Generalist Forecasting with Frozen Video Models via Latent Diffusion #DL #AI #ML #DeepLearning #ArtificialIntelligence #MachineLearning #ComputerVision #LLM #VLM #LVLM
arxiv.org/html/2507.13...

31.07.2025 04:00 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Preview
Le supercalculateur Nexus calculera plus vite que 8 milliards d'humains rΓ©unis - de quoi rΓ©volutionner la science mondiale en quelques annΓ©es Dans les laboratoires de Georgia Tech, une rΓ©volution pourrait transformer la faΓ§on dont l'humanitΓ© aborde ses dΓ©fis les plus complexes.

Le supercalculateur Nexus calculera plus vite que 8 milliards d'humains rΓ©unis - de quoi rΓ©volutionner la science mondiale en quelques annΓ©es #DL #AI #ML #DeepLearning #ArtificialIntelligence #MachineLearning #LLM #VLM #LVLM
sciencepost.fr/le-supercalc...

31.07.2025 03:41 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Video thumbnail

In a new study, researchers have developed a way to align and orient liquid crystal microlens arrays (LC-MLAs) to take ultra-high-resolution images, making this a promising advancement for next-generation light-field cameras!

Learn more: ieee-sensorsalert.org

10.07.2025 05:06 β€” πŸ‘ 2    πŸ” 2    πŸ’¬ 1    πŸ“Œ 0
Preview
LLMs No Longer Require Powerful Servers: Researchers from MIT, KAUST, ISTA, and Yandex Introduce a New AI Approach to Rapidly Compress Large Language Models without a Significant Loss of Quality LLMs No Longer Require Powerful Servers: Researchers from MIT, KAUST, ISTA, and Yandex Introduce a New AI Approach to Rapidly Compress Large Language Models without a Significant Loss of Quality

LLMs No Longer Require Powerful Servers: Researchers from MIT, KAUST, ISTA, and Yandex Introduce a New AI Approach to Rapidly Compress Large Language Models without a Significant Loss of Quality #DL #AI #ML #ArtificialIntelligence #ComputerVision #LLM #VLM
www.marktechpost.com/2025/04/11/l...

31.07.2025 00:00 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Preview
Datasets and Recipes for Video Temporal Grounding via Reinforcement Learning Video Temporal Grounding (VTG) aims to localize relevant temporal segments in videos given natural language queries. Despite recent progress with large vision-language models (LVLMs) and…

Datasets and Recipes for Video Temporal Grounding via Reinforcement Learning #DL #AI #ML #DeepLearning #ArtificialIntelligence #MachineLearning #ComputerVision #LLM #VLM #LVLM
arxiv.org/html/2507.18...

30.07.2025 20:00 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Preview
Accurate Automatic 3D Annotation of Traffic Lights and Signs for Autonomous Driving 3D detection of traffic management objects, such as traffic lights and road signs, is vital for self-driving cars, particularly for address-to-address navigation where vehicles encounter numerous…

Accurate Automatic 3D Annotation of Traffic Lights and Signs for Autonomous Driving #DL #AI #ML #DeepLearning #ArtificialIntelligence #MachineLearning #ComputerVision #AutonomousVehicles #Robotics #LLM #VLM #LVLM
arxiv.org/html/2409.12...

30.07.2025 16:02 β€” πŸ‘ 1    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Preview
TwinLiteNet: An Efficient and Lightweight Model for Driveable Area and Lane Segmentation in Self-Driving Cars Semantic segmentation is a common task in autonomous driving to understand the surrounding environment. Driveable Area Segmentation and Lane Detection are particularly important for safe and…

TwinLiteNet: An Efficient and Lightweight Model for Driveable Area and Lane Segmentation in Self-Driving Cars #DL #AI #ML #DeepLearning #ArtificialIntelligence #MachineLearning #ComputerVision #AutonomousVehicles #Robotics #LLM #VLM #LVLM
arxiv.org/html/2307.10...

30.07.2025 12:01 β€” πŸ‘ 2    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Preview
CRUISE: Cooperative Reconstruction and Editing in V2X Scenarios using Gaussian Splatting Vehicle-to-everything (V2X) communication plays a crucial role in autonomous driving, enabling cooperation between vehicles and infrastructure. While simulation has significantly contributed to…

CRUISE: Cooperative Reconstruction and Editing in V2X Scenarios using Gaussian Splatting #DL #AI #ML #DeepLearning #ArtificialIntelligence #MachineLearning #ComputerVision #AutonomousVehicles #Robotics #LLM #VLM #LVLM
arxiv.org/html/2507.18...

30.07.2025 08:00 β€” πŸ‘ 1    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Preview
High-fidelity 3D Gaussian Inpainting: preserving multi-view consistency and photorealistic details Recent advancements in multi-view 3D reconstruction and novel-view synthesis, particularly through Neural Radiance Fields (NeRF) and 3D Gaussian Splatting (3DGS), have greatly enhanced the fidelity…

High-fidelity 3D Gaussian Inpainting: preserving multi-view consistency and photorealistic details #DL #AI #ML #DeepLearning #ArtificialIntelligence #MachineLearning #ComputerVision #AutonomousVehicles #Robotics #LLM #VLM #LVLM
arxiv.org/html/2507.18...

30.07.2025 04:00 β€” πŸ‘ 1    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Preview
STEAD: Spatio-Temporal Efficient Anomaly Detection for Time and Compute Sensitive Applications This paper presents a new method for anomaly detection in automated systems with time and compute sensitive requirements, such as autonomous driving, with unparalleled efficiency. As systems like…

STEAD: Spatio-Temporal Efficient Anomaly Detection for Time and Compute Sensitive Applications #DL #AI #ML #DeepLearning #ArtificialIntelligence #MachineLearning #ComputerVision #AutonomousVehicles #Robotics #LLM #VLM #LVLM
arxiv.org/html/2503.07...

30.07.2025 00:00 β€” πŸ‘ 1    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Preview
Delving into Multi-modal Multi-task Foundation Models for Road Scene Understanding: From Learning Paradigm Perspectives Foundation models have indeed made a profound impact on various fields, emerging as pivotal components that significantly shape the capabilities of intelligent systems. In the context of intelligent…

Delving into Multi-modal Multi-task Foundation Models for Road Scene Understanding: From Learning Paradigm Perspectives #DL #AI #ML #DeepLearning #ArtificialIntelligence #MachineLearning #ComputerVision #AutonomousVehicles #Robotics #LLM #VLM #LVLM
arxiv.org/html/2402.02...

29.07.2025 20:00 β€” πŸ‘ 1    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Preview
Depth3DLane: Fusing Monocular 3D Lane Detection with Self-Supervised Monocular Depth Estimation Monocular 3D lane detection is essential for autonomous driving, but challenging due to the inherent lack of explicit spatial information. Multi-modal approaches rely on expensive depth sensors,…

Depth3DLane: Fusing Monocular 3D Lane Detection with Self-Supervised Monocular Depth Estimation #DL #AI #ML #DeepLearning #ArtificialIntelligence #MachineLearning #ComputerVision #AutonomousVehicles #Robotics #LLM #VLM #LVLM
arxiv.org/html/2507.13...

29.07.2025 18:24 β€” πŸ‘ 1    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Preview
DiSCO-3D : Discovering and segmenting Sub-Concepts from Open-vocabulary queries in NeRF 3D semantic segmentation provides high-level scene understanding for applications in robotics, autonomous systems, etc. Traditional methods adapt exclusively to either task-specific goals…

DiSCO-3D : Discovering and segmenting Sub-Concepts from Open-vocabulary queries in NeRF #DL #AI #ML #DeepLearning #ArtificialIntelligence #MachineLearning #ComputerVision #LLM #VLM #LVLM
arxiv.org/html/2507.14...

26.07.2025 04:00 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Preview
PCR-GS: COLMAP-Free 3D Gaussian Splatting via Pose Co-Regularizations COLMAP-free 3D Gaussian Splatting (3D-GS) has recently attracted increasing attention due to its remarkable performance in reconstructing high-quality 3D scenes from unposed images or videos.…

PCR-GS: COLMAP-Free 3D Gaussian Splatting via Pose Co-Regularizations #DL #AI #ML #DeepLearning #ArtificialIntelligence #MachineLearning #ComputerVision #LLM #VLM #LVLM
arxiv.org/html/2507.13...

26.07.2025 00:00 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Preview
ObjectGS: Object-aware Scene Reconstruction and Scene Understanding via Gaussian Splatting 3D Gaussian Splatting is renowned for its high-fidelity reconstructions and real-time novel view synthesis, yet its lack of semantic understanding limits object-level perception. In this work, we…

ObjectGS: Object-aware Scene Reconstruction and Scene Understanding via Gaussian Splatting #DL #AI #ML #DeepLearning #ArtificialIntelligence #MachineLearning #ComputerVision #LLM #VLM #LVLM
arxiv.org/html/2507.15...

25.07.2025 20:01 β€” πŸ‘ 2    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0
Preview
VLM-UDMC: VLM-Enhanced Unified Decision-Making and Motion Control for Urban Autonomous Driving Scene understanding and risk-aware attentions are crucial for human drivers to make safe and effective driving decisions. To imitate this cognitive ability in urban autonomous driving while ensuring…

VLM-UDMC: VLM-Enhanced Unified Decision-Making and Motion Control for Urban Autonomous Driving #DL #AI #ML #DeepLearning #ArtificialIntelligence #MachineLearning #ComputerVision #AutonomousVehicles #Robotics #LLM #VLM #LVLM
arxiv.org/html/2507.15...

25.07.2025 16:01 β€” πŸ‘ 1    πŸ” 2    πŸ’¬ 0    πŸ“Œ 0
Preview
Adaptive 3D Gaussian Splatting Video Streaming The advent of 3D Gaussian splatting (3DGS) has significantly enhanced the quality of volumetric video representation. Meanwhile, in contrast to conventional volumetric video, 3DGS video poses…

Adaptive 3D Gaussian Splatting Video Streaming #DL #AI #ML #DeepLearning #ArtificialIntelligence #MachineLearning #ComputerVision #LLM #VLM #LVLM
arxiv.org/html/2507.14...

25.07.2025 12:01 β€” πŸ‘ 1    πŸ” 1    πŸ’¬ 0    πŸ“Œ 0

@deep-in-depth is following 20 prominent accounts