Held in conjunction with CVPR2026 (Denver), June 3, 2026
Main Theme: Energy-Efficient Embedded Vision
Place: Room 709
Organized by: Matteo Poggi, Tse-Wei Chen, Branislav Kisacanin, Ahmed Nabil Belbachir
Description
Embedded vision is an active field of research, bringing together efficient learning models with fast computer vision and pattern recognition algorithms, to tackle many areas of robotics and intelligent systems that are enjoying an impressive growth today. Such strong impact comes with many challenges that stem from the difficulty of understanding complex visual scenes under the tight computational constraints required by real-time solutions on embedded devices. The Embedded Vision Workshop will provide a venue for discussing these challenges by bringing together researchers and practitioners from the different fields outlined above.
Program
| Tim (GMT-6) | Session | Speaker | Topic |
| 9:00 | Welcome notes | EVW Committee | Welcome notes |
| 9:10 | Invited Talk #1 | Dr. Bowen Wen | Building Foundation Models for Robotic Perception |
| 9:35 | Paper ID #2 | Jaehyeon So | EventGuard: Sparsity-Aware In-Sensor Denoising for Frame-Based Event Vision Sensors |
| 9:45 | Paper ID #3 | Vinit Katariya | EdgeVTP: Exploration of Latency-efficient Trajectory Prediction for Edge-based Embedded Vision Applications |
| 9:55 | Paper ID #5 | Carlo Marra | BlankSkip: Early-exit Object Detection onboard Nano-drones |
| 10:05 | Paper ID #7 | Lorenzo Lamberti | TinyDEVO: Deep Event-based Visual Odometry on Ultra-low-power Multi-core Microcontrollers |
| 10:15 | Break and Poster Session | ||
| 11:00 | Invited Talk #2 | Dr. Kang Eun Jeon | One Model, Many Precisions: Toward Flexible and Adaptive Compression for Edge Vision |
| 11:25 | Paper ID #8 | Francesco Tosini | Hardware-Aware Neural Feature Extraction for Resource-Constrained Devices |
| 11:35 | Paper ID #9 | Calvin GALAGAIN | Token-Space Mask Prediction for Efficient Vision Transformer Segmentation |
| 11:45 | Paper ID #17 | Mohamed Mejri | Dynamic 3D CNN Pruning: Joint Frame, Channel, and Feature Adaptation for Energy Efficiency on the Edge |
| 11:55 | Invited Talk #3 | Prof. Enzo Tartaglione | Bringing Training to the Edge: Subspace and Sparsity for Efficient Deep Learning |
| 12:20 | Closing Remarks | EVW Committee |
Important Dates
Paper submission: March 11, 2026
Review assignment: March 12, 2026
Review deadline: March 22, 2026
Notification of acceptance: March 25, 2026
Camera-ready submission: April 10, 2026 (11:59 Pacific Time)
Please refer to Submission page for details.
The accepted papers will be published in the proceeding, along with the CVPR main conference, indexed in EI Compendex.
The submission system is open! (February 6, 2026)
CMT3 Site: https://cmt3.research.microsoft.com/EVW2026
The Microsoft CMT service was used for managing the peer-reviewing process for this conference. This service was provided for free by Microsoft and they bore all expenses, including costs for Azure cloud services as well as for software development and support.
Invited Talk #1

Speaker: Dr. Bowen Wen
Title: Building Foundation Models for Robotic Perception
Abstract: 3D spatial understanding is a critical skill for robotics which typically requires tedious manual design, expensive data collection and per-domain training. This presentation will focus on the development and application of foundation models to address several fundamental challenges in robotic perception. First, we introduce FoundationStereo, a novel architecture designed to maximize zero-shot performance. We discuss the creation of a large-scale (1M pairs) synthetic dataset with iterative self-curation to eliminate ambiguity. We detail how we mitigate the sim-to-real gap by integrating rich monocular priors into the stereo pipeline. Furthermore, we examine the Attentive Hybrid Cost Filtering (AHCF) module that enables long-range context reasoning. Second, we address the computational bottlenecks of foundation models with Fast-FoundationStereo. We propose a “divide-and-conquer” acceleration strategy that retains the teacher model’s robustness while achieving a 10x speedup. Finally, we demonstrate how an automatic pseudo-labeling pipeline on 1.4M in-the-wild images further closes the performance gap. The results in a new family of stereo matching models that deliver foundation-model-level accuracy with real-time inference capabilities.
Biography: Bowen is a Senior Research Scientist at NVIDIA Research, where his work focuses on large foundation models for 3D visual perception to advance embodied AI. He earned his PhD in Computer Science from Rutgers University in 2022 under the advisement of Prof. Kostas Bekris. During his PhD, he conducted several research internships at Google[X], Meta Reality Labs and Amazon Lab 126. As an impactful researcher in both computer vision and robotics, his leading projects received Best Paper Award nominations at both CVPR 2025 and RSS 2022.
Invited Talk #2

Speaker: Dr. Kang Eun Jeon 🎓
Title: One Model, Many Precisions: Toward Flexible and Adaptive Compression for Edge Vision
Abstract: The deployment of deep neural networks on edge devices is challenging due to heterogeneous hardware, limited resources, and varying task demands. Existing compression techniques typically produce a single operating point, and the resulting models are often brittle, suffering catastrophic accuracy degradation when perturbed away from their target operating point. To address this issue, this talk presents our recent work on flexible model compression, including techniques for efficient training of multi-bit quantization networks[1], memory-efficient training of mixed-precision models[2], and truncation-ready quantization that adapts a single trained model to lower bit-widths by discarding the LSBs[3]. Moving beyond adapting to different hardware constraints, these works lay the foundation for adaptive compression, where model precision can be dynamically adjusted at runtime based on resource availability and task complexity.
Biography: Kang Eun Jeon is a postdoctoral researcher at KAIST AI with Prof. Jaegul Choo. He previously held a postdoctoral position at SKKU with Prof. Jong Hwan Ko and received his PhD from HKUST under Prof. James She and Prof. Jun Zhang. During his PhD, he developed luXbeacon, an open-source low-power embedded platform powered entirely by energy harvesting, deployed by over ten industry and academic partners, including Hong Kong government agencies such as EMSD and OGCIO. His current research focuses on model compression and efficient AI for resource-constrained edge devices, with particular interest in analog in-memory computing architectures and adaptive models that dynamically adjust precision at runtime.
Invited Talk #3

Speaker: Prof. Enzo Tartaglione
Title: Bringing Training to the Edge: Subspace and Sparsity for Efficient Deep Learning
Abstract: Modern deep networks are increasingly difficult to train on embedded vision platforms due to strict memory, compute, and energy constraints. In this talk, we present some approaches that make on-device training practical by reducing the cost of backpropagation through low-rank subspaces and dynamic sparsity. Across CNNs and transformers, these methods drastically cut memory usage and FLOPs while preserving accuracy, enabling efficient training and inference on resource-limited devices. Together, they point toward a future where embedded vision systems can learn, adapt, and personalize directly at the edge.
Biography: Enzo Tartaglione is a Full Professor at Télécom Paris, where he is responsible for the equipe Multimedia and he is a Hi!Paris associate member. He is also a Member of the ELLIS Society, Senior IEEE Member, Associate Editor of IEEE Transactions on Neural Networks and Learning Systems and Action Editor for Transactions of Machine Learning Research. He received the MS degree in Electronic Engineering at Politecnico di Torino in 2015, cum laude. The same year, he also received a magna cum laude MS in electrical and computer engineering at University of Illinois at Chicago. In 2016 he was also awarded the MS in Electronics by Politecnico di Milano, cum laude. In 2019 he obtained a PhD in Physics at Politecnico di Torino, cum laude, with the thesis “From Statistical Physics to Algorithms in Deep Neural Systems”. His principal interests include compression and responsible (frugal) AI, privacy-aware learning, debiasing and regularization for deep learning.
Topics
- Agentic AI at the edge
- Embodied AI
- Lightweight and efficient computer vision algorithms for embedded systems
- Hardware dedicated to embedded vision systems (GPUs, FPGAs, DSPs, etc.)
- Software platforms for embedded vision systems
- Neuromorphic computing
- Applications of embedded vision systems in general domains: UAVs (industrial, mobile and consumer), Advanced assistance systems and autonomous navigation frameworks, Augmented and Virtual Reality, Robotics.
- New trends and challenges in embedded visual processing
- Analysis of vision problems specific to embedded systems
- Analysis of embedded systems issues specific to computer vision
- Biologically-inspired vision and embedded systems
- Hardware and software enhancements that impact vision applications
- Performance metrics for evaluating embedded systems
- Hybrid embedded systems combining vision and other sensor modalities
- Embedded vision systems applied to new domains
Committee

Program Chair:
Matteo Poggi, University of Bologna

Publication Chair:
Tse-Wei Chen, Canon Inc. (Japan)

Program Chair:
Branislav Kisacanin, Institute for AI R&D (Serbia)

General Chair:
Ahmed Nabil Belbachir, NORCE Norwegian Research Centre (Norway)
Sponsors
Steering Committee:
Marilyn Claire Wolf, University of Nebraska-Lincoln
Martin Humenberger, NAVER LABS Europe
Roland Brockers, Jet Propulsion Laboratory
Swarup Medasani, MathWorks
Stefano Mattoccia, University of Bologna
Jagadeesh Sankaran, Nvidia
Goksel Dedeoglu, Perceptonic
Margrit Gelautz, Vienna University of Technology
Branislav Kisacanin, Nvidia
Sek Chai, Latent AI
Zoran Nikolic, Nvidia
Ravi Satzoda, Nauto
Stephan Weiss, University of Klagenfurt
Program Committee:
Dragos Costea, University Politehnica of Bucharest
Assia Belbachir, NORCE Norwegian Research Centre
Fahim Hasan Khan, California Polytechnic State University, San Luis Obispo
Florin Condrea, Siemens Corporate Research
Mihai Masala, The Institute of Mathematics of the Romanian Academy (IMAR)
Dongchao Wen, IEIT SYSTEMS Co., Ltd.
Alina Marcu, The National University of Science and Technology Politehnica Bucharest
Faycal Bensaali, University of Qatar
Sam Lerouxm Ghent University
Omkar Prabhunem Purdue University
Luca Bompani, University of Bologna
Linda M. Wills, Georgia Institute of Technology
Burak Ozer, Pekosoft LLC
Cevahir Cigla, ASELSAN
Natalia Jurado, Latent AI
Branislav Kisacanin, NVIDIA
Wei Tao, Canon Innovative Solution (Beijing) Co., Ltd.