Jetson TX2 Research Articles

Sustaining real-time, high fidelity AI-based vision perception on edge devices is challenging due to both the high computational overhead of increasingly “deeper” Deep Neural Networks (DNNs) and the increasing resolution/quality of camera sensors. Such high-throughput vision perception is even more challenging in multi-tenancy systems, where video streams from multiple such high-quality cameras need to share the same GPU resource on a single edge device. Criticality-aware canvas-based processing is a promising paradigm that decomposes multiple concurrent video streams into Regions of Interest (RoI) and spatially channels the limited computational resources to selected RoI with higher “resolution”, thereby moderating the trade-off between computational load, task fidelity, and processing throughput. RA-MOSAIC (Resource Adaptive MOSAIC) employs such canvas-based processing, while further tuning the incoming video streams and available resources on-demand to allow the system to adapt to dynamic changes in workload (often arising from variations in the number or size of relevant objects observed by individual cameras). RA-MOSAIC utilizes two distinct and synergistic concepts. First, at the camera sensor, a bandwidth-adaptive and lightweight Bandwidth Aware Camera Transmission (BACT) method applies differential down-sampling to create mixed-resolution individual frames that preferentially preserve resolution for critical ROIs, before being transmitted to the edge node. Second, at the edge, BACT video streams received from multiple cameras are decomposed into multi-scale RoI tiles and spatially packed using a novel workload-adaptive bin-packing strategy into a single ‘canvas frame’. Notably, the canvas frame itself is dynamically sized such that the edge device can opportunistically provide higher processing throughput for selected high-priority tiles during periods of lower aggregate workloads. To demonstrate RA-MOSAIC’s gains in processing throughput and perception fidelity, we evaluate RA-MOSAIC on a single NVIDIA Jetson TX2 edge device for two benchmark tasks: Drone-based Pedestrian Detection and Automatic License Plate Recognition. In a bandwidth-constrained wireless environment, RA-MOSAIC employs a batch size of 1 to pack up to 6 concurrent video streams on a dynamically sized canvas frame to provide (i) 14.3% gain in object detection accuracy and (ii) 11.11% gain in throughput on average (up to 20 FPS per camera, cumulatively 120 FPS), over our previous work MOSAIC, a naïve canvas-based baseline. Compared to prior state of the art baselines such as batched inference over extracted RoI, RA-MOSAIC provides a very-significant, 29.6% gain in accuracy for a comparable throughput. Similarly, RA-MOSAIC dramatically outperforms bandwidth adaptive baselines, such as FCFS ( \(\leq 1\%\) accuracy gain but \(5.6\) x or 566.67% throughput gain) and uniform grid packing (17% accuracy improvement and 5% throughput gain).

Read full abstract

Ants are capable of learning long visually guided foraging routes with limited neural resources. The visual scene memory needed for this behaviour is mediated by the mushroom bodies; an insect brain region important for learning and memory. In a visual navigation context, the mushroom bodies are theorised to act as familiarity detectors, guiding ants to views that are similar to those previously learned when first travelling along a foraging route. Evidence from behavioural experiments, computational studies and brain lesions all support this idea. Here we further investigate the role of mushroom bodies in visual navigation with a spiking neural network model learning complex natural scenes. By implementing these networks in GeNN-a library for building GPU accelerated spiking neural networks-we were able to test these models offline on an image database representing navigation through a complex outdoor natural environment, and also online embodied on a robot. The mushroom body model successfully learnt a large series of visual scenes (400 scenes corresponding to a 27 m route) and used these memories to choose accurate heading directions during route recapitulation in both complex environments. Through analysing our model's Kenyon cell (KC) activity, we were able to demonstrate that KC activity is directly related to the respective novelty of input images. Through conducting a parameter search we found that there is a non-linear dependence between optimal KC to visual projection neuron (VPN) connection sparsity and the length of time the model is presented with an image stimulus. The parameter search also showed training the model on lower proportions of a route generally produced better accuracy when testing on the entire route. We embodied the mushroom body model and comparator visual navigation algorithms on a Quanser Q-car robot with all processing running on an Nvidia Jetson TX2. On a 6.5 m route, the mushroom body model had a mean distance to training route (error) of 0.144 ± 0.088 m over 5 trials, which was performance comparable to standard visual-only navigation algorithms. Thus, we have demonstrated that a biologically plausible model of the ant mushroom body can navigate complex environments both in simulation and the real world. Understanding the neural basis of this behaviour will provide insight into how neural circuits are tuned to rapidly learn behaviourally relevant information from complex environments and provide inspiration for creating bio-mimetic computer/robotic systems that can learn rapidly with low energy requirements.

Read full abstract

Jetson TX2 Research Articles

Articles published on Jetson TX2

RA-MOSAIC : Resource Adaptive Edge AI Optimization over Spatially Multiplexed Video Streams

Image Acquisition of Critical Bridge Components Using Vision-guided Autonomous Vehicle

Field Pest Detection via Pyramid Vision Transformer and Prime Sample Attention

Characterizing the Performance and Cost of Blockchains on the Cloud and at the Edge

Malaria Cell Image Classification Using Compact Deep Learning Architectures on Jetson TX2

Enhanced YOLOv8 Ship Detection Empower Unmanned Surface Vehicles for Advanced Maritime Surveillance.

An automatic rebar spacing measuring method based on the YOLOv8-GB model

Electric trucks pantograph-catenary interaction condition monitoring method based on semantic segmentation network and linear fitting

CUAHN-VIO: Content-and-uncertainty-aware homography network for visual-inertial odometry

Removing nonrigid refractive distortions for underwater images using an attention-based deep neural network

A Lightweight Insulator Defect Detection Model Based on Drone Images

An Intelligent Bait Delivery Control Method for Flight Vehicle Evasion Based on Reinforcement Learning

A lightweight and accurate recognition algorithm of pointer meter based on improved Deeplabv3+ for inspection robots

AI-Driven Computer Vision Detection of Cotton in Corn Fields Using UAS Remote Sensing Data and Spot-Spray Application

Design and Implementation of Intelligent Robot Control System Integrating Computer Vision and Mechanical Engineering

L1RR: Model Pruning Using Dynamic and Self-Adaptive Sparsity for Remote-Sensing Target Detection to Prevent Target Feature Loss

Clairvoyance: Vision-impaired friendly assistive mobile device

Investigating visual navigation using spiking neural network models of the insect mushroom bodies.

A photogrammetric approach for real‐time visual SLAM applied to an omnidirectional system

Convolution technique for focusing of ISAR images

Lead the way for us

Editage

Paperpal

R Discovery

Mind the Graph

Jetson TX2 Research Articles

Articles published on Jetson TX2

RA-MOSAIC : Resource Adaptive Edge AI Optimization over Spatially Multiplexed Video Streams

Image Acquisition of Critical Bridge Components Using Vision-guided Autonomous Vehicle

Field Pest Detection via Pyramid Vision Transformer and Prime Sample Attention

Characterizing the Performance and Cost of Blockchains on the Cloud and at the Edge

Malaria Cell Image Classification Using Compact Deep Learning Architectures on Jetson TX2

Enhanced YOLOv8 Ship Detection Empower Unmanned Surface Vehicles for Advanced Maritime Surveillance.

An automatic rebar spacing measuring method based on the YOLOv8-GB model

Electric trucks pantograph-catenary interaction condition monitoring method based on semantic segmentation network and linear fitting

CUAHN-VIO: Content-and-uncertainty-aware homography network for visual-inertial odometry

Removing nonrigid refractive distortions for underwater images using an attention-based deep neural network

A Lightweight Insulator Defect Detection Model Based on Drone Images

An Intelligent Bait Delivery Control Method for Flight Vehicle Evasion Based on Reinforcement Learning

A lightweight and accurate recognition algorithm of pointer meter based on improved Deeplabv3+ for inspection robots

AI-Driven Computer Vision Detection of Cotton in Corn Fields Using UAS Remote Sensing Data and Spot-Spray Application

Design and Implementation of Intelligent Robot Control System Integrating Computer Vision and Mechanical Engineering

L1RR: Model Pruning Using Dynamic and Self-Adaptive Sparsity for Remote-Sensing Target Detection to Prevent Target Feature Loss

Clairvoyance: Vision-impaired friendly assistive mobile device

Investigating visual navigation using spiking neural network models of the insect mushroom bodies.

A photogrammetric approach for real‐time visual SLAM applied to an omnidirectional system

Convolution technique for focusing of ISAR images