搜索 — ResearchTracker

Accurate vehicle analysis from aerial imagery has become increasingly vital for emerging technologies and public service applications such as intelligent traffic management, urban planning, autonomous navigation, and military surveillance. However, analyzing UAV-captured video poses several inherent challenges, such as the small size of target vehicles, occlusions, cluttered urban backgrounds, motion blur, and fluctuating lighting conditions which hinder the accuracy and consistency of conventional perception systems. To address these complexities, our research proposes a fully end-to-end deep learning-driven perception pipeline specifically optimized for UAV-based traffic monitoring. The proposed framwork integrates multiple advanced modules: RetinexNet for preprocessing, segmentation using HRNet to preserve high-resolution semantic information, and vehicle detection using the YOLOv11 framework. Deep SORT is employed for efficient vehicle tracking, while CSRNet facilitates high-density vehicle counting. LSTM networks are integrated to predict vehicle trajectories based on temporal patterns, and a combination of DenseNet and SuperPoint is utilized for robust feature extraction. Finally, classification is performed using Vision Transformers (ViTs), leveraging attention mechanisms to ensure accurate recognition across diverse categories. The modular yet unified architecture is designed to handle spatiotemporal dynamics, making it suitable for real-time deployment in diverse UAV platforms. The framework suggests using today's best neural networks that are made to solve different problems in aerial vehicle analysis. RetinexNet is used in preprocessing to make the lighting of each input frame consistent. Using HRNet for semantic segmentation allows for accurate splitting between vehicles and their surroundings. YOLOv11 provides high precision and quick vehicle detection and Deep SORT allows reliable tracking without losing track of individual cars. CSRNet are used for vehicle counting that is unaffected by obstacles or traffic jams. LSTM models capture how a car moves in time to forecast future positions. Combining DenseNet and SuperPoint embeddings that were improved with an AutoEncoder is done during feature extraction. In the end, using an attention function, Vision Transformer-based models classify vehicles seen from above. Every part of the system is developed and included to give the improved performance when the UAV is being used in real life. Our proposed framework significantly improves the accuracy, reliability, and efficiency of vehicle analysis from UAV imagery. Our pipeline was rigorously evaluated on two famous datasets, AU-AIR and Roundabout. On the AU-AIR dataset, the system achieved a detection accuracy of 97.8%, a tracking accuracy of 96.5%, and a classification accuracy of 98.4%. Similarly, on the Roundabout dataset, it reached 96.9% detection accuracy, 94.4% tracking accuracy, and 97.7% classification accuracy. These results surpass previous benchmarks, demonstrating the system's robust performance across diverse aerial traffic scenarios. The integration of advanced models, YOLOv11 for detection, HRNet for segmentation, Deep SORT for tracking, CSRNet for counting, LSTM for trajectory prediction, and Vision Transformers for classification enables the framework to maintain high accuracy even under challenging conditions like occlusion, variable lighting, and scale variations. The outcomes show that the chosen deep learning system is powerful enough to deal with the challenges of aerial vehicle analysis and gives reliable and precise results in all the aforementioned tasks. Combining several advanced models ensures that the system works smoothly even when dealing with problems like people being covered up and varying sizes.

Trainable movement control using spikes and muscle-twitch dynamics.

PubMed2026-01-01作者：Timmermans J, Schomaker L

The biological sensorimotor system is a source of inspiration for the design of neuromorphic ballistic control systems. A large portion of sensorimotor-inspired research focuses on the sensory encoding and information processing stages of the system. However, research on broader task-performance systems, involving actuator control on the output side remains scarce. In this work, we develop and train a neuromuscular-inspired model to perform ballistic control. In the model, a spiking neural network's output spikes are used to generate twitch-like signals. These twitches are the basis for generating a continuous fluctuating output signal that is used to operate an actuator. We refer to the the used model as the Twitch Neural Network (TwNN). As a test case, the model is trained to control the paddle of an adapted version of the game of Pong. An adapted version of the Direct Feedback Alignment learning rule, specifically for integrate-and-fire neurons, is introduced. The new rule avoids the update-locking problem of backpropagation, allowing network weight updates in parallel. The model output consists of one group of agonist-innervating motor neurons, and one group of antagonist-innervating motor neurons. We find that it is possible to teach a neuromuscular-inspired system to control the paddle in the game of Pong with the adapted Direct Feedback Alignment learning rule. The best-performing baseline model achieved a hit rate of 96%. By applying logarithmic scaling to the output activity, a hit rate of 98% could be achieved. Finally, by replacing the neuromorphically unrealistic exact summation steps with leaky integrators in training, the range of good learning parameters became more narrow and clear. The best-performing model reaches a hit rate of 99%. Threshold analysis during training has shown that learning is robust to a variety of neuron thresholds. Noise analysis has shown that the system is robust to membrane potential noise during inference for uniform noise up to values in the order of around 0.1-1% of the neuron threshold value per time step.

搜索结果：Frontiers in neurorobotics

Integrated neural network framework for multi-object detection and recognition using UAV imagery.

Trainable movement control using spikes and muscle-twitch dynamics.

End-to-end robot intelligent obstacle avoidance method based on deep reinforcement learning with spatiotemporal transformer architecture.

Transformer-based human-motion forecasting coupled with safe reinforcement learning for telepresence robot co-navigation.

A simple robot suggests trunk rotation is essential for emergence of inside leading limb during quadruped galloping turns.

Gait analysis system for assessing abnormal patterns in individuals with hemiparetic stroke during robot-assisted gait training: a criterion-related validity study in healthy adults.

Multimodal human action recognition and personalized sports health promotion: a deep learning framework integrating wearable sensor fusion.

A brain-inspired ISMO-PNN framework for neurally-grounded bearing fault diagnosis.

UAV-based intelligent traffic surveillance using recurrent neural networks and Swin transformer for dynamic environments.

Subdomain adaptation method based on transferable semantic alignment and class correlation.

SpikeAEC: a neuromodulation-based spiking controller for explore-exploit balancing in mobile robots.

Effective and efficient self-supervised masked model based on mixed feature training.

A novel intelligent physiotherapy robot based on dynamic acupoint recognition method.

Innovative approach of nonlinear controllers design for prosthetic knee performance.

The evolution of trends and technology in wearable sensors used to detect falls in people with neurodegenerative diseases: a systematic review.

Constructing three-way classifier with interval granulation neighborhood rough sets based on uncertainty invariance.

Imitation-relaxation reinforcement learning for sparse badminton strikes via dynamic trajectory generation.

NeuroVI-based wave compensation system control for offshore wind turbines.

A robust and effective framework for 3D scene reconstruction and high-quality rendering in nasal endoscopy surgery.

Understanding human co-manipulation via motion and haptic information to enable future physical human-robotic collaborations.