Back to the App List

Video Action Recognition

Deep learning solution that classifies human actions in video streams for workflow analysis and operational safety.

$0.00
More Information
Video Action Recognition
app details
Model Type

Action Recognition

Framework

ONNX

Algorithm

CNN+LSTM

Technical Details

This application implements a hybrid action-recognition architecture that combines a 2D Convolutional Neural Network (CNN) for spatial feature extraction with an LSTM-based recurrent module for temporal modeling. The model processes short sequences of consecutive RGB frames (e.g., 8–32), extracting per-frame visual descriptors through a lightweight CNN backbone. These descriptors are then fed into a bidirectional or unidirectional LSTM layer that captures motion dynamics and temporal dependencies, enabling robust classification of human actions in real-world environments. The pipeline includes temporal sampling, frame normalization, optional optical-flow augmentation, and sequence-level aggregation. The ONNX-exported model is optimized for real-time edge deployment, ensuring low-latency inference on SECO hardware for workflow monitoring, safety applications, and activity understanding.

Use Cases
Quality Control Industrial Safety Industrial Surveillance Surveillance Emotion Recognition
Supported SECO Devices
AI optimized hardware
Fanless embedded PCs
Modules
supported chipsets
Intel
Intel® Core™ i3
Qualcomm
Qualcomm® Dragonwing QCS6490
Qualcomm® Dragonwing QCS5430
NXP
NXP i.MX 8M Plus
NXP i.MX 95

Related models