株式会社アジラ | Activate all streams !!

行動認識AIに特化した映像解析事業で、犯罪や事故を未然に防ぐ世界を目指す株式会社アジラの公式ウェブサイトです。

Multifunctional supportive Human Body Pose Estimation model AsillaPose V4 Lite

Intelligent Video Analysis(IVA) is becoming increasingly popular in the video surveillance industry. Recently, Human Behavior Recognition(HBR) and Human tracking across multiple cameras have become attractive research fields. Performance of both of them immensely depends on the Human Body Pose estimation. As an AI startup company, We have been developing our own state-of-the-art human body pose estimation model which is named as AsillaPose.

Intention of writing this article is to introduce the latest version of AsillaPose, AsillaPose V4 Lite, a small and fast version of AsillaPose V4 and it’s outstanding performance in comparison to OpenPose.

Nowadays, there’s a number of human body pose estimation models available to use. However, most of them are challenging to use in real world applications, especially in the field of security and surveillance. Our mission is to build a fast and robust Human Body Pose Estimation model which is compatible with real-world applications focusing on the security and surveillance industry.

In this article, the following parameters of AsillaPose v4 are compared to OpenPose.

  1. Accuracy.
  2. Inference speed.
  3. System requirements.

In addition, we would like to introduce the unique features of AsillaPose V4 Lite which can magnificently satisfy market requirements.
Let’s look at the performance of our pose estimation model.

1. Accuracy.

COCO 2017 validation dataset with 5000 images is used for the accuracy evaluation and comparison of AsillaPose with OpenPose. OKS(object keypoint similarity), Precision and Recall metrics are used for evaluation.

The AsillaPose V4 Lite model is compatible with two input resolutions, 224x224 and 320x320. We conducted two experiments for each input resolution.

Results on COCO validation dataset are as below,

f:id:asilla:20210131103929p:plain

Above results show that accuracy of AsillaPose V4 Lite pose estimation model is remarkably higher in comparison with OpenPose pose estimation model.
Here are some pose overlaid images from both AsillaPose V4 Lite and OpenPose models for your observation.

f:id:asilla:20210131104047p:plain

 

f:id:asilla:20210131104123p:plain

 

1  It's work well with steep security camera angles which we meet in real-life applications specially in the security and surveillance industry.

2 AsillaPose V4 Lite is also capable of estimating poses of people who are far away from the camera.

3 AsillaPose V4 Lite is able to detect a range of poses like when people are sitting, walking, running.

 

When it comes to real-time applications, speed is one of the biggest challenges to overcome. AsillaPose can be executed even in edge devices such as NVIDIA Jetson Nano with very high speed.

2. Inference speed.

Both AsillaPose and OpenPose models are executed on a NVIDIA Jetson Nano device in order to evaluate the inference speed. System configurations of the device are as below,

f:id:asilla:20210131104201p:plain

Inference speed comparison of AsillaPose V4 Lite versus OpenPose is shown below,

f:id:asilla:20210131104226p:plain


The graph on the left shows that AsillaPose V4 Lite model has got high inference speed with GPU support which is very suitable for real-time applications. The graph on the right shows that AsillaPose V4 Lite model achieves a much faster speed with CPU only model (CPU configuration - Intel(R) Core(TM) i7-8700K CPU @ 3.70 GHz, RAM 32GB ).

And the following graph shows speed comparison for increasing the number of people.

 

f:id:asilla:20210131104253p:plain



3. System requirements.f:id:YudaiVlog:20210203211854p:plain

As a responsible AI startup company, We’ve been developing our Human Body Pose Estimation model, AsillaPose, with the capability of applying it in real-world applications, specially in the security and surveillance industry.
Initially, AsillaPose V4 Lite is targeted to apply in following applications,

  1. Intrusion detection.
  2. Human tracking across multiple cameras.
  3. Abnormal behavior detection.

 f:id:asilla:20210131104505p:plain

AsillaPose V4 Lite is perfectly applicable to detect suspicious in and outs by analysing video streams from existing security cameras eliminating cost for accessories such as infrared sensors, microwave sensors which are currently used for detecting intrusions.
Multiple cameras are needed to cover large premises and tracking people across multiple cameras is challenging. AsillaPose V4 Lite supports Multiple Camera Tracking and Multiple Human Tracking even in nighttime.

Last but not least, Our pose estimation model also supports recognizing abnormal behavior of people such as staggering, fighting, falling, .etc, by analysing pose movements.

Developing Human Body Pose Estimation models is not the only thing we do. With the ultimate goal of being a guardian of the world, we’re developing a complete Software Development Kit(SDK) which compacts three main solutions, a Human Body Pose Estimation Model, AsillaPose, an application for Multiple Camera Tracking and an Abnormal Behavior Recognition application.
The SDK will officially be released in March, 2021 at Innovation Leaders Summit in Tokyo.