Machine learning for human behavior analysis

Nowadays, many fitness bloggers have popped up to upload teaching yoga videos for rookies to exercise. Generally, yoga poses are designed to stretch different parts of human bodies, and if wrong videos are followed, it would be a waste of time and effort. However, the present solution to select the...

Full description

Bibliographic Details
Main Author: Chen, Zien
Other Authors: Tan Yap Peng
Format: Thesis-Master by Coursework
Language:English
Published: Nanyang Technological University 2022
Subjects:
Online Access:https://hdl.handle.net/10356/158900
_version_ 1811681009228316672
author Chen, Zien
author2 Tan Yap Peng
author_facet Tan Yap Peng
Chen, Zien
author_sort Chen, Zien
collection NTU
description Nowadays, many fitness bloggers have popped up to upload teaching yoga videos for rookies to exercise. Generally, yoga poses are designed to stretch different parts of human bodies, and if wrong videos are followed, it would be a waste of time and effort. However, the present solution to select the right videos is by manual recognition, which is time-consuming and requires domain expertise. In addition, new yoga gestures are created constantly, which cannot be simply recognized by pose recognition or detection. This dissertation aims to design a system to classify yoga exercising videos. It adopts VGG16, short for Visual Geometry Group Network, as its classification model. In this dissertation, one-shot learning is used to find gestures of interest in video testing samples. After that, these gestures are compared with small datasets using an m-way k-shots few-shot learning method. Eventually, it would label each yoga video, classifying its exercising part of body for yoga learners. In addition, this dissertation provides a supervision function for learners. It allows users to input videos recording their gestures and judge if they do it right. The output score is an evaluation indicator similar to mAP. This part relies on supervised learning method, and this dissertation adopts Faster RCNN as its object detection model, whose accuracy is 90.90%, based on our experiments. Keywords: yoga, VGG16, classification, one-shot learning, few-shot learning, supervised learning, mAP, accuracy.
first_indexed 2024-10-01T03:34:08Z
format Thesis-Master by Coursework
id ntu-10356/158900
institution Nanyang Technological University
language English
last_indexed 2024-10-01T03:34:08Z
publishDate 2022
publisher Nanyang Technological University
record_format dspace
spelling ntu-10356/1589002023-07-04T17:49:18Z Machine learning for human behavior analysis Chen, Zien Tan Yap Peng School of Electrical and Electronic Engineering EYPTan@ntu.edu.sg Engineering::Electrical and electronic engineering Nowadays, many fitness bloggers have popped up to upload teaching yoga videos for rookies to exercise. Generally, yoga poses are designed to stretch different parts of human bodies, and if wrong videos are followed, it would be a waste of time and effort. However, the present solution to select the right videos is by manual recognition, which is time-consuming and requires domain expertise. In addition, new yoga gestures are created constantly, which cannot be simply recognized by pose recognition or detection. This dissertation aims to design a system to classify yoga exercising videos. It adopts VGG16, short for Visual Geometry Group Network, as its classification model. In this dissertation, one-shot learning is used to find gestures of interest in video testing samples. After that, these gestures are compared with small datasets using an m-way k-shots few-shot learning method. Eventually, it would label each yoga video, classifying its exercising part of body for yoga learners. In addition, this dissertation provides a supervision function for learners. It allows users to input videos recording their gestures and judge if they do it right. The output score is an evaluation indicator similar to mAP. This part relies on supervised learning method, and this dissertation adopts Faster RCNN as its object detection model, whose accuracy is 90.90%, based on our experiments. Keywords: yoga, VGG16, classification, one-shot learning, few-shot learning, supervised learning, mAP, accuracy. Master of Science (Computer Control and Automation) 2022-05-31T08:01:08Z 2022-05-31T08:01:08Z 2022 Thesis-Master by Coursework Chen, Z. (2022). Machine learning for human behavior analysis. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/158900 https://hdl.handle.net/10356/158900 en application/pdf Nanyang Technological University
spellingShingle Engineering::Electrical and electronic engineering
Chen, Zien
Machine learning for human behavior analysis
title Machine learning for human behavior analysis
title_full Machine learning for human behavior analysis
title_fullStr Machine learning for human behavior analysis
title_full_unstemmed Machine learning for human behavior analysis
title_short Machine learning for human behavior analysis
title_sort machine learning for human behavior analysis
topic Engineering::Electrical and electronic engineering
url https://hdl.handle.net/10356/158900
work_keys_str_mv AT chenzien machinelearningforhumanbehavioranalysis