Publication
Hybrid time-spatial video saliency detection method to enhance human action recognition systems
| Summary: | Since digital media has become increasingly popular, video processing has expanded in recent years. Video processing systems require high levels of processing, which is one of the challenges in this field. Various approaches, such as hardware upgrades, algorithmic optimizations, and removing unnecessary information, have been suggested to solve this problem. This study proposes a video saliency map based method that identifies the critical parts of the video and improves the system's overall performance. Using an image registration algorithm, the proposed method first removes the camera's motion. Subsequently, each video frame's color, edge, and gradient information are used to obtain a spatial saliency map. Combining spatial saliency with motion information derived from optical flow and color-based segmentation can produce a saliency map containing both motion and spatial data. A nonlinear function is suggested to properly combine the temporal and spatial saliency maps, which was optimized using a multi-objective genetic algorithm. The proposed saliency map method was added as a preprocessing step in several Human Action Recognition (HAR) systems based on deep learning, and its performance was evaluated. Furthermore, the proposed method was compared with similar methods based on saliency maps, and the superiority of the proposed method was confirmed. The results show that the proposed method can improve HAR efficiency by up to 6.5% relative to HAR methods with no preprocessing step and 3.9% compared to the HAR method containing a temporal saliency map. |
|---|---|
| Subject: | Technological sciences, Engineering and technology Ciências Tecnológicas, Ciências da engenharia e tecnologias |
| Country: | Portugal |
| Document type: | journal article |
| Access type: | Open |
| Associated institution: | Repositório Aberto da Universidade do Porto |
| Language: | English |
| Origin: | Repositório Aberto da Universidade do Porto |
| _version_ | 1850560655787032576 |
|---|---|
| conditionsOfAccess_str | open access |
| country_str | PT |
| description | Since digital media has become increasingly popular, video processing has expanded in recent years. Video processing systems require high levels of processing, which is one of the challenges in this field. Various approaches, such as hardware upgrades, algorithmic optimizations, and removing unnecessary information, have been suggested to solve this problem. This study proposes a video saliency map based method that identifies the critical parts of the video and improves the system's overall performance. Using an image registration algorithm, the proposed method first removes the camera's motion. Subsequently, each video frame's color, edge, and gradient information are used to obtain a spatial saliency map. Combining spatial saliency with motion information derived from optical flow and color-based segmentation can produce a saliency map containing both motion and spatial data. A nonlinear function is suggested to properly combine the temporal and spatial saliency maps, which was optimized using a multi-objective genetic algorithm. The proposed saliency map method was added as a preprocessing step in several Human Action Recognition (HAR) systems based on deep learning, and its performance was evaluated. Furthermore, the proposed method was compared with similar methods based on saliency maps, and the superiority of the proposed method was confirmed. The results show that the proposed method can improve HAR efficiency by up to 6.5% relative to HAR methods with no preprocessing step and 3.9% compared to the HAR method containing a temporal saliency map. |
| documentTypeURL_str | http://purl.org/coar/resource_type/c_6501 |
| documentType_str | journal article |
| id | abcf558f-6733-44c0-9c59-f99fa52ae4f0 |
| identifierHandle_str | https://hdl.handle.net/10216/162823 |
| language | eng |
| relatedInstitutions_str_mv | Repositório Aberto da Universidade do Porto |
| resourceName_str | Repositório Aberto da Universidade do Porto |
| spellingShingle | Hybrid time-spatial video saliency detection method to enhance human action recognition systems Technological sciences, Engineering and technology Ciências Tecnológicas, Ciências da engenharia e tecnologias |
| title | Hybrid time-spatial video saliency detection method to enhance human action recognition systems |
| topic | Technological sciences, Engineering and technology Ciências Tecnológicas, Ciências da engenharia e tecnologias |
A digital service from FCT