Neural Semantic Video Analysis

View Sample PDF

Author(s): Hamid Mohammadi (University of Alberta, Canada), Tahereh Firoozi (University of Alberta, Canada)and Mark Gierl (University of Alberta, Canada)
Copyright: 2025
Pages: 15
Source title: Encyclopedia of Information Science and Technology, Sixth Edition
Source Author(s)/Editor(s): Mehdi Khosrow-Pour, D.B.A. (Founding Editor-in-Chief, Information Resources Management Journal (IRMJ), USA)
DOI: 10.4018/978-1-6684-7366-5.ch068

Keywords: Computer Science & IT / Engineering Science Reference / Global Information Technology

Purchase

View Neural Semantic Video Analysis on the publisher's website for pricing and purchasing information.

Abstract

Videos are a rich form of data intended for capturing, storing, and communicating information. The availability of inexpensive and accessible video-capturing sensors in smartphones, handheld cameras, and consumer security cameras has exponentially increased global video footage generation over the past decade. Since video is a popular form of widely consumed and produced data, it is essential to develop automated systems to analyze and identify relevant information within the large body of video material. This chapter demonstrates how the emergence of neural networks, including CNNs and transformers, has revolutionized semantic video analysis. Through convolutional filters, spatial patterns can be captured at the pixel level through this type of neural network. The learning capability of CNN-based models has been exceeded more recently by self-attention-based models. Both CNN-based and transformer-based semantic video analysis models take advantage of transfer learning, self-supervised learning, and more to compensate for the lack of large, supervised video datasets.

The IRMA Community

Research IRM

Neural Semantic Video Analysis

Purchase

Abstract

Related Content

IRMA Sponsors