You are in:Home/Publications/Lamiaa A. Elrefaei, Tahani Q. Alhassan, Shefaa S. Omar, An Arabic Visual Dataset for Visual Speech Recognition, Procedia Computer Science, Volume 163, 2019, Pages 400-409, ISSN 1877-0509, https://doi.org/10.1016/j.procs.2019.12.122. | |
Prof. Lamiaa Abdallah Ahmed Elrefaei :: Publications: |
Title: | Lamiaa A. Elrefaei, Tahani Q. Alhassan, Shefaa S. Omar,
An Arabic Visual Dataset for Visual Speech Recognition, Procedia Computer Science, Volume 163, 2019, Pages 400-409,
ISSN 1877-0509, https://doi.org/10.1016/j.procs.2019.12.122. |
Authors: | Not Available |
Year: | 2020 |
Keywords: | Not Available |
Journal: | Not Available |
Volume: | Not Available |
Issue: | Not Available |
Pages: | Not Available |
Publisher: | Not Available |
Local/International: | International |
Paper Link: | |
Full paper | Not Available |
Supplementary materials | Not Available |
Abstract: |
Visual speech recognition (VSR) has received increasing attention in recent decades due to its potential uses in many applications. As for any recognition system, useful materials for training and testing are required. For VSR system development, the training and testing materials are videos representing the visual speech of the words. This paper presents the Arabic Visual Speech Dataset (AVSD) for visual speech recognition. The dataset contains 1100 videos for 10 daily communication words collected from 22 speakers and recorded using smartphones’ cameras in high-resolution and high-framerate. The process of building the dataset, including design, acquisition, post-processing phases are described in the paper. Finally, the results of evaluating AVSD using a VSR system are presented and discussed. |