Multimodal user interaction with in-car equipment in real conditions based on touch and speech modes in the Persian language

Nazari, Fateme, Tabibian, Shima and Homayounvala, Elaheh (2022) Multimodal user interaction with in-car equipment in real conditions based on touch and speech modes in the Persian language. Multimedia Tools and Applications. ISSN 1380-7501

E.Homayounvala_2022_Mutimedia-Tools-and-Applications-Journal.pdf - Accepted Version

Download (1MB) | Preview
Official URL:

Abstract / Description

Nowadays, communication with in-car equipment is performed via a large number of buttons or a touch screen. This increases the need for driver’s visual attention and leads to reduce the concentration of drivers while driving. Speech-based interaction has been introduced in recent years as a way to reduce driver distractions. This input mode faces several technical challenges such as the need to memorize voice commands and the difficulties of canceling them. This paper focuses on presenting a multimodal user interface design based on touch and speech modes, for controlling five in-car devices (radio, CD player or music player, fan, heater, and driver-side window). The research is designed to collect a dataset of in-car voice commands in the Persian language in real conditions (in a real car and in the presence of background noises) to firstly create a dataset of Persian voice commands (due to lack of research in this area in Persian speaking countries) and secondly intending to solve the mentioned challenges. To evaluate the proposed user interface, 15 participants performed ten different tasks based on the speech and touch modes, with and without driving simulation. The evaluation results indicated that the speech input mode with and without driving simulation has had in average smaller number of clicks for performing tasks (0.2 and 0.6), smaller task completion time (7.37 and 3.3 seconds), smaller time intervals between clicks (8.2 and 5 seconds) and smaller driver’s distraction rate (25.08%) in comparison to the touch input mode, respectively. Moreover, using two different input modes in designing the in vehicle user interface leads to increased accessibility.

Item Type: Article
Additional Information: This version of the article has been accepted for publication, after peer review and is subject to Springer Nature’s AM terms of use, but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections. The Version of Record is available online at:
Uncontrolled Keywords: in-car equipment; multimodal user interface; touch and speech modes; voice commands; Persian language
Subjects: 000 Computer science, information & general works
600 Technology
Department: School of Computing and Digital Media
Depositing User: Elaheh Homayounvala
Date Deposited: 21 Nov 2022 11:12
Last Modified: 19 Sep 2023 01:58


Downloads per month over past year

Downloads each year

Actions (login required)

View Item View Item