Car noise verification and applicationsInternational Journal of Speech Technology - Tập 17 - Trang 167-181 - 2013
Nitish Krishnamurthy, John H. L. Hansen
This study presents audio based vehicle-verification as a new area of research. The task involves verifying the claim that an acoustic sample belongs to a vehicle. Audio based vehicle verification has the potential to impact research in the areas of vehicle forensics and in-vehicle speech systems. For this task, a new corpus (UTD-CAR-NOISE) that consists of noise from 20 vehicles under 8 distinct ...... hiện toàn bộ
Security enhancement for AES encrypted speech in communicationsInternational Journal of Speech Technology - Tập 20 - Trang 163-169 - 2017
Emad Mossa
This paper introduces a secure speech communication approach, which is based on encryption and authentication. This system is based on Advanced Encryption Standard (AES) for encryption and private image database for enhancement of encryption and for authentication. The idea of this cryptosystem is based on XOR of one image from image database with the clear speech before encryption and embedding t...... hiện toàn bộ
Multilingual rule-based approach to number expansion: Framework, extensions and applicationInternational Journal of Speech Technology - Tập 9 - Trang 29-40 - 2006
Marko Moberg, Kimmo Pärssinen
The language development of a multilingual text-to-speech system requires contribution from linguists and native speakers of a given language. Text normalization including number expansion is one of the language-specific processing steps. The most available solutions do not support inflections and are not simple enough to be practical for non-technical developers. This paper presents a novel solut...... hiện toàn bộ
Multi-coder vector quantizer for transparent coding of wideband speech ISF parametersInternational Journal of Speech Technology - - 2024
Merouane Bouzid, Nacèra Meziane, Salah-Eddine Cheraitia
Modern low bit-rate speech coders require efficient coding of the linear predictive coding (LPC) coefficients. Immittance Spectral Frequencies (ISF) and Line Spectral Frequencies (LSF) are currently the most efficient transmission parameters for LPC coefficients in wideband speech coding. In this paper, we propose a new hybrid coding scheme with multi-coder vector quantization for efficient coding...... hiện toàn bộ
Encrypted gray image transmission over OFDM channel for TV cloud computingInternational Journal of Speech Technology - Tập 20 - Trang 431-442 - 2017
Salwa M. Serag Eldin
One of the most important issues which attract the researcher is to provide a secure channel to transfer data between many points. Television cloud has many contents which are needed to be transferred to authorized groups (AuthGs). Also, the transfer rate is an aspect to be considered. In this work, orthogonal frequency division multiplexing is investigated to be sure it is acceptable for the clou...... hiện toàn bộ
Enhanced multiclass SVM with thresholding fusion for speech-based emotion classificationInternational Journal of Speech Technology - Tập 20 - Trang 27-41 - 2016
Na Yang, Jianbo Yuan, Yun Zhou, Ilker Demirkol, Zhiyao Duan, Wendi Heinzelman, Melissa Sturge-Apple
As an essential approach to understanding human interactions, emotion classification is a vital component of behavioral studies as well as being important in the design of context-aware systems. Recent studies have shown that speech contains rich information about emotion, and numerous speech-based emotion classification methods have been proposed. However, the classification performance is still ...... hiện toàn bộ
Building a neural speech recognizer for quranic recitationsInternational Journal of Speech Technology - - Trang 1-21 - 2022
Suhad Al-Issa, Mahmoud Al-Ayyoub, Osama Al-Khaleel, Nouh Elmitwally
This work is an effort towards building Neural Speech Recognizers system for Quranic recitations that can be effectively used by anyone regardless of their gender and age. Despite having a lot of recitations available online, most of them are recorded by professional male adult reciters, which means that an ASR system trained on such datasets would not work for female/child reciters. We address th...... hiện toàn bộ