Security enhancement for AES encrypted speech in communicationsInternational Journal of Speech Technology - Tập 20 - Trang 163-169 - 2017
Emad Mossa
This paper introduces a secure speech communication approach, which is based on encryption and authentication. This system is based on Advanced Encryption Standard (AES) for encryption and private image database for enhancement of encryption and for authentication. The idea of this cryptosystem is based on XOR of one image from image database with the clear speech before encryption and embedding t...... hiện toàn bộ
An efficient iterative method for nearly perfect reconstruction non-uniform filter bankInternational Journal of Speech Technology - Tập 16 - Trang 261-272 - 2012
S. Anurag, A. Kumar, G. K. Singh
In this paper, a computationally efficient iterative algorithm is presented for the design of multi-channel nearly perfect reconstruction nonuniform filter bank using the modified window functions such as Kaiser, Cosh and Exponential windows with exploiting a new perfect reconstruction condition of nonuniform filter banks instead of using complex objective functions. The cutoff frequency is optimi...... hiện toàn bộ
Phonetic segmentation using multiple speech featuresInternational Journal of Speech Technology - Tập 11 - Trang 73-85 - 2009
Iosif Mporas, Todor Ganchev, Nikos Fakotakis
In this paper we propose a method for improving the performance of the segmentation of speech waveforms to phonetic units. The proposed method is based on the well known Viterbi time-alignment algorithm and utilizes the phonetic boundary predictions from multiple speech parameterization techniques. Specifically, we utilize the most appropriate, with respect to boundary type, phone transition posit...... hiện toàn bộ
An evaluation of sentence selection methods on the different phone-sized units for constructing Indonesian speech corpusInternational Journal of Speech Technology - Tập 23 - Trang 141-147 - 2019
Muljono, Agus Harjoko, Nurul Anisa Sri Winarsih, Catur Supriyanto
Collecting phonetically balanced text corpus is an important step to develop automatic speech recognition and text-to-speech systems. A corpus should have a small number of sentences but contains all phonetic units, such as monophone, triphone, and pentaphone units. There are exist least-to-most greedy algorithm (LTM + Greedy) and its variant to select the minimum sentence set. The variant is on t...... hiện toàn bộ
Performance analysis of adaptive variational mode decomposition approach for speech enhancementInternational Journal of Speech Technology - Tập 21 - Trang 369-381 - 2018
Rashmirekha Ram, Mihir Narayan Mohanty
Speech enhancement is an important pre-processing task in the area of speech processing research. Many techniques have been applied in this area since four/five decades. With progressive research it occupies a special position in various fields like engineering, medicine, society and security. Adaptive algorithms found effective for such cases and are utilized in this problem. The work is based on...... hiện toàn bộ
An automatic speech recognition system for isolated Amazigh word using 1D & 2D CNN-LSTM architectureInternational Journal of Speech Technology - Tập 26 - Trang 775-787 - 2023
Mohamed Daouad, Fadoua Ataa Allah, El Wardani Dadi
The availability of automatic speech recognition systems is crucial in various domains such as communication, healthcare, security, education, etc. However, currently, the existing systems often favor dominant languages such as English, French, Arabic, or Asian languages, leaving under-resourced languages without the consideration they deserve. In this specific context, our work is focused on the ...... hiện toàn bộ
Automatic annotation method of VR speech corpus based on artificial intelligenceInternational Journal of Speech Technology - Tập 25 - Trang 399-407 - 2022
Shanshan Yang, Ding Liu
With the rapid development of the Internet and artificial intelligence, the demand for data annotation becomes more and more urgent. In order to meet the needs of data annotation, the automatic annotation method of VR speech corpus based on artificial intelligence is designed. The existing annotation methods use word, excel and other text forms, or develop a special web page system to organize the...... hiện toàn bộ
Speaker recognition utilizing distributed DCT-II based Mel frequency cepstral coefficients and fuzzy vector quantizationInternational Journal of Speech Technology - Tập 16 - Trang 103-113 - 2012
M. Afzal Hossan, Mark A. Gregory
In this paper, a new and novel Automatic Speaker Recognition (ASR) system is presented. The new ASR system includes novel feature extraction and vector classification steps utilizing distributed Discrete Cosine Transform (DCT-II) based Mel Frequency Cepstral Coefficients (MFCC) and Fuzzy Vector Quantization (FVQ). The ASR algorithm utilizes an approach based on MFCC to identify dynamic features th...... hiện toàn bộ