International Journal of Speech Technology

Công bố khoa học tiêu biểu

* Dữ liệu chỉ mang tính chất tham khảo

Sắp xếp:  
nameGist: a novel phonetic algorithm with bilingual support
International Journal of Speech Technology - Tập 22 - Trang 1135-1148 - 2019
Shahidul Islam Khan, Md. Mahmudul Hasan, Mohammad Imran Hossain, Abu Sayed Md. Latiful Hoque
Phonetic algorithm plays an essential role in many applications including name-matching, database record linkage, spelling correction, search recommendations, etc. Since 1918, many phonetic algorithms have been proposed by the researchers. Soundex, Match Rating Codex, NYSIIS, Metaphone, and Double Metaphone are among the frequently used phonetic algorithms. These algorithms were primarily develope...... hiện toàn bộ
Usefulness, localizability, humanness, and language-benefit: additional evaluation criteria for natural language dialogue systems
International Journal of Speech Technology - Tập 19 - Trang 373-383 - 2016
Bayan AbuShawar, Eric Atwell
Human–computer dialogue systems interact with human users using natural language. We used the ALICE/AIML chatbot architecture as a platform to develop a range of chatbots covering different languages, genres, text-types, and user-groups, to illustrate qualitative aspects of natural language dialogue system evaluation. We present some of the different evaluation techniques used in natural language ...... hiện toàn bộ
Gender and age-evolution detection based on audio forensic analysis using light deep neural network
International Journal of Speech Technology - Tập 26 - Trang 1091-1098 - 2023
Noor D. AL-Shakarchy, Huda Rageb, Mais Saad Safoq
Forensic audio analysis is a foundation stone of many crime investigations. In forensic evidence; the audio file of the human voice is analyzed to extract much information in addition to the content of the speech, such as the speaker’s identity, emotions, gender, origin, etc. The accurate determination of individuals into groups based on their age development stage and their gender are often used ...... hiện toàn bộ
Voice assessments for detecting patients with Parkinson’s diseases using PCA and NPCA
International Journal of Speech Technology - Tập 19 - Trang 743-754 - 2016
Achraf Benba, Abdelilah Jilbab, Ahmed Hammouch
In this study, we wanted to discriminate between two groups of people. The database used in this study contains 20 patients with Parkinson’s disease and 20 healthy people. Three types of sustained vowels (/a/, /o/ and /u/) were recorded from each participant and then the analyses were done on these voice samples. Firstly, an initial feature vector extracted from time, frequency and cepstral domain...... hiện toàn bộ
Combining evidences from excitation source and vocal tract system features for Indian language identification using deep neural networks
International Journal of Speech Technology - Tập 21 - Trang 501-508 - 2017
Mounika Kamsali Veera, Ravi Kumar Vuddagiri, Suryakanth V. Gangashetty, Anil Kumar Vuppala
In this paper, a combination of excitation source information and vocal tract system information is explored for the task of language identification (LID). The excitation source information is represented by features extracted from linear prediction (LP) residual signal called the residual cepstral coefficients (RCC). Vocal tract system information is represented by the mel frequency cepstral coef...... hiện toàn bộ
Dual estimation based vocal tract shape computation
International Journal of Speech Technology - Tập 22 - Trang 575-584 - 2018
Subhasmita Sahoo, Aurobinda Routray
This paper presents a new method for direct estimation of vocal tract shape from the speech signal. The method computes cross-sectional areas of uniform-length cylindrical tubes comprising the vocal tract. Cross-sectional areas are calculated from reflection coefficients at tube junctions whose values depend on the areas of adjoining tubes. A new state space representation of the speech production...... hiện toàn bộ
Clean speech/speech with background music classification using HNGD spectrum
International Journal of Speech Technology - Tập 20 Số 4 - Trang 1023-1036 - 2017
Banriskhem K. Khonglah, S. R. Mahadeva Prasanna
ILATalk: a new multilingual text-to-speech synthesizer with machine learning
International Journal of Speech Technology - Tập 19 - Trang 55-64 - 2015
Saleh M. Abu-Soud
In this paper, a new multilingual text-to-speech system based on inductive learning has been developed. This system is called ILATalk. It is composed of three phases: the analysis phase, learning phase, and synthesis phase. It can accept any language; all what is needed is to store the data set that contains the training examples that are generated from a representative and selected subset of word...... hiện toàn bộ
A statistical framework for EEG channel selection and seizure prediction on mobile
International Journal of Speech Technology - - 2019
Fatma E. Ibrahim, Saly Abd-Elateif El-Gindy, Sami A. El-Dolil, Adel S. El‐Fishawy, El-Sayed M. El-Rabaie, M. I. Dessouky, Ibrahim M. El-Dokany, Turky N. Alotaiby, Saleh A. Alshebeili, Fathi E. Abd El‐Samie
Maximum entropy PLDA for robust speaker recognition under speech coding distortion
International Journal of Speech Technology - Tập 22 - Trang 1115-1122 - 2019
Ahmed Krobba, Mohamed Debyeche, Sid. Ahmed Selouani
The system combining i-vector and probabilistic linear discriminant analysis (PLDA) has been applied with great success in the speaker recognition task. The i-vector space gives a low-dimensional representation of a speech segment and training data of a PLDA model, which offers greater robustness under different conditions. In this paper, we propose a new framework based on i-vector/PLDA and Maxim...... hiện toàn bộ
Tổng số: 851   
  • 1
  • 2
  • 3
  • 4
  • 5
  • 6
  • 10