Ing. Vaněk Jan, Ph.D.
Activities
signal processing, acoustic modelling, speech recognition, speaker recognition (verification, identification), GPGPU programming (CUDA, OpenCL)
My Research Gate profile.
Publications
+ / - Publications in year 2019
UWB-NTIS Speaker Diarization System for the DIHARD II 2019 Challenge .
Interspeech,
p. 993-997,
2019.
:
+ / - Publications in year 2015
Simultaneously Trained NN-based Acoustic Model and NN-based Feature Extractor .
Text, Speech, and Dialogue, 18th International Conference, TSD 2015,
p. 234-242,
2015.
:
Neural-Network-based Spectrum Processing for Speech Recognition and Speaker Verification .
Statistical Language and Speech Processing, Third International Conference, SLSP 2015,
p. 288-299,
Springer,
2015.
:
+ / - Publications in year 2014
Convolutional Neural Network for Refinement of Speaker Adaptation Transformation .
16th International Conference on Speech and Computer, SPECOM 2014,
Lecture Notes in Artificial Intelligence,
vol. 8773,
p. 161-168,
2014.
:
Audio-Video Speaker Diarization for Unsupervised Speaker and Face Model Creation .
Text, Speech and Dialogue, Proceedings of the 17th International Conference TSD 2014,
Lecture Notes in Artificial Intelligence,
2014.
:
An Open-Source GPU-Accelerated Feature Extraction Tool .
12th International Conference on Signal Processing - ICSP2014,
12th International Conference on Signal Processing - ICSP2014,
p. 450-454,
IEEE Print,
Danvers, USA,
2014.
:
Anti-Models: An Alternative Way to Discriminative Training .
Text Speech nad Dialoque - TSD 2014,
Text Speech nad Dialoque - TSD 2014,
p. 449-456,
Springer,
2014.
:
Sports Video Classification in Continuous TV Broadcasts .
The 12th IEEE International Conference on Signal Processing (ICSP'14),
HangZhou China,
2014.
:
+ / - Publications in year 2013
Estimation of Single-Gaussian and Gaussian Mixture Models for Pattern Recognition .
18th Iberoamerican Congress on Pattern Recognition,
Lecture Notes in Computer Science,
Springer,
2013.
:
A Direct Criterion Minimization based fMLLR via Gradient Descend .
Text, Speech, and Dialogue,
Lecture Notes in Computer Science,
vol. 8082,
p. 52-59,
Springer,
2013.
:
Covariance Matrix Enhancement Approach to Train Robust Gaussian Mixture Models of Speech Data .
Speech and Computer,
Lecture Notes in Computer Science,
vol. 8113,
p. 92-99,
Springer,
2013.
:
+ / - Publications in year 2012
Full Covariance Gaussian Mixture Models Evaluation on GPU .
IEEE International Symposium on Signal Processing and Information Technology,
Vietnam, Ho Chi Minh City,
2012.
:
Optimized Acoustic Likelihoods Computation for NVIDIA and ATI/AMD Graphics Processors .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING,
6,
vol. 20,
p. 1818-1828,
Institute of Electrical and Electronics Engineers ( IEEE ),
2012.
:
GPU Accelerated Real Time Rotation, Scale and Translation Invariant Image Registration Method .
International Conference on Image Analysis and Recognition,
Lecture Notes in Computer Science,
vol. 7324,
p. 224-233,
Springer,
2012.
:
+ / - Publications in year 2011
Fast Estimation of Gaussian Mixture Model Parameters on GPU using CUDA .
The 12th International Conference on Parallel and Distributed Computing, Applications and Technologies,
p. 167-172,
IEEE Computer Society Conference Publishing Services (CPS),
2011.
:
Speaker-clustered Acoustic Models Evaluated on GPU for on-line Subtitling of Parliament Meetings .
Text, Speech, and Dialogue,
Lecture Notes in Computer Science,
vol. 6836,
p. 284-290,
Springer,
2011.
:
Optimization of the Gaussian Mixture Model Evaluation on GPU .
12th Annual Conference of the International Speech Communication Association 2011 (INTERSPEECH 2011),
p. 1748-1751,
Firenze, Italy,
2011.
:
+ / - Publications in year 2010
Training of Speaker-Clustered Discriminative Acoustic Models for Use in Real-Time Recognizers .
Speech Processing,
vol. 2010,
p. 152-158,
Institute of Photonics and Electronics AS CR,
Prague,
2010.
:
Gender-dependent acoustic models fusion developed for automatic subtitling of Parliament meetings broadcasted by the Czech TV .
Lecture Notes in Computer Science,
vol. 2010,
p. 431-438,
Springer,
Berlin,
2010.
:
UWB system description for NIST SRE 2010 .
2010 NIST Speaker Recognition Evaluation Workshop,
p. 180-182,
NIST USA,
2010.
:
Fast Phonetic/Lexical Searching in the Archives of the Czech Holocaust Testimonies: Advancing Towards the MALACH Project Visions .
Lecture Notes in Computer Science,
vol. 2010,
p. 385-391,
Springer,
Heidelberg,
2010.
:
+ / - Publications in year 2009
UWB system description: EVALITA 2009 .
Conference of the Italian Association for Artificial Intelligence,
vol. XI.,
2009.
:
Diskriminativní trénování akustických modelů .
University of West Bohemia,
Pilsen, Czech Republic,
2009.
:
Discriminative training of gender-dependent acoustic models .
Text, Speech and Dialogue,
p. 331-338,
Springer,
Plzeň,
2009.
:
Training of Speaker-Clustered Acoustic Models for Use in Real-Time Recognizers .
Proceedings of the International Conference on Signal Processing and Multimedia Application,
p. 131-135,
INSTICC,
Miláno,
2009.
:
+ / - Publications in year 2008
An Expert System in Speaker Verification Task .
Proceedings of Interspeech 2008 incorporating SST 2008,
vol. 9,
p. 355-358,
International Speech Communication Association,
Brisbane, AU,
2008.
:
+ / - Publications in year 2007
A Cohort Methods for Score Normalization in Speaker Verification System, Acceleration of On-line Cohort Methods .
Specom 2007 Proceedings,
p. 367-372,
Moskow State Linguistic University,
Moskow,
2007.
:
+ / - Publications in year 2006
Independent components for acoustic modeling .
Interspeech,
vol. 1,
p. 2486-2489,
ISCA,
Bonn,
2006.
:
A structure of expert system for speaker verification .
Lecture Notes in Artificial Intelligence ,
Lecture notes in artificial intelligence. 0302-9743 ; 4188,
4188,
p. 493-500,
Springer,
Berlin,
2006.
:
Silence/speech detection method based on set of decision graphs .
Lecture Notes in Artificial Intelligence,
Lecture notes in artificial intelligence. 0302-9743 ; 4188,
4188,
p. 539-546,
Springer,
Berlin,
2006.
:
+ / - Publications in year 2005
Introduction of improved UWB speaker verification system .
Lecture Notes in Artificial Intelligence,
Lecture notes in artificial intelligence 3658,
3658,
p. 364-370,
Springer,
Berlin,
2005.
:
+ / - Publications in year 2004
Optimization of features for robust speaker recognition .
Speech processing,
p. 140-147,
Academy of Sciences of the Czech Republic,
Prague,
2004.
: