Ing. Hrúz Marek, Ph.D.
Courses
Course Lecturer
Supervising student projects
Publications
+ / - Publications in year 2020
An Automated Pipeline for Robust Image Processing and Optical Character Recognition of Historical Documents .
SPECOM: International Conference on Speech and Computer,
Lecture Notes in Computer Science ,
vol. 12335,
p. 166-175,
Springer, Cham,
2020.
:
+ / - Publications in year 2019
UWB-NTIS Speaker Diarization System for the DIHARD II 2019 Challenge .
Interspeech,
p. 993-997,
2019.
:
Combination of Positions and Angles for Hand Pose Estimation .
21st International Conference on Speech and Computer SPECOM 2019,
Proceedings of SPECOM 2019,
Lecture Notes in Artificial Intelligence, LNAI 11658 ,
Springer, Cham,
2019.
:
Detection of Overlapping Speech for the Purposes of Speaker Diarization .
Speech and Computer (SPECOM 2019),
p. 247-257,
Springer, Cham,
2019.
:
Semantic text segmentation from synthetic images of full-text documents .
SPIIRAS Proceedings,
vol. 18:6,
p. 1381-1406,
2019.
:
Hands 2019 - Challenge .
2019.
:
+ / - Publications in year 2018
ZCU-NTIS Speaker Diarization System for the DIHARD 2018 Challenge .
Interspeech,
p. 2788-2792,
2018.
:
Generation of Synthetic Images of Full-Text Documents .
20th International Conference on Speech and Computer, SPECOM 2018,
Lecture Notes in Artificial Intelligence, LNAI 11096,
p. 68-75,
Springer Nature Switzerland AG,
2018.
:
Recurrent Neural Network Based Speaker Change Detection from Text Transcription Applied in Telephone Speaker Diarization System .
Text, Speech, and Dialogue 21st International Conference, TSD 2018,
p. 342-350,
Cham: Springer Nature Switzerland AG,
2018.
:
Towards Processing of the Oral History Interviews and Related Printed Documents .
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018),
2104,
European Language Resources Association (ELRA),
2018.
:
+ / - Publications in year 2017
Convolutional Neural Network for speaker change detection in telephone speaker diarization system .
2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
p. 4945-4949,
IEEE,
2017.
:
Speaker Diarization Using Convolutional Neural Network for Statistics Accumulation Refinement .
Interspeech, 18th Annual Conference of the International Speech Communication Association,
p. 3562-3566,
2017.
:
+ / - Publications in year 2016
Fisher Vectors in PLDA Speaker Verification System .
The IEEE 13th International Conference on Signal Processing, ICSP 2016,
p. 1338-1341,
IEEE Press,
2016.
:
Convolutional Neural Network in the Task of Speaker Change Detection .
Speech and Computer, 18th International Conference, SPECOM 2016,
Lecture Notes in Computer Science,
vol. 9811,
p. 191-198,
Springer,
2016.
:
+ / - Publications in year 2014
+ / - Publications in year 2013
+ / - Publications in year 2012
Local Binary Pattern based features for sign language recognition .
Pattern Recognition and Image Analysis,
p. 519-526,
2012.
:
Automatic recognition fingerspelling gestures in multiple languages for a communication interface for the disabled .
Pattern Recognition and Image Analysis,
p. 527-536,
2012.
:
Particle Swarm Optimization For Automatic Hardness Measurement .
Chemické Listy,
vol. 106,
p. 434-437,
2012.
:
+ / - Publications in year 2011
Multi-modal dialogue system with sign language capabilities .
ASSETS'11: Proceedings of the 13th international ACM SIGACCESS Conference on Computers and Accessibility,
2011.
:
Towards Automatic Annotation of Sign Language Dictionary Corpora .
Lecture Notes in Computer Science,
p. 331-339,
2011.
:
Multi-lingual fingerspelling recognition for handicapped kiosk .
Pattern Recognit. Image Anal.,
vol. 21,
p. 402-406,
Springer-Verlag New York, Inc.,
Secaucus, NJ, USA,
2011.
:
Automatic fingersign-to-speech translation system .
Journal on Multimodal User Interfaces,
vol. 4,
p. 61-79,
Springer Berlin / Heidelberg,
2011.
:
Metodika pro automatizovanou tvorbu slovníku znakového jazyka .
INSPO 2011,
2011.
:
A Methodology for Automatic Sign Language Dictionary Creation .
Universal Learning Design,
Brno, Czech Republic,
2011.
:
Automatic sign categorization using visual data .
The proceedings of the 13th international ACM SIGACCESS conference on Computers and accessibility,
ASSETS '11,
p. 229-230,
ACM,
New York, NY, USA,
2011.
:
Local Binary Pattern Based Features for Sign Language Recognition .
Pattern Recognition and Image Analysis,
p. 398-401,
2011.
:
+ / - Publications in year 2010
Automatic Fingersign to Speech Translator .
eNTERFACE'10,
2010.
:
Evaluation of Feature Space Transforms for Czech Sign-Language Recognition .
MetaCentrum Yearbook 2010,
p. 145-150,
2010.
:
Towards Czech on-line sign language dictionary - technological overview and data collection .
LREC 2010, Seventh international conference on language resources and evaluation; 4th workshop on the representation and processing of sign languages: corpora and sign language technologies,
p. 41-44,
Valletta, Malta,
2010.
:
Correlation analysis of facial features and sign gestures .
2010 IEEE 10th International Conference on Signal Processing Proceedings,
vol. 2010,
p. 732-735,
Institute of Electrical and Electronics Engineers, Inc.,
Beijing,
2010.
:
Knoop hardness measurement using computer vision. .
Proceedings of the 21st International DAAAM Symposium "Intelligent Manufacturing & Automation: Focus on Interdisciplinary Solutions",
vol. 21,
p. 537-538,
Vienna,
Zadar, Croatia,
2010.
:
Robust image processing technique for Knoop hardness measurement .
Proceedings of IMEKO 17th TC-4, 3rd TC-19 Symposium and Workshop IWADC,
vol. 2010,
Technical University of Kosice,
Košice, Slovenská republika,
2010.
:
+ / - Publications in year 2009
Sign-language-enabled information kiosk .
eNTERFACE'08,
2009.
:
Input and output modalities used in a sign-language-enabled information kiosk .
proceedings of SPECOM'2009,
p. 113-116,
SPIIRAS,
2009.
:
+ / - Publications in year 2008
Design and Recording of Czech Audio-Visual Database with Impaired Conditions for Continuous Speech Recognition .
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08),
2008.
:
Semi-automatic Annotation of Sign Language Corpora .
LREC Workshop on the Representation and Processing of Sign Languages: Construction and Exploitation of Sign Language Corpora,
p. 78-81,
Marrakech, Morocco,
2008.
:
Collection and Preprocessing of Czech Sign Language Corpus for Sign Language Recognition .
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08),
ELRA,
Marrakech, Morocco,
2008.
:
Feature Space Transforms for Czech Sign-Language Recognition .
Proceedings of the 9th Annual Conference of the International Speech Communication Association (Interspeech 2008),
p. 2036-2039,
Causal Production Pty ltd.,
2008.
:
Speech and sliding text aided sign retrieval from hearing impaired sign news videos .
Journal on Multimodal User Interfaces,
2008.
:
+ / - Publications in year 2007
An Overview of Features for a Sign Language Recognition System from the Database UWB-06-SLR-A .
The 1st Young Researchers Conference on Applied Sciences (YRCAS 2007),
p. 186-191,
University of West Bohemi,
Pilsen, Czech Republic,
2007.
:
Design of a Multi-Modal Information Kiosk for Aurally Handicapped People .
SPECOM 2007 Proceedings,
p. 751-755,
Moscow State Linguistic University,
Moscow,
2007.
:
Speech and sliding text aided sign retrieval from hearing impaired sign news videos .
eNTERFACE'07,
p. 37-49,
TELE, Universite catholique de Louvain,
Louvain-la-Neuve,
2007.
:
Design and Recording of Signed Czech Language Corpus for Automatic Sign Language Recognition .
Interspeech 2007,
p. 678-681,
2007.
: