Sabato Marco Siniscalchi
About
Sabato Marco Siniscalchi (Senior Member, IEEE) is a FULL Professor with the University
of Palermo,Palermo, Italy, an Adjunct Professor with the Norwegian University of Science and Technology (NTNU), and an Affiliate Faculty with the Georgia Institute of Technology. He received his doctorate degree in computer engineering from the University of Palermo, Palermo, Italy, in 2006. In 2006, he was a Postdoctoral Fellow with Ga Tech. From 2007 to 2010, he joined NTNU, Norway, as a Research Scientist. From 2010 to 2023, he was an Assistant Professor, first, an Associate Professor, second, and a Full Professor, after, at Kore University. From 2017 to 2018, he was a Senior Speech Researcher with Siri Speech Group, Apple Inc., Cupertino CA, USA. He acted as an Associate Editor of the IEEE/ACM Transactions on Audio, Speech and Language Processing, from 2015 to 2019. Prof. Siniscalchi was an Elected Member of the IEEE SLT Committee from 2019 to 2022 and was re-elected in 2024.
Publications
2024
-
Guo, Zilu;
Du, Jun;
Siniscalchi, Sabato Marco;
Pan, Jia;
Liu, Qingfeng.
(2024)
Controllable Conformer for Speech Enhancement and Recognition.
IEEE Signal Processing Letters
Academic article
-
La Quatra, Moreno;
Turco, Maria Francesca;
Svendsen, Torbjørn Karl;
Salvi, Giampiero;
Orozco-Arroyave, Juan Rafael;
Siniscalchi, Sabato Marco.
(2024)
Exploiting Foundation Models and Speech Enhancement for Parkinson’s Disease Detection from Speech in Real-World Operative Conditions.
Interspeech
Academic article
2023
-
Adiban, Mohammad;
Siniscalchi, Sabato Marco;
Salvi, Giampiero.
(2023)
A step-by-step training method for multi generator GANs with application to anomaly detection and cybersecurity.
Neurocomputing
Academic article
2021
-
Sabzi Shahrebabaki, Abdolreza;
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn Karl.
(2021)
Raw Speech-to-Articulatory Inversion by Temporal Filtering and Decimation.
Interspeech
Academic article
-
Sabzi Shahrebabaki, Abdolreza;
Salvi, Giampiero;
Svendsen, Torbjørn Karl;
Siniscalchi, Sabato Marco.
(2021)
Acoustic-to-Articulatory Mapping With Joint Optimization of Deep Speech Enhancement and Articulatory Inversion Models.
IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP)
Academic article
-
Sabzi Shahrebabaki, Abdolreza;
Olfati, Negar;
Imran, Ali Shariq;
Johnsen, Magne Hallstein;
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn Karl.
(2021)
A Two-Stage Deep Modeling Approach to Articulatory Inversion.
IEEE (Institute of Electrical and Electronics Engineers)
Academic chapter/article/Conference paper
-
Sabzi Shahrebabaki, Abdolreza;
Siniscalchi, Sabato Marco;
Salvi, Giampiero;
Svendsen, Torbjørn Karl.
(2021)
A DNN Based Speech Enhancement Approach to Noise Robust Acoustic-to-Articulatory Inversion.
IEEE (Institute of Electrical and Electronics Engineers)
Academic chapter/article/Conference paper
2020
-
Sabzi Shahrebabaki, Abdolreza;
Olfati, Negar;
Siniscalchi, Sabato Marco;
Salvi, Giampiero;
Svendsen, Torbjørn.
(2020)
Transfer learning of articulatory information through phone information.
Interspeech (USB)
Academic article
-
Sabzi Shahrebabaki, Abdolreza;
Siniscalchi, Marco;
Salvi, Giampiero;
Svendsen, Torbjørn Karl.
(2020)
Sequence-to-sequence articulatory inversion through time convolution of sub-band frequency signals.
Interspeech (USB)
Academic article
2014
-
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn;
Lee, Chin-Hui.
(2014)
An artificial neural network approach to automatic speech processing.
Neurocomputing
Academic article
2013
-
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn;
Lee, Chin-Hui.
(2013)
A Bottom-Up Modular Search Approach to Large Vocabulary Continuous Speech Recognition.
IEEE Transactions on Audio, Speech, and Language Processing
Academic article
2012
-
Siniscalchi, Sabato Marco;
Lyu, DC;
Svendsen, Torbjørn;
Lee, CH.
(2012)
Experiments on Cross-Language Attribute Detection and Phone Recognition With Minimal Target-Specific Training Data.
IEEE Transactions on Audio, Speech, and Language Processing
Academic article
2011
-
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn;
Lee, Chin-Hui.
(2011)
A Bottom-Up Stepwise Knowledge-Integration Approach to Large Vocabulary Continuous Speech Recognition Using Weighted Finite State Machines.
Interspeech
Academic article
2010
-
Birkenes, Øystein;
Matsui, Tomoko;
Tanabe, Kunio;
Siniscalchi, Sabato Marco;
Myrvoll, Tor Andre;
Johnsen, Magne Hallstein.
(2010)
Penalized Logistic Regression with HMM Log-Likelihood Regressors for Speech Recognition.
IEEE Transactions on Audio, Speech, and Language Processing
Academic article
-
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn;
Lee, Chin-Hui.
(2010)
A Survey on Recent Progress in the ASAT/SIRKUS Paradigm.
IEEE conference proceedings
Other
-
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn;
Sorbello, Filippo;
Lee, Chin-Hui.
(2010)
Experimental Studies on Continuous Speech Recognition Using Neural Architectures with ‘Adaptive’ Hidden Activation Functions.
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing
Academic article
-
Siniscalchi, Sabato Marco;
Reed, Jeremy;
Svendsen, Torbjørn;
Lee, Chin-Hui.
(2010)
Exploiting Context-Dependency and Acoustic Resolution of Universal Speech Attribute Models in Spoken Language Recognition.
Interspeech
Academic article
2009
-
Siniscalchi, Sabato Marco;
Reed, Jeremy;
Svendsen, Torbjørn;
Lee, Chin-Hui.
(2009)
Exploring Universal Attribute Characterization of Spoken Languages for Spoken Language Recognition.
Interspeech
Academic article
-
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn;
Lee, Chin-Hui.
(2009)
A Phonetic Feature Based Lattice Rescoring Approach to LVCSR.
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing
Academic article
-
Siniscalchi, Sabato Marco;
lee, chin-hui.
(2009)
A study on integrating acoustic-phonetic information into lattice rescoring for automatic speech recognition.
Speech Communication
Academic article
2008
-
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn;
lee, chin-hui.
(2008)
A Penalized Logistic Regression Approach to Detection Based Phone Classification.
Interspeech
Academic article
-
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn;
lee, chin-hui.
(2008)
Toward a Detector-Based Universal Phone Recognizer.
Other
-
Siniscalchi, Sabato Marco;
Birkenes, Øystein;
Johnsen, Magne Hallstein;
Svendsen, Torbjørn.
(2008)
Joint Optimization of Event Detectors and Evidence Merger for Continuous Speech Recognition.
Other
2007
-
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn;
Lee, Chin-Hui.
(2007)
Towards Bottom-Up Continuous Phone Recognition.
IEEE Signal Processing Society
Academic chapter/article/Conference paper
Journal publications
-
Guo, Zilu;
Du, Jun;
Siniscalchi, Sabato Marco;
Pan, Jia;
Liu, Qingfeng.
(2024)
Controllable Conformer for Speech Enhancement and Recognition.
IEEE Signal Processing Letters
Academic article
-
La Quatra, Moreno;
Turco, Maria Francesca;
Svendsen, Torbjørn Karl;
Salvi, Giampiero;
Orozco-Arroyave, Juan Rafael;
Siniscalchi, Sabato Marco.
(2024)
Exploiting Foundation Models and Speech Enhancement for Parkinson’s Disease Detection from Speech in Real-World Operative Conditions.
Interspeech
Academic article
-
Adiban, Mohammad;
Siniscalchi, Sabato Marco;
Salvi, Giampiero.
(2023)
A step-by-step training method for multi generator GANs with application to anomaly detection and cybersecurity.
Neurocomputing
Academic article
-
Sabzi Shahrebabaki, Abdolreza;
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn Karl.
(2021)
Raw Speech-to-Articulatory Inversion by Temporal Filtering and Decimation.
Interspeech
Academic article
-
Sabzi Shahrebabaki, Abdolreza;
Salvi, Giampiero;
Svendsen, Torbjørn Karl;
Siniscalchi, Sabato Marco.
(2021)
Acoustic-to-Articulatory Mapping With Joint Optimization of Deep Speech Enhancement and Articulatory Inversion Models.
IEEE/ACM Transactions on Audio, Speech, and Language Processing (TASLP)
Academic article
-
Sabzi Shahrebabaki, Abdolreza;
Olfati, Negar;
Siniscalchi, Sabato Marco;
Salvi, Giampiero;
Svendsen, Torbjørn.
(2020)
Transfer learning of articulatory information through phone information.
Interspeech (USB)
Academic article
-
Sabzi Shahrebabaki, Abdolreza;
Siniscalchi, Marco;
Salvi, Giampiero;
Svendsen, Torbjørn Karl.
(2020)
Sequence-to-sequence articulatory inversion through time convolution of sub-band frequency signals.
Interspeech (USB)
Academic article
-
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn;
Lee, Chin-Hui.
(2014)
An artificial neural network approach to automatic speech processing.
Neurocomputing
Academic article
-
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn;
Lee, Chin-Hui.
(2013)
A Bottom-Up Modular Search Approach to Large Vocabulary Continuous Speech Recognition.
IEEE Transactions on Audio, Speech, and Language Processing
Academic article
-
Siniscalchi, Sabato Marco;
Lyu, DC;
Svendsen, Torbjørn;
Lee, CH.
(2012)
Experiments on Cross-Language Attribute Detection and Phone Recognition With Minimal Target-Specific Training Data.
IEEE Transactions on Audio, Speech, and Language Processing
Academic article
-
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn;
Lee, Chin-Hui.
(2011)
A Bottom-Up Stepwise Knowledge-Integration Approach to Large Vocabulary Continuous Speech Recognition Using Weighted Finite State Machines.
Interspeech
Academic article
-
Birkenes, Øystein;
Matsui, Tomoko;
Tanabe, Kunio;
Siniscalchi, Sabato Marco;
Myrvoll, Tor Andre;
Johnsen, Magne Hallstein.
(2010)
Penalized Logistic Regression with HMM Log-Likelihood Regressors for Speech Recognition.
IEEE Transactions on Audio, Speech, and Language Processing
Academic article
-
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn;
Sorbello, Filippo;
Lee, Chin-Hui.
(2010)
Experimental Studies on Continuous Speech Recognition Using Neural Architectures with ‘Adaptive’ Hidden Activation Functions.
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing
Academic article
-
Siniscalchi, Sabato Marco;
Reed, Jeremy;
Svendsen, Torbjørn;
Lee, Chin-Hui.
(2010)
Exploiting Context-Dependency and Acoustic Resolution of Universal Speech Attribute Models in Spoken Language Recognition.
Interspeech
Academic article
-
Siniscalchi, Sabato Marco;
Reed, Jeremy;
Svendsen, Torbjørn;
Lee, Chin-Hui.
(2009)
Exploring Universal Attribute Characterization of Spoken Languages for Spoken Language Recognition.
Interspeech
Academic article
-
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn;
Lee, Chin-Hui.
(2009)
A Phonetic Feature Based Lattice Rescoring Approach to LVCSR.
Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing
Academic article
-
Siniscalchi, Sabato Marco;
lee, chin-hui.
(2009)
A study on integrating acoustic-phonetic information into lattice rescoring for automatic speech recognition.
Speech Communication
Academic article
-
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn;
lee, chin-hui.
(2008)
A Penalized Logistic Regression Approach to Detection Based Phone Classification.
Interspeech
Academic article
Part of book/report
-
Sabzi Shahrebabaki, Abdolreza;
Olfati, Negar;
Imran, Ali Shariq;
Johnsen, Magne Hallstein;
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn Karl.
(2021)
A Two-Stage Deep Modeling Approach to Articulatory Inversion.
IEEE (Institute of Electrical and Electronics Engineers)
Academic chapter/article/Conference paper
-
Sabzi Shahrebabaki, Abdolreza;
Siniscalchi, Sabato Marco;
Salvi, Giampiero;
Svendsen, Torbjørn Karl.
(2021)
A DNN Based Speech Enhancement Approach to Noise Robust Acoustic-to-Articulatory Inversion.
IEEE (Institute of Electrical and Electronics Engineers)
Academic chapter/article/Conference paper
-
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn;
Lee, Chin-Hui.
(2010)
A Survey on Recent Progress in the ASAT/SIRKUS Paradigm.
IEEE conference proceedings
Other
-
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn;
lee, chin-hui.
(2008)
Toward a Detector-Based Universal Phone Recognizer.
Other
-
Siniscalchi, Sabato Marco;
Birkenes, Øystein;
Johnsen, Magne Hallstein;
Svendsen, Torbjørn.
(2008)
Joint Optimization of Event Detectors and Evidence Merger for Continuous Speech Recognition.
Other
-
Siniscalchi, Sabato Marco;
Svendsen, Torbjørn;
Lee, Chin-Hui.
(2007)
Towards Bottom-Up Continuous Phone Recognition.
IEEE Signal Processing Society
Academic chapter/article/Conference paper
Teaching
Courses
Supervision
-
Adiban, Mohammad (co-supervised with Prof. G. Salvi). PhD topic: Video sequence prediction with hierarchical variational autoencoders.
- Sabzi Shahrebabaki, Abdolreza (co-supervised with Prof. T. Svendsen). PhD topic: Speech-to-Articulatory Inversion with deep models.
Outreach
2024
-
Academic lectureLa Quarta, Moreno; Turco, Maria Francesca; Svendsen, Torbjørn; Salvi, Giampiero; Orozco-Arroyave, Juan Rafael; Siniscalchi, Sabato Marco. (2024) oundation Models and Speech Enhancement for Parkinson’s Disease Detection from Speech in Real-World Operative Conditions. ISCA Interspeech , Kos, Greece 2024-09-01 - 2024-09-05
2010
-
Academic lectureSiniscalchi, Sabato Marco; Svendsen, Torbjørn; Sorbello, Filippo; Lee, Chin-Hui. (2010) Experimental Studies on Continuous Speech Recognition Using Neural Architectures with ‘Adaptive’ Hidden Activation Functions. IEEE ICASSP 2010 , Dallas, Texas 2010-03-14 - 2010-03-19
-
Academic lectureSiniscalchi, Sabato Marco; Reed, Jeremy; Svendsen, Torbjørn; Lee, Chin-Hui. (2010) Exploiting Context-Dependency and Acoustic Resolution of Universal Speech Attribute Models in Spoken Language Recognition. ISCA Interspeech 2010 , Makuhari 2010-09-27 - 2010-09-30
-
Academic lectureSiniscalchi, Sabato Marco; Svendsen, Torbjørn; Lee, Chin-Hui. (2010) A Survey on Recent Progress in the ASAT/SIRKUS Paradigm. IEEE ISCSLP 2010 , Tainan 2010-11-21 - 2010-12-03
2009
-
Academic lectureSiniscalchi, Sabato Marco; Svendsen, Torbjørn; Lee, Chin-Hui. (2009) A Phonetic Feature Based Lattice Rescoring Approach to LVCSR. IEEE IEEE International Conference on Acoustics, Speech and Signal Processing , Taipei 2009-04-19 - 2009-04-24
-
Academic lectureSiniscalchi, Sabato Marco; Reed, Jeremy; Svendsen, Torbjørn; Lee, Chin-Hui. (2009) Exploring Universal Attribute Characterization of Spoken Languages for Spoken Language Recognition. ISCA Interspeech , Brighton 2009-09-06 - 2009-09-10
2008
-
Academic lectureAmdal, Ingunn; Svendsen, Torbjørn; Johnsen, Magne Hallstein; Siniscalchi, Sabato Marco; Hamar, Jarle Bauck; Martinez, Del Hoyo Canterla A.. (2008) SIRKUS - A new paradigm for speech recognition. Norges forskningsråd VERDIKT Conference 2008 , Bergen 2008-10-29 - 2008-10-30
-
Academic lectureSiniscalchi, Sabato Marco; Svendsen, Torbjørn; lee, chin-hui. (2008) A Penalized Logistic Regression Approach to Detection Based Phone Classification. ISCA Interspeech 2008 , Brisbane 2008-09-22 - 2008-09-26
-
Academic lectureSiniscalchi, Sabato Marco; Svendsen, Torbjørn; lee, chin-hui. (2008) Toward a Detector-Based Universal Phone Recognizer. IEEE International Conference on Acoustics, Speech and Signal Processing , Las Vegas 2008-03-30 - 2008-04-04
-
Academic lectureSiniscalchi, Sabato Marco; Birkenes, Øystein; Johnsen, Magne Hallstein; Svendsen, Torbjørn. (2008) Joint Optimization of Event Detectors and Evidence Merger for Continuous Speech Recognition. ISCA ISCA ITRW on Speech Analysis and Processing for Knowledge Discovery , Aalborg 2008-06-04 - 2008-06-06
2007
-
Academic lectureSiniscalchi, Sabato Marco; Svendsen, Torbjørn; Lee, Chin-Hui. (2007) Towards Bottom-Up Continuous Phone Recognition. IEEE 2007 IEEE Workshop on Automatic Speech Recognition and Understanding , Kyoto 2007-12-09 - 2007-12-13