default search action
Vassilis Katsouros
Person information
- affiliation: Athena RIC, Institute for Language & Speech Processing, Athens, Greece
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j9]Georgios Paraskevopoulos, Theodoros Kouzelis, Georgios Rouvalis, Athanasios Katsamanis, Vassilis Katsouros, Alexandros Potamianos:
Sample-Efficient Unsupervised Domain Adaptation of Speech Recognition Systems: A Case Study for Modern Greek. IEEE ACM Trans. Audio Speech Lang. Process. 32: 286-299 (2024) - [c43]Vassilis Evangelidis, Helena G. Theodoropoulou, Vassilis Katsouros, Chairi Kiourt:
AI-Enabled Art Education: Unleashing Creative Potential and Exploring Co-Creation Frontiers. CSEDU (2) 2024: 294-301 - [c42]Manos Plitsis, Theodoros Kouzelis, Georgios Paraskevopoulos, Vassilis Katsouros, Yannis Panagakis:
Investigating Personalization Methods in Text to Music Generation. ICASSP 2024: 1081-1085 - [c41]Eleanna Kouletou, Vassilis Papavassiliou, Vassilis Katsouros:
Investigating Neural Networks and Transformer Models for Enhanced Comic Decoding. ICDAR (Workshops 1) 2024: 138-153 - [i10]Georgios Paraskevopoulos, Chara Tsoukala, Athanasios Katsamanis, Vassilis Katsouros:
The Greek podcast corpus: Competitive speech models for low-resourced languages with weakly supervised data. CoRR abs/2406.15284 (2024) - [i9]Leon Voukoutis, Dimitris Roussis, Georgios Paraskevopoulos, Sokratis Sofianopoulos, Prokopis Prokopidis, Vassilis Papavasileiou, Athanasios Katsamanis, Stelios Piperidis, Vassilis Katsouros:
Meltemi: The first open Large Language Model for Greek. CoRR abs/2407.20743 (2024) - 2023
- [c40]Panagiotis Kaddas, Konstantinos Palaiologos, Basilis Gatos, Vassilis Katsouros, Katerina Christopoulou:
A System for Processing and Recognition of Greek Byzantine and Post-Byzantine Documents. ICDAR (4) 2023: 366-376 - [c39]Theodoros Kouzelis, Georgios Paraskevopoulos, Athanasios Katsamanis, Vassilis Katsouros:
Weakly-supervised forced alignment of disfluent speech using phoneme-level modeling. INTERSPEECH 2023: 1563-1567 - [i8]Georgios Paraskevopoulos, Theodoros Kouzelis, Georgios Rouvalis, Athanasios Katsamanis, Vassilis Katsouros, Alexandros Potamianos:
Sample-Efficient Unsupervised Domain Adaptation of Speech Recognition Systems A case study for Modern Greek. CoRR abs/2301.00304 (2023) - [i7]Theodoros Kouzelis, Georgios Paraskevopoulos, Athanasios Katsamanis, Vassilis Katsouros:
Weakly-supervised forced alignment of disfluent speech using phoneme-level modeling. CoRR abs/2306.00996 (2023) - [i6]Manos Plitsis, Theodoros Kouzelis, Georgios Paraskevopoulos, Vassilis Katsouros, Yannis Panagakis:
Investigating Personalization Methods in Text to Music Generation. CoRR abs/2309.11140 (2023) - [i5]Theodoros Kouzelis, Vassilis Katsouros:
Weakly-supervised Automated Audio Captioning via text only training. CoRR abs/2309.12242 (2023) - 2022
- [j8]Kosmas Kritsis, Aggelos Gkiokas, Aggelos Pikrakis, Vassilis Katsouros:
DanceConv: Dance Motion Generation With Convolutional Networks. IEEE Access 10: 44982-45000 (2022) - [c38]Grigoris Bastas, Stefanos Koutoupis, Maximos A. Kaliakatsos-Papakostas, Vassilis Katsouros, Petros Maragos:
A Few-Sample Strategy for Guitar Tablature Transcription Based on Inharmonicity Analysis and Playability Constraints. ICASSP 2022: 771-775 - [c37]Alkiviadis Katsalis, Konstantinos Christantonis, Charalampos Tsioustas, Pantelis I. Kaplanoglou, Maximos A. Kaliakatsos-Papakostas, Athanasios Katsamanis, Konstantinos I. Diamantaras, Vassilis Katsouros, Evita F. Fotinea, Depy Panga, Dimitra Loupi:
NLP-Theatre: Employing Speech Recognition Technologies for Improving Accessibility and Augmenting the Theatrical Experience. IntelliSys (2) 2022: 507-526 - [c36]Gerasimos Chatzoudis, Manos Plitsis, Spyridoula Stamouli, Athanasia-Lida Dimou, Nassos Katsamanis, Vassilis Katsouros:
Zero-Shot Cross-lingual Aphasia Detection using Automatic Speech Recognition. INTERSPEECH 2022: 2178-2182 - [c35]Dimitrios Roussis, Vassilis Papavassiliou, Prokopis Prokopidis, Stelios Piperidis, Vassilis Katsouros:
SciPar: A Collection of Parallel Corpora from Scientific Abstracts. LREC 2022: 2652-2657 - [c34]Grigoris Bastas, Maximos A. Kaliakatsos-Papakostas, Georgios Paraskevopoulos, Pantelis I. Kaplanoglou, Konstantinos Christantonis, Charalampos Tsioustas, Dimitris Mastrogiannopoulos, Depy Panga, Evita F. Fotinea, Athanasios Katsamanis, Vassilis Katsouros, Konstantinos I. Diamantaras, Petros Maragos:
Towards a DHH Accessible Theater: Real-Time Synchronization of Subtitles and Sign Language Videos with ASR and NLP Solutions. PETRA 2022: 653-661 - [c33]Efthymios Georgiou, Kosmas Kritsis, Georgios Paraskevopoulos, Athanasios Katsamanis, Vassilis Katsouros, Alexandros Potamianos:
Regotron: Regularizing the Tacotron2 Architecture Via Monotonic Alignment Loss. SLT 2022: 977-983 - [i4]Gerasimos Chatzoudis, Manos Plitsis, Spyridoula Stamouli, Athanasia-Lida Dimou, Athanasios Katsamanis, Vassilis Katsouros:
Zero-Shot Cross-lingual Aphasia Detection using Automatic Speech Recognition. CoRR abs/2204.00448 (2022) - [i3]Efthymios Georgiou, Kosmas Kritsis, Georgios Paraskevopoulos, Athanasios Katsamanis, Vassilis Katsouros, Alexandros Potamianos:
Regotron: Regularizing the Tacotron2 architecture via monotonic alignment loss. CoRR abs/2204.13437 (2022) - 2021
- [c32]Kosmas Kritsis, Aggelos Gkiokas, Aggelos Pikrakis, Vassilis Katsouros:
Attention-based Multimodal Feature Fusion for Dance Motion Generation. ICMI 2021: 763-767 - 2020
- [j7]Kosmas Kritsis, Theatina Kylafi, Maximos A. Kaliakatsos-Papakostas, Aggelos Pikrakis, Vassilis Katsouros:
On the Adaptability of Recurrent Neural Networks for Real-Time Jazz Improvisation Accompaniment. Frontiers Artif. Intell. 3: 508727 (2020) - [c31]Grigoris Bastas, Kosmas Kritsis, Vassilis Katsouros:
Air-Writing Recognition using Deep Convolutional and Recurrent Neural Network Architectures. ICFHR 2020: 7-12 - [i2]Maximos A. Kaliakatsos-Papakostas, Kosmas Kritsis, Vassilis Katsouros:
Music in Education through Technology. ERCIM News 2020(120) (2020) - [i1]Vassilis Katsouros:
Educational Technology - Introduction to the Special Theme. ERCIM News 2020(120) (2020)
2010 – 2019
- 2019
- [j6]Christos Emmanouilidis, Petros Pistofidis, Luka Bertoncelj, Vassilis Katsouros, Apostolos P. Fournaris, Christos Koulamas, Cristobal Ruiz-Carcel:
Enabling the human in the loop: Linked data and knowledge in industrial cyber-physical systems. Annu. Rev. Control. 47: 249-265 (2019) - [c30]Christos Garoufis, Athanasia Zlatintsi, Kosmas Kritsis, Panagiotis Paraskevas Filntisis, Vassilis Katsouros, Petros Maragos:
An Environment for Gestural Interaction with 3D Virtual Musical Instruments as an Educational Tool. EUSIPCO 2019: 1-5 - [c29]Kosmas Kritsis, Maximos A. Kaliakatsos-Papakostas, Vassilis Katsouros, Aggelos Pikrakis:
Deep Convolutional and LSTM Neural Network Architectures on Leap Motion Hand Tracking Data Sequences. EUSIPCO 2019: 1-5 - 2018
- [j5]Ioannis Karydis, Aggelos Gkiokas, Vassilis Katsouros, Lazaros S. Iliadis:
Musical track popularity mining dataset: Extension & experimentation. Neurocomputing 280: 76-85 (2018) - [c28]Athanasia Zlatintsi, Panagiotis Paraskevas Filntisis, Christos Garoufis, Antigoni Tsiami, Kosmas Kritsis, Maximos A. Kaliakatsos-Papakostas, Aggelos Gkiokas, Vassilis Katsouros, Petros Maragos:
A Web-based Real-Time Kinect Application for Gestural Interaction with Virtual Musical Instruments. Audio Mostly Conference 2018: 12:1-12:6 - [c27]Maximos A. Kaliakatsos-Papakostas, Aggelos Gkiokas, Vassilis Katsouros:
Interactive Control of Explicit Musical Features in Generative LSTM-based Systems. Audio Mostly Conference 2018: 29:1-29:7 - [c26]Kosmas Kritsis, Aggelos Gkiokas, Carlos Árpád Acosta, Quentin Lamerand, Robert Piéchaud, Maximos A. Kaliakatsos-Papakostas, Vassilis Katsouros:
A web-based 3D environment for gestural interaction with virtual music instruments as a STEAM education tool. NIME 2018: 348-349 - [p1]Vassilis Katsouros, Evita F. Fotinea, Renaat Frans, Erica Andreotti, Petros Stergiopoulos, Manolis Chaniotakis, Thomas Fischer, Robert Piéchaud, Zoltan Karpati, Pierre Laborde, Daniel Martín-Albo, Fotini Simistira, Marcus Liwicki:
iMuSciCA: Interactive Music Science Collaborative Activities for STEAM Learning. Designing for the User Experience in Learning Systems 2018: 123-154 - 2017
- [c25]Aggelos Gkiokas, Vassilis Katsouros:
Convolutional Neural Networks for Real-Time Beat Tracking: A Dancing Robot Application. ISMIR 2017: 286-293 - 2016
- [j4]Aggelos Gkiokas, Vassilis Katsouros, George Carayannis:
Towards Multi-Purpose Spectral Rhythm Features: An Application to Dance Style, Meter and Tempo Estimation. IEEE ACM Trans. Audio Speech Lang. Process. 24(11): 1885-1896 (2016) - [c24]Vassilis Katsouros, Vassilis Papavassiliou, Fotini Simistira, Basilis Gatos:
Recognition of Greek Polytonic on Historical Degraded Texts Using HMMs. DAS 2016: 346-351 - [c23]Ioannis Karydis, Aggelos Gkiokas, Vassilis Katsouros:
Musical Track Popularity Mining Dataset. AIAI 2016: 562-572 - 2015
- [j3]Fotini Simistira, Vassilis Katsouros, George Carayannis:
Recognition of online handwritten mathematical formulas using probabilistic SVMs and stochastic context free grammars. Pattern Recognit. Lett. 53: 85-92 (2015) - [c22]Basilis Gatos, Nikolaos Stamatopoulos, Georgios Louloudis, Giorgos Sfikas, George Retsinas, Vassilis Papavassiliou, Fotini Sunistira, Vassilis Katsouros:
GRPOLY-DB: An old Greek polytonic document image database. ICDAR 2015: 646-650 - [c21]Fotini Simistira, Adnan Ul-Hassan, Vassilis Papavassiliou, Basilis Gatos, Vassilis Katsouros, Marcus Liwicki:
Recognition of historical Greek polytonic scripts using LSTM networks. ICDAR 2015: 766-770 - 2014
- [c20]Fotini Simistira, Vassilis Papavassiliou, Vassilis Katsouros, George Carayannis:
Recognition of Spatial Relations in Mathematical Formulas. ICFHR 2014: 164-168 - [c19]Aggelos Gkiokas, Vassilis Katsouros, George Carayannis:
Deploying Deep Belief Nets for content based audio music similarity. IISA 2014: 180-185 - 2013
- [c18]Fotini Simistira, Vassilis Papavassiliou, Vassilis Katsouros, George Carayannis:
Structural analysis of online handwritten mathematical symbols based on support vector machines. DRR 2013: 86580Z - 2012
- [c17]Aggelos Gkiokas, Vassilis Katsouros, George Carayannis, Themos Stafylakis:
Music tempo estimation and beat tracking by applying source separation and metrical relations. ICASSP 2012: 421-424 - [c16]Fotini Simistira, Vassilis Papavassiliou, Vassilis Katsouros, George Carayannis:
A System for Recognition of On-Line Handwritten Mathematical Expressions. ICFHR 2012: 193-198 - [c15]Vassilis Papavassiliou, Fotini Simistira, Vassilis Katsouros, George Carayannis:
A Morphology Based Approach for Binarization of Handwritten Documents. ICFHR 2012: 577-581 - [c14]Aggelos Gkiokas, Vassilis Katsouros, George Carayannis:
Reducing Tempo Octave Errors by Periodicity Vector Coding And SVM Learning. ISMIR 2012: 301-306 - [c13]Themos Stafylakis, Vassilis Katsouros, Patrick Kenny, Pierre Dumouchel:
A mean shift algorithm for manifolds of exponential families. ISSPA 2012: 511-516 - [c12]Themos Stafylakis, Vassilis Katsouros, Patrick Kenny, Pierre Dumouchel:
Mean shift algorithm for exponential families with applications to speaker clustering. Odyssey 2012: 324-329 - 2011
- [c11]Themos Stafylakis, Xavier Anguera Miró, Vassilis Katsouros, George Carayannis:
Closed-form expressions vs. BIC: A comparison for speaker clustering. ICASSP 2011: 2228-2231 - [c10]Fotini Simistira, Vassilis Papavassiliou, Themos Stafylakis, Vassilis Katsouros:
Enhancing Handwritten Word Segmentation by Employing Local Spatial Features. ICDAR 2011: 1314-1318 - 2010
- [j2]Themos Stafylakis, Vassilis Katsouros, George Carayannis:
The Segmental Bayesian Information Criterion and Its Applications to Speaker Diarization. IEEE J. Sel. Top. Signal Process. 4(5): 857-866 (2010) - [j1]Vassilis Papavassiliou, Themos Stafylakis, Vassilis Katsouros, George Carayannis:
Handwritten document image segmentation into text lines and words. Pattern Recognit. 43(1): 369-377 (2010) - [c9]Themos Stafylakis, Georgios Tzimiropoulos, Vassilis Katsouros, George Carayannis:
A new penalty term for the BIC with respect to speaker diarization. ICASSP 2010: 4978-4981 - [c8]Vassilis Papavassiliou, Vassilis Katsouros, George Carayannis:
A Morphological Approach for Text-Line Segmentation in Handwritten Documents. ICFHR 2010: 19-24 - [c7]Aggelos Gkiokas, Vassilis Katsouros, George Carayannis:
Tempo Induction Using Filterbank Analysis and Tonal Features. ISMIR 2010: 555-558 - [c6]Themos Stafylakis, Vassilis Katsouros, George Carayannis:
Speaker clustering via the mean shift algorithm. Odyssey 2010: 33
2000 – 2009
- 2009
- [c5]Themos Stafylakis, Vassilis Katsouros, George Carayannis:
Redefining the Bayesian information criterion for speaker diarisation. INTERSPEECH 2009: 1051-1054 - 2008
- [c4]Iason Demiros, George Carayannis, Vassilios Antonopoulos, Georgios Kambourakis, Vassilios Katsouros, Panayotis Kolevris, Marios Nottas, Harris Papageorgiou, Vassilis Papavassiliou, Spyros Raptis, Fotini Simistira, Themos Stafylakis:
PANOPTIS: A System for Intelligent Monitoring of the Hellenic Broadcast Sector. DEXA Workshops 2008: 605-609 - [c3]Themos Stafylakis, Vassilis Papavassiliou, Vassilis Katsouros, George Carayannis:
Robust text-line and word segmentation for handwritten documents images. ICASSP 2008: 3393-3396 - 2007
- [c2]Themos Stafylakis, Vassilis Katsouros, George Carayannis:
Efficient combination of parametric spaces, models and metrics for speaker diarization1. ASRU 2007: 256-261 - [c1]Vassilis Papavassiliou, Themos Stafylakis, Vassilis Katsouros, George Carayannis:
A Parametric Spectral-Based Method for Verification of Text in Videos. ICDAR 2007: 879-883
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-04 21:00 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint