default search action
Alexander Richard
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [i29]Yi-Chiao Wu, Dejan Markovic, Steven Krenn, Israel D. Gebru, Alexander Richard:
ComplexDec: A Domain-robust High-fidelity Neural Audio Codec with Complex Spectrum Modeling. CoRR abs/2502.02019 (2025) - [i28]Aggelina Chatziagapi, Louis-Philippe Morency, Hongyu Gong, Michael Zollhoefer, Dimitris Samaras, Alexander Richard:
AV-Flow: Transforming Text to Audio-Visual Human-like Interactions. CoRR abs/2502.13133 (2025) - [i27]Simon Welker, Matthew Le, Ricky T. Q. Chen, Wei-Ning Hsu, Timo Gerkmann, Alexander Richard, Yi-Chiao Wu:
FlowDec: A flow-based full-band general audio codec with high perceptual quality. CoRR abs/2503.01485 (2025) - 2024
- [c32]Evonne Ng, Javier Romero, Timur M. Bagautdinov, Shaojie Bai, Trevor Darrell, Angjoo Kanazawa, Alexander Richard:
From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations. CVPR 2024: 1001-1010 - [c31]Ziyang Chen, Israel D. Gebru, Christian Richardt, Anurag Kumar, William Laney, Andrew Owens, Alexander Richard:
Real Acoustic Fields: An Audio-Visual Room Acoustics Dataset and Benchmark. CVPR 2024: 21886-21896 - [c30]Chao Huang, Dejan Markovic, Chenliang Xu, Alexander Richard:
Modeling and Driving Human Body Soundfields Through Acoustic Primitives. ECCV (10) 2024: 1-17 - [c29]Yi-Chiao Wu, Dejan Markovic, Steven Krenn, Israel D. Gebru, Alexander Richard:
ScoreDec: A Phase-Preserving High-Fidelity Audio Codec with a Generalized Score-Based Diffusion Post-Filter. ICASSP 2024: 361-365 - [i26]Evonne Ng, Javier Romero, Timur M. Bagautdinov, Shaojie Bai, Trevor Darrell, Angjoo Kanazawa, Alexander Richard:
From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations. CoRR abs/2401.01885 (2024) - [i25]Ziyang Chen, Israel D. Gebru, Christian Richardt, Anurag Kumar, William Laney, Andrew Owens, Alexander Richard:
Real Acoustic Fields: An Audio-Visual Room Acoustics Dataset and Benchmark. CoRR abs/2403.18821 (2024) - [i24]Julius Richter, Yi-Chiao Wu, Steven Krenn, Simon Welker, Bunlong Lay, Shinji Watanabe, Alexander Richard, Timo Gerkmann:
EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation. CoRR abs/2406.06185 (2024) - [i23]Chao Huang, Dejan Markovic, Chenliang Xu, Alexander Richard:
Modeling and Driving Human Body Soundfields through Acoustic Primitives. CoRR abs/2407.13083 (2024) - 2023
- [c28]Changan Chen, Alexander Richard, Roman Shapovalov, Vamsi Krishna Ithapu, Natalia Neverova, Kristen Grauman, Andrea Vedaldi:
Novel-View Acoustic Synthesis. CVPR 2023: 6409-6419 - [c27]Pranay Manocha, Israel D. Gebru, Anurag Kumar, Dejan Markovic, Alexander Richard:
Nord: Non-Matching Reference Based Relative Depth Estimation from Binaural Speech. ICASSP 2023: 1-5 - [c26]Yi-Chiao Wu, Israel D. Gebru, Dejan Markovic, Alexander Richard:
Audiodec: An Open-Source Streaming High-Fidelity Neural Audio Codec. ICASSP 2023: 1-5 - [c25]Pranay Manocha, Israel Dejene Gebru, Anurag Kumar, Dejan Markovic, Alexander Richard:
Spatialization Quality Metric for Binaural Speech. INTERSPEECH 2023: 5426-5430 - [c24]Xudong Xu, Dejan Markovic, Jacob Sandakly, Todd Keebler, Steven Krenn, Alexander Richard:
Sounding Bodies: Modeling 3D Spatial Sound of Humans Using Body Pose and Audio. NeurIPS 2023 - [i22]Changan Chen, Alexander Richard, Roman Shapovalov, Vamsi Krishna Ithapu, Natalia Neverova, Kristen Grauman, Andrea Vedaldi:
Novel-View Acoustic Synthesis. CoRR abs/2301.08730 (2023) - [i21]Xudong Xu, Dejan Markovic, Jacob Sandakly, Todd Keebler, Steven Krenn, Alexander Richard:
Sounding Bodies: Modeling 3D Spatial Sound of Humans Using Body Pose and Audio. CoRR abs/2311.06285 (2023) - 2022
- [c23]Karren Yang, Dejan Markovic, Steven Krenn, Vasu Agrawal, Alexander Richard:
Audio-Visual Speech Codecs: Rethinking Audio-Visual Speech Enhancement by Re-Synthesis. CVPR 2022: 8217-8227 - [c22]Emre Aksan, Shugao Ma, Akin Caliskan, Stanislav Pidhorskyi, Alexander Richard, Shih-En Wei, Jason M. Saragih, Otmar Hilliges:
LiP-Flow: Learning Inference-Time Priors for Codec Avatars via Normalizing Flows in Latent Space. ECCV (26) 2022: 92-110 - [c21]Alexander Richard, Peter Sheridan Dodds, Vamsi Krishna Ithapu:
Deep Impulse Responses: Estimating and Parameterizing Filters with Deep Networks. ICASSP 2022: 3209-3213 - [c20]Yen-Ju Lu, Zhong-Qiu Wang, Shinji Watanabe
, Alexander Richard, Cheng Yu, Yu Tsao:
Conditional Diffusion Probabilistic Model for Speech Enhancement. ICASSP 2022: 7402-7406 - [c19]Wen-Chin Huang, Dejan Markovic, Alexander Richard, Israel Dejene Gebru, Anjali Menon:
End-to-End Binaural Speech Synthesis. INTERSPEECH 2022: 1218-1222 - [c18]Dejan Markovic, Alexandre Défossez, Alexander Richard:
Implicit Neural Spatial Filtering for Multichannel Source Separation in the Waveform Domain. INTERSPEECH 2022: 1806-1810 - [i20]Alexander Richard, Peter Sheridan Dodds, Vamsi Krishna Ithapu:
Deep Impulse Responses: Estimating and Parameterizing Filters with Deep Networks. CoRR abs/2202.03416 (2022) - [i19]Yen-Ju Lu, Zhong-Qiu Wang, Shinji Watanabe, Alexander Richard, Cheng Yu, Yu Tsao:
Conditional Diffusion Probabilistic Model for Speech Enhancement. CoRR abs/2202.05256 (2022) - [i18]Emre Aksan, Shugao Ma, Akin Caliskan, Stanislav Pidhorskyi, Alexander Richard, Shih-En Wei, Jason M. Saragih, Otmar Hilliges:
LiP-Flow: Learning Inference-time Priors for Codec Avatars via Normalizing Flows in Latent Space. CoRR abs/2203.07881 (2022) - [i17]Karren Yang, Dejan Markovic, Steven Krenn, Vasu Agrawal, Alexander Richard:
Audio-Visual Speech Codecs: Rethinking Audio-Visual Speech Enhancement by Re-Synthesis. CoRR abs/2203.17263 (2022) - [i16]Dejan Markovic, Alexandre Défossez, Alexander Richard:
Implicit Neural Spatial Filtering for Multichannel Source Separation in the Waveform Domain. CoRR abs/2206.15423 (2022) - [i15]Wen-Chin Huang, Dejan Markovic, Alexander Richard, Israel Dejene Gebru, Anjali Menon:
End-to-End Binaural Speech Synthesis. CoRR abs/2207.03697 (2022) - [i14]Cheng-hsin Wuu, Ningyuan Zheng, Scott Ardisson, Rohan Bali, Danielle Belko, Eric Brockmeyer, Lucas Evans, Timothy Godisart, Hyowon Ha, Alexander Hypes, Taylor Koska, Steven Krenn, Stephen Lombardi, Xiaomin Luo, Kevyn McPhail, Laura Millerschoen, Michal Perdoch, Mark Pitts, Alexander Richard, Jason M. Saragih, Junko Saragih, Takaaki Shiratori, Tomas Simon, Matthew Stewart, Autumn Trimble, Xinshuo Weng, David Whitewolf, Chenglei Wu, Shoou-I Yu, Yaser Sheikh:
Multiface: A Dataset for Neural Face Rendering. CoRR abs/2207.11243 (2022) - 2021
- [c17]Alexander Richard, Jesse Francis, Jalal Kawash:
Basic Distributed Algorithms Visual Simulations for Distributed Systems Students. EDUCON 2021: 199-205 - [c16]Israel D. Gebru, Dejan Markovic, Alexander Richard, Steven Krenn, Gladstone Alexander Butler, Fernando De la Torre, Yaser Sheikh:
Implicit HRTF Modeling Using Temporal Convolutional Networks. ICASSP 2021: 3385-3389 - [c15]Alexander Richard, Michael Zollhöfer, Yandong Wen, Fernando De la Torre, Yaser Sheikh:
MeshTalk: 3D Face Animation from Speech using Cross-Modality Disentanglement. ICCV 2021: 1153-1162 - [c14]Alexander Richard, Dejan Markovic, Israel D. Gebru, Steven Krenn, Gladstone Alexander Butler, Fernando De la Torre, Yaser Sheikh:
Neural Synthesis of Binaural Speech From Mono Audio. ICLR 2021 - [c13]Alexander Richard, Colin Lea, Shugao Ma, Juergen Gall, Fernando De la Torre, Yaser Sheikh:
Audio- and Gaze-driven Facial Animation of Codec Avatars. WACV 2021: 41-50 - [i13]Alexander Richard, Michael Zollhöfer, Yandong Wen, Fernando De la Torre, Yaser Sheikh:
MeshTalk: 3D Face Animation from Speech using Cross-Modality Disentanglement. CoRR abs/2104.08223 (2021) - 2020
- [j3]Hilde Kuehne
, Alexander Richard, Juergen Gall
:
A Hybrid RNN-HMM Approach for Weakly Supervised Temporal Action Segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 42(4): 765-779 (2020) - [i12]Yaser Souri, Alexander Richard, Luca Minciullo, Juergen Gall:
On Evaluating Weakly Supervised Action Segmentation Methods. CoRR abs/2005.09743 (2020) - [i11]Alexander Richard, Colin Lea, Shugao Ma, Juergen Gall, Fernando De la Torre, Yaser Sheikh:
Audio- and Gaze-driven Facial Animation of Codec Avatars. CoRR abs/2008.05023 (2020)
2010 – 2019
- 2019
- [b1]Alexander Richard:
Temporal Segmentation of Human Actions in Videos. University of Bonn, Germany, 2019 - [c12]Ahsan Iqbal, Alexander Richard, Juergen Gall:
Enhancing Temporal Action Localization with Transfer Learning from Action Recognition. ICCV Workshops 2019: 1533-1540 - [i10]Hilde Kuehne, Ahsan Iqbal, Alexander Richard, Juergen Gall:
Mining YouTube - A dataset for learning fine-grained action concepts from webly supervised video data. CoRR abs/1906.01012 (2019) - [i9]Hilde Kuehne, Alexander Richard, Juergen Gall:
A Hybrid RNN-HMM Approach for Weakly Supervised Temporal Action Segmentation. CoRR abs/1906.01028 (2019) - 2018
- [c11]Yazan Abu Farha
, Alexander Richard, Juergen Gall:
When Will You Do What? - Anticipating Temporal Occurrences of Activities. CVPR 2018: 5343-5352 - [c10]Alexander Richard, Hilde Kuehne
, Juergen Gall:
Action Sets: Weakly Supervised Action Segmentation Without Ordering Constraints. CVPR 2018: 5987-5996 - [c9]Alexander Richard, Hilde Kuehne
, Ahsan Iqbal, Juergen Gall:
NeuralNetwork-Viterbi: A Framework for Weakly Supervised Video Learning. CVPR 2018: 7386-7395 - [i8]Yazan Abu Farha, Alexander Richard, Juergen Gall:
When will you do what? - Anticipating Temporal Occurrences of Activities. CoRR abs/1804.00892 (2018) - [i7]Martin Garbade, Johann Sawatzky, Alexander Richard, Juergen Gall:
Two Stream 3D Semantic Scene Completion. CoRR abs/1804.03550 (2018) - [i6]Alexander Richard, Hilde Kuehne, Ahsan Iqbal, Juergen Gall:
NeuralNetwork-Viterbi: A Framework for Weakly Supervised Video Learning. CoRR abs/1805.06875 (2018) - 2017
- [j2]Alexander Richard, Juergen Gall:
A bag-of-words equivalent recurrent neural network for action recognition. Comput. Vis. Image Underst. 156: 79-91 (2017) - [j1]Hilde Kuehne
, Alexander Richard, Juergen Gall:
Weakly supervised learning of actions from transcripts. Comput. Vis. Image Underst. 163: 78-89 (2017) - [c8]Alexander Richard, Hilde Kuehne
, Juergen Gall:
Weakly Supervised Action Learning with RNN Based Fine-to-Coarse Modeling. CVPR 2017: 1273-1282 - [c7]Ahsan Iqbal, Alexander Richard, Hilde Kuehne
, Juergen Gall:
Recurrent Residual Learning for Action Recognition. GCPR 2017: 126-137 - [i5]Alexander Richard, Juergen Gall:
A Bag-of-Words Equivalent Recurrent Neural Network for Action Recognition. CoRR abs/1703.08089 (2017) - [i4]Alexander Richard, Hilde Kuehne, Juergen Gall:
Weakly Supervised Action Learning with RNN based Fine-to-coarse Modeling. CoRR abs/1703.08132 (2017) - [i3]Alexander Richard, Hilde Kuehne, Juergen Gall:
Temporal Action Labeling using Action Sets. CoRR abs/1706.00699 (2017) - [i2]Ahsan Iqbal, Alexander Richard, Hilde Kuehne, Juergen Gall:
Recurrent Residual Learning for Action Recognition. CoRR abs/1706.08807 (2017) - 2016
- [c6]Alexander Richard, Juergen Gall:
Temporal Action Detection Using a Statistical Language Model. CVPR 2016: 3131-3140 - [i1]Hilde Kuehne, Alexander Richard, Juergen Gall:
Weakly supervised learning of actions from transcripts. CoRR abs/1610.02237 (2016) - 2015
- [c5]Alexander Richard, Juergen Gall:
A BoW-equivalent Recurrent Neural Network for Action Recognition. BMVC 2015: 57.1-57.13 - 2014
- [c4]Simon Wiesler, Alexander Richard, Ralf Schlüter
, Hermann Ney:
Mean-normalized stochastic gradient for large-scale deep learning. ICASSP 2014: 180-184 - [c3]Simon Wiesler, Alexander Richard, Pavel Golik
, Ralf Schlüter
, Hermann Ney:
RASR/NN: The RWTH neural network toolkit for speech recognition. ICASSP 2014: 3281-3285 - 2013
- [c2]Simon Wiesler, Alexander Richard, Ralf Schlüter
, Hermann Ney:
A critical evaluation of stochastic algorithms for convex optimization. ICASSP 2013: 6955-6959 - 2011
- [c1]Simon Wiesler, Alexander Richard, Yotaro Kubo, Ralf Schlüter
, Hermann Ney:
Feature selection for log-linear acoustic models. ICASSP 2011: 5324-5327
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-04-09 21:26 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint