default search action
Shota Horiguchi
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c32]Shota Horiguchi, Kota Dohi, Yohei Kawaguchi:
Streaming Active Learning for Regression Problems Using Regression via Classification. ICASSP 2024: 4955-4959 - [i37]Hiroyuki Namba, Shota Horiguchi, Masaki Hamamoto, Masashi Egi:
Thresholding Data Shapley for Data Cleansing Using Multi-Armed Bandits. CoRR abs/2402.08209 (2024) - [i36]Atsushi Ando, Takafumi Moriya, Shota Horiguchi, Ryo Masumura:
Factor-Conditioned Speaking-Style Captioning. CoRR abs/2406.18910 (2024) - [i35]Hiroshi Sato, Takafumi Moriya, Masato Mimura, Shota Horiguchi, Tsubasa Ochiai, Takanori Ashihara, Atsushi Ando, Kentaro Shinayama, Marc Delcroix:
SpeakerBeam-SS: Real-time Target Speaker Extraction with Lightweight Conv-TasNet and State Space Modeling. CoRR abs/2407.01857 (2024) - [i34]Shota Horiguchi, Atsushi Ando, Takafumi Moriya, Takanori Ashihara, Hiroshi Sato, Naohiro Tawara, Marc Delcroix:
Recursive Attentive Pooling for Extracting Speaker Embeddings from Multi-Speaker Recordings. CoRR abs/2408.17142 (2024) - [i33]Takafumi Moriya, Shota Horiguchi, Marc Delcroix, Ryo Masumura, Takanori Ashihara, Hiroshi Sato, Kohei Matsuura, Masato Mimura:
Alignment-Free Training for Transducer-based Multi-Talker ASR. CoRR abs/2409.20301 (2024) - [i32]Alexis Plaquet, Naohiro Tawara, Marc Delcroix, Shota Horiguchi, Atsushi Ando, Shoko Araki:
Mamba-based Segmentation Model for Speaker Diarization. CoRR abs/2410.06459 (2024) - [i31]Takanori Ashihara, Takafumi Moriya, Shota Horiguchi, Junyi Peng, Tsubasa Ochiai, Marc Delcroix, Kohei Matsuura, Hiroshi Sato:
Investigation of Speaker Representation for Target-Speaker Speech Processing. CoRR abs/2410.11243 (2024) - [i30]Shota Horiguchi, Takafumi Moriya, Atsushi Ando, Takanori Ashihara, Hiroshi Sato, Naohiro Tawara, Marc Delcroix:
Guided Speaker Embedding. CoRR abs/2410.12182 (2024) - 2023
- [j4]Shota Horiguchi, Shinji Watanabe, Paola García, Yuki Takashima, Yohei Kawaguchi:
Online Neural Diarization of Unlimited Numbers of Speakers Using Global and Local Attractors. IEEE ACM Trans. Audio Speech Lang. Process. 31: 706-720 (2023) - [c31]Tuan Vu Ho, Shota Horiguchi, Shinji Watanabe, Paola García, Takashi Sumiyoshi:
Synthetic Data Augmentation for ASR with Domain Filtering. APSIPA ASC 2023: 1760-1765 - [c30]Yuki Okamoto, Kanta Shimonishi, Keisuke Imoto, Kota Dohi, Shota Horiguchi, Yohei Kawaguchi:
CAPTDURE: Captioned Sound Dataset of Single Sources. INTERSPEECH 2023: 1683-1687 - [c29]Aoi Ito, Shota Horiguchi:
Spoofing Attacker Also Benefits from Self-Supervised Pretrained Model. INTERSPEECH 2023: 5346-5350 - [i29]Aoi Ito, Shota Horiguchi:
Spoofing Attacker Also Benefits from Self-Supervised Pretrained Model. CoRR abs/2305.15518 (2023) - [i28]Yuki Okamoto, Kanta Shimonishi, Keisuke Imoto, Kota Dohi, Shota Horiguchi, Yohei Kawaguchi:
CAPTDURE: Captioned Sound Dataset of Single Sources. CoRR abs/2305.17758 (2023) - [i27]Shota Horiguchi, Kota Dohi, Yohei Kawaguchi:
Streaming Active Learning for Regression Problems Using Regression via Classification. CoRR abs/2309.01013 (2023) - 2022
- [j3]Shota Horiguchi, Yusuke Fujita, Shinji Watanabe, Yawen Xue, Paola García:
Encoder-Decoder Based Attractors for End-to-End Neural Diarization. IEEE ACM Trans. Audio Speech Lang. Process. 30: 1493-1507 (2022) - [c28]Yuki Okamoto, Shota Horiguchi, Masaaki Yamamoto, Keisuke Imoto, Yohei Kawaguchi:
Environmental Sound Extraction Using Onomatopoeic Words. ICASSP 2022: 221-225 - [c27]Shota Horiguchi, Yuki Takashima, Paola García, Shinji Watanabe, Yohei Kawaguchi:
Multi-Channel End-To-End Neural Diarization with Distributed Microphones. ICASSP 2022: 7332-7336 - [c26]Terufumi Morishita, Gaku Morio, Shota Horiguchi, Hiroaki Ozaki, Nobuo Nukaga:
Rethinking Fano's Inequality in Ensemble Learning. ICML 2022: 15976-16016 - [c25]Yuki Takashima, Shota Horiguchi, Shinji Watanabe, Leibny Paola García-Perera, Yohei Kawaguchi:
Updating Only Encoders Prevents Catastrophic Forgetting of End-to-End ASR Models. INTERSPEECH 2022: 2218-2222 - [c24]Natsuo Yamashita, Shota Horiguchi, Takeshi Homma:
Improving the Naturalness of Simulated Conversations for End-to-End Neural Diarization. Odyssey 2022: 133-140 - [c23]Shota Horiguchi, Yuki Takashima, Shinji Watanabe, Paola García:
Mutual Learning of Single- and Multi-Channel End-to-End Neural Diarization. SLT 2022: 620-625 - [i26]Natsuo Yamashita, Shota Horiguchi, Takeshi Homma:
Improving the Naturalness of Simulated Conversations for End-to-End Neural Diarization. CoRR abs/2204.11232 (2022) - [i25]Terufumi Morishita, Gaku Morio, Shota Horiguchi, Hiroaki Ozaki, Nobuo Nukaga:
Rethinking Fano's Inequality in Ensemble Learning. CoRR abs/2205.12683 (2022) - [i24]Shota Horiguchi, Shinji Watanabe, Paola García, Yuki Takashima, Yohei Kawaguchi:
Online Neural Diarization of Unlimited Numbers of Speakers. CoRR abs/2206.02432 (2022) - [i23]Shota Horiguchi, Yuki Takashima, Shinji Watanabe, Paola García:
Mutual Learning of Single- and Multi-Channel End-to-End Neural Diarization. CoRR abs/2210.03459 (2022) - 2021
- [c22]Shota Horiguchi, Shinji Watanabe, Paola García, Yawen Xue, Yuki Takashima, Yohei Kawaguchi:
Towards Neural Diarization for Unlimited Numbers of Speakers Using Global and Local Attractors. ASRU 2021: 98-105 - [c21]Shota Horiguchi, Paola García, Yusuke Fujita, Shinji Watanabe, Kenji Nagamatsu:
End-To-End Speaker Diarization as Post-Processing. ICASSP 2021: 7188-7192 - [c20]Yuki Takashima, Yusuke Fujita, Shota Horiguchi, Shinji Watanabe, Leibny Paola García-Perera, Kenji Nagamatsu:
Semi-Supervised Training with Pseudo-Labeling for End-To-End Neural Diarization. Interspeech 2021: 3096-3100 - [c19]Yawen Xue, Shota Horiguchi, Yusuke Fujita, Yuki Takashima, Shinji Watanabe, Leibny Paola García-Perera, Kenji Nagamatsu:
Online Streaming End-to-End Neural Diarization Handling Overlapping Speech and Flexible Numbers of Speakers. Interspeech 2021: 3116-3120 - [c18]Shota Horiguchi, Yusuke Fujita, Kenji Nagamatsu:
Block-Online Guided Source Separation. SLT 2021: 236-242 - [c17]Yawen Xue, Shota Horiguchi, Yusuke Fujita, Shinji Watanabe, Paola García, Kenji Nagamatsu:
Online End-To-End Neural Diarization with Speaker-Tracing Buffer. SLT 2021: 841-848 - [c16]Yuki Takashima, Yusuke Fujita, Shinji Watanabe, Shota Horiguchi, Paola García, Kenji Nagamatsu:
End-to-End Speaker Diarization Conditioned on Speech Activity and Overlap Detection. SLT 2021: 849-856 - [i22]Yawen Xue, Shota Horiguchi, Yusuke Fujita, Yuki Takashima, Shinji Watanabe, Paola García, Kenji Nagamatsu:
Online End-to-End Neural Diarization Handling Overlapping Speech and Flexible Numbers of Speakers. CoRR abs/2101.08473 (2021) - [i21]Shota Horiguchi, Nelson Yalta, Paola García, Yuki Takashima, Yawen Xue, Desh Raj, Zili Huang, Yusuke Fujita, Shinji Watanabe, Sanjeev Khudanpur:
The Hitachi-JHU DIHARD III System: Competitive End-to-End Neural Diarization and X-Vector Clustering Systems Combined by DOVER-Lap. CoRR abs/2102.01363 (2021) - [i20]Yuki Takashima, Yusuke Fujita, Shinji Watanabe, Shota Horiguchi, Paola García, Kenji Nagamatsu:
End-to-End Speaker Diarization Conditioned on Speech Activity and Overlap Detection. CoRR abs/2106.04078 (2021) - [i19]Yuki Takashima, Yusuke Fujita, Shota Horiguchi, Shinji Watanabe, Paola García, Kenji Nagamatsu:
Semi-Supervised Training with Pseudo-Labeling for End-to-End Neural Diarization. CoRR abs/2106.04764 (2021) - [i18]Shota Horiguchi, Yusuke Fujita, Shinji Watanabe, Yawen Xue, Paola García:
Encoder-Decoder Based Attractor Calculation for End-to-End Neural Diarization. CoRR abs/2106.10654 (2021) - [i17]Shota Horiguchi, Shinji Watanabe, Paola García, Yawen Xue, Yuki Takashima, Yohei Kawaguchi:
Towards Neural Diarization for Unlimited Numbers of Speakers Using Global and Local Attractors. CoRR abs/2107.01545 (2021) - [i16]Shota Horiguchi, Yuki Takashima, Paola García, Shinji Watanabe, Yohei Kawaguchi:
Multi-Channel End-to-End Neural Diarization with Distributed Microphones. CoRR abs/2110.04694 (2021) - [i15]Yuki Okamoto, Shota Horiguchi, Masaaki Yamamoto, Keisuke Imoto, Yohei Kawaguchi:
Environmental Sound Extraction Using Onomatopoeia. CoRR abs/2112.00209 (2021) - 2020
- [j2]Shota Horiguchi, Daiki Ikami, Kiyoharu Aizawa:
Significance of Softmax-Based Features in Comparison to Distance Metric Learning-Based Features. IEEE Trans. Pattern Anal. Mach. Intell. 42(5): 1279-1285 (2020) - [c15]Koichiro Ito, Quan Kong, Shota Horiguchi, Takashi Sumiyoshi, Kenji Nagamatsu:
Anticipating the Start of User Interaction for Service Robot in the Wild. ICRA 2020: 9687-9693 - [c14]Shota Horiguchi, Yusuke Fujita, Shinji Watanabe, Yawen Xue, Kenji Nagamatsu:
End-to-End Speaker Diarization for an Unknown Number of Speakers with Encoder-Decoder Based Attractors. INTERSPEECH 2020: 269-273 - [c13]Shota Horiguchi, Yusuke Fujita, Kenji Nagamatsu:
Utterance-Wise Meeting Transcription System Using Asynchronous Distributed Microphones. INTERSPEECH 2020: 344-348 - [c12]Terufumi Morishita, Gaku Morio, Shota Horiguchi, Hiroaki Ozaki, Toshinori Miyoshi:
Hitachi at SemEval-2020 Task 8: Simple but Effective Modality Ensemble for Meme Emotion Recognition. SemEval@COLING 2020: 1126-1134 - [i14]Yusuke Fujita, Shinji Watanabe, Shota Horiguchi, Yawen Xue, Kenji Nagamatsu:
End-to-End Neural Diarization: Reformulating Speaker Diarization as Simple Multi-label Classification. CoRR abs/2003.02966 (2020) - [i13]Shota Horiguchi, Yusuke Fujita, Shinji Watanabe, Yawen Xue, Kenji Nagamatsu:
End-to-End Speaker Diarization for an Unknown Number of Speakers with Encoder-Decoder Based Attractors. CoRR abs/2005.09921 (2020) - [i12]Yusuke Fujita, Shinji Watanabe, Shota Horiguchi, Yawen Xue, Jing Shi, Kenji Nagamatsu:
Neural Speaker Diarization with Speaker-Wise Chain Rule. CoRR abs/2006.01796 (2020) - [i11]Yawen Xue, Shota Horiguchi, Yusuke Fujita, Shinji Watanabe, Kenji Nagamatsu:
Online End-to-End Neural Diarization with Speaker-Tracing Buffer. CoRR abs/2006.02616 (2020) - [i10]Shota Horiguchi, Yusuke Fujita, Kenji Nagamatsu:
Utterance-Wise Meeting Transcription System Using Asynchronous Distributed Microphones. CoRR abs/2007.15868 (2020) - [i9]Shota Horiguchi, Yusuke Fujita, Kenji Nagamatsu:
Block-Online Guided Source Separation. CoRR abs/2011.07791 (2020) - [i8]Shota Horiguchi, Paola García, Yusuke Fujita, Shinji Watanabe, Kenji Nagamatsu:
End-to-End Speaker Diarization as Post-Processing. CoRR abs/2012.10055 (2020)
2010 – 2019
- 2019
- [c11]Naoyuki Kanda, Shota Horiguchi, Yusuke Fujita, Yawen Xue, Kenji Nagamatsu, Shinji Watanabe:
Simultaneous Speech Recognition and Speaker Diarization for Monaural Dialogue Recordings with Target-Speaker Acoustic Models. ASRU 2019: 31-38 - [c10]Yusuke Fujita, Naoyuki Kanda, Shota Horiguchi, Yawen Xue, Kenji Nagamatsu, Shinji Watanabe:
End-to-End Neural Speaker Diarization with Self-Attention. ASRU 2019: 296-303 - [c9]Naoyuki Kanda, Yusuke Fujita, Shota Horiguchi, Rintaro Ikeshita, Kenji Nagamatsu, Shinji Watanabe:
Acoustic Modeling for Distant Multi-talker Speech Recognition with Single- and Multi-channel Branches. ICASSP 2019: 6630-6634 - [c8]Naoyuki Kanda, Shota Horiguchi, Ryoichi Takashima, Yusuke Fujita, Kenji Nagamatsu, Shinji Watanabe:
Auxiliary Interference Speaker Loss for Target-Speaker Speech Recognition. INTERSPEECH 2019: 236-240 - [c7]Naoyuki Kanda, Christoph Böddeker, Jens Heitkaemper, Yusuke Fujita, Shota Horiguchi, Kenji Nagamatsu, Reinhold Haeb-Umbach:
Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR. INTERSPEECH 2019: 1248-1252 - [c6]Shota Horiguchi, Naoyuki Kanda, Kenji Nagamatsu:
Multimodal Response Obligation Detection with Unsupervised Online Domain Adaptation. INTERSPEECH 2019: 4180-4184 - [c5]Yusuke Fujita, Naoyuki Kanda, Shota Horiguchi, Kenji Nagamatsu, Shinji Watanabe:
End-to-End Neural Speaker Diarization with Permutation-Free Objectives. INTERSPEECH 2019: 4300-4304 - [c4]Masato Tamura, Shota Horiguchi, Tomokazu Murakami:
Omnidirectional Pedestrian Detection by Rotation Invariant Training. WACV 2019: 1989-1998 - [i7]Naoyuki Kanda, Christoph Böddeker, Jens Heitkaemper, Yusuke Fujita, Shota Horiguchi, Kenji Nagamatsu, Reinhold Haeb-Umbach:
Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASR. CoRR abs/1905.12230 (2019) - [i6]Naoyuki Kanda, Shota Horiguchi, Ryoichi Takashima, Yusuke Fujita, Kenji Nagamatsu, Shinji Watanabe:
Auxiliary Interference Speaker Loss for Target-Speaker Speech Recognition. CoRR abs/1906.10876 (2019) - [i5]Yusuke Fujita, Naoyuki Kanda, Shota Horiguchi, Kenji Nagamatsu, Shinji Watanabe:
End-to-End Neural Speaker Diarization with Permutation-Free Objectives. CoRR abs/1909.05952 (2019) - [i4]Yusuke Fujita, Naoyuki Kanda, Shota Horiguchi, Yawen Xue, Kenji Nagamatsu, Shinji Watanabe:
End-to-End Neural Speaker Diarization with Self-attention. CoRR abs/1909.06247 (2019) - [i3]Naoyuki Kanda, Shota Horiguchi, Yusuke Fujita, Yawen Xue, Kenji Nagamatsu, Shinji Watanabe:
Simultaneous Speech Recognition and Speaker Diarization for Monaural Dialogue Recordings with Target-Speaker Acoustic Models. CoRR abs/1909.08103 (2019) - 2018
- [j1]Shota Horiguchi, Sosuke Amano, Makoto Ogawa, Kiyoharu Aizawa:
Personalized Classifier for Food Image Recognition. IEEE Trans. Multim. 20(10): 2836-2848 (2018) - [c3]Shota Horiguchi, Naoyuki Kanda, Kenji Nagamatsu:
Face-Voice Matching using Cross-modal Embeddings. ACM Multimedia 2018: 1011-1019 - [i2]Shota Horiguchi, Sosuke Amano, Makoto Ogawa, Kiyoharu Aizawa:
Personalized Classifier for Food Image Recognition. CoRR abs/1804.04600 (2018) - 2017
- [i1]Shota Horiguchi, Daiki Ikami, Kiyoharu Aizawa:
Significance of Softmax-based Features in Comparison to Distance Metric Learning-based Features. CoRR abs/1712.10151 (2017) - 2016
- [c2]Shota Horiguchi, Kiyoharu Aizawa, Makoto Ogawa:
The log-normal distribution of the size of objects in daily meal images and its application to the efficient reduction of object proposals. ICIP 2016: 3668-3672 - [c1]Sosuke Amano, Shota Horiguchi, Kiyoharu Aizawa, Kazuki Maeda, Masanori Kubota, Makoto Ogawa:
Food Search Based on User Feedback to Assist Image-based Food Recording Systems. MADiMa @ ACM Multimedia 2016: 71-75
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-12-01 01:14 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint