default search action
Daiki Takeuchi
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j4]Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Kunio Kashino:
Masked Modeling Duo: Towards a Universal Audio Pre-Training Framework. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2391-2406 (2024) - [c20]Bo He, Shiqi Zhang, Xianrui Wang, Zheng Qiu, Daiki Takeuchi, Daisuke Niizumi, Noboru Harada, Shoji Makino:
Light Gated Multi Mini-Patch Extractor for Audio Classification. ICASSP Workshops 2024: 765-769 - [c19]Shiqi Zhang, Zheng Qiu, Daiki Takeuchi, Noboru Harada, Shoji Makino:
Unrestricted Global Phase Bias-Aware Single-Channel Speech Enhancement with Conformer-Based Metric Gan. ICASSP 2024: 1026-1030 - [i23]Shiqi Zhang, Zheng Qiu, Daiki Takeuchi, Noboru Harada, Shoji Makino:
Unrestricted Global Phase Bias-Aware Single-channel Speech Enhancement with Conformer-based Metric GAN. CoRR abs/2402.08252 (2024) - [i22]Shunsuke Tsubaki, Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Keisuke Imoto:
Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval. CoRR abs/2403.10756 (2024) - [i21]Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Kunio Kashino:
Masked Modeling Duo: Towards a Universal Audio Pre-training Framework. CoRR abs/2404.06095 (2024) - [i20]Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Kunio Kashino:
Exploring Pre-trained General-purpose Audio Representations for Heart Murmur Detection. CoRR abs/2404.17107 (2024) - [i19]Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Masahiro Yasuda, Shunsuke Tsubaki, Keisuke Imoto:
M2D-CLAP: Masked Modeling Duo Meets CLAP for Learning General-purpose Audio-Language Representation. CoRR abs/2406.02032 (2024) - 2023
- [j3]Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Kunio Kashino:
BYOL for Audio: Exploring Pre-Trained General-Purpose Audio Representations. IEEE ACM Trans. Audio Speech Lang. Process. 31: 137-151 (2023) - [c18]Haoran Xing, Shiqi Zhang, Daiki Takeuchi, Daisuke Niizumi, Noboru Harada, Shoji Makino:
Enhancing Spectrogram for Audio Classification Using Time-Frequency Enhancer. APSIPA ASC 2023: 1155-1160 - [c17]Ami Igarashi, Shunsuke Tsubaki, Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Keisuke Imoto:
Joint Analysis of Acoustic Scenes and Sound Events Based on Semi-Supervised Approach. APSIPA ASC 2023: 2074-2080 - [c16]Noboru Harada, Daisuke Niizumi, Yasunori Ohishi, Daiki Takeuchi, Masahiro Yasuda:
First-Shot Anomaly Sound Detection for Machine Condition Monitoring: A Domain Generalization Baseline. EUSIPCO 2023: 191-195 - [c15]Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Kunio Kashino:
Masked Modeling Duo: Learning Representations by Encouraging Both Networks to Model the Input. ICASSP 2023: 1-5 - [c14]Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Kunio Kashino:
Masked Modeling Duo for Speech: Specializing General-Purpose Audio Representation to Speech using Denoising Distillation. INTERSPEECH 2023: 1294-1298 - [i18]Noboru Harada, Daisuke Niizumi, Yasunori Ohishi, Daiki Takeuchi, Masahiro Yasuda:
First-shot anomaly sound detection for machine condition monitoring: A domain generalization baseline. CoRR abs/2303.00455 (2023) - [i17]Kenji Ishikawa, Daiki Takeuchi, Noboru Harada, Takehiro Moriya:
Deep sound-field denoiser: optically-measured sound-field denoising using deep neural network. CoRR abs/2304.14923 (2023) - [i16]Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Kunio Kashino:
Masked Modeling Duo for Speech: Specializing General-Purpose Audio Representation to Speech using Denoising Distillation. CoRR abs/2305.14079 (2023) - [i15]Daiki Takeuchi, Yasunori Ohishi, Daisuke Niizumi, Noboru Harada, Kunio Kashino:
Audio Difference Captioning Utilizing Similarity-Discrepancy Disentanglement. CoRR abs/2308.11923 (2023) - 2022
- [c13]Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Kunio Kashino:
Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model. EUSIPCO 2022: 200-204 - [c12]Daiki Takeuchi, Yasunori Ohishi, Daisuke Niizumi, Noboru Harada, Kunio Kashino:
Introducing Auxiliary Text Query-modifier to Content-based Audio Retrieval. INTERSPEECH 2022: 4197-4201 - [c11]Yasunori Ohishi, Marc Delcroix, Tsubasa Ochiai, Shoko Araki, Daiki Takeuchi, Daisuke Niizumi, Akisato Kimura, Noboru Harada, Kunio Kashino:
ConceptBeam: Concept Driven Target Speech Extraction. ACM Multimedia 2022: 4252-4260 - [i14]Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Kunio Kashino:
BYOL for Audio: Exploring Pre-trained General-purpose Audio Representations. CoRR abs/2204.07402 (2022) - [i13]Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Kunio Kashino:
Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representation. CoRR abs/2204.12260 (2022) - [i12]Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Kunio Kashino:
Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model. CoRR abs/2205.08138 (2022) - [i11]Daiki Takeuchi, Yasunori Ohishi, Daisuke Niizumi, Noboru Harada, Kunio Kashino:
Introducing Auxiliary Text Query-modifier to Content-based Audio Retrieval. CoRR abs/2207.09732 (2022) - [i10]Yasunori Ohishi, Marc Delcroix, Tsubasa Ochiai, Shoko Araki, Daiki Takeuchi, Daisuke Niizumi, Akisato Kimura, Noboru Harada, Kunio Kashino:
ConceptBeam: Concept Driven Target Speech Extraction. CoRR abs/2207.11964 (2022) - [i9]Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Kunio Kashino:
Masked Modeling Duo: Learning Representations by Encouraging Both Networks to Model the Input. CoRR abs/2210.14648 (2022) - 2021
- [c10]Noboru Harada, Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Masahiro Yasuda, Shoichiro Saito:
ToyADMOS2: Another Dataset of Miniature-Machine Operating Sounds for Anomalous Sound Detection under Domain Shift Conditions. DCASE 2021: 1-5 - [c9]Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Kunio Kashino:
Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representation. HEAR@NeurIPS 2021: 1-24 - [c8]Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Kunio Kashino:
BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation. IJCNN 2021: 1-8 - [i8]Daisuke Niizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Kunio Kashino:
BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation. CoRR abs/2103.06695 (2021) - 2020
- [c7]Daiki Takeuchi, Yuma Koizumi, Yasunori Ohishi, Noboru Harada, Kunio Kashino:
Effects of Word-Frequency Based Pre- and Post- Processings for Audio Captioning. DCASE 2020: 190-194 - [c6]Yuma Koizumi, Kohei Yatabe, Marc Delcroix, Yoshiki Masuyama, Daiki Takeuchi:
Speech Enhancement Using Self-Adaptation and Multi-Head Self-Attention. ICASSP 2020: 181-185 - [c5]Daiki Takeuchi, Kohei Yatabe, Yuma Koizumi, Yasuhiro Oikawa, Noboru Harada:
Real-Time Speech Enhancement Using Equilibriated RNN. ICASSP 2020: 851-855 - [c4]Daiki Takeuchi, Kohei Yatabe, Yuma Koizumi, Yasuhiro Oikawa, Noboru Harada:
Invertible DNN-Based Nonlinear Time-Frequency Transform for Speech Enhancement. ICASSP 2020: 6644-6648 - [i7]Daiki Takeuchi, Kohei Yatabe, Yuma Koizumi, Yasuhiro Oikawa, Noboru Harada:
Real-time speech enhancement using equilibriated RNN. CoRR abs/2002.05843 (2020) - [i6]Yuma Koizumi, Kohei Yatabe, Marc Delcroix, Yoshiki Masuyama, Daiki Takeuchi:
Speech Enhancement using Self-Adaptation and Multi-Head Self-Attention. CoRR abs/2002.05873 (2020) - [i5]Yuma Koizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Kunio Kashino:
The NTT DCASE2020 Challenge Task 6 system: Automated Audio Captioning with Keywords and Sentence Length Estimation. CoRR abs/2007.00225 (2020) - [i4]Daiki Takeuchi, Yuma Koizumi, Yasunori Ohishi, Noboru Harada, Kunio Kashino:
Effects of Word-frequency based Pre- and Post- Processings for Audio Captioning. CoRR abs/2009.11436 (2020) - [i3]Yuma Koizumi, Yasunori Ohishi, Daisuke Niizumi, Daiki Takeuchi, Masahiro Yasuda:
Audio Captioning using Pre-Trained Large-Scale Language Model Guided by Audio-based Similar Caption Retrieval. CoRR abs/2012.07331 (2020)
2010 – 2019
- 2019
- [c3]Daiki Takeuchi, Kohei Yatabe, Yuma Koizumi, Yasuhiro Oikawa, Noboru Harada:
Data-driven Design of Perfect Reconstruction Filterbank for DNN-based Sound Source Enhancement. ICASSP 2019: 596-600 - [i2]Daiki Takeuchi, Kohei Yatabe, Yuma Koizumi, Yasuhiro Oikawa, Noboru Harada:
Data-driven design of perfect reconstruction filterbank for DNN-based sound source enhancement. CoRR abs/1903.08876 (2019) - [i1]Daiki Takeuchi, Kohei Yatabe, Yuma Koizumi, Yasuhiro Oikawa, Noboru Harada:
Invertible DNN-based nonlinear time-frequency transform for speech enhancement. CoRR abs/1911.10764 (2019) - 2018
- [c2]Daiki Takeuchi, Kohei Yatabe, Yasuhiro Oikawa:
Realizing Directional Sound Source in FDTD Method by Estimating Initial Value. ICASSP 2018: 461-465 - [c1]Kenji Kobayashi, Daiki Takeuchi, Mio Iwamoto, Kohei Yatabe, Yasuhiro Oikawa:
Parametric Approximation of Piano Sound Based on Kautz Model with Sparse Linear Prediction. ICASSP 2018: 626-630 - 2014
- [j2]Daiki Takeuchi, Wataru Chujo, Shin-ichi Yamamoto, Yahei Koyamada:
Coherent synthesis of two continuous microwave signals generated by two optical beats. IEICE Electron. Express 11(10): 20140209 (2014) - 2011
- [j1]Daiki Takeuchi, Wataru Chujo, Shin-ichi Yamamoto, Yahei Koyamada:
Phase Control and Calibration Characteristics of Optically Controlled Phased Array Antenna Feed Using Multiple SMFs. IEICE Trans. Electron. 94-C(10): 1634-1640 (2011)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-09-05 23:41 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint