default search action
Takashi Masuko
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [i1]Kento Nozawa, Takashi Masuko, Toru Taniguchi:
Enhancing Large Language Model-based Speech Recognition by Contextualization for Rare and Ambiguous Words. CoRR abs/2408.08027 (2024)
2010 – 2019
- 2018
- [c47]Daichi Hayakawa, Takashi Masuko, Hiroshi Fujimura:
Applying Complex-Valued Neural Networks to Acoustic Modeling for Speech Recognition. APSIPA 2018: 1725-1731 - [c46]Hiroshi Fujimura, Manabu Nagao, Takashi Masuko:
Simultaneous Speech Recognition and Acoustic Event Detection Using an LSTM-CTC Acoustic Model and a WFST Decoder. ICASSP 2018: 5834-5838 - 2017
- [c45]Takashi Masuko:
Computational cost reduction of long short-term memory based on simultaneous compression of input and hidden state. ASRU 2017: 126-133 - 2013
- [c44]Hiroshi Fujimura, Yusuke Shinohara, Takashi Masuko:
N-best rescoring by phoneme classifiers using subclass adaboost algorithm. INTERSPEECH 2013: 3327-3331 - 2011
- [c43]Hiroshi Fujimura, Masanobu Nakamura, Yusuke Shinohara, Takashi Masuko:
N-Best rescoring by adaboost phoneme classifiers for isolated word recognition. ASRU 2011: 83-88 - 2010
- [c42]Yusuke Shinohara, Takashi Masuko, Masami Akamine:
Covariance clustering on Riemannian manifolds for acoustic model compression. ICASSP 2010: 4326-4329 - [c41]Hiroshi Fujimura, Takashi Masuko, Mitsuyoshi Tachimori:
A duration modeling technique with incremental speech rate normalization. INTERSPEECH 2010: 2962-2965
2000 – 2009
- 2009
- [c40]Kei Hashimoto, Heiga Zen, Yoshihiko Nankaku, Takashi Masuko, Keiichi Tokuda:
A Bayesian approach to HMM-based speech synthesis. ICASSP 2009: 4029-4032 - [c39]Yusuke Kida, Masaru Sakai, Takashi Masuko, Akinori Kawamura:
Robust F0 estimation based on log-time scale autocorrelation and its application to Mandarin tone recognition. INTERSPEECH 2009: 2971-2974 - 2008
- [c38]Yusuke Shinohara, Takashi Masuko, Masami Akamine:
Feature enhancement by speaker-normalized splice for robust speech recognition. ICASSP 2008: 4881-4884 - 2007
- [j13]Heiga Zen, Takashi Masuko, Keiichi Tokuda, Takayoshi Yoshimura, Takao Kobayashi, Tadashi Kitamura:
State Duration Modeling for HMM-Based Speech Synthesis. IEICE Trans. Inf. Syst. 90-D(3): 692-693 (2007) - [j12]Heiga Zen, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura:
A Hidden Semi-Markov Model-Based Speech Synthesis System. IEICE Trans. Inf. Syst. 90-D(5): 825-834 (2007) - [j11]Takashi Nose, Junichi Yamagishi, Takashi Masuko, Takao Kobayashi:
A Style Control Technique for HMM-Based Expressive Speech Synthesis. IEICE Trans. Inf. Syst. 90-D(9): 1406-1413 (2007) - [c37]Heiga Zen, Takashi Nose, Junichi Yamagishi, Shinji Sako, Takashi Masuko, Alan W. Black, Keiichi Tokuda:
The HMM-based speech synthesis system (HTS) version 2.0. SSW 2007: 294-299 - 2006
- [j10]Makoto Tachibana, Junichi Yamagishi, Takashi Masuko, Takao Kobayashi:
A Style Adaptation Technique for Speech Synthesis Using HSMM and Suprasegmental Features. IEICE Trans. Inf. Syst. 89-D(3): 1092-1099 (2006) - [j9]Takashi Masuko, Takao Kobayashi, Keiichi Tokuda:
Very low bit rate speech coding based on HMM with speaker adaptation. Syst. Comput. Jpn. 37(2): 67-78 (2006) - [c36]Masahide Ariu, Takashi Masuko, Shinichi Tanaka, Akinori Kawamura:
Speech Recognition Using Syllable Duration Ratio Model. ICASSP (1) 2006: 341-344 - 2005
- [j8]Junichi Yamagishi, Koji Onishi, Takashi Masuko, Takao Kobayashi:
Acoustic Modeling of Speaking Styles and Emotional Expressions in HMM-Based Speech Synthesis. IEICE Trans. Inf. Syst. 88-D(3): 502-509 (2005) - [j7]Makoto Tachibana, Junichi Yamagishi, Takashi Masuko, Takao Kobayashi:
Speech Synthesis with Various Emotional Expressions and Speaking Styles by Style Interpolation and Morphing. IEICE Trans. Inf. Syst. 88-D(11): 2484-2491 (2005) - [j6]Takayoshi Yoshimura, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura:
Incorporating a mixed excitation model and postfilter into HMM-based text-to-speech synthesis. Syst. Comput. Jpn. 36(12): 43-50 (2005) - [c35]Makoto Tachibana, Junichi Yamagishi, Takashi Masuko, Takao Kobayashi:
Performance evaluation of style adaptation for hidden semi-Markov model based speech synthesis. INTERSPEECH 2005: 2805-2808 - 2004
- [j5]Dhany Arifianto, Tomohiro Tanaka, Takashi Masuko, Takao Kobayashi:
Robust F0 Estimation of Speech Signal Using Harmonicity Measure Based on Instantaneous Frequency. IEICE Trans. Inf. Syst. 87-D(12): 2812-2820 (2004) - [j4]Masatsune Tamura, Takashi Masuko, Keiichi Tokuda, Takao Kobayashi:
Speaker adaptation of pitch and spectrum for HMM-based speech synthesis. Syst. Comput. Jpn. 35(11): 59-68 (2004) - [c34]Junichi Yamagishi, Makoto Tachibana, Takashi Masuko, Takao Kobayashi:
Speaking style adaptation using context clustering decision tree for HMM-based speech synthesis. ICASSP (1) 2004: 5-8 - [c33]Junichi Yamagishi, Takashi Masuko, Takao Kobayashi:
MLLR adaptation for hidden semi-Markov model based speech synthesis. INTERSPEECH 2004: 1213-1216 - [c32]Heiga Zen, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura:
Hidden semi-Markov model based speech synthesis. INTERSPEECH 2004: 1393-1396 - [c31]Takashi Masuko, Takao Kobayashi, Keisuke Miyanaga:
A style control technique for HMM-based speech synthesis. INTERSPEECH 2004: 1437-1440 - 2003
- [j3]Junichi Yamagishi, Masatsune Tamura, Takashi Masuko, Keiichi Tokuda, Takao Kobayashi:
A Training Method of Average Voice Model for HMM-Based Speech Synthesis. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 86-A(8): 1956-1963 (2003) - [c30]Junichi Yamagishi, Takashi Masuko, Keiichi Tokuda, Takao Kobayashi:
A training method for average voice model based on shared decision tree context clustering and speaker adaptive training. ICASSP (1) 2003: 716-719 - [c29]Takahiro Hoshiya, Shinji Sako, Heiga Zen, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura:
Improving the performance of HMM-based very low bit rate speech coding. ICASSP (1) 2003: 800-803 - [c28]Junichi Yamagishi, Koji Onishi, Takashi Masuko, Takao Kobayashi:
Modeling of various speaking styles and emotions for HMM-based speech synthesis. INTERSPEECH 2003: 2461-2464 - 2002
- [j2]Takashi Masuko, Keiichi Tokuda, Noboru Miyazaki, Takao Kobayashi:
Pitch pattern generation using multispace probability distribution HMM. Syst. Comput. Jpn. 33(6): 62-72 (2002) - [c27]Tomohiro Tanaka, Takao Kobayashi, Dhany Arifianto, Takashi Masuko:
Fundamental frequency estimation based on instantaneous frequency amplitude spectrum. ICASSP 2002: 329-332 - [c26]Junichi Yamagishi, Masatsune Tamura, Takashi Masuko, Keiichi Tokuda, Takao Kobayashi:
A context clustering technique for average voice model in HMM-based speech synthesis. INTERSPEECH 2002: 133-136 - [c25]Kengo Shichiri, Atsushi Sawabe, Takayoshi Yoshimura, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura:
Eigenvoices for HMM-based speech synthesis. INTERSPEECH 2002: 1269-1272 - 2001
- [j1]Jun Hiroi, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura:
Very low bit rate speech coding based on HMMs. Syst. Comput. Jpn. 32(12): 38-46 (2001) - [c24]Chiyomi Miyajima, Yosuke Hattori, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura:
Speaker identification using Gaussian mixture models based on multi-space probability distribution. ICASSP 2001: 433-436 - [c23]Masatsune Tamura, Takashi Masuko, Keiichi Tokuda, Takao Kobayashi:
Adaptation of pitch and spectrum for HMM-based speech synthesis using MLLR. ICASSP 2001: 805-808 - [c22]Masatsune Tamura, Takashi Masuko, Keiichi Tokuda, Takao Kobayashi:
Text-to-speech synthesis with arbitrary speaker's voice from average voice. INTERSPEECH 2001: 345-348 - [c21]Takayuki Satoh, Takashi Masuko, Takao Kobayashi, Keiichi Tokuda:
A robust speaker verification system against imposture using an HMM-based speech synthesis system. INTERSPEECH 2001: 759-762 - [c20]Takayoshi Yoshimura, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura:
Mixed excitation for HMM-based speech synthesis. INTERSPEECH 2001: 2263-2266 - 2000
- [c19]Keiichi Tokuda, Takayoshi Yoshimura, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura:
Speech parameter generation algorithms for HMM-based speech synthesis. ICASSP 2000: 1315-1318 - [c18]Shinji Sako, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura:
HMM-based text-to-audio-visual speech synthesis. INTERSPEECH 2000: 25-28 - [c17]Takashi Masuko, Keiichi Tokuda, Takao Kobayashi:
Imposture using synthetic speech against speaker verification based on spectrum and pitch. INTERSPEECH 2000: 302-305
1990 – 1999
- 1999
- [c16]Keiichi Tokuda, Takashi Masuko, Noboru Miyazaki, Takao Kobayashi:
Hidden Markov models based on multi-space probability distribution for pitch pattern modeling. ICASSP 1999: 229-232 - [c15]Masatsune Tamura, Shigekazu Kondo, Takashi Masuko, Takao Kobayashi:
Text-to-audio-visual speech synthesis based on parameter generation from HMM. EUROSPEECH 1999: 959-962 - [c14]Takashi Masuko, Takafumi Hitotsumatsu, Keiichi Tokuda, Takao Kobayashi:
On the security of HMM-based speaker verification systems against imposture using synthetic speech. EUROSPEECH 1999: 1223-1226 - [c13]Takayoshi Yoshimura, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura:
Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis. EUROSPEECH 1999: 2347-2350 - 1998
- [c12]Masatsune Tamura, Takashi Masuko, Takao Kobayashi, Keiichi Tokuda:
Visual Speech Synthesis Based on Parameter Generation From HMM: Speech-Driven and Text-And-Speech-Driven Approaches. AVSP 1998: 221-224 - [c11]Keiichi Tokuda, Takashi Masuko, Jun Hiroi, Takao Kobayashi, Tadashi Kitamura:
A very low bit rate speech coder using HMM-based speech recognition/synthesis techniques. ICASSP 1998: 609-612 - [c10]Takashi Masuko, Takao Kobayashi, Masatsune Tamura, Jun Masubuchi, Keiichi Tokuda:
Text-to-visual speech synthesis based on parameter generation from HMM. ICASSP 1998: 3745-3748 - [c9]Takashi Masuko, Keiichi Tokuda, Takao Kobayashi:
A very low bit rate speech coder using HMM with speaker adaptation. ICSLP 1998 - [c8]Takayoshi Yoshimura, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura:
Duration modeling for HMM-based speech synthesis. ICSLP 1998 - [c7]Masatsune Tamura, Takashi Masuko, Keiichi Tokuda, Takao Kobayashi:
Speaker adaptation for HMM-based speech synthesis system using MLLR. SSW 1998: 273-276 - 1997
- [c6]Takashi Masuko, Keiichi Tokuda, Takao Kobayashi, Satoshi Imai:
Voice characteristics conversion for HMM-based speech synthesis system. ICASSP 1997: 1611-1614 - [c5]Takao Kobayashi, Takashi Masuko, Keiichi Tokuda:
HMM compensation for noisy speech recognition based on cepstral parameter generation. EUROSPEECH 1997: 1583-1586 - [c4]Takayoshi Yoshimura, Takashi Masuko, Keiichi Tokuda, Takao Kobayashi, Tadashi Kitamura:
Speaker interpolation in HMM-based speech synthesis system. EUROSPEECH 1997: 2523-2526 - 1996
- [c3]Takashi Masuko, Keiichi Tokuda, Takao Kobayashi, Satoshi Imai:
Speech synthesis using HMMs with dynamic features. ICASSP 1996: 389-392 - 1995
- [c2]Keiichi Tokuda, Takashi Masuko, Tetsuya Yamada, Takao Kobayashi, Satoshi Imai:
An algorithm for speech parameter generation from continuous mixture HMMs with dynamic features. EUROSPEECH 1995: 757-760 - 1994
- [c1]Keiichi Tokuda, Takao Kobayashi, Takashi Masuko, Satoshi Imai:
Mel-generalized cepstral analysis - a unified approach to speech spectral estimation. ICSLP 1994: 1043-1046
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 21:19 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint