Follow
Shogo Seki
Shogo Seki
CyberAgent, Inc.
Verified email at cyberagent.co.jp
Title
Cited by
Cited by
Year
iSTFTNet: Fast and lightweight mel-spectrogram vocoder incorporating inverse short-time Fourier transform
T Kaneko, K Tanaka, H Kameoka, S Seki
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
732022
Generalized multichannel variational autoencoder for underdetermined source separation
S Seki, H Kameoka, L Li, T Toda, K Takeda
2019 27th European Signal Processing Conference (EUSIPCO), 1-5, 2019
272019
Joint separation and dereverberation of reverberant mixtures with multichannel variational autoencoder
S Inoue, H Kameoka, L Li, S Seki, S Makino
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
272019
Underdetermined source separation based on generalized multichannel variational autoencoder
S Seki, H Kameoka, L Li, T Toda, K Takeda
IEEE Access 7, 168104-168115, 2019
252019
Voicegrad: Non-parallel any-to-many voice conversion with annealed langevin dynamics
H Kameoka, T Kaneko, K Tanaka, N Hojo, S Seki
arXiv preprint arXiv:2010.02977, 2020
162020
Wave-U-Net Discriminator: Fast and lightweight discriminator for generative adversarial network-based speech synthesis
T Kaneko, H Kameoka, K Tanaka, S Seki
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
82023
Distilling sequence-to-sequence voice conversion models for streaming conversion applications
K Tanaka, H Kameoka, T Kaneko, S Seki
2022 IEEE Spoken Language Technology Workshop (SLT), 1022-1028, 2023
62023
MISRNet: Lightweight Neural Vocoder Using Multi-Input Single Shared Residual Blocks.
T Kaneko, H Kameoka, K Tanaka, S Seki
INTERSPEECH, 1631-1635, 2022
52022
Non-parallel whisper-to-normal speaking style conversion using auxiliary classifier variational autoencoder
S Seki, H Kameoka, T Kaneko, K Tanaka
IEEE Access 11, 44590-44599, 2023
42023
Self-produced speech enhancement and suppression method using air-and body-conductive microphones
M Takada, S Seki, T Toda
2018 Asia-Pacific Signal and Information Processing Association Annual …, 2018
42018
VoiceGrad: Non-Parallel Any-to-Many Voice Conversion with Annealed Langevin Dynamics
H Kameoka, T Kaneko, K Tanaka, N Hojo, S Seki
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
32024
iSTFTNet2: Faster and More Lightweight iSTFT-Based Neural Vocoder Using 1D-2D CNN
T Kaneko, H Kameoka, K Tanaka, S Seki
arXiv preprint arXiv:2308.07117, 2023
32023
Semi-Supervised Self-Produced Speech Enhancement and Suppression Based on Joint Source Modeling of Air-and Body-Conducted Signals Using Variational Autoencoder.
S Seki, M Takada, T Toda
INTERSPEECH, 4039-4043, 2020
32020
Robust Example Search Using Bottleneck Features for Example-Based Speech Enhancement.
A Ogawa, S Seki, K Kinoshita, M Delcroix, T Yoshioka, T Nakatani, ...
INTERSPEECH, 3733-3737, 2016
32016
Investigation and comparison of optimization methods for variational autoencoder-based underdetermined multichannel source separation
S Seki, H Kameoka, L Li
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
22022
CAUSE: Crossmodal Action Unit Sequence Estimation from Speech.
H Kameoka, T Kaneko, S Seki, K Tanaka
Interspeech, 506-510, 2022
22022
Remixed2Remixed: Domain adaptation for speech enhancement by Noise2Noise learning with Remixing
L Li, S Seki
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
12024
Jsv-vc: Jointly trained speaker verification and voice conversion models
S Seki, H Kameoka, K Tanaka, T Kaneko
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
12023
Attentionpit: Soft Permutation Invariant Training for Audio Source Separation with Attention Mechanism
H Kameoka, S Seki, L Li, C Watanabe
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
12022
HBP: An efficient block permutation solver using Hungarian algorithm and spectrogram inpainting for multichannel audio source separation
L Li, H Kameoka, S Seki
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
12022
The system can't perform the operation now. Try again later.
Articles 1–20