Follow
William Chan
William Chan
Ideogram
Verified email at ideogram.ai - Homepage
Title
Cited by
Cited by
Year
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding
C Saharia, W Chan, S Saxena, L Li, J Whang, E Denton, ...
NeurIPS, 2022
50232022
SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
DS Park, W Chan, Y Zhang, CC Chiu, B Zoph, ED Cubuk, QV Le
INTERSPEECH, 2019
42392019
Listen, Attend and Spell: A Neural Network for Large Vocabulary Conversational Speech Recognition
W Chan, N Jaitly, QV Le, O Vinyals
ICASSP, 2016
3377*2016
Image Super-Resolution via Iterative Refinement
C Saharia, J Ho, W Chan, T Salimans, D Fleet, M Norouzi
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022
16632022
Palette: Image-to-Image Diffusion Models
C Saharia, W Chan, H Chang, C A. Lee, J Ho, D Tim Salimans, J. Fleet, ...
SIGGRAPH, 2022
13282022
Video Diffusion Models
J Ho, T Salimans, A Gritsenko, W Chan, M Norouzi, D Fleet
arXiv:2204.03458, 2022
12202022
Imagen Video: High Definition Video Generation with Diffusion Models
J Ho, W Chan, C Saharia, J Whang, R Gao, A Gritsenko, D P. Kingma, ...
arXiv:2210.02303, 2022
12092022
Cascaded Diffusion Models for High Fidelity Image Generation
J Ho, C Saharia, W Chan, D Fleet, M Norouzi, T Salimans
Journal of Machine Learning Research 23 (47), 1-33, 2022
10532022
WaveGrad: Estimating Gradients for Waveform Generation
N Chen, Y Zhang, H Zen, R Weiss, M Norouzi, W Chan
ICLR, 2021
7912021
Very Deep Convolutional Networks for End-to-End Speech Recognition
Y Zhang, W Chan, N Jaitly
ICASSP, 2017
5672017
Advances in Joint CTC-Attention based End-to-End Speech Recognition with a Deep CNN Encoder and RNN-LM
T Hori, S Watanabe, Y Zhang, W Chan
INTERSPEECH, 2017
3632017
Insertion Transformer: Flexible Sequence Generation via Insertion Operations
M Stern, W Chan, J Kiros, J Uszkoreit
ICML, 2019
2602019
Novel View Synthesis with Diffusion Models
D Watson, W Chan, R Martin-Brualla, J Ho, A Tagliasacchi, M Norouzi
ICLR, 2023
2312023
Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling
J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ...
arXiv preprint arXiv:1902.08295, 2019
2142019
BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition
Y Zhang, DS Park, W Han, J Qin, A Gulati, J Shor, A Jansen, Y Xu, ...
IEEE Journal of Selected Topics in Signal Processing, 2021
1882021
Learning Fast Samplers for Diffusion Models by Differentiating Through Sample Quality
D Watson, W Chan, J Ho, M Norouzi
ICLR, 2022
1702022
SpecAugment on Large Scale Datasets
D Park, Y Zhang, CC Chiu, Y Chen, B Li, W Chan, Q Le, Y Wu
ICASSP, 2020
1642020
Noise2Music: Text-conditioned Music Generation with Diffusion Models
Q Huang, D S. Park, T Wang, T I. Denk, A Ly, N Chen, Z Zhang, Z Zhang, ...
arXiv:2302.03917, 2023
1632023
Bytes are All You Need: End-to-End Multilingual Speech Recognition and Synthesis with Bytes
B Li, Y Zhang, T Sainath, Y Wu, W Chan
ICASSP, 2019
1572019
SpeechStew: Simply Mix All Available Speech Recognition Data to Train One Large Neural Network
W Chan, D Park, C Lee, Y Zhang, Q Le, M Norouzi
INTERSPEECH: Workshop on Machine Learning in Speech and Language Processing, 2021
1542021
The system can't perform the operation now. Try again later.
Articles 1–20