Yanzhang He, Rohit Prabhavalkar, Kanishka Rao, Wei Li, Anton Bakhtin, Ian McGraw
STREAMING SMALL-FOOTPRINT KEYWORD SPOTTING USING SEQUENCE-TO-SEQUENCE MODELS
Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
LISTENING WHILE SPEAKING: SPEECH CHAIN BY DEEP LEARNING
Yanzhang He, Rohit Prabhavalkar, Kanishka Rao, Wei Li, Anton Bakhtin, Ian McGraw
STREAMING SMALL-FOOTPRINT KEYWORD SPOTTING USING SEQUENCE-TO-SEQUENCE MODELS
Emiru Tsunoo, Ondrej Klejch, Peter Bell, Steve Renals
HIERARCHICAL RECURRENT NEURAL NETWORK FOR STORY SEGMENTATION USING FUSION OF LEXICAL AND ACOUSTIC FEATURES
Herman Kamper, Karen Livescu, Sharon Goldwater
AN EMBEDDED SEGMENTAL K-MEANS MODEL FOR UNSUPERVISED SEGMENTATION AND CLUSTERING OF SPEECH
Hayato Shibata, Taku Kato, Takahiro Shinozaki, Shinji Watanabe
COMPOSITE EMBEDDING SYSTEMS FOR ZEROSPEECH2017 TRACK1
Kanishka Rao, Hasim Sak, Rohit Prabhavalkar
EXPLORING ARCHITECTURES, DATA AND UNITS FOR STREAMING END-TO-END SPEECH RECOGNITION WITH RNN-TRANSDUCER
Eric Battenberg, Jitong Chen, Rewon Child, Adam Coates, Yashesh Gaur, Yi Li, Hairong Liu, Sanjeev Satheesh, Anuroop Sriram, Zhenyao Zhu
EXPLORING NEURAL TRANSDUCERS FOR END-TO-END SPEECH RECOGNITION
Zhong Meng, Zhuo Chen, Vadim Mazalov, Jinyu Li, Yifan Gong
UNSUPERVISED ADAPTATION WITH DOMAIN SEPARATION NETWORKS FOR ROBUST SPEECH RECOGNITION
Shinji Watanabe, Takaaki Hori, John Hershey
LANGUAGE INDEPENDENT END-TO-END ARCHITECTURE FOR JOINT LANGUAGE IDENTIFICATION AND SPEECH RECOGNITION
Takaaki Hori, Shinji Watanabe, John Hershey
MULTI-LEVEL LANGUAGE MODELING AND DECODING FOR OPEN VOCABULARY END-TO-END SPEECH RECOGNITION
Andros Tjandra, Sakriani Sakti, Satoshi Nakamura
LISTENING WHILE SPEAKING: SPEECH CHAIN BY DEEP LEARNING
Wei-Ning Hsu, Yu Zhang, James Glass
UNSUPERVISED DOMAIN ADAPTATION FOR ROBUST SPEECH RECOGNITION VIA VARIATIONAL AUTOENCODER-BASED DATA AUGMENTATION