Skip to content
View kaituoxu's full-sized avatar

Block or report kaituoxu

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. Speech-Transformer Speech-Transformer Public

    A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

    Python 768 196

  2. Conv-TasNet Conv-TasNet Public

    A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).

    Python 668 149

  3. Listen-Attend-Spell Listen-Attend-Spell Public

    A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.

    Python 200 56

  4. TasNet TasNet Public

    A PyTorch implementation of Time-domain Audio Separation Network (TasNet) with Permutation Invariant Training (PIT) for speech separation.

    Python 109 31

  5. Tacotron2 Tacotron2 Public

    A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".

    Python 52 13

  6. X-Punctuator X-Punctuator Public

    A PyTorch implementation of a punctuation prediction system using (B)LSTM, which automatically adds suitable punctuation into text without punctuation.

    Python 62 21