fangjun's picture

fangjun

csukuangfj

·

https://github.com/csukuangfj

csukuangfj

AI & ML interests

None yet

Recent Activity

updated a model 5 days ago

csukuangfj/sherpa-onnx-rknn-models

updated a Space 11 days ago

k2-fsa/text-to-speech

updated a model 14 days ago

csukuangfj/sherpa-onnx-tts-samples

View all activity

Organizations

authored 11 papers 6 months ago

Zipformer: A faster and better encoder for automatic speech recognition

Paper • 2310.11230 • Published Oct 17, 2023 • 1

PromptASR for contextualized ASR with controllable style

Paper • 2309.07414 • Published Sep 14, 2023

Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context

Paper • 2309.08105 • Published Sep 15, 2023 • 1

Pruned RNN-T for fast, memory-efficient ASR training

Paper • 2206.13236 • Published Jun 23, 2022

Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation

Paper • 2211.00508 • Published Oct 31, 2022

Blank-regularized CTC for Frame Skipping in Neural Transducer

Paper • 2305.11558 • Published May 19, 2023

LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization

Paper • 2409.00819 • Published Sep 1, 2024

Delay-penalized CTC implemented based on Finite State Transducer

Paper • 2305.11539 • Published May 19, 2023

k2SSL: A Faster and Better Framework for Self-Supervised Speech Representation Learning

Paper • 2411.17100 • Published Nov 26, 2024

ZipVoice: Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching

Paper • 2506.13053 • Published Jun 16, 2025

ZipVoice-Dialog: Non-Autoregressive Spoken Dialogue Generation with Flow Matching

Paper • 2507.09318 • Published Jul 12, 2025 • 2