Hi! Interesting multilingual ASR model.
Wanted to suggest potential interop or comparison with FunASR and SenseVoice:
FunASR ecosystem
- FunASR — Industrial-grade ASR toolkit with streaming, VAD, punctuation, diarization
- SenseVoice — Multi-task speech model (ASR + emotion + audio events), 50+ languages
- Fun-ASR-Nano — Encoder-LLM architecture, HuggingFace Transformers native
Potential collaboration areas
- Benchmark comparison — Could be valuable to compare Dolphin with Paraformer/SenseVoice on multilingual benchmarks
- Pipeline integration — FunASR's VAD (FSMN-VAD) and punctuation models could complement Dolphin
- Streaming — FunASR's streaming architecture could pair with Dolphin's recognition
References
Hi! Interesting multilingual ASR model.
Wanted to suggest potential interop or comparison with FunASR and SenseVoice:
FunASR ecosystem
Potential collaboration areas
References