Comparison / interop with FunASR and SenseVoice

Hi! Interesting multilingual ASR model.

Wanted to suggest potential interop or comparison with [FunASR](https://github.com/modelscope/FunASR) and [SenseVoice](https://github.com/FunAudioLLM/SenseVoice):

## FunASR ecosystem

- **FunASR** — Industrial-grade ASR toolkit with streaming, VAD, punctuation, diarization
- **SenseVoice** — Multi-task speech model (ASR + emotion + audio events), 50+ languages
- **Fun-ASR-Nano** — Encoder-LLM architecture, HuggingFace Transformers native

## Potential collaboration areas

1. **Benchmark comparison** — Could be valuable to compare Dolphin with Paraformer/SenseVoice on multilingual benchmarks
2. **Pipeline integration** — FunASR's VAD (FSMN-VAD) and punctuation models could complement Dolphin
3. **Streaming** — FunASR's streaming architecture could pair with Dolphin's recognition

## References

- FunASR: https://github.com/modelscope/FunASR (16K+ stars)
- SenseVoice: https://github.com/FunAudioLLM/SenseVoice (8K+ stars)
- Benchmarks: https://funasr.com

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comparison / interop with FunASR and SenseVoice #108

FunASR ecosystem

Potential collaboration areas

References

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Comparison / interop with FunASR and SenseVoice #108

Description

FunASR ecosystem

Potential collaboration areas

References

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions