Skip to content

Comparison / interop with FunASR and SenseVoice #108

Description

@LauraGPT

Hi! Interesting multilingual ASR model.

Wanted to suggest potential interop or comparison with FunASR and SenseVoice:

FunASR ecosystem

  • FunASR — Industrial-grade ASR toolkit with streaming, VAD, punctuation, diarization
  • SenseVoice — Multi-task speech model (ASR + emotion + audio events), 50+ languages
  • Fun-ASR-Nano — Encoder-LLM architecture, HuggingFace Transformers native

Potential collaboration areas

  1. Benchmark comparison — Could be valuable to compare Dolphin with Paraformer/SenseVoice on multilingual benchmarks
  2. Pipeline integration — FunASR's VAD (FSMN-VAD) and punctuation models could complement Dolphin
  3. Streaming — FunASR's streaming architecture could pair with Dolphin's recognition

References

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions