VOOZH about

URL: https://huggingface.co/funasr/ct-punc

⇱ funasr/ct-punc · Hugging Face


⭐ Powered by FunASR — please give us a GitHub Star!

This model is part of the FunASR ecosystem — one industrial-grade open-source toolkit for ASR · VAD · punctuation · speaker diarization · emotion / event · LLM-ASR. A Star really helps the project (and keeps you updated):

🌟 FunASR · 🌟 SenseVoice · 🌟 Fun-ASR · 🌟 FunClip

CT-Punc

Punctuation Restoration — automatically add punctuation to ASR output text.

CT-Punc (Controllable Time-delay Punctuation) restores punctuation marks for unpunctuated text, commonly used as a post-processing step after speech recognition.

Quick Start

from funasr import AutoModel

# Standalone punctuation restoration
model = AutoModel(model="funasr/ct-punc", hub="hf", device="cuda")
result = model.generate(input="我们今天讨论三个议题首先是产品发布其次是市场策略最后是团队建设")
print(result[0]["text"])
# → 我们今天讨论三个议题,首先是产品发布,其次是市场策略,最后是团队建设。

Use as Part of ASR Pipeline

from funasr import AutoModel

model = AutoModel(
 model="funasr/paraformer-zh",
 hub="hf",
 vad_model="funasr/fsmn-vad",
 punc_model="funasr/ct-punc",
 device="cuda",
)
result = model.generate(input="audio.wav")
# Output text includes punctuation automatically

Features

  • Chinese and English punctuation restoration
  • Low latency, suitable for streaming pipelines
  • Integrates seamlessly with FunASR ASR models

Links

Downloads last month
1,222

Spaces using funasr/ct-punc 4