spoken-dialogue-models

Here are 3 public repositories matching this topic...

jishengpeng / WavChat

A Survey of Spoken Dialogue Models (60 pages)

streaming duplex speech moshi speech-representation encodec gpt-4o speech-language-model spoken-dialogue-models modal-alignment intreaction mini-omni llama-omni wavtokenizer

Updated

Ruiqi-Yan / Awesome-Full-Duplex-SDM

Star

A curated list of full-duplex spoken dialogue models & benchmarks

awesome-list full-duplex spoken-dialogue-systems spoken-dialogue-models

Updated

ictnlp / FastLongSpeech

Star

FastLongSpeech is a novel framework designed to extend the capabilities of Large Speech-Language Models for efficient long-speech processing without necessitating dedicated long-speech training data.

speech speech-recognition speech-to-text multi-modal speech-processing spoken-language-understanding speech-emotion-recognition large-language-models llms llm-training qwen speech-llms large-speech-models multi-modal-llms qwen2-5 spoken-dialogue-models

Updated
Python

Improve this page

Add a description, image, and links to the spoken-dialogue-models topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the spoken-dialogue-models topic, visit your repo's landing page and select "manage topics."

Learn more

URL: https://github.com/topics/spoken-dialogue-models