Skip to content
You signed in with another tab or window. to refresh your session.
You signed out in another tab or window. to refresh your session.
You switched accounts on another tab or window. to refresh your session.
Pinned
Loading
-
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
Python
3.1k
223
-
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Python
1.3k
101
-
“百聆”是一个基于LLaMA的语言对齐增强的英语/中文大语言模型,具有优越的英语/中文能力,在多语言和通用任务等多项测试中取得ChatGPT 90%的性能。BayLing is an English/Chinese LLM equipped with advanced language alignment, showing superior capability in English/Ch…
Python
318
18
-
LLaVA-Mini is a unified large multimodal model (LMM) that can support the understanding of images, high-resolution images, and videos in an efficient manner.
Python
566
31
-
This is the official repository for Auto-RAG.
Python
234
20
-
FlexRAG: A RAG Framework for Information Retrieval and Generation.
Python
234
23
Repositories
Showing 10 of 87 repositories
-
FlexRAG
Public
FlexRAG: A RAG Framework for Information Retrieval and Generation.
-
-
XBridge
Public
Code for paper "Language on Demand, Knowledge at Core: Composing LLMs with Encoder-Decoder Translation Models for Extensible Multilinguality".
-
-
AlignX
Public
Code for paper "AlignX: Advancing Multilingual Large Language Models with Multilingual Representation Alignment".
-
IG-Pruning
Public
Official repository for EMNLP2025 paper "IG-Pruning: Input-Guided Block Pruning for Large Language Models"
-
PSO-Merging
Public
PSO-Merging is an innovative deep model fusion method that uses particle swarm optimization algorithm to automatically find optimal model fusion weights.
Python
11
MIT
1
0
0
Updated
-
FastLongSpeech
Public
FastLongSpeech is a novel framework designed to extend the capabilities of Large Speech-Language Models for efficient long-speech processing without necessitating dedicated long-speech training data.
-
Auto-RAG
Public
This is the official repository for Auto-RAG.
Python
234
Apache-2.0
20
4
0
Updated
-
StreamUni
Public
StreamUni is a framework that efficiently enables unified Large Speech-Language Models to accomplish streaming speech translation in a cohesive manner.
People
This organization has no public members. You must be a member to see who’s a part of this organization.
You can’t perform that action at this time.