Configuration Parsing Warning:Config file tokenizer_config.json cannot be fetched (too big)

Galgame-Llasa-3B-v3

Overview

This is the version 3 of the Galgame-Llasa-3B, a Text-to-Speech (TTS) model fine-tuned for Japanese. This model is based on HKUSTAudio/Llasa-3B.

The primary improvement in v3 is the modification of the text normalization process during training.

This update leads to more consistent and accurate speech synthesis, further improving upon the advances made in v2.

Version 2 was trained on a larger and more diverse dataset, including the original Galgame dataset and other sources.

As a result, v2 offered several key improvements over the original version:

Improved Kanji Reading: The model handled the reading of Kanji characters more accurately.
Enhanced Prosody: The generated speech had more natural intonation and expressiveness.
Greater Voice Diversity: The model could produce a wider range of voice styles than the previous version.

This model is licensed under the CC-BY-NC-4.0.

Safetensors

Model size

3B params

Tensor type

BF16

Base model

Finetuned

Finetuned

(5)

this model

Quantizations