VOOZH about

URL: https://huggingface.co/OmniAICreator/Galgame-Llasa-3B-v3

⇱ OmniAICreator/Galgame-Llasa-3B-v3 · Hugging Face


Configuration Parsing Warning:Config file tokenizer_config.json cannot be fetched (too big)

Galgame-Llasa-3B-v3

Overview

This is the version 3 of the Galgame-Llasa-3B, a Text-to-Speech (TTS) model fine-tuned for Japanese. This model is based on HKUSTAudio/Llasa-3B.

What's New in v3?

The primary improvement in v3 is the modification of the text normalization process during training.

This update leads to more consistent and accurate speech synthesis, further improving upon the advances made in v2.

What's New in v2 (from v1)?

Version 2 was trained on a larger and more diverse dataset, including the original Galgame dataset and other sources.

As a result, v2 offered several key improvements over the original version:

  • Improved Kanji Reading: The model handled the reading of Kanji characters more accurately.
  • Enhanced Prosody: The generated speech had more natural intonation and expressiveness.
  • Greater Voice Diversity: The model could produce a wider range of voice styles than the previous version.

License

This model is licensed under the CC-BY-NC-4.0.

Downloads last month
47
Safetensors
Model size
3B params
Tensor type
BF16
·

Model tree for OmniAICreator/Galgame-Llasa-3B-v3

Finetuned
(5)
this model
Quantizations
1 model

Datasets used to train OmniAICreator/Galgame-Llasa-3B-v3

Space using OmniAICreator/Galgame-Llasa-3B-v3 1