Skip to content
You signed in with another tab or window. to refresh your session.
You signed out in another tab or window. to refresh your session.
You switched accounts on another tab or window. to refresh your session.
Here are
14 public repositories
matching this topic...
👁 Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
A webui for different audio related Neural Networks
OpenMusic: SOTA Text-to-music (TTM) Generation
👁 NeuroSandboxWebUI
(Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on 3 languages
Text prompt steered synthetic audio generators
👁 DreamSound
[ICASSP'24] Investigating Personalization Methods in Text to Music Generation
A comprehensive, click to install, fully open-source, Video + Audio Generation AIO Toolkit using advanced prompt engineering plus the power of CogVideox + AudioLDM2 + Python!
AudioLDM text to audio colab
Generative AI version of the GeoGuesser game.
👁 AudioLDM-with-LoRA
Enhancing Diffusion-Based Music Generation Performance with LoRA.
Simple web UI for AudioLDM 2.
Workshop for Multimodale media generator
Inference-time optimization for diffusion-based text-to-audio generation using CLAP-guided multi-sample selection, with quantitative cost–quality analysis on AudioLDM.
In this game, your given an image for so many seconds to view. Then you have to guess just by clicking on any point in the world that the photo was taken. NOTICE: This game is INCOMPLETE
Improve this page
Add a description, image, and links to the
audioldm
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
audioldm
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.