VOOZH about

URL: https://en.wikipedia.org/wiki/DD-Shooter

⇱ DD-Shooter - Wikipedia


Jump to content
From Wikipedia, the free encyclopedia
AI-based drum synthesis software
DD-Shooter
👁 Image
DevelopersJoao Carreira (Instituto de Telecomunicações, PT); Christopher Hahne (University of Bern, CH)
Release2024; 2 years ago (2024)[1]
Stable release
0.1.2.0
Preview release
0.0.8.0
Operating systemCross-platform: Windows, macOS
TypeGenerative artificial intelligence, Audio synthesis
Websitesonolisk.com

DD-Shooter is a generative artificial intelligence audio synthesis tool developed by Sonolisk for creating and transforming drum sounds. It uses diffusion model-based machine learning models to generate percussive audio from text descriptions, audio examples, or combinations of conditioning inputs.

Unlike conventional drum sample libraries and drum machines, DD-Shooter employs learned latent representations to synthesize new drum sounds and variations rather than relying solely on prerecorded samples or manually designed synthesis parameters.

History

[edit]

The development of DD-Shooter began early 2024 as an online drum synthesis application in the cloud customizing the waveform diffusion model.[2] on GitHub[3] The software was initially introduced under audiostein as part of a broader trend toward generative AI systems for music and audio production. Today, DD-Shooter is a standalone and VST/AU plugin from Sonolisk running locally on CPUs.

Technology

[edit]
👁 Image
Waveform synthesis of drum shots through probabilistic denoising diffusion by DD-Shooter

DD-Shooter utilizes diffusion models for waveform synthesis and transformation. The model generates drum sounds through an iterative denoising process conditioned on user inputs.

Supported conditioning methods include:

  • Text prompts describing sonic characteristics
  • Audio-guided generation
  • Latent-space interpolation between sounds
  • Drum sound variation and augmentation
  • Audio inpainting and localized editing

Features

[edit]

DD-Shooter provides:

  • Text-to-drum synthesis
  • Generation of drum sound variations
  • Timbre transfer and transformation
  • Prompt-based sound exploration
  • Latent interpolation between drum sounds
  • Inpainting of selected audio regions

See also

[edit]

References

[edit]
  1. ^ DD-Shooter Manual
  2. ^ Schneider, Flavio; Kamal, Ojasv; Jin, Zhijing; Schölkopf, Bernhard (2024). "Moûsai: Efficient Text-to-Music Diffusion Models". Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics. pp. 8050–8068. doi:10.18653/v1/2024.acl-long.437.
  3. ^ "audio-diffusion-pytorch: Audio generation using diffusion models in PyTorch". GitHub. Archinetai. Retrieved 2026-06-09.