Programmatic Pitch

Luke Sutor

doi:10.32473/ufjur.26.135340

Programmatic Pitch

Luke Sutor

Open Access

https://doi.org/10.32473/ufjur.26.135340

Copy DOI

Export

Save

Cite

Journal: UF Journal of Undergraduate Research	Publication Date: Oct 16, 2024
License type: CC BY-NC 4.0

Abstract
Full-Text
Similar Papers

Abstract

Listen

Diffusion models have firmly established their excellence for image generation tasks, as evidenced by the success of renowned models such as DALL-E, Midjourney, and Stable Diffusion. This advanced development of diffusion models for visual content raises an interesting question: can diffusion models be adapted for audio generation tasks? In this study, we introduce a novel diffusion model architecture designed to generate mel spectrograms, visual representations of sound which can subsequently be converted into audible music. Given the exceptional capability of diffusion models to produce high- quality images, their application to mel spectrogram generation is particularly promising. Our proposed diffusion model deviates minimally from the conventional architectures employed for visual content, making this research especially useful for examining the potential for cross-domain application between image and audio generation. The proposed model has been trained on a dataset consisting of over 186 hours of Lofi audio, offering the model diverse samples for generalized learning. However, a combination of research limitations led to subpar results, paving the way for further studies to build on this one.

Full Text

Published Version

View

Talk to us

Join us for a 30 min session where you can share your feedback and ask us any queries you have

Schedule a call

R Discovery Prime

Programmatic Pitch

Abstract

Published Version

Talk to us

Similar Papers

More From: UF Journal of Undergraduate Research

Lead the way for us

Similar Papers

Research Advanced in Image Generation Based on Diffusion Probability Model
Yuhan Huang
Highlights in Science, Engineering and Technology | VOL. 85
Yuhan HuangYuhan Huang
13 Mar 2024
Highlights in Science, Engineering and Technology | VOL. 85

A Survey on Video Diffusion Models
Zhen Xing ... Yu-Gang Jiang
ACM Computing Surveys | VOL. 57
Zhen Xing, et. al.Zhen Xing ... Yu-Gang Jiang
07 Nov 2024
ACM Computing Surveys | VOL. 57

Data augmentation-based enhanced fingerprint recognition using deep convolutional generative adversarial network and diffusion models
Yukai Liu
Applied and Computational Engineering | VOL. 52
Yukai LiuYukai Liu
27 Mar 2024
Applied and Computational Engineering | VOL. 52

Generative Diffusion Models on Graphs: Methods and Applications
Chengyi Liu ... Wenqi Fan
-
Chengyi Liu, et. al.Chengyi Liu ... Wenqi Fan
01 Aug 2023
01 Aug 2023

Editage

Paperpal

R Discovery

Mind the Graph

R Discovery Prime

Programmatic Pitch

Abstract

Published Version

Talk to us

Similar Papers

More From: UF Journal of Undergraduate Research