Found 1 AI tools
Click any tool to view details
Make-An-Audio 2 is a text-to-audio generation technology based on the diffusion model, jointly developed by researchers from Zhejiang University, ByteDance, and the Chinese University of Hong Kong. The technology improves the quality of generated audio by using pre-trained large language models (LLMs) to parse text, optimizing semantic alignment and temporal consistency. It also designs a feedforward Transformer-based diffusion denoiser to improve the performance of variable-length audio generation and enhance the extraction of temporal information. Furthermore, the problem of temporal data scarcity is solved by using LLMs to convert large amounts of audio label data into audio text datasets.
Explore other subcategories under other Other Categories
195 tools
178 tools
113 tools
102 tools
62 tools
61 tools
49 tools
45 tools
AI music generation Hot other is a popular subcategory under 1 quality AI tools