Found 2 AI tools
Click any tool to view details
Emilia is an open source multilingual wild speech dataset designed for large-scale speech generation research. It contains more than 101,000 hours of high-quality speech data and corresponding text transcriptions in six languages, covering a variety of speaking styles and content types such as talk shows, interviews, debates, sports commentary and audiobooks.
DL3DV-10K is a large-scale real-scene data set containing more than 10,000 high-quality videos. Each video is manually annotated with scene key points and complexity, and provides camera poses, NeRF estimated depth, point clouds, and 3D meshes. This data set can be used for computer vision research such as general NeRF research, scene consistency tracking, and visual language models.
Explore other subcategories under other Other Categories
195 tools
178 tools
113 tools
102 tools
62 tools
61 tools
49 tools
45 tools
AI data mining Hot other is a popular subcategory under 2 quality AI tools