Found 2 AI tools
Click any tool to view details
Persona Hub is a large-scale synthetic data set released by Tencent AI Lab, aiming to promote persona-driven data synthesis research. This dataset contains millions of synthetic data samples of different personas and can be used to simulate the diverse input of real-world users for testing and research of large language models (LLM).
MNBVC (Massive Never-ending BT Vast Chinese corpus) is a project aimed at providing rich Chinese corpus for AI. It includes not only mainstream cultural content, but also niche culture and Internet slang. The data set includes news, compositions, novels, books, magazines, papers, lines, posts, wikis, ancient poems, lyrics, product introductions, jokes, embarrassments, chat records and other forms of plain text Chinese data.
Explore other subcategories under Open source Other Categories
9 tools
3 tools
2 tools
AI data mining Hot Open source is a popular subcategory under 2 quality AI tools