Found 1 AI tools
Click any tool to view details
MINT-1T is a multi-modal dataset open sourced by Salesforce AI, containing one trillion text tags and 3.4 billion images, which is 10 times larger than existing open source datasets. It contains not only HTML documents, but also PDF documents and ArXiv papers, enriching the diversity of the dataset. MINT-1T's data set construction involves data collection, processing and filtering steps from multiple sources, ensuring the high quality and diversity of data.
Explore other subcategories under Open source Other Categories
9 tools
3 tools
2 tools
research tools Hot Open source is a popular subcategory under 1 quality AI tools