🖼️ image

CelebV-Text

Large-scale face text-video dataset

#video

#video creation

#Dataset

#text

#Statistical analysis

#Generate tasks

CelebV-Text

Product Details

CelebV-Text is a large-scale, high-quality, and diverse face text-video dataset designed to promote research on face text-video generation tasks. The dataset contains 70,000 video clips of faces in the wild, each with 20 texts, covering 40 general appearances, 5 detailed appearances, 6 lighting conditions, 37 actions, 8 emotions, and 6 light directions. CelebV-Text validates its superiority in video, text, and text-video correlation through comprehensive statistical analysis, and builds a benchmark to standardize the evaluation of face text-video generation tasks.

Main Features

1

Large-scale face text-video dataset

2

70,000 video clips of faces in the wild

3

Each video clip comes with 20 texts

4

Covers 40 general looks, 5 detailed looks, 6 lighting conditions, 37 actions, 8 moods and 6 light directions

5

Comprehensive statistical analysis validates the superiority of the data set

6

A benchmark is constructed to standardize the evaluation of face text-to-video generation tasks.

Target Users

Research on face text-video generation tasks

Examples

✓

Research on face text-video generation task using CelebV-Text dataset

✓

Face text-video correlation analysis using CelebV-Text dataset

✓

Building a benchmark for face text-video generation tasks using the CelebV-Text dataset

Quick Access

Visit Website →

Categories

🖼️ image

› AI model inference training

› AI dataset

Related Recommendations

Discover more similar quality AI tools

SMPLer-X

SMPLer-X

SMPLer-X is a human pose and shape estimation model based on big data and large models, which can uniformly capture the movements of the body, hands and face, and has a wide range of applications. This model achieves a significant improvement in the capabilities of EHPS by systematically studying data sets from 32 different scenarios, optimizing the training plan and selecting data sets. SMPLer-X adopts Vision Transformer for model expansion and transforms it into an expert model through fine-tuning strategies, further improving performance. The model performs well on multiple benchmarks such as AGORA (107.2 mm NMVE), UBody (57.4 mm PVE), EgoBody (63.6 mm PVE) and EHF (62.3 mm PVE without finetuning). The advantage of SMPLer-X is its ability to handle diverse data sources and its excellent generalization capabilities and portability.

large model big data

DreamLLM

DreamLLM

DreamLLM is a learning framework that enables for the first time the synergy between multimodal understanding and creation of multimodal large language models (LLM). It generates posterior models of language and images by directly sampling in the original multi-modal space. This approach avoids the limitations and information loss inherent in external feature extractors like CLIP, resulting in a more comprehensive multi-modal understanding. DreamLLM also efficiently learns all conditional, marginal and joint multi-modal distributions by modeling text and image content as well as raw cross-documents with no structure layout. Therefore, DreamLLM is the first MLLM capable of generating free-form cross-content. Comprehensive experiments demonstrate the superior performance of DreamLLM as a zero-shot multimodal generalist, fully exploiting the enhanced learning synergy.

image generation language model

DINOv2

DINOv2

DINOv2 is a self-supervised learning method for unsupervised learning that can generate high-performance visual features suitable for computer vision tasks. It requires no fine-tuning and is robust and performant across domains.

computer vision visual characteristics

NanoPhoto.AI

NanoPhoto.AI

NanoPhoto.AI is a professional AI photo editor powered by advanced AI models. Its background relies on advanced AI technology, especially the Google GEMINI model, designed to provide users with a professional-level photo processing experience. This product is positioned to meet the diverse image editing needs of users, whether it is used by individual users for daily photo beautification or by professionals processing work-related images. The main advantage of the product lies in its powerful functions, including a variety of professional editing styles, free image conversion and compression functions, which allow users to exert unlimited creativity in the photo processing process, and the operation is simple and efficient. In terms of price, the document does not mention charging information, and it is speculated that some functions are free to use.

image generation creative design

Retro Image Prompt

Retro Image Prompt

Retro Image Prompt is a retro image prompt generator powered by Google Nano Banana. It supports text-to-image (T2I) and image-to-image (I2I) workflows, helping users quickly create high-quality retro image cues and retro AI art. The main advantage of the product is that it provides a wealth of retro styles for users to choose from, and the generated images are of high quality and stable style. In terms of price, use requires points. Users can obtain points and use it. It is positioned to meet users' needs for retro image creation and can be used by individual artists, designers or ordinary enthusiasts.

image generation text to image

Midjourney TV

Midjourney TV

Midjourney TV is an online image generation platform based on Midjourney technology. Midjourney is an advanced AI image generation model that can generate high-quality images based on text descriptions. The importance of this platform lies in providing users with a convenient and efficient way to create images. Key advantages include fast generation, high image quality, and flexible customization based on text. Its background is to adapt to the market demand for AI image generation. The price has not yet been determined, but it is positioned for image creation enthusiasts, designers and other groups to help them quickly obtain creative images.

image generation AI painting

Create point AI

Create point AI

Quark·Zangdian AI is a platform that uses advanced AI technology to generate images and videos. Users can generate visual content through simple input. Its main advantage is that it is fast and efficient, making it suitable for designers, artists, and content creators. This product provides users with flexible creative tools to help them realize their creative ideas in a short time, and the flexible pricing model provides users with more choices.

AI image generation

VisualGPT

VisualGPT

VisualGPT is a one-stop AI image platform that integrates hundreds of AI image tools on one platform, covering multiple industry scenarios such as social media graphics, marketing visuals, advertising, research, and fashion design. The platform integrates powerful image models such as Nano Banana, Flux Ideogram and Stable Diffusion to ensure that the generated images are clear and detailed, without the need for additional repair, saving time and energy. It has a zero learning curve, users only need to upload images or describe ideas in simple language to get started, and the interface is simple and suitable for beginners and professionals. The product is free to use and is positioned to meet the needs of all types of users to quickly and easily create visual content.

AI design tools AI image generator

buzz

buzz

BuzzCut AI is a free online AI hair style changing tool. Its main function is to allow users to upload photos and use AI technology to preview the effects of short hair styles of different lengths. Its importance is to help users understand in advance whether the short haircut is suitable for their face shape and style before actually changing their hairstyle, so as to avoid regrets after cutting their hair. The product is based on advanced facial recognition and style mapping algorithms, generating effects with an accuracy of up to 99.2%. It is positioned as a personal virtual hair guide, free, instant and reliable. In addition to basic free functions, it also provides advanced customized paid functions.

Free online tools Hair preview

LongHair

LongHair

LongHair AI is a free AI hair changing tool that focuses on long hair styling transformation. It uses advanced artificial intelligence technology to convert a single frontal photo into a realistic preview of long hair styles in a short time. The product requires no registration, is easy to use, and can be used in the browser of any device. Its core functions are free, and users can also choose to pay for advanced hairstyle and high-definition export services. The product is positioned to help users try various long hair styles in advance without taking risks, saving time and money in hair salons.

long hair long hair filter

FluxAPI.ai

FluxAPI.ai

FluxAPI.ai is a developer-oriented platform that provides API access to Black Forest Labs' FLUX 1 model family. Supports advanced text-to-image and image-to-image generation. The main advantages include economical price. Its Kontext Pro is priced at only $0.025 and Kontext Max is priced at only $0.05, which is lower cost than other platforms; it provides a variety of AI models that can adapt to the needs of different scenarios; it has flexible generation mode and real-time performance, bringing a smooth creative experience; and it also has 24/7 expert support. The platform is designed for large-scale use by developers, creators and teams. It adopts a points-based billing model to purchase points on demand, with no subscriptions, no minimum consumption, and no hidden fees.

image generation image editing

NanoBananaX

NanoBananaX

Nano Banana is an artificial intelligence image generation and editing platform driven by Google's latest Nano Banana model. Its importance lies in providing users with a convenient, efficient and powerful way to create and edit images. Key advantages include lightning-fast image generation and preview speeds, enabling instant iteration; high fidelity to ensure image details are clear, consistent in style and meet prompt requirements; and users can precisely control the image creation and editing process using natural language. The platform is available in a variety of price packages, including monthly or annual payments, with different credit limits and features to meet the different needs of businesses from beginners to professional. It is positioned to meet the needs of all types of users for image generation and editing, whether they are individual creators or commercial enterprises.

image generation AI image editing

Nano Banana AI | Nano Banana the Best AI Image Generator

Nano Banana AI | Nano Banana the Best AI Image Generator

Nano Banana AI is an innovative next-generation AI image generator built by Google Project. It leverages natural language to create and edit stunning visuals. Key benefits include lightning-fast generation, with optimized AI processing workflows for flawless results in just seconds; industry-leading accuracy, with advanced AI technology providing unparalleled precision in understanding cues; unlimited creative potential, from simple edits to complex compositions; and next-generation technology, powered by cutting-edge AI that surpasses existing models. The current limited-time free event aims to provide users with an efficient, accurate, and creative image generation and editing experience.

image editing AI image generation

AI Image Generator by Art Neurona

AI Image Generator by Art Neurona

ART Neurona's AI image generator uses artificial intelligence technology to quickly generate stunning works of art. Users can try it for free, with no need to download, and create unique images instantly.

Artificial Intelligence creativity

Photo to video ai

Photo to video ai

Image to Video AI Generator utilizes advanced AI models to convert static images into eye-catching videos, suitable for social media creators and anyone who wants to experience AI video generation. The product is positioned to simplify the video production process and improve efficiency.

AI social media

Browse More Tools