🖼️

image Category

video generation

Found 43 AI tools

43

tools

Primary Category: image

Subcategory: video generation

Found 43 matching tools

Related AI Tools

Click any tool to view details

Create point AI

Create point AI

Quark·Zangdian AI is a platform that uses advanced AI technology to generate images and videos. Users can generate visual content through simple input. Its main advantage is that it is fast and efficient, making it suitable for designers, artists, and content creators. This product provides users with flexible creative tools to help them realize their creative ideas in a short time, and the flexible pricing model provides users with more choices.

AI 图像生成设计 +2

Photo to video ai

Photo to video ai

Image to Video AI Generator utilizes advanced AI models to convert static images into eye-catching videos, suitable for social media creators and anyone who wants to experience AI video generation. The product is positioned to simplify the video production process and improve efficiency.

AI 社交媒体视频生成 +1

AI Animate Image

AI Animate Image

AI Animate Image uses advanced AI technology to transform static images into vivid animations, providing professional-level animation quality and smooth dynamic effects.

AI技术在线工具图片动画

Grok Imagine

Grok Imagine

Grok Imagine is an AI image and video generation platform powered by the Aurora engine that can generate multi-domain realistic images and dynamic video content. Its core technology is based on the Aurora engine's autoregressive image model, providing users with high-quality and diverse visual creation experiences.

AI 图像生成视频创作 +1

MuAPI

MuAPI

WAN 2.1 LoRA T2V is a tool that can generate videos based on text prompts. Through customized training of the LoRA module, users can customize the generated videos, which is suitable for brand narratives, fan content and stylized animations. The product background is rich and provides a highly customized video generation experience.

视频生成品牌叙事风格化动画

Openjourney

Openjourney

Openjourney is a high-fidelity open source project designed to simulate MidJourney's interface and utilize Google's Gemini SDK for AI image and video generation. This project supports high-quality image generation using Imagen 4, as well as text-to-video and image-to-video conversion using Veo 2 and Veo 3. It is suitable for developers and creators who need to perform image generation and video production. It provides a user-friendly interface and real-time generation experience to assist creative work and project development.

AI 图像生成开源 +2

A2E Free and Uncensored AI Videos

A2E Free and Uncensored AI Videos

a2e.ai is an AI tool that provides AI avatars, lip synchronization, voice cloning, text generation video and other functions. This product has the advantages of high definition, high consistency, and efficient generation speed. It is suitable for various scenarios and provides a complete set of avatar AI tools.

视频生成 AI工具语音克隆 +2

FlyAgt.ai

FlyAgt.ai

FlyAgt is an AI image and video generation platform that provides advanced AI tools from creation to editing to image enhancement. Its main advantages are its affordability, wide range of professional tools, and protection of user privacy.

视频生成图像编辑 AI视频生成 +2

DreamVid

DreamVid

iMyFone DreamVid is a powerful AI image to video conversion tool. By uploading photos, AI can convert static images into vivid videos, including special effects such as hugs, kisses, and face swaps. This tool is rich in background information, affordable, and targeted at individual users and small businesses.

AI 视频制作企业 +2

Everlyn.AI

Everlyn.AI

Everlyn AI is the world's leading AI video generator and free AI picture generator, using advanced AI technology to transform your ideas into stunning visuals. It has disruptive performance indicators, including 15-second rapid generation speed, 25-fold cost reduction, and 8-fold higher efficiency.

图像处理 AI视频生成器 AI图片生成器

Describe Anything

Describe Anything

The Describe Anything Model (DAM) is able to process specific areas of an image or video and generate a detailed description. Its main advantage is that it can generate high-quality localized descriptions through simple tags (points, boxes, graffiti or masks), which greatly improves image understanding in the field of computer vision. Developed jointly by NVIDIA and multiple universities, the model is suitable for use in research, development, and real-world applications.

自然语言处理深度学习计算机视觉 +2

vivago.ai

vivago.ai

vivago.ai is a free AI generation tool and community that provides text-to-image, image-to-video and other functions, making creation easier and more efficient. Users can generate high-quality images and videos for free, and support a variety of AI editing tools to facilitate users to create and share. The platform is positioned to provide creators with easy-to-use AI tools to meet their visual creation needs.

图像生成视频生成创作工具 +2

Stable Virtual Camera

Stable Virtual Camera

Stable Virtual Camera is a 1.3B parameter universal diffusion model developed by Stability AI, which is a Transformer image to video model. Its importance lies in providing technical support for New View Synthesis (NVS), which can generate 3D consistent new scene views based on the input view and target camera. The main advantages are the freedom to specify target camera trajectories, the ability to generate samples with large viewing angle changes and temporal smoothness, the ability to maintain high consistency without additional Neural Radiation Field (NeRF) distillation, and the ability to generate high-quality seamless loop videos of up to half a minute. This model is free for research and non-commercial use only, and is positioned to provide innovative image-to-video solutions for researchers and non-commercial creators.

图像转视频 Transformer模型 3D场景生成 +2

Pippo

Pippo

Pippo is a generative model developed by Meta Reality Labs in cooperation with multiple universities. It can generate high-resolution multi-view videos from a single ordinary photo. The core benefit of this technology is the ability to generate high-quality 1K resolution video without additional inputs such as parametric models or camera parameters. It is based on a multi-view diffusion converter architecture and has a wide range of application prospects, such as virtual reality, film and television production, etc. Pippo's code is open source, but it does not include pre-trained weights. Users need to train the model by themselves.

图像生成计算机视觉扩散模型 +2

Animate Anyone 2

Animate Anyone 2

Animate Anyone 2 is a character image animation technology based on the diffusion model, which can generate animations that are highly adapted to the environment. It solves the problem of lack of reasonable correlation between characters and environment in traditional methods by extracting environment representation as conditional input. The main advantages of this technology include high fidelity, strong adaptability to the environment, and excellent dynamic motion processing capabilities. It is suitable for scenes that require high-quality animation generation, such as film and television production, game development and other fields. It can help creators quickly generate character animations with environmental interaction, saving time and costs.

图像处理高保真动画生成 +2

X-Dyna

X-Dyna

X-Dyna is an innovative zero-sample human image animation generation technology that generates realistic and expressive dynamic effects by transferring facial expressions and body movements in driving videos to a single human image. This technology is based on the diffusion model. Through the Dynamics-Adapter module, the reference appearance context is effectively integrated into the spatial attention of the diffusion model, while retaining the ability of the motion module to synthesize smooth and complex dynamic details. It can not only realize body posture control, but also capture identity-independent facial expressions through the local control module to achieve precise expression transmission. X-Dyna is trained on a mixture of human and scene videos and is able to learn physical human motion and natural scene dynamics to generate highly realistic and expressive animations.

扩散模型零样本学习图像动画 +3

Hallo3

Hallo3

Hallo3 is a technology for portrait image animation that utilizes pre-trained transformer-based video generation models to generate highly dynamic and realistic videos, effectively solving challenges such as non-frontal perspectives, dynamic object rendering, and immersive background generation. This technology, jointly developed by researchers from Fudan University and Baidu, has strong generalization capabilities and brings new breakthroughs to the field of portrait animation.

人工智能图像处理视频生成 +2

DisPose

DisPose

DisPose is a method for controlling human image animation that improves the quality of video generation through motion field guidance and keypoint correspondence. This technology is able to generate videos from reference images and driving videos while maintaining consistency of motion alignment and identity information. DisPose provides region-level dense guidance by generating dense motion fields from sparse motion fields and reference images while maintaining the generalization ability of sparse pose control. Furthermore, it extracts diffusion features corresponding to pose key points from the reference image and transfers these point features to the target pose to provide unique identity information. Key benefits of DisPose include the ability to extract more versatile and efficient control signals without the need for additional dense inputs, as well as improved quality and consistency of generated videos via plug-and-play hybrid ControlNet without freezing existing model parameters.

视频生成图像动画控制信号 +2

Ruyi-Models

Ruyi-Models

Ruyi-Models is an image-to-video model capable of generating cinematic videos up to 768 resolution and 24 frames per second, supporting lens control and motion range control. Using an RTX 3090 or RTX 4090 graphics card, you can generate 512-resolution, 120-frame video losslessly. This model has attracted attention for its high-quality video generation capabilities and precise control of details, especially in areas where high-quality video content needs to be generated, such as film production, game production, and virtual reality experiences.

视频生成图像到视频高质量视频 +2

ComfyUI-IF_MemoAvatar

ComfyUI-IF_MemoAvatar

ComfyUI-IF_MemoAvatar is a memory-guided diffusion based model for generating expressive videos. The technology allows users to create expressive talking avatar videos from a single image and audio input. The importance of this technology lies in its ability to convert static images into dynamic videos while retaining the facial features and emotional expressions of the characters in the images, providing new possibilities for video content creation. This model was developed by Longtao Zheng and others, and related papers were published on arXiv.

AI 图像生成视频制作 +2

ComfyUI_HelloMeme

ComfyUI_HelloMeme

HelloMeme is a diffusion model integrated with Spatial Knitting Attentions for embedding high-level and detail-rich conditions. This model supports the generation of images and videos, and has the advantages of improving expression consistency between generated videos and driven videos, reducing VRAM usage, and optimizing algorithms. HelloMeme, developed by the HelloVision team and owned by HelloGroup Inc., is a cutting-edge image and video generation technology with important commercial and educational value.

图像生成视频生成扩散模型 +2

Qwen2-VL-72B

Qwen2-VL-72B

The Qwen2-VL-72B is the latest iteration of the Qwen-VL model and represents nearly a year of innovation. The model achieves state-of-the-art performance in visual understanding benchmarks, including MathVista, DocVQA, RealWorldQA, MTVQA, and more. It can understand videos of more than 20 minutes and can be integrated into mobile phones, robots and other devices to perform automatic operations based on the visual environment and text instructions. In addition to English and Chinese, Qwen2-VL now supports the understanding of text in images in different languages, including most European languages, Japanese, Korean, Arabic, Vietnamese, and more. Model architecture updates include Naive Dynamic Resolution and Multimodal Rotary Position Embedding (M-ROPE), which enhance its multi-modal processing capabilities.

多语言支持视觉理解多模态处理 +2

Qwen2-VL-7B

Qwen2-VL-7B

The Qwen2-VL-7B is the latest iteration of the Qwen-VL model and represents nearly a year of innovation. The model achieves state-of-the-art performance on visual understanding benchmarks, including MathVista, DocVQA, RealWorldQA, MTVQA, and others. It can understand videos longer than 20 minutes and provide high-quality support for video-based question answering, dialogue, content creation, etc. In addition, Qwen2-VL also supports multi-language, in addition to English and Chinese, it also includes most European languages, Japanese, Korean, Arabic, Vietnamese, etc. Model architecture updates include Naive Dynamic Resolution and Multimodal Rotary Position Embedding (M-ROPE), which enhance its multi-modal processing capabilities.

多语言支持文本生成多模态 +2

FLOAT

FLOAT

FLOAT is an audio-driven portrait video generation method based on a flow matching generative model that transfers generative modeling from a pixel-based latent space to a learned motion latent space, achieving temporally consistent motion design. This technology introduces a transformer-based vector field predictor with a simple yet effective frame-by-frame conditional mechanism. In addition, FLOAT supports voice-driven emotional enhancement and can naturally incorporate expressive movements. Extensive experiments show that FLOAT outperforms existing audio-driven talking portrait methods in terms of visual quality, motion fidelity, and efficiency.

人工智能音频驱动人像动画 +2

CAT4D

CAT4D

CAT4D is a technology that uses multi-view video diffusion models to generate 4D scenes from monocular videos. It can convert input monocular video into multi-view video and reconstruct dynamic 3D scenes. The importance of this technology lies in its ability to extract and reconstruct complete information of three-dimensional space and time from single-view video data, providing powerful technical support for fields such as virtual reality, augmented reality, and three-dimensional modeling. Product background information shows that CAT4D was jointly developed by researchers from Google DeepMind, Columbia University and UC San Diego. It is a case in which cutting-edge scientific research results are transformed into practical applications.

扩散模型虚拟现实 3D重建 +3

Fashion-VDM

Fashion-VDM

Fashion-VDM is a video diffusion model (VDM) for generating virtual try-on videos. The model accepts an image of a piece of clothing and a video of a person as input, and aims to generate a high-quality try-on video of a person wearing a given piece of clothing while preserving the person's identity and movements. Compared with traditional image-based virtual try-on, Fashion-VDM performs well in terms of clothing details and time consistency. The main advantages of this technology include: diffusion architecture, classifier free guidance enhanced control, progressive temporal training strategy for single 64-frame 512px video generation, and effectiveness of joint image-video training. Fashion-VDM sets a new industry standard in video virtual try-on.

人工智能图像处理视频生成 +2

MimicTalk

MimicTalk

MimicTalk is a personalized three-dimensional speaking face generation technology based on Neural Radiation Fields (NeRF) that is able to imitate the static appearance and dynamic speaking style of a specific identity in minutes. 这项技术的主要优点包括高效率、高质量的视频生成以及对目标人物说话风格的精确模仿。 MimicTalk uses a general 3D face generation model as a foundation and learns personalized static appearance and facial dynamics through a static-dynamic hybrid adaptation process. It also proposes a contextually stylized audio-to-motion (ICS-A2M) model to generate facial movements that match the speaking style of the target person. MimicTalk's technical background is based on the latest advances in deep learning and computer vision, especially in face synthesis and animation generation. Currently, the technology is freely available to the research and development community.

个性化 NeRF 3D面部生成 +2

LivePortrait.app

LivePortrait.app

LivePortrait is an AI-driven animation production tool, open sourced by Kuaishou Technology, that can quickly transform static photos into realistic dynamic videos. It supports a variety of styles such as photorealistic, animated and artistic portraits, and provides precise motion control such as natural movement of eyes and lips. LivePortrait also features diverse style support, custom animation modes, enhanced image processing capabilities, and a fast creative process.

图像处理个性化定制 AI动画

Pixeling

Pixeling

HiDream.ai is a website focused on image and video creation, using artificial intelligence technology to provide a variety of functions. Its importance lies in helping users create high-quality image and video content more easily. This product has the advantages of rich functions and simple operation, and is suitable for all kinds of users who need to create images and videos. Currently, some features may require payment or a free trial.

图像生成创意工具视频制作 +1

Animate Old Photos

Animate Old Photos

Animate Old Photos is a website that uses Kling AI technology to transform old photos into vivid videos. It uses AI technology to rejuvenate old memories and bring users a more vivid and dynamic experience. The product is currently in beta testing and is available for free, but paid plans may be rolled out in the future as premium features are added.

AI 图像处理视频制作 +1

Haiper

Haiper

Haiper AI is on a mission to build the best perceptual foundation model for the next generation of content creation. It provides the following main functions: text to video, picture animation, video redrawing, director's perspective. Haiper AI seamlessly transforms textual content and static images into dynamic videos, bringing images to life simply by dragging and dropping them. Using Haiper AI's repaint tool, you can easily modify the colors, textures, and elements of your video to enhance the quality of your visual content. Advanced control tools let you adjust camera angles, lighting effects, character poses, and object movement like a director. Haiper AI is suitable for various scenarios, such as content creation, design, marketing, etc. Please refer to the official website for pricing.

人工智能图像处理视频编辑 +1

BasedLabs

BasedLabs

BasedLabs.ai is your go-to source for AI videos and tools. We provide powerful AI video tools and an active community that allows you to interact and share your work with other creators. Our tools include video generation and cloning capabilities to help you quickly generate stunning AI video creations.

AI 视频创作 +1

MagicAnimate Playground

MagicAnimate Playground

MagicAnimate is an advanced diffusion model-based framework for human body image animation. It can generate animated videos from single images and dynamic videos with temporal consistency, maintain the characteristics of reference images, and significantly improve the fidelity of animations. MagicAnimate supports image animation using action sequences from a variety of sources, including animation across identities and unseen areas such as paintings and movie characters. It also integrates seamlessly with T2I diffusion models such as DALLE3, which can give dynamic actions to images generated based on text. MagicAnimate is jointly developed by the National University of Singapore Show Lab and Bytedance.

图像扩散模型动画

Pollinations

Pollinations

Pollinations is a team of data scientists, machine learning experts, artists, and futurists deeply involved in the AI ecosystem. Now, Pollinations is focusing on AI music video creation and will soon launch an exciting real-time immersive AI product called Dreamachine.

人工智能 ChatGPT 数据科学 +2

VideoMyListing

VideoMyListing

VideoMyListing is an AI-assisted video generation tool that helps Airbnb hosts market their listings by automatically generating videos. Users only need to paste the listing link, and VideoMyListing will use AI technology to automatically generate attractive videos that can be used for promotion on social media platforms. The tool provides commercially licensed content and the generated video format is MP4, suitable for social video services such as Instagram, LinkedIn and Snapchat.

社交媒体 AI辅助视频营销 +1

LensGo

LensGo

LensGo is a free AI-powered image and video creation tool best suited for customized video production. It helps users create personalized AI videos.

AI 视频生成视频 +3

8 Arc

8 Arc

8 Arc is a powerful text-to-movie generator. Users can input a movie script or a short story concept, and AI will generate a movie script that meets the requirements. Users can also describe characters and scenes to generate complete movie stories. 8 Arc can help users easily create their own movie scripts, providing creators with inspiration and creative tools.

创作故事电影 +1

Turn texts into AI

Turn texts into AI

THE FABLE STUDIO can turn your ideas into captivating stories, harnessing the power of AI. You can transform simple text into engaging stories with style and originality. Expressed in a chosen style, our cutting-edge technology transforms your text into a unique video. You can recast your favorite characters, change the course of the story or even change the ending of your favorite movie.

人工智能创意视频 +1

Olm

Olm

Olm is an optical language model-based product that helps users generate brand new videos from scratch in minutes. It creates, reimagines and understands multimedia and generates content that matches user requirements. Olm has the following main functions: 1. Generate brand new video content; 2. Reimagine existing video content; 3. Understand and analyze multimedia. Olm is suitable for various scenarios, including creation, education, entertainment and other fields. Please visit the official website for specific pricing information.

图像生成图像处理视频创作 +3

Gencraft

Gencraft

Gencraft is a powerful AI image and video art generation engine that turns your ideas into stunning AI generated art, whether it's photos or videos. You can use keywords to spark your imagination and pair them with styles for more personalized results. Suitable for productivity, design, writing, business and more, Gencraft can help you easily create a creative brand that truly represents you.

图像处理 Web应用 Ai生成写真 +5

Vidnoz

Vidnoz

Talking Head by Vidnoz is an online tool that allows you to create realistic talking avatars in minutes. It uses artificial intelligence technology to generate avatar videos with mouth movements and voices, which can be used in a variety of scenarios such as sales, marketing, communication and support. Talking Head is free to use, but also offers paid plans to enjoy more advanced features.

人工智能视频头像 +1

Genmo AI

Genmo AI

Genmokey is a creative tool that uses AI to generate videos from text. It transforms your input text into unique video works that go beyond traditional 2D effects. Whether you want to create personal videos, marketing ads, or other creative projects, Genmokey can help you realize the limits of your imagination. Genmokey is a comprehensive video generation tool that offers rich features and customization options. Pricing plans are flexible and suitable for both personal and business use. Whether you are a designer, marketer, creative practitioner or video enthusiast, Genmokey will be your right-hand assistant.

AI 视频生成创意工具

Luma AI

Luma AI

Luma AI is a technology company focusing on AI. Through its innovative technology, users can use their mobile phones to quickly generate the 3D models they need. The company was founded by a team with extensive 3D computer vision experience. Its technology is based on Neural Radiance Fields, which can model 3D scenes based on a small number of 2D images. Dream Machine is an AI model that quickly generates high-quality, photorealistic videos directly from text and images. It is a highly scalable and efficient transformer model specifically trained on video, capable of generating physically accurate, consistent, and event-filled footage. Dream Machine is the first step in building a universal imagination engine, now available to everyone.

AI视频生成 VR 神经渲染 +2

Related Subcategories

Explore other subcategories under image Other Categories

AI design tools

832 tools

Image generation

771 tools

AI image generation

543 tools

Picture editing

522 tools

AI model

352 tools

AI image editing

196 tools

Development and Tools

95 tools

graphic design

68 tools

🖼️

Explore More image Tools

video generation Hot image is a popular subcategory under 43 quality AI tools

Browse image Category Categories