Learn AI Video & Image Creation

AI creative tools move fast, and the vocabulary moves with them. This page collects two things in one place: a glossary of the terms you will meet everywhere (text-to-video, seeds, inpainting, lip sync) and direct answers to the questions creators actually ask — what things cost, how long generation takes, what happened to Sora, and what you can legally do with the output.

Every entry leads with a short, self-contained answer, then goes deeper. Where a concept maps to a real tool, we link the VdoBloom tool that does it — the platform bundles 65+ video tools plus image, audio, and design generation, with models including Google VEO 3.1, Runway, Kling, Wan, Seedance, Flux, and Nano Banana.

AI Video

Text-to-Video

Text-to-video is an AI technique that turns a written description into a finished video clip.

Read more Glossary

Image-to-Video

Image-to-video is AI video generation that starts from a still picture instead of (or in addition to) a text prompt.

Read more Glossary

AI Video Generator

An AI video generator is software that creates video clips from text prompts, images, or both, using generative models instead of cameras or manual editing.

Read more Glossary

Video Upscaling

Video upscaling is the process of increasing a video’s resolution — for example from 720p to HD or 4K — using AI that reconstructs detail rather than simply stretching pixels.

Read more Glossary

AI Avatar Video

An AI avatar video features a digital presenter — generated from a photo or built from scratch — that speaks your script with synchronized lip movement.

Read more Glossary

Lip Sync (AI)

AI lip sync is technology that matches a person’s mouth movements in a video to a given audio track, so the subject appears to genuinely speak the words.

Read more Glossary

UGC Ad

A UGC ad (user-generated-content ad) is a paid advertisement styled to look like an authentic customer video — casual framing, direct-to-camera talking, phone-quality aesthetics — rather than a polished studio commercial.

Read more Glossary

Faceless YouTube Channel

A faceless YouTube channel publishes videos without the creator ever appearing on camera — instead using AI-generated visuals, stock footage, voiceover, and text.

Read more Glossary

Motion & Photo Effects

Motion effects (also called photo effects) are one-click AI templates that animate a still photo with a predefined action — a hug, a hair flip, falling rain, a fashion-walk turn, a superhero transformation.

Read more Glossary

Google VEO 3.1

VEO 3.

Read more Glossary

Kling

Kling is an AI video model from Kuaishou known for fluid, realistic human motion and strong image animation.

Read more Glossary

Wan

Wan is Alibaba’s family of AI video generation models, supporting both text-to-video and image-to-video.

Read more Glossary

Seedance

Seedance is ByteDance’s AI video generation model, notable for smooth multi-shot motion, strong instruction following, and stylistic range from realistic to anime-flavored output.

Can AI make a video from one photo?

Yes.

How long does AI video generation take?

Typically a few minutes per clip, though it varies by model, clip length, and resolution.

What happened to Sora?

OpenAI discontinued the Sora app on April 26, 2026, and the Sora API sunsets on September 24, 2026 — after that date the model will no longer be available anywhere.

What resolution can AI video reach?

Most AI video models natively generate between 720p and 1080p, with some supporting higher outputs on premium settings.

Do AI video generators add watermarks?

It varies by platform and plan.

What is the best AI video generator?

There is no single best — it depends on what you make.

What is the difference between text-to-video and image-to-video?

Text-to-video creates a clip entirely from a written description — the model invents the visuals.

How do I make an AI video for TikTok?

Generate in vertical 9:16 from the start, hook the viewer inside the first two seconds, and keep clips short and caption-heavy since much of TikTok plays muted.

AI Images

Logo Animation

Logo animation turns a static logo into a short motion clip — a reveal, spin, particle build, or subtle loop — typically used as a video intro, social sting, or website accent.

Read more Glossary

GIF Generator (AI)

An AI GIF generator creates short looping animations from a text prompt or an uploaded image.

Read more Glossary

AI Image Generator

An AI image generator creates original pictures from text descriptions using models trained on vast image datasets.

Read more Glossary

Image Upscaling

Image upscaling increases a picture’s resolution using AI that reconstructs plausible detail — sharper edges, cleaner textures, restored faces — instead of just enlarging pixels.

Read more Glossary

Inpainting (AI Photo Editing)

Inpainting is AI photo editing that regenerates a selected part of an image while leaving the rest untouched — removing an object, replacing a background, changing an outfit, or fixing a flaw.

Read more Glossary

AI Headshot

An AI headshot is a professional-looking portrait generated from one or more casual photos of you.

Read more Glossary

Virtual Try-On

Virtual try-on is AI technology that shows how clothing would look on a specific person by combining a photo of them with a photo of the garment.

Read more Glossary

Face Swap

Face swap is an AI technique that transfers one person’s face onto another person’s photo while keeping the original pose, outfit, lighting, and background.

Read more Glossary

Manga & Comic Generation

Manga and comic generation is the use of AI image models to create sequential art — multi-panel pages with consistent characters, speech-bubble-ready compositions, and manga or western comic styling.

Read more Glossary

Flux

Flux is a family of image generation models from Black Forest Labs, the team founded by Stable Diffusion’s creators.

Read more Glossary

Nano Banana

Nano Banana is the popular nickname for Google’s Gemini image generation and editing model.

Read more Glossary

Seedream

Seedream is ByteDance’s image generation model, known for high-resolution output, vivid aesthetics, and strong text rendering inside images — useful for posters, social graphics, and designs where words must be legible.

AI Audio

Text to Speech (TTS)

Text to speech (TTS) is technology that converts written text into spoken audio using synthetic voices.

Read more Glossary

AI Voice

An AI voice is a synthetic voice generated by a neural network rather than recorded from a person.

Read more Glossary

Transcription (AI)

AI transcription is the automatic conversion of speech in audio or video into written text.

Can AI clone a voice?

Yes — modern voice AI can reproduce a specific person’s voice from a short audio sample, capturing tone, accent, and speaking style.

Pricing & Credits

How much does AI video generation cost?

Most AI video platforms charge through credit-based subscriptions, usually between $10 and $100+ per month depending on volume and model quality.

Is there a free AI video generator?

Yes — most AI video platforms, VdoBloom included, offer a free tier, but expect limits: a small credit allowance, restricted model access, and sometimes watermarks or lower resolution.

How do credits work on AI platforms?

Credits are a platform currency that gets spent per generation, with costs scaled to the compute behind each task — a premium video model consumes more credits than a quick image generation.

Getting Started

Prompt

A prompt is the text instruction you give an AI model to tell it what to create.

Read more Glossary

Negative Prompt

A negative prompt is a list of things you tell an AI model to avoid — such as "blurry, extra fingers, watermark, text.

Read more Glossary

Seed

A seed is the starting random number an AI model uses for a generation.

Read more Glossary

Aspect Ratio

Aspect ratio is the width-to-height proportion of a video or image.

Can I use AI videos commercially?

Generally yes, but it depends on the platform’s terms of service — usage rights are granted by each platform, not by the technology itself.

Is AI-generated content detectable?

Sometimes, but not reliably in every case.

What makes a good AI video prompt?

A good AI video prompt reads like a shot description: subject, action, setting, lighting, style, and camera movement, in concrete visual language.