AI creative tools move fast, and the vocabulary moves with them. This page collects two things in one place: a glossary of the terms you will meet everywhere (text-to-video, seeds, inpainting, lip sync) and direct answers to the questions creators actually ask — what things cost, how long generation takes, what happened to Sora, and what you can legally do with the output.
Every entry leads with a short, self-contained answer, then goes deeper. Where a concept maps to a real tool, we link the VdoBloom tool that does it — the platform bundles 65+ video tools plus image, audio, and design generation, with models including Google VEO 3.1, Runway, Kling, Wan, Seedance, Flux, and Nano Banana.
Text-to-video is an AI technique that turns a written description into a finished video clip.
Read more GlossaryImage-to-video is AI video generation that starts from a still picture instead of (or in addition to) a text prompt.
Read more GlossaryAn AI video generator is software that creates video clips from text prompts, images, or both, using generative models instead of cameras or manual editing.
Read more GlossaryVideo upscaling is the process of increasing a video’s resolution — for example from 720p to HD or 4K — using AI that reconstructs detail rather than simply stretching pixels.
Read more GlossaryAn AI avatar video features a digital presenter — generated from a photo or built from scratch — that speaks your script with synchronized lip movement.
Read more GlossaryAI lip sync is technology that matches a person’s mouth movements in a video to a given audio track, so the subject appears to genuinely speak the words.
Read more GlossaryA UGC ad (user-generated-content ad) is a paid advertisement styled to look like an authentic customer video — casual framing, direct-to-camera talking, phone-quality aesthetics — rather than a polished studio commercial.
Read more GlossaryA faceless YouTube channel publishes videos without the creator ever appearing on camera — instead using AI-generated visuals, stock footage, voiceover, and text.
Read more GlossaryMotion effects (also called photo effects) are one-click AI templates that animate a still photo with a predefined action — a hug, a hair flip, falling rain, a fashion-walk turn, a superhero transformation.
Read more GlossaryVEO 3.
Read more GlossaryKling is an AI video model from Kuaishou known for fluid, realistic human motion and strong image animation.
Read more GlossaryWan is Alibaba’s family of AI video generation models, supporting both text-to-video and image-to-video.
Read more GlossarySeedance is ByteDance’s AI video generation model, notable for smooth multi-shot motion, strong instruction following, and stylistic range from realistic to anime-flavored output.
Read more Q&AYes.
Read more Q&ATypically a few minutes per clip, though it varies by model, clip length, and resolution.
Read more Q&AOpenAI discontinued the Sora app on April 26, 2026, and the Sora API sunsets on September 24, 2026.
Read more Q&AMost AI video models natively generate between 720p and 1080p, with some supporting higher outputs on premium settings.
Read more Q&AIt varies by platform and plan.
Read more Q&AThere is no single best — it depends on what you make.
Read more Q&AText-to-video creates a clip entirely from a written description — the model invents the visuals.
Read more Q&AGenerate in vertical 9:16 from the start, hook the viewer inside the first two seconds, and keep clips short and caption-heavy since much of TikTok plays muted.
Read moreLogo animation turns a static logo into a short motion clip — a reveal, spin, particle build, or subtle loop — typically used as a video intro, social sting, or website accent.
Read more GlossaryAn AI GIF generator creates short looping animations from a text prompt or an uploaded image.
Read more GlossaryAn AI image generator creates original pictures from text descriptions using models trained on vast image datasets.
Read more GlossaryImage upscaling increases a picture’s resolution using AI that reconstructs plausible detail — sharper edges, cleaner textures, restored faces — instead of just enlarging pixels.
Read more GlossaryInpainting is AI photo editing that regenerates a selected part of an image while leaving the rest untouched — removing an object, replacing a background, changing an outfit, or fixing a flaw.
Read more GlossaryAn AI headshot is a professional-looking portrait generated from one or more casual photos of you.
Read more GlossaryVirtual try-on is AI technology that shows how clothing would look on a specific person by combining a photo of them with a photo of the garment.
Read more GlossaryFace swap is an AI technique that transfers one person’s face onto another person’s photo while keeping the original pose, outfit, lighting, and background.
Read more GlossaryManga and comic generation is the use of AI image models to create sequential art — multi-panel pages with consistent characters, speech-bubble-ready compositions, and manga or western comic styling.
Read more GlossaryFlux is a family of image generation models from Black Forest Labs, the team founded by Stable Diffusion’s creators.
Read more GlossaryNano Banana is the popular nickname for Google’s Gemini image generation and editing model.
Read more GlossarySeedream is ByteDance’s image generation model, known for high-resolution output, vivid aesthetics, and strong text rendering inside images — useful for posters, social graphics, and designs where words must be legible.
Read moreText to speech (TTS) is technology that converts written text into spoken audio using synthetic voices.
Read more GlossaryAn AI voice is a synthetic voice generated by a neural network rather than recorded from a person.
Read more GlossaryAI transcription is the automatic conversion of speech in audio or video into written text.
Read more Q&AYes — modern voice AI can reproduce a specific person’s voice from a short audio sample, capturing tone, accent, and speaking style.
Read moreMost AI video platforms charge through credit-based subscriptions, usually between $10 and $100+ per month depending on volume and model quality.
Read more Q&AYes — most AI video platforms, VdoBloom included, offer a free tier, but expect limits: a small credit allowance, restricted model access, and sometimes watermarks or lower resolution.
Read more Q&ACredits are a platform currency that gets spent per generation, with costs scaled to the compute behind each task — a premium video model consumes more credits than a quick image generation.
Read moreA prompt is the text instruction you give an AI model to tell it what to create.
Read more GlossaryA negative prompt is a list of things you tell an AI model to avoid — such as "blurry, extra fingers, watermark, text.
Read more GlossaryA seed is the starting random number an AI model uses for a generation.
Read more GlossaryAspect ratio is the width-to-height proportion of a video or image.
Read more Q&AGenerally yes, but it depends on the platform’s terms of service — usage rights are granted by each platform, not by the technology itself.
Read more Q&ASometimes, but not reliably in every case.
Read more Q&AA good AI video prompt reads like a shot description: subject, action, setting, lighting, style, and camera movement, in concrete visual language.
Read more