Sora 2 提示(Prompting)指南 | OpenAI Cookbook-扣子工作流大全、coze零基础教程、提示词

Sora 2 提示(Prompting)指南 | OpenAI Cookbook

成功的视频提示撰写

在提示之前：

把写 Prompt 想像成你在向一个从未见过你分镜稿的摄影师下达指令。如果你没提供细节，它就会自由发挥 —— 结果可能与你设想的有所差距。通过清晰地表达“镜头”想要达成什么效果，你可以给模型更多的控制与一致性。

但有时故意保留一些开放性也很有价值。给模型更多的创作自由可能带来意料之外的变化与美感。这两种方式都有效：详细 Prompt 给你控制与一致性，简洁 Prompt 则可能开启惊喜式变体。哪种方式好，取决于你的目标和你希望的结果。把 Prompt 看作「创意愿望清单」而不是「合同」。跟 ChatGPT 一样，即便你多次使用相同的 Prompt，也可能得到不一样的结果——这是一个特性，不是 bug。每次生成都是一个新的解读，有时第二、第三种版本反而更合适。

最重要的是：准备好反复试验（iterate）。对摄像机、光线、动作的小调整，可能会极大地改变结果。把自己看作与模型的协作者：你提供方向，模型带来创意变体。

以下的指导并不是精确公式，而是我们在与 Sora 2 模型合作时总结出的一些有用建议。

API 参数

Prompt（文本内容）控制的是视频里的内容，但有些属性只能通过 API 参数来设定，不能在 prose（叙述）里“写出来”：

model：sora-2 或 sora-2-pro
size：格式为 {宽度}x{高度}，支持的分辨率取决于模型：
- sora-2：1280×720、720×1280
- sora-2-pro：1280×720、720×1280，以及 1024×1792、1792×1024
seconds：视频时长，支持 “4”、“8”、“12”，默认是 “4”

这些参数是视频的“容器”——分辨率、时长、画质。它们不会因为你在 prompt 里写“让它更长”之类的话而自动改变。你的 prompt 控制内容（主体、动作、光线、风格等）。

视频分辨率

视频的分辨率直接影响画面细节和运动一致性。更高分辨率能生成更多纹理、光影过渡也更准确；较低分辨率可能带来模糊、压缩痕迹或伪影。

视频时长

模型在较短的片段里通常更容易遵守指令。为获得更佳效果，倾向于用较短镜头。如果你的项目允许，不妨用两个 4 秒片段拼接，而不是直接生成一个 8 秒片段。

有效的提示结构（Prompt anatomy）

一个清晰的 Prompt 会像你在画分镜：说明镜头取景方式，指出景深深浅，描述动作节奏，设定光线与色彩风格。在主体中保留少量具有辨识度的细节以保持一致性，然后让一个合理的动作成为镜头的焦点。

如果你要写多个镜头（shots）在同一个 prompt 里，也可以。但要把每个镜头块 (shot block) 写得清楚：一个摄像机设定、一个主体动作、一个光线方案。这样你可以灵活地把它们单独生成，或者让它们连成一个连续的片段播放。把每个镜头当成一个创作单元，当你剪辑或一次连拍时都方便。

简短的 Prompt 给予模型更多创意空间，可能出现出人意料的结果。
冗长、过度细化的 Prompt 会限制模型的自由，它会尽量按你的指令来但可能执行得不够灵活。

下面是一个短 Prompt 的例子：

In a 90s documentary-style interview, an old Swedish man sits in a study and says, “I still remember when I was young.”

解释：

90s documentary设置视频的风格。模型将相应地选择相机镜头、灯光和色彩等级等变量。
an old Swedish man sits in a study详细描述主题和设置，让模型自由发挥创意，决定人物和设置的外观。
and says, "I still remember when I was young."描述对话。Sora 应该能够完全理解。

这个 prompt 可较可靠地产生符合这些要求的视频。但它可能无法完全符合你的设想，因为有许多细节（时间、天气、服装、镜头角度、布景、光线等）没有被指定。若你不描述，Sora 会自由填充。

更超细致的写法（Going Ultra-Detailed）

要拍复杂或电影感强的镜头，你可以在标准结构之外加入更多：镜头语言、滤镜、色彩分级、音景 (soundscape)，甚至镜头意图（shot rationale）等。就像导演给摄影团队或 VFX 团队下的 brief。这样可以让模型更准确地贴近你想要的美学风格。

你可以描述：观众首先注意到什么、摄像机平台和镜头、光线方向、色彩基调、材质质感、配乐或环境音、镜头节奏等。这个方法在你希望模仿真实电影拍摄风格（如 IMAX 航拍、35mm 手持镜头、16mm 复古纪录片风）或保持镜头间连续性时特别有效。

示例（超详细）：

Format & Look Duration 4s; 180° shutter; digital capture emulating 65 mm photochemical contrast; fine grain; subtle halation on speculars; no gate weave. Lenses & Filtration 32 mm / 50 mm spherical primes; Black Pro-Mist 1/4; slight CPL rotation to manage glass reflections on train windows. Grade / Palette Highlights: clean morning sunlight with amber lift. Mids: balanced neutrals with slight teal cast in shadows. Blacks: soft, neutral with mild lift for haze retention. Lighting & Atmosphere Natural sunlight from camera left, low angle (07:30 AM). Bounce: 4×4 ultrabounce silver from trackside. Negative fill from opposite wall. Practical: sodium platform lights on dim fade. Atmos: gentle mist; train exhaust drift through light beam. Location & Framing Urban commuter platform, dawn. Foreground: yellow safety line, coffee cup on bench. Midground: waiting passengers silhouetted in haze. Background: arriving train braking to a stop. Avoid signage or corporate branding. Wardrobe / Props / Extras Main subject: mid-30s traveler, navy coat, backpack slung on one shoulder, holding phone loosely at side. Extras: commuters in muted tones; one cyclist pushing bike. Props: paper coffee cup, rolling luggage, LED departure board (generic destinations). Sound Diegetic only: faint rail screech, train brakes hiss, distant announcement muffled (-20 LUFS), low ambient hum. Footsteps and paper rustle; no score or added foley. Optimized Shot List (2 shots / 4 s total) 0.00–2.40 — “Arrival Drift” (32 mm, shoulder-mounted slow dolly left) Camera slides past platform signage edge; shallow focus reveals traveler mid-frame looking down tracks. Morning light blooms across lens; train headlights flare softly through mist. Purpose: establish setting and tone, hint anticipation. 2.40–4.00 — “Turn and Pause” (50 mm, slow arc in) Cut to tighter over-shoulder arc as train halts; traveler turns slightly toward camera, catching sunlight rim across cheek and phone screen reflection. Eyes flick up toward something unseen. Purpose: create human focal moment with minimal motion. Camera Notes (Why It Reads) Keep eyeline low and close to lens axis for intimacy. Allow micro flares from train glass as aesthetic texture. Preserve subtle handheld imperfection for realism. Do not break silhouette clarity with overexposed flare; retain skin highlight roll-off. Finishing Fine-grain overlay with mild chroma noise for realism; restrained halation on practicals; warm-cool LUT for morning split tone. Mix: prioritize train and ambient detail over footstep transients. Poster frame: traveler mid-turn, golden rim light, arriving train soft-focus in background haze.

中文翻译：

格式与画面风格（Format & Look） 时长：4 秒；快门角度：180°；数字拍摄，模拟 65 mm 胶片的光化学对比度；细腻颗粒；高光处有轻微晕光（halation）；无画幅漂移（no gate weave）。 镜头与滤镜（Lenses & Filtration） 镜头：32 mm / 50 mm 球面定焦；滤镜：Black Pro-Mist 1/4；偏振镜（CPL）轻微旋转，以控制列车窗玻璃反光。 调色与色调（Grade / Palette） 高光（Highlights）：清晨阳光，略带琥珀色提亮。中间调（Mids）：中性色平衡，阴影带轻微青色倾向。黑位（Blacks）：柔和、中性，略微提升以保留雾气层次。 光线与氛围（Lighting & Atmosphere） 自然光来自镜头左侧，低角度（上午 07:30）。反光板：轨道边 4×4 银面 ultrabounce。对侧设置负补光（Negative fill）。实景光源：站台钠灯，设置为微弱渐暗模式。氛围：轻薄晨雾；列车尾气在光束中缓缓飘散。 取景与构图（Location & Framing） 场景：城市通勤列车站台，黎明时分。前景：黄色安全线、长凳上的咖啡杯。中景：等待的乘客，剪影在雾气中。背景：列车进站、减速停车。避免出现标志或商业品牌。 服装／道具／群众演员（Wardrobe / Props / Extras） 主角：三十多岁的旅人，深蓝外套，单肩背包，右手松握手机。群演：穿着低饱和色衣物的通勤者；一位推着自行车的骑行者。道具：纸质咖啡杯、滚轮行李箱、LED 出发显示屏（泛化目的地）。 声音设计（Sound） 仅使用现场声（Diegetic only）：微弱的铁轨摩擦声、列车制动声、远处模糊的广播（-20 LUFS）、低频环境嗡鸣。包含脚步声与纸张摩擦声；无配乐，无额外拟音。 优化镜头清单（Optimized Shot List – 共 2 个镜头 / 4 秒总长） 0.00 – 2.40 s — “Arrival Drift”（32 mm，肩扛慢速左移） 摄像机从站台标牌边缘滑过，浅景深中逐渐显露旅人居中注视铁轨。晨光透镜而入，列车前灯在雾中柔和闪光。 目的： 建立场景氛围，传递等待与预感。 2.40 – 4.00 s — “Turn and Pause”（50 mm，缓慢弧线推进） 切换至更近的越肩弧形镜头，列车停下时旅人微微转向镜头，阳光勾勒面颊与手机屏反光。眼神微抬，望向未知方向。 目的： 以极少动作制造人性化焦点瞬间。 摄影注释（Camera Notes – Why It Reads） 保持视线高度低且靠近镜头轴线，以增强亲密感。允许列车玻璃产生轻微炫光，作为画面质感。保留轻微手持晃动，增强真实感。避免因过曝炫光破坏剪影轮廓；保留皮肤高光的柔和衰减（roll-off）。 成片处理（Finishing） 添加细微胶片颗粒与轻度色度噪点以增强真实感。对实景灯光保留克制的晕光（halation）。使用暖-冷对比 LUT 营造晨间分色调。混音：突出列车与环境声，弱化脚步瞬态。海报帧（Poster Frame）：旅人半转身，金色轮廓光勾边，背景雾中列车虚焦。

引导视觉风格的元素（Visual cues that steer the look）

在 Prompt 里写风格是最强有力的杠杆之一 —— 比如 “1970s 电影质感”、“史诗级 IMAX 场景”或 “16mm 黑白胶片”——它为其他选择设定基调。先设定整体美学，让模型贯通整体风格。

同样一句话句，如果配上“好莱坞剧情”与“手持手机片段”语境，会得到天差地别的效果。一旦语气调好，再在其上层叠细节（镜头、动作、光线、色彩）。

清晰优于模糊。与其写 “一条漂亮的街道夜景”，不如写 “湿润的柏油路面、斑马线、霓虹灯在水洼中倒影”；与其写 “快速移动”，不如写 “骑士用三脚踩踏，刹车停在斑马线上”。具体、指向可见结果的动词和名词总比泛泛而谈更能得到一致输出。

下面是弱 prompt 与强 prompt 对比：

弱提示	强提示
“夜晚的街道很美” “A beautiful street at night”	“湿漉漉的沥青路面、斑马线、水坑里倒映的霓虹灯”“Wet asphalt, zebra crosswalk, neon signs reflecting in puddles”
“人动作很快”“Person moves quickly”	“骑车人踩了三次踏板，刹车，然后停在人行横道上”“Cyclist pedals three times, brakes, and stops at crosswalk”
“电影般的观感”“Cinematic look”	“变形 2.0x 镜头，浅景深，体积光” “Anamorphic 2.0x lens, shallow DOF, volumetric light”

镜头取向与构图也决定一镜头的感受：俯拍宽镜头强调空间、环境；贴脸特写成就情绪表达。景深（Depth of Field）也起作用：浅景深让主体突出、背景虚化；深景深则让前后都清晰。光线设定语气：柔和暖光给人温暖感，硬光加冷调则更具张力。

引入角色时，模型可能在身份、姿态、焦点等方面出现不可控变化 —— 小措辞差异可能导致结果有区别。为保持镜头连贯性，应注意措辞统一，避免混用相互竞争的特征描述。

错误示例（Weak）

Camera shot: cinematic look

问题：过于模糊，“cinematic” 只是氛围词，不包含具体的画面信息。

优秀示例（Strong）

Camera shot: wide shot, low angle Depth of field: shallow (sharp on subject, blurred background) Lighting + palette: warm backlight with soft rim

说明：明确了镜头角度、景深与光线配色，模型可据此保持视觉一致性。

✅ 良好的构图（Framing）指令示例：

wide establishing shot, eye level → 宽幅开场镜头，视角与人物视线齐平
wide shot, tracking left to right with the charge → 宽景镜头，随奔跑方向从左向右平移拍摄
aerial wide shot, slight downward angle → 航拍广角镜头，略带俯视角度
medium close-up shot, slight angle from behind → 中近景镜头，从人物背后略倾角度拍摄

🎥 良好的摄像机运动（Camera Motion）指令示例：

slowly tilting camera → 摄像机缓慢仰俯移动（上仰或下俯）
handheld ENG camera → 手持新闻纪录片风格拍摄，带自然抖动感

控制运动和节奏（Control motion and timing）

运动 (movement) 通常是最难控制的部分，所以保持简单。每个镜头里最好有一个清晰的摄像机运动和一个清晰的主体动作。把动作按“拍子”或“节拍”来写 —— 小步、手势、暂停 —— 让节奏更可控。

比如 “演员在房间里走” 就太模糊了；“演员走四步到窗户，停顿，然后在最后一秒拉开窗帘”就很具体、节奏明确。

弱（Weak）：

Actor walks across the room. → 演员走过房间。（描述过于笼统，缺乏节奏与画面感）

强（Strong）：

Actor takes four steps to the window, pauses, and pulls the curtain in the final second. → 演员走向窗户，迈出四步，停顿片刻，在最后一秒拉开窗帘。（动作有节奏、时间明确，能精准控制镜头节奏与情绪）

光线与色彩一致性（Lighting and Color Consistency）

光线对情绪的塑造与动作或场景同样重要。柔和、漫射的光线能营造平静中性的氛围；而单一、强烈的光源则会制造鲜明的对比与紧张感。

当你需要将多个片段剪辑在一起时，保持光线逻辑一致是让画面衔接自然的关键。

你应同时描述光线的质感与色彩锚点（color anchors）。不要只写笼统的提示，例如 “brightly lit room（明亮的房间）”，而要明确光源的类型与色调混合，比如：

“soft window light with a warm lamp fill and a cool edge from the hallway”

（窗边柔光 + 暖色台灯补光 + 走廊投来的冷边光）

此外，列出三到五种主色有助于在不同镜头间维持稳定的色调。

弱（Weak）示例：

Lighting + palette: brightly lit room 光线与色调：明亮的房间

强（Strong）示例：

Lighting + palette: soft window light with warm lamp fill, cool rim from hallway 光线与色调：窗边柔光，暖色台灯补光，走廊冷边光 Palette anchors: amber, cream, walnut brown 色彩锚点：琥珀色、奶油色、胡桃木棕

✅ 说明：明确了光线来源与色彩基调，画面更具层次与连贯性。

使用图像输入以增强控制（Use Image Input for More Control）

如果你希望对镜头的构图与风格进行更精细的控制，可以使用图像输入（image input）作为视觉参考。

你可以上传照片、数字艺术作品或 AI 生成的图片。

这样可以固定关键视觉元素，例如角色设计、服装造型、布景细节或整体美术风格。

模型会把这张图像当作视频的第一帧锚点（anchor），

然后根据你的文字提示（prompt）生成后续的动作与场景变化。

📘 使用方法（How to Use It）

在你的 POST /videos 请求中，将图像文件作为 input_reference 参数上传。

注意事项：

图像的分辨率必须与目标视频一致（size 参数需匹配）。
支持的文件格式包括：: image/jpeg, image/png, and image/webp.

Input image generated with OpenAI GPT Image 由 OpenAI GPT Image 生成的输入图像	Generated video using Sora 2 (converted to GIF) 使用 Sora 2 生成的视频（已转换为 GIF）
Download this image	Prompt: “She turns around and smiles, then slowly walks out of the frame.” 提示词：她转过身，微笑着，慢慢走出画面。
Download this image	Prompt: “The fridge door opens. A cute, chubby purple monster comes out of it.” 提示词：冰箱门缓缓打开，一个可爱、圆滚滚的紫色小怪兽从中走了出来。

💡 实验技巧（Experimentation Tip）

如果你暂时没有合适的视觉参考图像，可以使用 OpenAI 的图像生成模型来快速创建。

你可以迅速生成环境场景、空间布局或人物造型设计，然后将这些图像作为参考输入到 Sora 中。

这是一种极高效的创作方式：既能测试画面美学风格，又能为视频制作提供一个视觉上的理想起点。

对话与音频（Dialogue and Audio）

对话必须在你的 Prompt 中直接写出。建议将对话单独放在正文描述之下的独立区块，这样模型就能清楚地区分视觉描述与台词内容。

保持台词简洁、自然，并尽量控制在几句话之内，这样视频时长与对白节奏才能匹配。

对于多角色场景，要一致地标注角色名称，并让发言交替出现，这样模型才能正确地将每句台词与角色的表情、肢体动作对应起来。

💡 节奏与时长建议：

4 秒镜头：适合 1–2 句简短对白
8 秒镜头：可容纳稍多的句子
冗长、复杂的演讲通常难以对齐口型与节奏，可能会破坏视频的流畅性与观感。

若镜头是静默的，你依然可以用一个微弱的声音提示节奏，

例如： “远处传来车流声（distant traffic hiss）” “清脆的啪声（a crisp snap）”

把它当作节奏线索（rhythm cue），而非完整的配乐。

💬 含对白的 Prompt 示例（Example Prompt with Dialogue）

A cramped, windowless room with walls the color of old ash. A single bare bulb dangles from the ceiling, its light pooling onto the scarred metal table at the center. Two chairs face each other across it. On one side sits the Detective, trench coat draped across the back of his chair, eyes sharp and unblinking. Across from him, the Suspect slouches, cigarette smoke curling lazily toward the ceiling. The silence presses in, broken only by the faint hum of the overhead light. Dialogue: – Detective: “You’re lying. I can hear it in your silence.” – Suspect: “Or maybe I’m just tired of talking.” – Detective: “Either way, you’ll talk before the night’s over.”

中文翻译：

一个狭小、没有窗户的房间，墙壁呈旧灰色。天花板上吊着一盏裸露的灯泡，光线洒在中央那张布满划痕的金属桌上。桌子的两侧摆着两把椅子。一侧是侦探，风衣搭在椅背上，眼神锐利而冷静；另一侧是嫌疑人，懒散地靠着椅背，烟雾缓缓升起。空气里弥漫着压抑的沉默，只能听到灯泡发出的微弱嗡嗡声。 Dialogue对白： – 侦探：“你在撒谎，我能从你的沉默里听出来。” – 嫌疑人：“也许我只是，不想再说话了。” – 侦探：“无论如何，今晚你都会开口的。”

使用 Remix 功能进行迭代（Iterate with the remix functionality）

Remix 是用来“微调（nudging）”，而不是“赌运气（gambling）”的。

它的作用是让你在已有结果的基础上做受控修改 —— 一次只改动一个变量，并明确说明你要改什么。

例如：

“same shot, switch to 85 mm”

（同一镜头，改用 85 mm 镜头）

“same lighting, new palette: teal, sand, rust”

（同样的光线，新的色调方案：青色、沙色、铁锈色）

当你得到一个接近理想的结果时，可以将它固定为参考（pin），

之后在提示中只描述“想要微调的部分”。

这样，已经稳定、效果良好的部分就不会被破坏。

如果一个镜头总是生成得不理想（misfiring），

那就先还原到最基础版本：

固定摄像机位置（freeze the camera）
简化动作（simplify the action）
清空背景（clear the background）

等到这一基础版本稳定、可用后，

再逐步叠加复杂元素（layer additional complexity step by step），

让画面在稳定基础上不断完善。

Original Video	Remix Generated Video
Original Video 原始视频	Prompt: “Change the color of the monster to orange” “把怪物的颜色改为橙色”
Original Video	Prompt: “A second monster comes out right after” “紧接着，又出现第二个怪物”

Remix 的最佳用法是逐步叠加变化：一次只改一个要素（颜色 / 动作 / 灯光 / 构图），

以确保整体逻辑和视觉风格不被破坏。

提示模板与示例（Prompt Templates and Examples）

🧩 提示结构（Prompt Structure）

一种有效的写提示方法，是将你希望模型利用的不同类型信息分层描述。这不是一套固定公式，但能为你提供清晰框架，让结果更一致。并非每个细节都要写——如果某个部分对镜头结果无关紧要，你可以留空。

实际上，保留一定开放空间能激发模型的创造力：你越少去约束每个视觉细节，模型就越有机会以出人意料、却常常惊艳的方式进行诠释。详细的提示会带来更一致、可控的结果，而简洁的提示则可能解锁更多灵动、多样且富有想象力的变化。

🎬 描述型提示模板（Descriptive Prompt Template）

[Prose scene description in plain language. Describe characters, costumes, scenery, weather and other details. Be as descriptive to generate a video that matches your vision.] Cinematography: Camera shot: [framing and angle, e.g. wide establishing shot, eye level] Mood: [overall tone, e.g. cinematic and tense, playful and suspenseful, luxurious anticipation] Actions: – [Action 1: a clear, specific beat or gesture] – [Action 2: another distinct beat within the clip] – [Action 3: another action or dialogue line] Dialogue: [If the shot has dialogue, add short natural lines here or as part of the actions list. Keep them brief so they match the clip length.] [用自然语言描写场景。说明人物、服装、环境、天气等细节。尽量具体，以生成符合你愿景的视频。] Cinematography（摄影）: Camera shot: [镜头类型与角度，如 wide establishing shot, eye level（宽幅建立镜头，平视角度）] Mood: [整体氛围，如 cinematic and tense（电影感且紧张）、playful and suspenseful（轻快而悬疑）、luxurious anticipation（奢华的期待）] Actions（动作）: – [动作1：清晰、具体的节奏或手势] – [动作2：镜头内另一个明确的节拍] – [动作3：额外动作或台词] Dialogue（对白）: [如有对白，在此写简短自然的句子，或放在动作列表中。对白应简洁，以匹配片段长度。]

🎨 提示示例（Prompt Examples）

示例一（Example 1）

Style: Hand-painted 2D/3D hybrid animation with soft brush textures, warm tungsten lighting, and a tactile, stop-motion feel. The aesthetic evokes mid-2000s storybook animation — cozy, imperfect, full of mechanical charm. Subtle watercolor wash and painterly textures; warm–cool balance in grade; filmic motion blur for animated realism. Inside a cluttered workshop, shelves overflow with gears, bolts, and yellowing blueprints. At the center, a small round robot sits on a wooden bench, its dented body patched with mismatched plates and old paint layers. Its large glowing eyes flicker pale blue as it fiddles nervously with a humming light bulb. The air hums with quiet mechanical whirs, rain patters on the window, and the clock ticks steadily in the background. Cinematography: Camera: medium close-up, slow push-in with gentle parallax from hanging tools Lens: 35 mm virtual lens; shallow depth of field to soften background clutter Lighting: warm key from overhead practical; cool spill from window for contrast Mood: gentle, whimsical, a touch of suspense Actions: – The robot taps the bulb; sparks crackle. – It flinches, dropping the bulb, eyes widening. – The bulb tumbles in slow motion; it catches it just in time. – A puff of steam escapes its chest — relief and pride. – Robot says quietly: “Almost lost it… but I got it!” Background Sound: Rain, ticking clock, soft mechanical hum, faint bulb sizzle.

中文翻译：

Style: 手绘 2D/3D 混合动画，使用柔和笔触质感与温暖钨丝灯光，带有可触感的定格动画风格。整体美学呼应 2000 年代中期的童话绘本动画——温馨、不完美，却充满机械质感的魅力。细腻的水彩晕染与绘画质感；色调平衡在暖与冷之间；带轻微运动模糊以营造动画的真实感。场景：一个杂乱的工坊里，架子上堆满齿轮、螺栓和泛黄的蓝图。中央，一只圆形小机器人坐在木制长凳上，它的身体上布满不同金属片的补丁和旧漆斑。它那双发着淡蓝光的眼睛一闪一闪，正紧张地摆弄着一只嗡嗡作响的灯泡。空气中充满微弱的机械声，窗外下着雨，钟表在背景中稳定地滴答作响。 Cinematography: Camera: 中近景镜头，缓慢推近，画面中工具的轻微视差制造空间感 Lens: 虚拟 35mm 镜头；浅景深以柔化背景杂乱 Lighting: 头顶实景灯的暖色主光；窗外洒入冷光作对比 Mood: 温柔、奇趣，带一丝悬念 Actions: – 小机器人轻轻敲击灯泡，火花闪烁。 – 它吓了一跳，手一松，灯泡掉落，眼睛骤然睁大。 – 灯泡慢动作下坠，它及时接住。 – 胸口喷出一缕蒸汽——是释然，也是小小的骄傲。 – 机器人轻声说：“差点丢了……不过我接住了！” Background Sound: 雨声、钟表滴答声、轻柔的机械嗡鸣与细微电流声。

示例二（Example 2）

Style: 1970s romantic drama, shot on 35 mm film with natural flares, soft focus, and warm halation. Slight gate weave and handheld micro-shake evoke vintage intimacy. Warm Kodak-inspired grade; light halation on bulbs; film grain and soft vignette for period authenticity. At golden hour, a brick tenement rooftop transforms into a small stage. Laundry lines strung with white sheets sway in the wind, catching the last rays of sunlight. Strings of mismatched fairy bulbs hum faintly overhead. A young woman in a flowing red silk dress dances barefoot, curls glowing in the fading light. Her partner — sleeves rolled, suspenders loose — claps along, his smile wide and unguarded. Below, the city hums with car horns, subway tremors, and distant laughter. Cinematography: Camera: medium-wide shot, slow dolly-in from eye level Lens: 40 mm spherical; shallow focus to isolate the couple from skyline Lighting: golden natural key with tungsten bounce; edge from fairy bulbs Mood: nostalgic, tender, cinematic Actions: – She spins; her dress flares, catching sunlight. – Woman (laughing): “See? Even the city dances with us tonight.” – He steps in, catches her hand, and dips her into shadow. – Man (smiling): “Only because you lead.” – Sheets drift across frame, briefly veiling the skyline before parting again. Background Sound: Natural ambience only: faint wind, fabric flutter, street noise, muffled music. No added score.

中文翻译：

Style: 1970 年代浪漫剧情片风格，用 35mm 胶片拍摄，伴随自然炫光、柔焦与温暖晕光。轻微的画面漂移与手持抖动营造复古亲密感。采用 Kodak 风格的暖色调分级；灯泡处有轻柔晕影；颗粒感与暗角增强年代质感。场景：黄昏时分，一栋砖砌公寓的屋顶变成了小舞台。晾衣绳上挂着白色床单，随风摇曳，反射着最后的夕阳。几串不规则的彩灯在头顶轻轻嗡鸣。一位身穿红色丝绸长裙的年轻女子赤脚起舞，卷发在余晖中闪烁。她的舞伴——卷起袖子、吊带松垮——笑着拍手相和，笑容真挚无防备。下方的城市充满汽车喇叭声、地铁震动与远处的笑声。 Cinematography: Camera: 中广景镜头，从平视缓慢推近（dolly-in） Lens: 40mm 球面镜头；浅景深突出情侣，模糊城市天际线 Lighting: 金色自然主光 + 钨丝灯反光；彩灯提供柔边光 Mood: 怀旧、温柔、电影感 Actions: – 她旋转，裙摆扬起，捕捉到阳光。 – 女子（笑着）：“看吧？连城市都跟着我们跳舞。” – 他走上前，握住她的手，将她轻轻带入阴影。 – 男子（微笑）：“那是因为你在引领。” – 床单掠过镜头，短暂遮蔽城市天际线，又随风分开。 Background Sound: 仅保留自然环境音：轻风、布料飘动、街道声、遥远的音乐，无额外配乐。

✅ 总结：

结构清晰： 用模板划分“视觉描述、摄影参数、情绪、动作、对白”。
细节具体： 让模型能抓住关键视觉元素（光线、镜头、动作节奏）。
开放度可调： 想稳定就写细，想出彩就留白。
专业表达： 模板既适合 AI 视频生成，也能直接用于分镜脚本或创意简报。

声明：本站所有文章，如无特殊说明或标注，均为本站原创发布。任何个人或组织，在未征得本站同意时，禁止复制、盗用、采集、发布本站内容到任何网站、书籍等各类媒体平台。如若本站内容侵犯了原著者的合法权益，可联系我们进行处理。

Sora 2 提示(Prompting)指南 | OpenAI Cookbook

成功的视频提示撰写

在提示之前：

API 参数

视频分辨率

视频时长

有效的提示结构（Prompt anatomy）

示例（超详细）：

引导视觉风格的元素（Visual cues that steer the look）

控制运动和节奏（Control motion and timing）

光线与色彩一致性（Lighting and Color Consistency）

使用图像输入以增强控制（Use Image Input for More Control）

📘 使用方法（How to Use It）

对话与音频（Dialogue and Audio）

💬 含对白的 Prompt 示例（Example Prompt with Dialogue）

使用 Remix 功能进行迭代（Iterate with the remix functionality）

提示模板与示例（Prompt Templates and Examples）

🧩 提示结构（Prompt Structure）

🎨 提示示例（Prompt Examples）

评论(0)

提示：请文明发言取消回复

文章展示

【Coze工作流】每天听懂一首歌工作流搭建指南

【Coze工作流】王阳明心学视频工作流搭建教程

【Coze工作流教程】每日学中药工作流搭建保姆级教程

【Coze工作流教程】名人图片生成工作流搭建指南

【Coze工作流教程】国学解签短视频制作工作流搭建保姆级教程

彩色心理火柴人工作流搭建教程

Sora 2 提示(Prompting)指南 | OpenAI Cookbook

成功的视频提示撰写

在提示之前：

API 参数

视频分辨率

视频时长

有效的提示结构（Prompt anatomy）

示例（超详细）：

引导视觉风格的元素（Visual cues that steer the look）

控制运动和节奏（Control motion and timing）

光线与色彩一致性（Lighting and Color Consistency）

使用图像输入以增强控制（Use Image Input for More Control）

📘 使用方法（How to Use It）

对话与音频（Dialogue and Audio）

💬 含对白的 Prompt 示例（Example Prompt with Dialogue）

使用 Remix 功能进行迭代（Iterate with the remix functionality）

提示模板与示例（Prompt Templates and Examples）

🧩 提示结构（Prompt Structure）

🎨 提示示例（Prompt Examples）

评论(0)

提示：请文明发言 取消回复

相关文章

文章展示

提示：请文明发言取消回复