AI绘图 DALL·E 3

DALL·E 3实战：用AI画出脑海中的画面

📅 2026-01-20 ⏱ 约 7 分钟 ✍️ AI探索笔记

DALL·E 3和Midjourney有什么不同

AI绘图工具里，Midjourney以艺术感强著称，而DALL·E 3的核心优势是对提示词的理解能力极强。你用自然语言描述一个复杂场景，DALL·E 3通常能很精准地还原你的意图，而不是"意思到了但细节差很多"。

此外，DALL·E 3对文字的处理是AI绘图领域的一大突破——在图片中生成清晰可读的文字，这在Midjourney里长期是痛点。

💡

DALL·E 3集成在ChatGPT Plus中，也可通过OpenAI API调用。ChatGPT还会自动优化你的提示词再发送给DALL·E，可以在对话中要求它"不要修改我的提示词"来保持原始输入。

基本用法

清晰的场景描述

DALL·E 3擅长理解自然语言，所以不需要像Midjourney那样堆砌关键词，用完整的句子描述更有效：

❌ 关键词堆砌风格（Midjourney适合，DALL·E不必要）：
cozy cafe, warm light, book, woman, autumn, bokeh

✅ 自然语言描述（DALL·E更擅长理解）：
A young woman with short hair reading a novel in a cozy corner cafe,
warm afternoon sunlight streaming through the window,
a half-finished latte on the wooden table,
autumn leaves visible outside the window.

风格关键词

DALL·E 3对艺术风格词的响应很好，常用风格：

photorealistic               照片级写实
oil painting                 油画
watercolor                   水彩画
pencil sketch                铅笔素描
digital art                  数字艺术
anime style                  动漫风格
vintage illustration         复古插画
minimalist                   极简主义
Studio Ghibli style          吉卜力风格
storybook illustration       童话绘本插画

在图片中生成文字

这是DALL·E 3的独家优势，把需要生成的文字用引号括起来即可：

A warm wooden sign hanging on a cafe door,
with the text "Welcome, We're Open" written in elegant chalk lettering,
soft morning light, cozy atmosphere

A retro poster for a music festival,
with the headline "Summer Sound Festival 2025" in bold vintage typography,
hand-drawn illustration style, warm color palette

构图控制

# 画幅比例（在ChatGPT中可以直接说）
"请生成横向/竖向/正方形的图片"

# 视角控制
bird's eye view              俯视视角
worm's eye view              仰视视角
close-up portrait            特写/近景
wide establishing shot       广角全景
isometric view               等轴测视图

# 构图方式
rule of thirds               三分构图法
centered composition         中心对称构图
leading lines                引导线构图

迭代优化：和DALL·E对话

DALL·E 3在ChatGPT里的一大优势是支持对话式迭代。生成图片后，可以直接说：

"把背景改成日落时的橙红色"
"让人物更靠近画面中心"
"加上雨水，让氛围更悲伤一些"
"保持构图不变，把风格改成水彩画"
"让光线更暗，添加月光效果"

这种对话式修改比Midjourney的变体功能更直观，不需要记忆参数。

通过API调用DALL·E 3

from openai import OpenAI
import requests, base64

client = OpenAI()

response = client.images.generate(
    model="dall-e-3",
    prompt="A serene Japanese garden in early morning mist, koi pond with lotus flowers, traditional stone lanterns, soft pink light, photorealistic",
    size="1792x1024",    # 可选：1024x1024, 1024x1792, 1792x1024
    quality="hd",        # standard 或 hd（高清，更多细节）
    n=1,                 # 每次只能生成1张
    style="natural",     # natural（自然写实）或 vivid（鲜艳戏剧化）
)

# 获取图片URL（1小时内有效）
image_url = response.data[0].url
print(image_url)

# 查看DALL·E自动优化后的提示词
revised_prompt = response.data[0].revised_prompt
print("优化后的Prompt:", revised_prompt)

注意事项

不能生成真实人物：DALL·E 3拒绝生成真实存在的公众人物图像
图片有水印信息：通过API生成的图片内嵌了不可见的C2PA元数据，标识为AI生成
每次只能生成1张：API不支持批量生成（n只能为1），需要多次调用
API费用：标准版 $0.04/张（1024x1024），高清版 $0.08/张

和Midjourney怎么选

需要精准还原复杂场景描述，或图中需要文字 → DALL·E 3
追求高艺术感、风格独特性，或需要批量出图 → Midjourney
想用对话方式迭代修改 → DALL·E 3（集成在ChatGPT中）
预算有限，免费额度更多 → DALL·E 3（ChatGPT免费版有限额度）