技能 gpt-image-2

📦

gpt-image-2

Name: gpt-image-2
Author: runcomfy-com

低风险 ⚙️ 外部命令🌐 网络访问📁 文件系统访问

在 RunComfy 上使用 GPT Image 2 生成图像

也可从以下获取: doany-ai,agentspace-so

通过 RunComfy 云平台使用 OpenAI GPT Image 2 创建和编辑图像。此技能提供专业的提示词模式、尺寸限制和模型选择指导，帮助您在每次生成时获得最佳效果。

支持: Claude Codex Code(CC)

🥉 72 青铜

下载技能 ZIP

在 Claude 中上传

前往设置 → 功能 → 技能 → 上传技能

开启并开始使用

测试它

正在使用“gpt-image-2”。 Generate a product photo of a ceramic water bottle on warm linen with the text 'AQUA+' on the label

预期结果:

High-resolution product image with accurate label text reading AQUA+, soft window lighting, subtle rim highlights, and neutral e-commerce background

正在使用“gpt-image-2”。 Edit this cafe photo: turn the background into a bright white studio sweep and add the headline 'OPEN STUDIO' in bold sans-serif, centered

预期结果:

Edited image with clean white-to-soft-gray studio background, centered bold OPEN STUDIO headline text, preserved subject identity and composition

正在使用“gpt-image-2”。 Create a Tokyo cafe storefront at dusk with the sign reading the Japanese characters for coffee in kana

预期结果:

Cinematic storefront image with warm interior glow, accurate Japanese text on wooden plaque, shallow depth of field, rule of thirds composition

安全审计

低风险

v1 • 5/28/2026

Static analysis flagged 77 patterns across SKILL.md (58 shell backticks, 10 hardcoded URLs, 5 filesystem paths, 3 weak-crypto indicators, 1 reconnaissance). All findings evaluated as false positives: backticks contain markdown CLI documentation examples, URLs are legitimate RunComfy service endpoints, filesystem references document CLI config paths (~/.config/runcomfy/token.json), and crypto/reconnaissance patterns are misidentified text ("Exit codes" heading, jq pipe examples). No executable code, no prompt injection, no data exfiltration detected. Minor concerns: user prompts transmitted to RunComfy's external API, third-party CLI dependency, local token file storage.

已扫描文件

212

分析行数

发现项

审计总数

低风险问题 (3)

SKILL.md:195

Third-Party Service Data Transmission

User prompts and image URLs are transmitted to RunComfy's model API (model-api.runcomfy.net) for processing. This is the intended function but users should be aware their data leaves the local environment.

SKILL.md:45

Third-Party CLI Dependency

Skill requires installation of @runcomfy/cli via npm. This introduces a supply chain dependency on an external package not controlled by the skill author.

SKILL.md:207

Local Token Storage

RunComfy CLI stores authentication token in ~/.config/runcomfy/token.json. Documented as using mode 0600 permissions which provides adequate local protection.

风险因素

⚙️ 外部命令 (13)

SKILL.md:9-10 SKILL.md:24-26 SKILL.md:45-47 SKILL.md:53-58 SKILL.md:60-68 SKILL.md:74-78 SKILL.md:82-89 SKILL.md:93-100 SKILL.md:102-109 SKILL.md:147-168 SKILL.md:172-185 SKILL.md:194-198 SKILL.md:207-210

🌐 网络访问 (6)

SKILL.md:14 SKILL.md:20 SKILL.md:86 SKILL.md:97 SKILL.md:190 SKILL.md:195

📁 文件系统访问 (2)

SKILL.md:196-198 SKILL.md:207

检测到的模式

Shell Backtick Execution (58 instances — All False Positives)Hardcoded URLs (10 instances — All False Positives)Filesystem Path Patterns (5 instances — All False Positives)

审计者: claude

质量评分

架构

100

可维护性

内容

社区

安全

规范符合性

你能构建什么

电商产品摄影

使用 GPT Image 2 精确的文本渲染功能，生成具有准确标签文本、品牌安全布光和跨产品线一致风格的产品图像。

多语言品牌资产创建

使用编辑端点从单一源资产创建具有多语言准确文本渲染的标识、海报和包装模型。

迭代图像优化

逐步骤编辑图像，每次更改一个属性，同时保留构图、人脸和品牌元素，使其在多次生成中保持一致。

试试这些提示

简单文生图生成

Generate an image of [描述主体] in [描述场景] with [描述氛围或光线] using GPT Image 2 on RunComfy.

带嵌入式文本的产品图像

Create a product photo of [产品] on [表面或背景], the label reads "[精确文本]" in [字体样式], [光线描述], e-commerce ready, neutral background.

保留元素的图像编辑

Edit this image using GPT Image 2: [描述具体更改]. Keep [列出要保留的元素] unchanged. Use [尺寸] for output.

多参考图像合成

Compose a new image using GPT Image 2 edit: subject from image 1 and background from image 2. Match the lighting of image 2. Keep the pose and face identity from image 1 unchanged.

最佳实践

精确引用所有嵌入式文本，确保其在生成的图像中准确呈现
每次编辑迭代只更改一个属性（如光线、背景、姿势或文本），以保持构图稳定性
直接在提示词中使用构图技巧，如三分法、特写、航拍或浅景深

避免

不要组合相互冲突的指令，如要求无文本同时又指定产品标签上的文本
避免在单一提示词中堆叠多种艺术风格（如浮世绘加水彩加电影感），这会导致结果不一致
不要使用极端宽高比，因为仅支持三种固定尺寸，不支持的宽高比会被自动调整大小

常见问题

与其他图像模型相比，GPT Image 2 最擅长什么？

GPT Image 2 在嵌入式文本渲染、标志、多语言排版和精确指令遵循方面表现出色。当准确性比艺术风格化更重要时，它非常理想。

使用此技能是否需要 OpenAI API 密钥？

不需要。此技能使用 RunComfy 的托管模型 API。您只需拥有 RunComfy 账户并在本地安装 RunComfy CLI。

支持哪些图像尺寸？

支持三种固定尺寸：1024x1024（方形）、1024x1536（竖版）和 1536x1024（横版）。编辑端点还支持 auto 以保留输入的宽高比。

编辑端点如何保留原始图像的元素？

在提示词中使用保留声明，如 keep the person's pose and face identity unchanged。当您每次只更改一个属性时，模型会保持构图稳定。

我的提示词数据是否会被发送到外部服务器？

是的。提示词通过 HTTPS 发送到 RunComfy 的模型 API (model-api.runcomfy.net)。CLI 不会对提示词进行 shell 展开。请查看 RunComfy 的隐私政策以了解数据处理的详细信息。

如何取消正在运行的图像生成？

在 CLI 轮询结果时按 Ctrl-C。这会向 API 发送取消请求，这样您就不会为未使用的 GPU 时间付费。

开发者详情

作者

runcomfy-com

许可证

MIT

仓库

https://github.com/runcomfy-com/skills/tree/main/gpt-image-2/

引用

main

文件结构

📄 SKILL.md