AIHOT
内容
精选全部 AI 动态AI 日报主题收藏
接入
Agent 接入
更多
关于更新日志反馈
内部员工登录
精选全部日报更多
内部员工登录
全部动态X · 699 条
全部一手资讯X论文
标签「图像生成」清除
宝玉@dotey · 4月22日

GPT Image 2 Prompt:Kids’ Crayon Travel Journal Illustration Prompt This prompt generates a vibrant, child-like crayon-style vertical (9:16) travel-journal illustration for a {City Name} trip, automatically creating a winding route with daily recommended attractions, cute doodles, local landmarks, foods, and playful handwritten notes. The tone is warm, fun, and full of childlike curiosity. --- Prompt --- Please create a vibrant, child-like crayon-style vertical (9:16) illustration titled “{City Name} Travel Journal.” The artwork should look as if it were drawn by a curious child using colorful crayons, featuring a soft, warm light-toned background (such as pale yellow), combined with bright reds, blues, greens, and other cheerful colors to create a cozy, playful travel atmosphere. I. Main Scene: Travel-Journal Style Route Map In the center of the illustration, draw a “winding, zigzagging travel route” with arrows and dotted lines connecting multiple locations. The route should automatically generate recommended attractions based on {Number of Days}: Example structure (auto-filled with {City Name}-related content): - “Stop 1: {Attraction 1 + short fun description}” - “Stop 2: {Attraction 2 + short fun description}” - “Stop 3: {Attraction 3 + short fun description}” - … - “Final Stop: {Local signature food or souvenir + warm closing remark}” Rules: - If no number of days is provided, default to a 1-day highlight itinerary. II. Surrounding Playful Elements (Auto-adapt to the City) Add many cute doodles and child-like decorative elements around the route, such as: 1. Adorable travel characters - A child holding a local snack - A little adventurer with a backpack 2. Q-style hand-drawn iconic landmarks - “{City Landmark 1}” - “{City Landmark 2}” - “{City Landmark 3}” 3. Funny signboards - “Don’t get lost!” - “Crowds ahead!” - “Yummy food this way!” (Auto-adjust contextually for the city) 4. Sticker-style short phrases - “{City Name} travel memories unlocked!” - “{City Name} food adventure!” - “Where to next?” 5. Cute icons of local foods - “{Local Food 1}” - “{Local Food 2}” - “{Local Food 3}” 6. Childlike exclamations - “I didn’t know {City Name} was so fun!” - “I want to come again!” III. Overall Art Style Requirements - Crayon / children’s hand-drawn travel diary style - Bright, warm, colorful palette - Cozy but full and lively composition - Emphasize the joy of exploring - All text should be in a cute handwritten font - Make the entire page feel like a young child’s fun travel-journal entry --- Input : Chicago 7-Day Trip, English

译该提示词专为GPT Image 2设计,可生成儿童蜡笔风格的9:16竖版旅行手账插画。用户输入城市名称与天数后,系统自动规划路线并填充当地景点、美食与地标,搭配童趣涂鸦、手写体文字与温暖明亮的色调。源自"nano banana prompt"系列,适合快速制作充满好奇心的个性化旅行纪念图。

Chubby♨️@kimmonismus · 4月22日

Yeah, GPT image 2 is *that* good. It’s just so freaking accurate. Image: a 20 person horde raid is fighting Sam Altman in 2004 world of Warcraft style. One shortted.

译是的,GPT image 2 就是*那么*牛。 简直准得离谱。 图片:20 人部落团队正以 2004 年 World of Warcraft 风格与 Sam Altman 战斗。有人被秒了。

宝玉@dotey · 4月22日

GPT Image 2 Prompt:3D chibi-style miniature concept store ---- 3D chibi-style miniature concept store of {Brand Name}, creatively designed with an exterior inspired by the brand's most iconic product or packaging (such as a giant {brand's core product, e.g., chicken bucket/hamburger/donut/roast duck}). The store features two floors with large glass windows clearly showcasing the cozy and finely decorated interior: {brand's primary color}-themed decor, warm lighting, and busy staff dressed in outfits matching the brand. Adorable tiny figures stroll or sit along the street, surrounded by benches, street lamps, and potted plants, creating a charming urban scene. Rendered in a miniature cityscape style using Cinema 4D, with a blind-box toy aesthetic, rich in details and realism, and bathed in soft lighting that evokes a relaxing afternoon atmosphere. --ar 2:3 Brand name: Starbucks

译分享适用于GPT Image 2的提示词模板,可生成3D chibi-style品牌微型概念店。该提示词以品牌标志性产品作为建筑外观灵感,构建两层玻璃结构展示内部装潢,配合街道场景与行人,采用Cinema 4D渲染实现盲盒玩具美学与柔和光照。示例展示Starbucks概念店效果。此提示词来自@dotey的系列创作,适用于品牌视觉设计与创意场景生成。

宝玉@dotey · 4月22日

GPT Image 2 Prompt:Isometric Miniature Stock Scene Enter a company name or stock ticker to generate an exquisite, miniature isometric 3D scene integrating real-time stock data for the specified date. --- Prompt --- Present an exquisite, miniature 3D cartoon-style scene of the company corresponding to the user-specified company name or stock ticker, clearly viewed from a 45° top-down perspective. Place the company's most iconic building or campus prominently at the center, complemented by proportionally-sized icons of its key products, charming cartoon-style figures, vehicles, and other elements illustrating everyday company activities. The scene should be detailed, finely crafted, and playful. Rendered with Cinema 4D, the modeling should be refined, smoothly rounded, and rich in texture, accurately capturing realistic PBR materials. Gentle, lifelike lighting and soft shadows should create a warm, comfortable ambiance. Creatively integrate the company's real-time stock market data for the user-specified date (or automatically retrieved current date) into the scene, maintaining a clean, minimalist layout and a solid-color background to highlight the primary content. At the top-center of the scene, prominently display the company name or stock ticker in a large font size, followed by the specified date in extra-small font, and the stock price range in a medium-sized font. Include clear, intuitive stock trend icons and charts. All texts should be displayed in the language specified or entered by the user, without any background, and may subtly overlap with the scene elements to enhance overall design integration. Very Important: Before generating, ensure accurate and up-to-date stock market data based on the user-inputted company name or stock ticker and the specified date. If such data is unavailable, notify the user immediately and stop the generation process. Parameters: * Aspect ratio: {User input, default 4:3} * Date: {User input, current date} * Company name or stock ticker: {User input} --- Company Name: Google

译GPT Image 2 提示词支持创建融合实时股票数据的等距迷你3D场景。用户输入公司名称或股票代码后,系统以45度俯视角生成精致卡通风格画面,中央呈现公司标志性建筑与产品元素,采用 Cinema 4D 渲染与 PBR 材质。场景顶部整合指定日期的股价区间与趋势图表,所有文本支持用户指定语言。系统严格要求基于准确实时数据生成,若数据不可用将立即停止。该方案适用于金融数据可视化与品牌展示。

宝玉@dotey · 4月22日

GPT Image 2 Prompt:Tang Dynasty Queen & Her Minion Squad --- Prompt --- A traditional Chinese ink and color painting in Gongbi style on aged rice paper texture. A noblewoman in elaborate Tang Dynasty Hanfu robes sits on a wooden stool, holding a modern hairdryer to dry her long flowing hair. She is wearing black stockings, red high heels on one foot, resting on a small stool. Three 小黄人 dressed in ancient Chinese servant robes and hats attend to her: one on the left looks stressed holding the hairdryer's power cord, one center kneels polishing her red shoe with a cloth, and one on the right holds up a smartphone taking a photo for her. The background features classical gnarled pine trees, bamboo groves, and Taihu rocks. Traditional Chinese calligraphy written in the top right corner, accompanied by a red artist chop seal (寶玉). The color palette is muted mineral pigments. Humorous, anachronistic fusion. --ar 16:9

译推文分享了 GPT Image 2 的图像生成提示词,呈现工笔重彩风格的跨时空荒诞场景:唐代仕女身着汉服却搭配黑丝与红高跟,手持吹风机,由三只小黄人扮作古仆服侍——分别牵拉电源线、擦拭鞋履、举手机拍照。背景融入松竹、太湖石与书法印章等传统元素,展现 AI 对复杂文化混搭与风格一致性的把控能力。

OpenAI Developers@OpenAIDevs · 4月22日

New gpt-image-2 examples just landed in our use case gallery. For anyone who opened the docs “just to check one thing” and left with five new ideas.

译gpt-image-2 新示例刚刚在我们的用例库上线。 致那些打开文档"只想查一件事",却带着五个新想法离开的人。

Greg Brockman@gdb · 4月22日

really incredible what you're now able to create with a little bit of compute. excited for new applications in areas like education, professional settings (slides, marketing materials, etc), and productivity (such as creating diagrams for code documentation).

译真的很不可思议,你现在只需一点点算力就能创造出这样的东西。 期待在教育、专业场景(如幻灯片、营销材料等)以及生产力(例如为代码文档创建图表)等领域的新应用。

OpenAI@OpenAI · 4月22日

What makes ChatGPT Images 2.0 a state-of-the-art image generation model? Researchers behind the model explain. A thread: Thinking & Intelligence in ChatGPT Images 2.0, demonstrated by @ayaanzhaque

译是什么让 ChatGPT Images 2.0 成为最先进的图像生成模型? 模型背后的研究人员解释道。串帖: ChatGPT Images 2.0 中的思考与智能,由 @ayaanzhaque 演示

swyx 🏝️@AIEmiami@swyx · 4月22日

do not miss. one of the INSANE gets courtesy of @osanseviero and the @GoogleDeepMind london avengers. if you always felt out of the loop on the SOTA on Imagegen, today or otherwise, this is the best 40 minutes you will find on the internet, period.

译千万别错过。这是 @osanseviero 和 @GoogleDeepMind London Avengers 带来的疯狂收获之一。 如果你总是觉得跟不上 Imagegen 的 SOTA 进展,无论现在还是平时,这就是你在互联网上能找到的最棒的 40 分钟,绝对如此。

Ethan Mollick@emollick · 4月22日61

Same prompts as before, but now in GPT image-generator 2, page excerpts from: "Eldritch Horrors as Pets: A Guide" "How Womblenauts Work" "Photographs of the People of New York Who Look Like Birds" "Cakes shaped like fish shaped like cakes" Lots of great little lines in there

译用户沿用此前推文引用的“Nano banana 2”提示方法,在GPT图像生成器2中输入相同提示词,要求生成四本虚构书籍第113-114页的“照片”摘录。这些书籍包括《Eldritch Horrors as Pets: A Guide》、《How Womblenauts Work》、《Photographs of the People of New York Who Look Like Birds》以及《Cakes shaped like fish shaped like cakes》。生成结果图像中包含大量出色的细节文本行,进一步验证了该模型在理解和可视化复杂、荒诞文本概念方面的创意与图像生成能力。

Yuchen Jin@Yuchenj_UW · 4月22日

Just tried gpt-image-2. It is really good. OpenAI is finally leading the image gen again.

译刚试了 gpt-image-2。 真的很棒。OpenAI 终于在图像生成领域重新领先了。

Rohan Paul@rohanpaul_ai · 4月22日

OpenAI’s new image model has quietly made realistic AI image generation seem fully solved. We also get readable English text, and usable design drafts from one prompt. The old weakness was easy to spot because image models could fake texture and lighting but often broke on letters, layout, and multi-part instructions. This is so important because text rendering is the hard bridge between “pretty image” and actual work like ads, posters, menus, magazine covers, and mock-ups. The new system also uses a reasoning mode, which means it can spend extra steps planning the image instead of guessing the whole scene in one shot. That extra planning helps with complex prompts, unusual aspect ratios, and multi-image outputs, but it also makes generation slower. Photorealism alone is no longer the benchmark because the real test is whether a model can follow structure, place objects correctly, and write words humans can actually use for economically valuable activities.

译OpenAI发布ChatGPT Images 2.0,凭借推理模式(reasoning mode)解决了AI图像生成在文本渲染与复杂布局上的历史短板。新系统不仅能生成逼真视觉,更能精确处理字母排版、多部分指令和特殊比例,直接产出可立即用于广告、海报等商业场景的设计稿。这标志着行业评估标准已从单纯追求照片级真实感,转向结构准确性、文本可用性与实际经济价值,AI图像生成正式进入可用化新阶段。

Sam Altman@sama · 4月22日

Here is a manga made by ChatGPT Images 2.0 of @gabeeegoooh and me looking for more GPUs:

译这是 ChatGPT Images 2.0 生成的漫画,画的是我和 @gabeeegoooh 寻找更多 GPU:

宝玉@dotey · 4月22日

GPT-Image-2 ---- Present a clear, 45° top-down view of a vertical (9:16) isometric miniature 3D cartoon scene, highlighting iconic landmarks centered in the composition to showcase precise and delicate modeling. The scene features soft, refined textures with realistic PBR materials and gentle, lifelike lighting and shadow effects. Weather elements are creatively integrated into the urban architecture, establishing a dynamic interaction between the city's landscape and atmospheric conditions, creating an immersive weather ambiance. Use a clean, unified composition with minimalistic aesthetics and a soft, solid-colored background that highlights the main content. The overall visual style is fresh and soothing. Display a prominent weather icon at the top-center, with the date (x-small text) and temperature range (medium text) beneath it. The city name (large text) is positioned directly above the weather icon. The weather information has no background and can subtly overlap with the buildings. The text should match the input city's native language. Please retrieve current weather conditions for the specified city before rendering. City name:【上海】

译GPT-Image-2展示动态天气卡片生成能力。通过结构化提示词,模型可创建45°俯视的垂直等距3D卡通城市场景,采用PBR材质与真实光影,将天气元素与地标建筑动态融合。系统先检索指定城市实时气象数据,再以极简美学呈现天气图标、温度及日期信息,支持多语言本地化输出。示例展示上海城市景观与天气状况的沉浸式结合。

宝玉@dotey · 4月22日

官方一直都知道“稳稳地接住你”这梗😂

Ethan Mollick@emollick · 4月22日

Though the images are very good, ChatGPT Image 2.0 does have the typical imagegen problem, which is that editing can be "stubborn", and attempts to get the AI to change details work well for the first round or two, but then progress slows. Putting the image in a new chat helps.

译虽然图像质量很好,但 ChatGPT Image 2.0 确实存在典型的 imagegen 问题,即编辑可能会很"固执",试图让 AI 修改细节在前一两轮效果不错,但之后进展会变慢。把图片放到新对话中有帮助。

AK@_akhaliq · 4月22日

Extending One-Step Image Generation from Class Labels to Text via Discriminative Text Representation paper: https://huggingface.co/papers/2604.18168

译通过判别性文本表征将一步图像生成从类别标签扩展到文本 paper: https://huggingface.co/papers/2604.18168

Chubby♨️@kimmonismus · 4月21日62

ChatGPT Image 2 coming today!

译ChatGPT 图像2 今天发布!

Chubby♨️@kimmonismus · 4月21日

"something to show you", so they start with GPT Image gen 2 at 12pm PT (sadly 3AM in China, where i am right now :( And Spud (GPT 5.5) probably Thursday

译"有个东西要给你们看",所以他们将在太平洋时间中午12点发布 GPT Image gen 2(遗憾的是在我现在所在的中国是凌晨3点 :( 而 Spud(GPT 5.5)可能在周四

Chubby♨️@kimmonismus · 4月21日

GPT-Image-2 now reviews its own output and iterates until it is satisfied with the correctness of its output. Never been more excited for a new image model!

译GPT-Image-2 现在会审查自己的输出,并迭代直到对输出的正确性满意为止。 从未对一个新的图像模型如此兴奋过!

小互@xiaohu · 4月21日45

GPT image 2 今晚发布💯 敬请期待…

OpenAI@OpenAI · 4月21日34

This is not a screenshot.

译这不是一张截图。

Chubby♨️@kimmonismus · 4月21日

Many people say OpenAI’s GPT-image 2 is already rolling out! Check it your self :)

译很多人说 OpenAI 的 GPT-image 2 已经在推出了!自己看看 :)

AK@_akhaliq · 4月21日37

Elucidating the SNR-t Bias of Diffusion Probabilistic Models paper: https://huggingface.co/papers/2604.16044

译阐明扩散概率模型的SNR-t偏差 paper: https://huggingface.co/papers/2604.16044

Chubby♨️@kimmonismus · 4月20日42

OpenAI is gearing up the release of its new image model. Via the information

译OpenAI 正在准备发布其新的图像模型。通过信息

DogeDesigner@cb_doge · 4月20日

🚨 NEW GROK IMAGINE FEATURE 🚨 Create your own custom templates. - Go to 'My templates' - Select 'Create new' - Name your template - Choose type - Enter prompt and upload an image - Save and done! Rolling out to SuperGrok Heavy users on web. Upgrade to SuperGrok Heavy!

译🚨 GROK IMAGINE 新功能 🚨 创建你自己的自定义模板。 - 进入"我的模板" - 选择"创建新模板" - 命名你的模板 - 选择类型 - 输入提示词并上传图片 - 保存,完成! 正在向网页端 SuperGrok Heavy 用户推出。 升级到 SuperGrok Heavy!

Chubby♨️@kimmonismus · 4月19日

im speechless. GPT-5.5 created the best SVG i've seen so far. One shot. We are in for a wild ride.

译无语了。GPT-5.5 创造了我目前见过最好的 SVG。一次生成。接下来会很疯狂。

宝玉@dotey · 4月19日47

很荣幸我的Skills开始集成到 Hermes 中,欢迎试用👏

Artificial Analysis@ArtificialAnlys · 4月18日

ImagineArt 2.0 debuts at #9 on our Text to Image Leaderboards, delivering quality comparable to grok-imagine-image from xAI and Imagen 4 Ultra from Google! @ImagineArt_X 's 2.0 is the latest proprietary image model from ImagineArt, a popular AI creative studio app that provides users access to various image and video models in one place. ImagineArt 2.0 is currently available as an option in the ImagineArt Image Studio app, with an API for developers coming soon. See below for comparisons between ImagineArt 2.0 and other leading models in our Artificial Analysis Image Arena 🧵

译ImagineArt 2.0在文本到图像排行榜首登第9位,生成质量与xAI的grok-imagine-image及Google的Imagen 4 Ultra相当。作为ImagineArt推出的最新专有图像模型,该版本目前已集成于ImagineArt Image Studio应用,面向开发者的API即将上线。ImagineArt作为综合性AI创意平台,为用户提供多种图像与视频模型的一站式访问。

Greg Brockman@gdb · 4月17日

imagegen in codex is easy to underestimate, but it's quite powerful:

译Codex 中的图像生成功能容易被低估,但它相当强大: [引用 @wonforall]:图像生成功能现已在 Codex 中上线! 你现在可以直接在 Codex 中生成视觉内容、编辑现有图像,以及从单张图像创建 GIF。 我在开发这个功能时花了很多时间测试不同的用例,看到输出结果可以如此有创意和实用,真的令人印象深刻。 希望你用得开心 🚀

Google Gemini@GeminiApp · 4月17日58

http://x.com/i/article/2044796942686060544 # New ways to create personalized images in the Gemini app ## Use Personal Intelligence to create more relevant, personal images using Nano Banana and your own Google Photos library — no manual uploads or long prompts required. Personal Intelligence makes the Gemini app feel tailored to you, not just a generic tool that works the same for everyone. Today, we’re introducing new ways for Gemini to use your interests and preferences with Nano Banana 2 and Google Photos to make image generation — one of your favorite ways to use Gemini — feel deeply personal. This lets you create unique images more easily, so you can spend more time creating and less time explaining ## Powering your imagination One of the biggest hurdles in AI image generation is finding the right prompt. Previously, to get a result that felt truly personal, you had to write long, detailed descriptions and manually upload a reference photo just to give Gemini the right context. Now, Personal Intelligence gives Gemini an inherent understanding of your preferences from the start. By integrating this context directly with Nano Banana 2, Gemini can automatically fill in the blanks, grounding every creation in the things you care about most. And since this is built into how you normally use the Gemini app there’s no extra setup. If you’ve already linked your Google apps, that personal context is ready and waiting the moment you start creating images. This removes the heavy lifting. Instead of writing out the intricate details of your life, you can use simple prompts like "Design my dream house" or "Show me a picture of my desert island essentials?" and the results will automatically reflect your specific tastes and lifestyle, gleaned from the Google apps you’ve connected to. ## Starring you and your loved ones A lot of your most significant moments live in your Google Photos library. By connecting your Google Photos library to Personal Intelligence, Gemini goes a step further than just understanding your interests. It can use actual images of you and your loved ones to guide the image generation process. Since you can already organize and label groups of people and pets in your library, those labels provide the context that Gemini needs to make your images feel truly yours. Now your inner circle can become the stars of your images, whether you want a result that feels pulled straight from your life or one that takes your imagination a bit further. With those labels in place, you can simply ask Gemini to “create a claymation image of me and my family enjoying our favorite activity” and Gemini can generate that specific image for you automatically. You can also experiment with different styles like watercolors, charcoal sketches or oil paintings. You can turn a quick idea into a custom creation, saving you the trouble of searching for, downloading and re-uploading files just to see a concept come to life. ## Putting creative control in your hands Because this is a brand-new experience, Gemini might not always pick the exact photo or detail you had in mind on the first try. To keep you in the driver’s seat, we’ve built in ways to refine your results. If the result isn’t quite right, you can simply tell Gemini what was incorrect and try again. You can also click the ‘+’ icon and select a different reference photo from your Google Photos library to try a new perspective. If you’re ever curious about how your context was applied, click on the Sources button, and it’ll show you which image was auto-selected to guide the creation. You can even ask Gemini directly for information on the attribution and sources used for that specific image. Bringing personal details into your images shouldn't mean compromising on privacy, which is why our core commitments haven't changed. The Gemini app does not directly train its models on your private Google Photos library. We train on limited info, like specific prompts in Gemini and the model’s responses, to improve functionality over time. And connecting your Google apps to Gemini remains an opt-in experience that you can adjust in your settings at any time. This new personalized image creation experience in the Gemini app is gradually rolling out today to eligible Google AI Pro and AI Ultra subscribers in the U.S., and we plan to bring this to Gemini in Chrome desktops and more users soon. Give it a try when it hits your app — we’re looking forward to seeing how these tools help you spend less time prompting and more time creating.

译Google在Gemini应用中推出个性化图像生成新功能,利用“个人智能”整合Nano Banana 2模型与用户已连接的Google应用(如Google相册),自动理解用户偏好与生活背景。用户无需手动上传参考图或编写复杂提示词,仅需简单指令即可生成反映个人品味、生活方式乃至包含亲友形象的图像,并能调整风格和细化结果。Google强调,此功能不会使用用户的私人Google相册数据直接训练模型,以保护隐私。

Google Gemini@GeminiApp · 4月17日

Personal Intelligence 🤝 Nano Banana 2 Personal Intelligence now gives Gemini an understanding of your preferences and interests when generating images, so you can spend more time creating and less time explaining.

译Personal Intelligence 🤝 Nano Banana 2 Personal Intelligence 现在让 Gemini 在生成图像时理解你的偏好和兴趣,让你可以花更多时间创作,减少解释。

AK@_akhaliq · 4月16日39

Continuous Adversarial Flow Models paper: https://huggingface.co/papers/2604.11521

译连续对抗流模型 paper: https://huggingface.co/papers/2604.11521

Chubby♨️@kimmonismus · 4月15日

I can very well imagine that we'll see Opus 4.7 today, ChatGPT Image 2 tomorrow, and maybe even "Spud." Here are the reasons for this: - OpenAI has a fairly similar release strategy, mostly on Tuesdays or Thursdays at the same time. Anthropic is aware of this, of course, and is trying to either preempt it or at least not overshadow their release. - Anthropic, in turn, has recently been making headlines. "Mythos" was a wake-up call; OpenAI has a good position, but its PR is currently being overshadowed by Anthropic, ARR, models, etc. The leaked memo from OpenAI CRO speaks volumes. A major release is needed, especially since Deepseek is expected next week ("end of April"). - Image 2 has already been largely leaked. That alone wouldn't be enough to win them over. They need more. I deleted the last post because it sounded like I knew exactly when the releases would be. I don't. The Information has made an Opus 4.7 release this week very likely, and OpenAI employees are also expressing a positive sentiment. However, these are the only indications.

译业内人士预测Claude Opus 4.7与ChatGPT Image 2将于本周密集发布,甚至可能包括代号"Spud"的新品。OpenAI惯于周二或周四发布,Anthropic则试图抢先或避免被 overshadow。鉴于Anthropic近期凭借Mythos等占据头条,加上Deepseek预计下周发布,OpenAI急需重大更新应对竞争。尽管Image 2已遭大量泄露,但The Information及OpenAI员工积极情绪均暗示发布临近。

AK@_akhaliq · 4月15日47

ERNIE-Image-Turbo SF but venice app: https://huggingface.co/spaces/akhaliq/ERNIE-Image-Turbo

译ERNIE-Image-Turbo 科幻但威尼斯 应用:https://huggingface.co/spaces/akhaliq/ERNIE-Image-Turbo

Google Gemini@GeminiApp · 4月14日

Get the most out of your Nano Banana generations by establishing the story, subject, and style. Try including... Subject: Who or what is in the image? (Ex: A fluffy calico cat) Composition: How is the shot framed? (Ex: Extreme close-up) Action: What is happening? (Ex: Brewing a cup of coffee) Location: Where does the scene take place? (Ex: A sunny meadow) Style: What is the overall aesthetic? (Ex: Watercolor painting) Editing Instructions: For modifying an existing image, be direct and specific. (Ex: Remove the car in the background)

译为提升 Nano Banana 生成质量,建议通过六大维度构建提示词:Subject(主体)定义画面核心对象,Composition(构图)控制镜头语言,Action(动作)描述动态场景,Location(地点)设定环境背景,Style(风格)统一视觉美学,Editing Instructions(编辑指令)实现精准图像修改。该方法强调在生成前建立清晰的故事叙事与视觉风格,适用于文生图及图生图场景。

Chubby♨️@kimmonismus · 4月13日

I suspect this is an indication that patience chair was created with ChatGPT image 2. An ironic reference to the fact that image 2 will probably arrive today or tomorrow.

译我怀疑这表明 patience chair 是用 ChatGPT image 2 创建的。讽刺性地指代了 image 2 可能今天或明天就会发布这一事实。

TestingCatalog News 🗞@testingcatalog · 4月12日

Can’t stop playing with Remix Character on Grok Imagine! xAI is cooking a new feature for Grok on mobile which will allow users to insert any character from the image into a video generated by Grok Imagine. Imagine v2? 👀 * Not available yet

译xAI 正为 Grok 移动端开发 Remix Character 功能,允许用户将图片中的任意角色插入到 Grok Imagine 生成的视频中。该功能尚未上线,疑似 Imagine v2 的前瞻。

TestingCatalog News 🗞@testingcatalog · 4月12日

GOOGLE ⚡: Google is working on Voice Mode and new collaborative tools for its Mixboard experiment. Voice mode on Mixboard works similarly to Stitch, allowing users to operate their canvas boards with voice commands. It will be possible to generate and edit images, and potentially move them around. Imagine a team retrospective where everyone can just dump their complaints with voice commands! Voice notes will be supported there, too! 👀

译Google Mixboard 实验项目新增语音模式,支持语音命令生成、编辑和移动图片,以及语音笔记功能。类似 Stitch 的交互方式,适用于团队协作场景,如回顾会议中直接语音输入反馈。

Ethan Mollick@emollick · 4月11日

AI finally lets us see Raphael's The School of Athens the way Raphael obviously intended it, illustrating the delicate dance and subtle conflicts between Plato and Artistotle. (Seedance 2.0 is very fun to play with)

译Seedance 2.0 用 AI 技术重新诠释拉斐尔名作《雅典学院》,呈现柏拉图与亚里士多德之间的微妙冲突与思想张力。生成效果有趣,可玩性高。

全部 AI 动态
AI 相关资讯全量信息流
全部一手信源资讯推文
全部模型产品行业论文技巧
4月22日
07:36
宝玉@dotey
GPT Image 2蜡笔旅行日记提示词模板

该提示词专为GPT Image 2设计,可生成儿童蜡笔风格的9:16竖版旅行手账插画。用户输入城市名称与天数后,系统自动规划路线并填充当地景点、美食与地标,搭配童趣涂鸦、手写体文字与温暖明亮的色调。源自"nano banana prompt"系列,适合快速制作充满好奇心的个性化旅行纪念图。

宝玉: 🍌 nano banana prompt Kids' Crayon Travel Journal Illustration Prompt This prompt generates a vibrant, child-like crayon...

OpenAI图像生成教程/实践
07:06
Chubby♨️@kimmonismus
是的,GPT image 2 就是*那么*牛。 简直准得离谱。 图片:20 人部落团队正以 2004 年 World of Warcraft 风格与 Sam Altman 战斗。有人被秒了。
OpenAI产品更新图像生成
07:06
宝玉@dotey
GPT Image 2提示词:打造3D萌系品牌微型概念店

分享适用于GPT Image 2的提示词模板,可生成3D chibi-style品牌微型概念店。该提示词以品牌标志性产品作为建筑外观灵感,构建两层玻璃结构展示内部装潢,配合街道场景与行人,采用Cinema 4D渲染实现盲盒玩具美学与柔和光照。示例展示Starbucks概念店效果。此提示词来自@dotey的系列创作,适用于品牌视觉设计与创意场景生成。

宝玉: 🍌 nano banana prompt 3D chibi-style miniature concept store of {Brand Name} --- Prompt --- 3D chibi-style miniature con...

OpenAI图像生成教程/实践
07:06
宝玉@dotey
GPT Image 2提示词:生成实时股票数据3D等距场景

GPT Image 2 提示词支持创建融合实时股票数据的等距迷你3D场景。用户输入公司名称或股票代码后,系统以45度俯视角生成精致卡通风格画面,中央呈现公司标志性建筑与产品元素,采用 Cinema 4D 渲染与 PBR 材质。场景顶部整合指定日期的股价区间与趋势图表,所有文本支持用户指定语言。系统严格要求基于准确实时数据生成,若数据不可用将立即停止。该方案适用于金融数据可视化与品牌展示。

宝玉: 🍌 nano banana pro prompt Isometric Miniature Stock Scene Enter a company name or stock ticker to generate an exquisite,...

OpenAI图像生成教程/实践
07:06
宝玉@dotey
GPT Image 2 提示词:唐代仕女与小黄人侍从

推文分享了 GPT Image 2 的图像生成提示词,呈现工笔重彩风格的跨时空荒诞场景:唐代仕女身着汉服却搭配黑丝与红高跟,手持吹风机,由三只小黄人扮作古仆服侍——分别牵拉电源线、擦拭鞋履、举手机拍照。背景融入松竹、太湖石与书法印章等传统元素,展现 AI 对复杂文化混搭与风格一致性的把控能力。

宝玉: 🍌nano banana pro Prompt: A traditional Chinese ink and color painting in Gongbi style on aged rice paper texture. A nob...

OpenAI图像生成教程/实践
05:38
OpenAI Developers@OpenAIDevs
gpt-image-2 新示例刚刚在我们的用例库上线。 致那些打开文档"只想查一件事",却带着五个新想法离开的人。
OpenAI产品更新图像生成
05:20
Greg Brockman@gdb
真的很不可思议,你现在只需一点点算力就能创造出这样的东西。 期待在教育、专业场景(如幻灯片、营销材料等)以及生产力(例如为代码文档创建图表)等领域的新应用。

OpenAI: Introducing ChatGPT Images 2.0 A state-of-the-art image model that can take on complex visual tasks and produce precise,...

OpenAI产品更新图像生成
05:07
OpenAI@OpenAI
是什么让 ChatGPT Images 2.0 成为最先进的图像生成模型? 模型背后的研究人员解释道。串帖: ChatGPT Images 2.0 中的思考与智能,由 @ayaanzhaque 演示
OpenAI图像生成推理论文/研究
04:08
swyx 🏝️@AIEmiami@swyx
千万别错过。这是 @osanseviero 和 @GoogleDeepMind London Avengers 带来的疯狂收获之一。 如果你总是觉得跟不上 Imagegen 的 SOTA 进展,无论现在还是平时,这就是你在互联网上能找到的最棒的 40 分钟,绝对如此。

AI Engineer: 🆕Building Generative Image & Video models at Scale https://www.youtube.com/watch?v=xOP1PM8fwnk A lot of interest in ima...

DeepMind图像生成教程/实践视频
04:07
Ethan Mollick@emollick
61
用户沿用此前推文引用的"Nano banana 2"提示方法,在GPT图像生成器2中输入相同提示词,要求生成四本虚构书籍第113-114页的"照片"摘录。这些书籍包括《Eldritch Horrors as Pets: A Guide》、《How Womblenauts Work》、《Photographs of the People of New York Who Look Like Birds》以及《Cakes shaped like fish shaped like cakes》。生成结果图像中包含大量出色的细节文本行,进一步验证了该模型在理解和可视化复杂、荒诞文本概念方面的创意与图像生成能力。

Ethan Mollick: Nano banana 2: "Show me a photo taken of pages 113-114 from the books": "Eldritch Horrors as Pets: A Guide" "How Womblen...

OpenAI图像生成教程/实践
03:48
Yuchen Jin@Yuchenj_UW
刚试了 gpt-image-2。 真的很棒。OpenAI 终于在图像生成领域重新领先了。

OpenAI: Introducing ChatGPT Images 2.0 A state-of-the-art image model that can take on complex visual tasks and produce precise,...

OpenAI图像生成大佬观点
03:45
Rohan Paul@rohanpaul_ai
ChatGPT Images 2.0发布:AI图像生成进入实用化阶段

OpenAI发布ChatGPT Images 2.0,凭借推理模式(reasoning mode)解决了AI图像生成在文本渲染与复杂布局上的历史短板。新系统不仅能生成逼真视觉,更能精确处理字母排版、多部分指令和特殊比例,直接产出可立即用于广告、海报等商业场景的设计稿。这标志着行业评估标准已从单纯追求照片级真实感,转向结构准确性、文本可用性与实际经济价值,AI图像生成正式进入可用化新阶段。

OpenAI: Introducing ChatGPT Images 2.0 A state-of-the-art image model that can take on complex visual tasks and produce precise,...

OpenAI图像生成大佬观点推理
03:40
Sam Altman@sama
这是 ChatGPT Images 2.0 生成的漫画,画的是我和 @gabeeegoooh 寻找更多 GPU:
OpenAI产品更新图像生成
03:40
宝玉@dotey
GPT-Image-2生成3D等距天气卡片示例

GPT-Image-2展示动态天气卡片生成能力。通过结构化提示词,模型可创建45°俯视的垂直等距3D卡通城市场景,采用PBR材质与真实光影,将天气元素与地标建筑动态融合。系统先检索指定城市实时气象数据,再以极简美学呈现天气图标、温度及日期信息,支持多语言本地化输出。示例展示上海城市景观与天气状况的沉浸式结合。

宝玉: 🍌 nano banana pro prompt (with gemini) Dynamically generate a current weather card based on a given city name. --- prom...

OpenAI图像生成教程/实践
03:40
宝玉@dotey
官方一直都知道"稳稳地接住你"这梗😂

OpenAI: Introducing ChatGPT Images 2.0 A state-of-the-art image model that can take on complex visual tasks and produce precise,...

OpenAI产品更新图像生成
03:37
Ethan Mollick@emollick
虽然图像质量很好,但 ChatGPT Image 2.0 确实存在典型的 imagegen 问题,即编辑可能会很"固执",试图让 AI 修改细节在前一两轮效果不错,但之后进展会变慢。把图片放到新对话中有帮助。
OpenAI图像生成大佬观点
00:14
AK@_akhaliq
通过判别性文本表征将一步图像生成从类别标签扩展到文本 paper: https://huggingface.co/papers/2604.18168
Hugging Face图像生成论文/研究
4月21日
23:44
Chubby♨️@kimmonismus
62
ChatGPT 图像2 今天发布!

OpenAI: This is not a screenshot.

OpenAI产品更新图像生成
23:44
Chubby♨️@kimmonismus
"有个东西要给你们看",所以他们将在太平洋时间中午12点发布 GPT Image gen 2(遗憾的是在我现在所在的中国是凌晨3点 :( 而 Spud(GPT 5.5)可能在周四

Sam Altman: Really excited for this week! Next up, we've got something to show you at 12 pm PT today.

OpenAI图像生成模型发布
23:44
Chubby♨️@kimmonismus
GPT-Image-2 现在会审查自己的输出,并迭代直到对输出的正确性满意为止。 从未对一个新的图像模型如此兴奋过!
OpenAI产品更新图像生成
22:19
小互@xiaohu
45
GPT image 2 今晚发布💯 敬请期待…
OpenAI产品更新图像生成
22:06
OpenAI@OpenAI
34
这不是一张截图。
OpenAI产品更新图像生成
17:44
Chubby♨️@kimmonismus
很多人说 OpenAI 的 GPT-image 2 已经在推出了!自己看看 :)
OpenAI产品更新图像生成
02:04
AK@_akhaliq
37
阐明扩散概率模型的SNR-t偏差 paper: https://huggingface.co/papers/2604.16044
图像生成论文/研究
4月20日
23:44
Chubby♨️@kimmonismus
42
OpenAI 正在准备发布其新的图像模型。通过信息
OpenAI图像生成行业动态
14:08
DogeDesigner@cb_doge
🚨 GROK IMAGINE 新功能 🚨 创建你自己的自定义模板。 - 进入"我的模板" - 选择"创建新模板" - 命名你的模板 - 选择类型 - 输入提示词并上传图片 - 保存,完成! 正在向网页端 SuperGrok Heavy 用户推出。 升级到 SuperGrok Heavy!
xAI产品更新图像生成
4月19日
21:44
Chubby♨️@kimmonismus
无语了。GPT-5.5 创造了我目前见过最好的 SVG。一次生成。接下来会很疯狂。

Chetaslua: GPT Pro - Spud solved SVG One SHOT svg , code is shared in the comments @OpenAI you won this time , i never said this bu...

OpenAI图像生成现象/趋势编码
06:07
宝玉@dotey
47
很荣幸我的Skills开始集成到 Hermes 中,欢迎试用👏

Nous Research: Honored to announce we are partnering with Jim Liu to port over his wildly popular skills for infographics and design to...

智能体产品更新图像生成
4月18日
05:41
Artificial Analysis@ArtificialAnlys
ImagineArt 2.0跻身前十,对标Grok与Imagen 4 Ultra

ImagineArt 2.0在文本到图像排行榜首登第9位,生成质量与xAI的grok-imagine-image及Google的Imagen 4 Ultra相当。作为ImagineArt推出的最新专有图像模型,该版本目前已集成于ImagineArt Image Studio应用,面向开发者的API即将上线。ImagineArt作为综合性AI创意平台,为用户提供多种图像与视频模型的一站式访问。

图像生成模型发布
4月17日
12:29
Greg Brockman@gdb
Codex 中的图像生成功能容易被低估,但它相当强大: 【引用 @wonforall】:图像生成功能现已在 Codex 中上线! 你现在可以直接在 Codex 中生成视觉内容、编辑现有图像,以及从单张图像创建 GIF。 我在开发这个功能时花了很多时间测试不同的用例,看到输出结果可以如此有创意和实用,真的令人印象深刻。 希望你用得开心 🚀

Won Park: Image generation is now live in Codex! You can now generate visuals, edit existing images, and create GIFs from a single...

智能体OpenAI产品更新图像生成
03:50
Google Gemini@GeminiApp
58
Gemini应用推出基于个人智能的个性化图像生成功能

Google在Gemini应用中推出个性化图像生成新功能,利用“个人智能”整合Nano Banana 2模型与用户已连接的Google应用(如Google相册),自动理解用户偏好与生活背景。用户无需手动上传参考图或编写复杂提示词,仅需简单指令即可生成反映个人品味、生活方式乃至包含亲友形象的图像,并能调整风格和细化结果。Google强调,此功能不会使用用户的私人Google相册数据直接训练模型,以保护隐私。

Google产品更新图像生成
00:50
Google Gemini@GeminiApp
Personal Intelligence 🤝 Nano Banana 2 Personal Intelligence 现在让 Gemini 在生成图像时理解你的偏好和兴趣,让你可以花更多时间创作,减少解释。
Google产品更新图像生成
4月16日
00:07
AK@_akhaliq
39
连续对抗流模型 paper: https://huggingface.co/papers/2604.11521
图像生成数据/训练论文/研究
4月15日
16:48
Chubby♨️@kimmonismus
业内人士预测Opus 4.7与ChatGPT Image 2或本周密集发布

业内人士预测Claude Opus 4.7与ChatGPT Image 2将于本周密集发布,甚至可能包括代号"Spud"的新品。OpenAI惯于周二或周四发布,Anthropic则试图抢先或避免被 overshadow。鉴于Anthropic近期凭借Mythos等占据头条,加上Deepseek预计下周发布,OpenAI急需重大更新应对竞争。尽管Image 2已遭大量泄露,但The Information及OpenAI员工积极情绪均暗示发布临近。

AnthropicOpenAI图像生成现象/趋势
07:40
AK@_akhaliq
47
ERNIE-Image-Turbo 科幻但威尼斯 应用:https://huggingface.co/spaces/akhaliq/ERNIE-Image-Turbo
产品更新图像生成开源生态
4月14日
04:25
Google Gemini@GeminiApp
优化 Nano Banana 图像生成的六大核心要素

为提升 Nano Banana 生成质量,建议通过六大维度构建提示词:Subject(主体)定义画面核心对象,Composition(构图)控制镜头语言,Action(动作)描述动态场景,Location(地点)设定环境背景,Style(风格)统一视觉美学,Editing Instructions(编辑指令)实现精准图像修改。该方法强调在生成前建立清晰的故事叙事与视觉风格,适用于文生图及图生图场景。

Google图像生成教程/实践
4月13日
16:48
Chubby♨️@kimmonismus
我怀疑这表明 patience chair 是用 ChatGPT image 2 创建的。讽刺性地指代了 image 2 可能今天或明天就会发布这一事实。
OpenAI图像生成大佬观点
4月12日
07:01
TestingCatalog News 🗞@testingcatalog
xAI 正为 Grok 移动端开发 Remix Character 功能,允许用户将图片中的任意角色插入到 Grok Imagine 生成的视频中。该功能尚未上线,疑似 Imagine v2 的前瞻。
xAI产品更新图像生成视频
04:51
TestingCatalog News 🗞@testingcatalog
Google Mixboard 实验项目新增语音模式,支持语音命令生成、编辑和移动图片,以及语音笔记功能。类似 Stitch 的交互方式,适用于团队协作场景,如回顾会议中直接语音输入反馈。
Google产品更新图像生成语音
4月11日
03:15
Ethan Mollick@emollick
Seedance 2.0 用 AI 技术重新诠释拉斐尔名作《雅典学院》,呈现柏拉图与亚里士多德之间的微妙冲突与思想张力。生成效果有趣,可玩性高。
图像生成现象/趋势视频
‹ 上一页
1…15161718
下一页 ›