Ideogram v4 > a scan of a page from my high school A3 art pad, highly original niche pencil piece working on the aura of...
Rohan Paul 实测新推出的图像转 3D 模型 Rodin Gen-2.5,最大改进是控制力。提供五种生成模式,最快 4 秒生成百万多边形模型,支持最高 1000 万多边形输出。原生 3D PBR 材质,模型开箱即用。Hyper 3D 还支持并行批量生成、Break to Parts 部件分离和局部编辑,无需重新生成整个模型,覆盖 3D 创作全流程。
归藏宣布其 PPT Skills 项目将继续更新。得益于近期的赞助,计划开发第三套主题,且会把在小红书图文卡片部分积累的好经验用于新版中。
http://x.com/i/article/2053655813877870592
商汤SenseTime发布SenseNova U1,一个原生理解和生成文本与图像的统一模型。该模型已开源,用户可自行运行。被@gurru_tech称赞“令人印象深刻”。提供在线演示平台SenseNova Studio、HuggingFace模型、GitHub代码及Discord社区。
关联讨论 1 条X:商汤 SenseTime (@SenseTime_AI)商汤 SenseTime 推出 SenseNova U1 开源多模态模型,实现原生理解与生成文本和图像,可一键将提示词转化为专业信息图。该模型被开发者 @gurru_tech 评价为“非常令人印象深刻”。项目已开源,提供 SenseNova Studio 在线试用,并公开 HuggingFace 模型集合、GitHub 源码仓库及 Discord 社区入口。
同一事件,精选展示《商汤发布信息图生成模型升级,增强多项核心能力》Today, we're launching Reve 2.0, the best 4K image model in the world. We invented a new way to generate and edit any im...
wow this @reve 2.0 launch copy is supurb. "it is now clear that the key to both controllable image generation and editin...
Reve 2.0 图像模型支持原生4K输出,核心亮点在于类似 Photoshop 的图像分层编辑能力。用户点击图像中的任意部分即可选中该区域,无需复杂的中间处理步骤,直接进行针对性编辑。该功能大幅简化了图像局部修改的工作流。
Ideogram 发布首个开源 AI 图像模型 Ideogram 4.0,主推文字渲染与版面控制。模型引入 bounding box(边界框)控制,允许用坐标精确指定元素位置;支持结构化 JSON 提示词格式,不再仅限纯文本;英文 OCR 准确率达 0.97(X-Omni 基准),支持跨语言密集文字渲染,涵盖中日韩等非拉丁文字。
Grok Imagine Video 1.5 on AI Gateway. Image-to-video generation with synced audio in one pass. await generateVideo({ mod...
同一事件,精选展示《xAI 发布 Grok Imagine 1.5 预览版(图像转视频模型)》Our independent research lab ranks top 2 on @arena Text-to-Image, ahead of Nano Banana 2 and GPT-Image-1.5.
Ideogram v4 is really good, and open weights. Images are crisp and feel fresh.
Introducing Ideogram 4.0: the best open image model in the world. Think it. Make it. Own it. Download the weights, fine-...
Grok @Imagine 1.5 Preview is here Try it today in the API: http://x.ai/api/imagine
关联讨论 3 条xAI:News(网页)X:Elon Musk (@elonmusk, xAI)X:阿易 AI Notes (@AYi_AInotes)Introducing Ideogram 4.0: the best open image model in the world. Think it. Make it. Own it. Download the weights, fine-...
New open model Ideogram-4.0-Quality has landed at #8 in the Text-to-Image Arena. This makes the new model by @ideogram_a...
> Change the screen so it shows that she's on a facetime call
商汤(SenseTime)开源SenseNova U1模型,宣称实现“看、思考、创作”一体——从一张普通运动鞋图片直接生成营销视觉效果。该模型代表了架构上的范式转变。用户可通过SenseNova Studio、HuggingFace和GitHub尝试使用。
同一事件,精选展示《商汤发布信息图生成模型升级,增强多项核心能力》Exploring the possibilities GPT Image Gen V2 Vertical smartphone screenshot from a Chinese short-video app. Front phone ...