“最強文生圖開源 AI 模型”，Stable Diffusion 3 Medium 發(fā)布：可在消費級顯卡上運行

2024/6/13 11:55:59 來源：IT之家作者：故淵責(zé)編：故淵

評論：

IT之家 6 月 13 日消息，Stability AI 發(fā)布了 Stable Diffusion 3 Medium（下文簡稱 SD3 Medium），官方聲稱是“迄今為止最先進的開源模型”，其性能甚至超過了 Midjourney 6。

Stability AI 公司表示 SD3 Medium 可以根據(jù)用戶輸入的文本描述，重點克服了文生圖模型中手部和臉部的挑戰(zhàn)，生成足以亂真的的圖像。

SD3 Medium 還利用其底層的 Diffusion Transformer 架構(gòu)，高精度地整合了文字元素。

SD3 Medium 的另一個特點是易于使用。相比較一些資源密集型 AI 模型，SD3 Medium 可以在消費級顯卡上運行，可以加速普及適配。

Stability AI 在非商業(yè)許可下提供 SD3 Medium，供免費使用。對于商業(yè)應(yīng)用，可為藝術(shù)家、設(shè)計師和開發(fā)人員提供創(chuàng)作者許可證；對于大型商業(yè)用戶，可以直接聯(lián)系 Stability AI 了解授權(quán)詳情。

Stability AI 還表示計劃在未來將其產(chǎn)品擴展到視頻和音頻生成領(lǐng)域。提示詞如下：

A photograph of an 18-year-old Japanese woman hitchhiking, holding a cardboard sign that reads ' 東京駅まで ' (To Tokyo Station). She is standing by the roadside with a hopeful expression, wearing casual clothing and a backpack. The background shows a bustling urban street with cars passing by and city buildings. The scene is lively and vibrant, capturing the energy of Tokyo. Cinematic composition, trending on artstation.

“最強文生圖開源 AI 模型”，Stable Diffusion 3 Medium 發(fā)布：可在消費級顯卡上運行

IT之家附上生成的相關(guān)圖片如下：

“最強文生圖開源 AI 模型”，Stable Diffusion 3 Medium 發(fā)布：可在消費級顯卡上運行

以上圖源：Yas@BizDev

廣告聲明：文內(nèi)含有的對外跳轉(zhuǎn)鏈接（包括不限于超鏈接、二維碼、口令等形式），用于傳遞更多信息，節(jié)省甄選時間，結(jié)果僅供參考，IT之家所有文章均包含本聲明。

下載IT之家APP，簽到賺金幣兌豪禮

“最強文生圖開源 AI 模型”，Stable Diffusion 3 Medium 發(fā)布：可在消費級顯卡上運行

相關(guān)文章

“最強文生圖開源 AI 模型”，Stable Diffusion 3 Medium 發(fā)布：可在消費級顯卡上運行