亚洲精品福利第一区第二区第三区,亚洲色WWW成人永久网址

本文作者 /ML 谷歌開發(fā)者專家王玉成

介紹

在快速發(fā)展的生成式 AI 領(lǐng)域，結(jié)合不同模型的優(yōu)勢(shì)可以帶來(lái)顯著的成果。通過利用谷歌的 Gemini 模型來(lái)制作詳細(xì)且富有創(chuàng)意的提示，然后使用 Imagen 3 模型根據(jù)這些提示生成高質(zhì)量的圖像，您可以獲得卓越的視覺效果。這個(gè)過程并不止于此；一旦圖像生成，Imagen 2 可以進(jìn)一步優(yōu)化以滿足特定需求，從而創(chuàng)建一個(gè)強(qiáng)大的工作流程，用于制作頂級(jí)視覺內(nèi)容。

使用 Gemini 進(jìn)行提示生成

Gemini 是谷歌開發(fā)的強(qiáng)大語(yǔ)言模型，擅長(zhǎng)生成連貫且上下文準(zhǔn)確的文本。在這個(gè)工作流程中，Gemini 用于創(chuàng)建詳細(xì)且富有想象力的提示，這些提示將作為圖像生成的基礎(chǔ)。提示的質(zhì)量至關(guān)重要，因?yàn)樗苯佑绊?Imagen 2 模型的輸出。通過仔細(xì)制作或完善 Gemini 的 Prompt，您可以確保生成的圖像與您的創(chuàng)意愿景相一致。

使用 Imagen 3 生成圖像

一旦從 Gemini 獲得了精心制作的 Prompt，下一步就是使用谷歌的 Imagen 3 模型生成圖像。Imagen 3 是一個(gè)尖端的生成式 AI 模型，專門根據(jù)文本描述生成高分辨率、細(xì)節(jié)豐富的圖像。該模型以其能夠以驚人的準(zhǔn)確性渲染復(fù)雜場(chǎng)景、紋理和光照而脫穎而出。通過將 Gemini 生成的提示輸入到 Imagen 3 中，你可以創(chuàng)建不僅視覺上令人驚嘆，而且精確符合初始概念的圖像。

使用 Imagen 2 精調(diào)生成的圖像

該工作流程的最后一步是優(yōu)化由 Imagen 2 生成的圖像。根據(jù)需求，這可能涉及調(diào)整顏色、增強(qiáng)細(xì)節(jié)，甚至合并多張圖像。目標(biāo)是對(duì)來(lái)自 Imagen 3 的高質(zhì)量輸出進(jìn)行微調(diào)，以確保最終圖像完全符合所需的美學(xué)和功能標(biāo)準(zhǔn)。

關(guān)于 Imagen 模型的有用資源

在 Vertex AI 上查找主題 Imagen | AI 圖像生成器以獲取更多信息。此頁(yè)面指導(dǎo)如何請(qǐng)求訪問 Vertex AI 上的 Imagen。

工作流程

準(zhǔn)備 Prompt

Vertex AI 提供多個(gè)模塊供使用。首先打開 Gemini 以生成滿足我們需求的理想 Prompt。要全面了解 Prompt 創(chuàng)建的關(guān)鍵方面，您可以閱讀 Prompt 和圖像屬性指南 (需要有限訪問權(quán)限)。

我們選擇 gemini-1.5-flash 作為當(dāng)前的語(yǔ)言模塊，它為高容量、高質(zhì)量、具有成本效益的應(yīng)用提供了速度和效率。有關(guān)更多信息，請(qǐng)閱讀 Gemini 模塊。

點(diǎn)擊 Edit 輸入系統(tǒng)指令，然后點(diǎn)擊保存。

System instructions:

You are an experienced article illustrator, skilled in creating illustrations for articles with a deep cultural background, capable of understanding the deeper meaning behind the text.

Prompt:

I'm currently reading the novel `One Hundred Years of Solitude` and plan to write an article expressing my sympathy for the tragic fate of the family. To enrich the content, I intend to include some illustrations in the article. After the title, I need an image that visually represents the time period and setting of the novel. We will generate and modify this image to achieve the desired effect.

I want to generate a prompt to help me generate an image, and the prompt contains the basic rules:

1. Short keyword-based prompts separated by commas.

2. Prompt includes: "A painting of...", "A sketch of..."for example

3. General Modifiers:high-quality, beautiful, stylized for example

4. Photos: like 4K, HDR, Studio Photo for example

5. Art, Illustration:by a professional, detailed for example

輸入 Prompt 并點(diǎn)擊紅色標(biāo)記的部分以生成結(jié)果。

結(jié)果:

原始結(jié)果:

A painting of, a bustling town square in a South American village, early 20th century, with a sense of magic realism, high-quality, by a professional artist.

Here are some variations for your prompt based on specific aspects of "One Hundred Years of Solitude":

Focus on the Buendía family: "A sketch of, the Buendía family gathered at a table, in a traditional Colombian home, with a feeling of isolation, high-quality, detailed"

Highlight the cyclical nature of history: "A photorealistic image of, a crumbling, overgrown mansion in a jungle setting, with the remnants of a forgotten era, HDR, stylized"

Emphasize the magical elements: "A painting of, a vibrant, surreal landscape, with fantastical creatures and strange weather patterns, in a dreamlike style, high-quality, beautiful"

By combining different elements of these prompts, you can create a visually striking image that perfectly captures the essence of "One Hundred Years of Solitude." Remember to experiment with different keywords and art styles to find the ideal representation for your article.

修改結(jié)果:

A painting of, a bustling town square in a South American village, early 20th century, with a sense of magic realism, by a professional artist, Focus on the Buendía family, the Buendía family gathered at a table, in a traditional Colombian home, with a feeling of isolation, high-quality, detailed.

打開 Imagen 3，檢查高級(jí)選項(xiàng)，如下所示:

確保選項(xiàng) Person/face generation is Allow (Adults Only)

此選擇用于避免產(chǎn)生如下問題 "圖像生成失敗，出現(xiàn)以下錯(cuò)誤:項(xiàng)目或用戶未被允許關(guān)閉兒童檢測(cè)過濾器":

輸入 Prompt:

點(diǎn)擊 GENERATE，結(jié)果應(yīng)該是這樣的:

我喜歡第二個(gè)。我們可以點(diǎn)擊第二張圖片:

對(duì)話框顯示如下:

我們使用 UPSCALE/EXPORT 按鈕下載此圖像，選擇 Upscle images (如果需要):

點(diǎn)擊 EXPORT 按鈕以下載 PNG 格式的圖像。

我們想編輯這張圖片，所以我們?cè)俅螜z查了第二張圖片。

點(diǎn)擊 EDIT IMAGE 按鈕。

頂部有很多工具可以幫助我們編輯圖像。Imagen 3 現(xiàn)在不支持 Edit image，確保模型已更改為 imagen 2 (預(yù)計(jì) Imagen 3 將在未來(lái)支持 Edit image)。

我想把所有遠(yuǎn)離桌子的人都移走，只留下在桌子旁邊的人。所以我添加了一個(gè) Musk box (遮罩盒) 并生成了一張圖像。我們不需要任何提示來(lái)進(jìn)行此操作。

點(diǎn)擊 GENERATE 按鈕后的結(jié)果:

為什么？二樓的閣樓消失了，與一樓合并，并創(chuàng)建了 4 幅圖片。

原來(lái)，我在原始圖片上添加了三個(gè) Musk box，兩個(gè) Musk box 給人打了 Musk，一個(gè) Musk box 給二樓打了 Musk。Imagan 3 的編輯操作有多智能？我們可以持續(xù)編輯圖像。

結(jié)果是:

這是我想要的最終圖片。如果您有權(quán)限，請(qǐng)閱讀有關(guān)圖像編輯的更多信息。

結(jié)論

通過將谷歌的 Gemini 模型的創(chuàng)造力與 Imagen 3 的先進(jìn)圖像生成能力以及 Imagen 2 的編輯能力相結(jié)合，您可以開發(fā)出一個(gè)強(qiáng)大的工作流程，以生成高質(zhì)量、精致的圖像。這個(gè)過程允許從文本到視覺內(nèi)容的無(wú)縫過渡，提供對(duì)最終輸出的靈活性和控制。無(wú)論是用于廣告、內(nèi)容創(chuàng)作還是藝術(shù)創(chuàng)作，這種方法都提供了一個(gè)強(qiáng)大的工具集，以實(shí)現(xiàn)卓越的視覺效果。

聲明：本文內(nèi)容及配圖由入駐作者撰寫或者入駐合作網(wǎng)站授權(quán)轉(zhuǎn)載。文章觀點(diǎn)僅代表作者本人，不代表電子發(fā)燒友網(wǎng)立場(chǎng)。文章及其配圖僅供工程師學(xué)習(xí)之用，如有內(nèi)容侵權(quán)或者其他違規(guī)問題，請(qǐng)聯(lián)系本站處理。舉報(bào)投訴

谷歌

谷歌

+關(guān)注

關(guān)注
27

文章
6227

瀏覽量
107692
Gemini

Gemini

+關(guān)注

關(guān)注
0

文章
65

瀏覽量
7872
AI

AI

+關(guān)注

關(guān)注
88

文章
34496

瀏覽量
275978

原文標(biāo)題：【GDE 分享】利用谷歌的 Gemini 和 Imagen 模型進(jìn)行高質(zhì)量圖像生成和優(yōu)化

文章出處：【微信號(hào)：Google_Developers，微信公眾號(hào)：谷歌開發(fā)者】歡迎添加關(guān)注！文章轉(zhuǎn)載請(qǐng)注明出處。

搜索歷史

借助谷歌Gemini和Imagen模型生成高質(zhì)量圖像

評(píng)論

電子發(fā)燒友