The tiled NVIDIA PiD (Pixel Diffusion) method enables high-resolution latent image decoding up to 4K by splitting large latent images into smaller tiles, processing each tile through pixel diffusion, and then merging them with overlap adjustments to produce detailed images with enhanced textures and artifacts, overcoming the limitation that standard PiD decoding only works with square images.
Inmersión profunda
Prerrequisito
- No hay datos disponibles.
Próximos pasos
- No hay datos disponibles.
Inmersión profunda
Tiled NVIDIA PiD in ComfyUIAñadido:
Hello guys, I know you listen about Nvidia Ped. This is fast and higher resolution latent decoding with pixel diffusion.
This model can upscale latent images up to 4K but it has some problem. It can upscale only square images and uh but resolution is about uh 5,000 11 by thou 5,11 or larger we can fix it by using tile method of decoding. Latent images. All you need you need install comfy UI first and uh download custom nis. Custom nis can be downloaded uh on GitHub.
put it in your custom node directory on config UI and download from example workflow folder. Um uh tile p example Jon else you need download uh some models from confi or we need download p flux one uh 5,11 to 2,00 48 BF is not it's safe tensor Um after you need to go to your config UI uh then go download the JSON file.
Put it in your config UI.
Uh here uh you need to insert your prompt insert your resolution res any generate small image by set image turbo.
You need also download that image turbo model.
Then it split latent image into tails and uh go to uh p diff diffusion it generate multiple image and connect it in one. Let's try press run generation start.
We just look the result of Y standard decoding images and the PD decoded images.
Generation finished it. Tail upscale started.
Let's wait.
Generation and upscaling finished it.
Let's view result.
Uh let's zoom in.
You can see that image not changing but have a lot of details added.
Where is that is eyes? We can detail it. See that reflecting it.
Uh, it's very good. It's very nice.
Let's try another prompt.
Copy your prompt and paste into text ring multi-line generation has been finished. Let's see.
Let's see on face first.
Oh, that's cool.
Wow. Oh, hair is perfect.
Tourist is also perfect.
Wow. We can see threads on clothes and we can see grain on rocks.
That's not We can see small artifacts, but it's good.
Let's try another prompt.
Let's see how much generation took. It took,92 seconds.
That's enough.
Let's see the result. This is a woman portrait.
Let's zoom in.
I think that is good idealizations.
Hair visualizations also perfect.
Skin texture is perfect.
Um flowers on the rest is also good.
Aberations is added.
Let's try last and we uh see how generations is processed.
First of first step is text encoding. Text encoding use clip text encoder prompt. Negative prompt is not add any effect on this because um CVG is set to one.
It's just keeping car sampler use SD sampler and simple scheduleuler we generate with constant noise seed we that set seed field generations took eight steps it uh Uh finish it soon.
We preview small image low resolution. The resolution of full image will be uh will be seen here.
It's finished.
Generate preview of small image.
Uh next is text encoding prompt for bit model tail for this model is constant size.
This is for twice larger that source image source image is 5,1 output image will be 2,98.
We produce four steps for each tail.
We have about 11 tails.
uh tails after decoding will be smoothly connected between it has overlap small overlaps and adjust brightness each tail to hit the um artifacts.
Tails go from left to right and from up to down of the image on the image.
Each image will be simply decoded by V.
V is uh not loaded. It's just simply real RGB pixels model generates RGB format and just generate simple image as bitm or other formats.
Uh last time I depoding we wait for it images will be mer by this and we compare with the image comparator. Resolution of output image will be about 35 megapixels.
Decoding has finished.
Let's zoom in.
We can see that image more or contrast.
Wow, that's perfect.
Dilization is very good.
We can see a lot of detail of on the image in unfocused areas detail is also added noise edit I think That's very good.
Thanks for watching.
Subscribe to the channel. Download custom notice links provided in description.
If you like video, comment it. Ask for your questions.
Bye-bye.
Videos Relacionados
OpenHuman VS Hermes AI: Who Wins?
JulianGoldieSEO
285 views•2026-05-29
Long-Running Agents — Build an Agent That Never Forgets with Google ADK
suryakunju
142 views•2026-05-30
This computer is made from real human brain cells. And you can buy it.
Talktmsmedia
3K views•2026-05-28
BREAKING: Microsoft’s New Image Generating Model Beat Out GPT 1.5 and Nano Banana 2
aimmediahouse
122 views•2026-06-03
I Made the Same Anime Fight Scene in Every AI Video Generator
NobleGooseAnime
295 views•2026-05-30
Nvidia Bets Big On AI PCs | New Chip To Power Windows Laptops | Technology | AI Updates | N18S
cnnnews18
3K views•2026-06-01
I Tested NEW Opus 4.8 on Four Projects (Updated LLM Leaderboard)
AICodingDaily
298 views•2026-05-29
3D Platformer Update - NO CAPES
SolarLune
294 views•2026-05-30











