Example: quiz answers
Search results with tag "Generative pretraining"
Taming Transformers for High-Resolution Image Synthesis
openaccess.thecvf.comsuitability of generative pretraining to learn image repre-sentations for downstream tasks. Since input resolutions of 32×32pixels are still quite computationally expensive [8], a VQVAE is used to encode images up to a resolution of 192× 192. In an effort to keep the learned discrete repre-sentation as spatially invariant as possible with ...