dalle2:hierarchical text-conditional image generation with clip

NoSuchKey