A simple noise conditional score unet that can generate animal face images.
There are two models architecture in models :
ncst, a DiT-like architecture.unet, a UNet based architecture.
- More experiment
- Change to SDE-based NCSN
- Project description
- Add citation