There is a little more information on the Interactive Gallery description, but I used the code from nshepperd with a few tweaks, that can be found here:
https://github.com/nshepperd/jax-guided-diffusion
You are right, Clip is not the entire process of creating the images, is more of a focus word I selected for the project.