This is a minimal implementation of Boris Dayma's [DALL·E Mini](https://github.com/borisdayma/dalle-mini) in PyTorch. It has been stripped to the bare essentials necessary for doing inference. The only third party dependencies are numpy and torch.
It currently takes **7.4 seconds** to generate an image with DALL·E Mega on a standard GPU runtime in Colab.
It currently take**35 seconds** to generate a 3x3 grid with DALL·E Mega on a standard GPU runtime in Colab.
The flax model and code for converting it to torch can be found [here](https://github.com/kuprel/min-dalle-flax).
@ -41,7 +41,7 @@ display(image)
```python
text = "trail cam footage of gollum eating watermelon"