| 
							
							
								 Brett Kuprel | 6d1ac07442 | mega model in standard runtime | 2022-06-30 09:43:13 -04:00 |  | 
			
				
					| 
							
							
								 Brett Kuprel | e9c01e32a5 | mega model works with standard colab runtime in expendable mode | 2022-06-30 09:39:14 -04:00 |  | 
			
				
					| 
							
							
								 Brett Kuprel | 41a44068d0 | keep params in expendable mode | 2022-06-30 09:36:32 -04:00 |  | 
			
				
					| 
							
							
								 Brett Kuprel | df9aa6f915 | sort -> topk, prev_token_and_index -> prev_token, token_index | 2022-06-30 09:04:11 -04:00 |  | 
			
				
					| 
							
							
								 Brett Kuprel | fb97ba5e20 | update readme, cleanup | 2022-06-30 07:41:31 -04:00 |  | 
			
				
					| 
							
							
								 Brett Kuprel | 1e18ba0ffa | is_expendable argument reduces memory usage for command line script | 2022-06-30 06:43:10 -04:00 |  | 
			
				
					| 
							
							
								 Brett Kuprel | 38377107da | Update setup.sh | 2022-06-29 21:45:05 -04:00 |  | 
			
				
					| 
							
							
								 Brett Kuprel | c76226571e | Update cog.yaml | 2022-06-29 19:14:25 -04:00 |  | 
			
				
					| 
							
							
								 Brett Kuprel | 8b86152359 | Update cog.yaml | 2022-06-29 19:14:10 -04:00 |  | 
			
				
					| 
							
							
								 Brett Kuprel | 5f931f6899 | simplified colab setup | 2022-06-29 16:32:51 -04:00 |  | 
			
				
					| 
							
							
								 Brett Kuprel | b7b6df23f7 | updated replicate predict.py file | 2022-06-29 15:53:25 -04:00 |  | 
			
				
					| 
							
							
								 Brett Kuprel | 005ee4938e | Update README.md | 2022-06-29 15:24:09 -04:00 |  | 
			
				
					| 
							
							
								 Brett Kuprel | 9fbfe0b76e | Delete predict.py | 2022-06-29 15:20:45 -04:00 |  | 
			
				
					| 
							
							
								 Brett Kuprel | 15b0c03485 | Merge pull request #38 from chenxwh/replicate Add Web Demo & Docker environment | 2022-06-29 15:20:20 -04:00 |  | 
			
				
					| 
							
							
								 Brett Kuprel | d99828a239 | simplified flax attention and matched torch attention | 2022-06-29 14:56:28 -04:00 |  | 
			
				
					| 
							
							
								 Chenxi | fcc17c895d | Merge branch 'kuprel:main' into replicate | 2022-06-29 19:50:47 +01:00 |  | 
			
				
					| 
							
							
								 Chenxi | eb9f4c6b3b | replicate demo | 2022-06-29 19:50:10 +01:00 |  | 
			
				
					| 
							
							
								 Brett Kuprel | 61cc99c13c | read tokenizer files with utf8 encoding | 2022-06-29 14:18:23 -04:00 |  | 
			
				
					| 
							
							
								 Brett Kuprel | 742d3609b0 | Merge branch 'main' of https://github.com/kuprel/min-dalle | 2022-06-29 13:56:39 -04:00 |  | 
			
				
					| 
							
							
								 Brett Kuprel | a4df279fd2 | simplified attention and keys_values state resulted in decrease in inference time to 7.3 seconds (from ~10 seconds) | 2022-06-29 13:56:29 -04:00 |  | 
			
				
					| 
							
							
								 Brett Kuprel | 80558b8a82 | time the inference pass | 2022-06-29 13:55:23 -04:00 |  | 
			
				
					| 
							
							
								 Brett Kuprel | 661ec976ac | simplified attention for torch model | 2022-06-29 13:48:12 -04:00 |  | 
			
				
					| 
							
							
								 Brett Kuprel | 95afa18893 | add gitattributes file | 2022-06-29 12:45:41 -04:00 |  | 
			
				
					| 
							
							
								 Brett Kuprel | 764b5bbc0e | works with latest flax version 0.5.2, updated requirements.txt | 2022-06-29 11:46:19 -04:00 |  | 
			
				
					| 
							
							
								 Brett Kuprel | c4f613c89f | readme wording | 2022-06-29 11:01:46 -04:00 |  | 
			
				
					| 
							
							
								 Brett Kuprel | 6046863805 | readme wording | 2022-06-29 11:00:38 -04:00 |  | 
			
				
					| 
							
							
								 Brett Kuprel | 2b552fe9db | readme wording | 2022-06-29 10:54:01 -04:00 |  | 
			
				
					| 
							
							
								 Brett Kuprel | b7c2414c76 | readme wording | 2022-06-29 10:47:08 -04:00 |  | 
			
				
					| 
							
							
								 Brett Kuprel | 4e62f85ab9 | readme wording | 2022-06-29 10:45:46 -04:00 |  | 
			
				
					| 
							
							
								 Brett Kuprel | 53695e32f7 | updated readme with torch examples | 2022-06-29 10:43:46 -04:00 |  | 
			
				
					| 
							
							
								 kuprel | 9d5bb34df0 | default to torch+mega in colab | 2022-06-29 10:37:12 -04:00 |  | 
			
				
					| 
							
							
								 kuprel | d4af693133 | updated colab to load model once and generate multiple times | 2022-06-29 09:43:42 -04:00 |  | 
			
				
					| 
							
							
								 Brett Kuprel | ed91ab4a30 | refactored to load models once and run multiple times | 2022-06-29 09:42:12 -04:00 |  | 
			
				
					| 
							
							
								 kuprel | 1ef9b0b929 | added mega to colab | 2022-06-29 06:44:07 -04:00 |  | 
			
				
					| 
							
							
								 kuprel | ae0411dbcc | Cleanup | 2022-06-29 06:25:52 -04:00 |  | 
			
				
					| 
							
							
								 kuprel | 0fe3e2d2b9 | Merge pull request #28 from interfect/torch-logical-cores Use all logical cores in Torch mode | 2022-06-28 22:57:02 -04:00 |  | 
			
				
					| 
							
							
								 Adam Novak | 28c812c832 | Use all logical cores in Torch mode | 2022-06-28 22:26:51 -04:00 |  | 
			
				
					| 
							
							
								 kuprel | 97a55f169c | Updated colab to use torch+cuda | 2022-06-28 22:12:36 -04:00 |  | 
			
				
					| 
							
							
								 Brett Kuprel | 1fbb209623 | fixed bug with cuda in detokenizer | 2022-06-28 22:02:35 -04:00 |  | 
			
				
					| 
							
							
								 Brett Kuprel | 764b0bc685 | cuda in detokenizer from previous commit broke colab flax model, fixed | 2022-06-28 21:36:48 -04:00 |  | 
			
				
					| 
							
							
								 Brett Kuprel | d846cab1b6 | Merge branch 'main' of https://github.com/kuprel/min-dalle | 2022-06-28 21:28:50 -04:00 |  | 
			
				
					| 
							
							
								 Brett Kuprel | 17c96fe110 | works with cuda | 2022-06-28 21:28:36 -04:00 |  | 
			
				
					| 
							
							
								 kuprel | 0af6b1731f | Update setup.sh removed `git lfs` dependency | 2022-06-28 21:18:24 -04:00 |  | 
			
				
					| 
							
							
								 kuprel | 6f0be73579 | Merge pull request #23 from andrewginns/main Simplified requirements | 2022-06-28 21:16:00 -04:00 |  | 
			
				
					| 
							
							
								 kuprel | b8c4173181 | Merge pull request #24 from TheFutureGadgetsLab/main Fixed disabling of gradients in the torch code | 2022-06-28 21:10:35 -04:00 |  | 
			
				
					| 
							
							
								 Haydn Jones | a3a247e6ec | Fixed disabling of gradients in the torch code | 2022-06-28 17:57:44 -06:00 |  | 
			
				
					| 
							
							
								 Andrew Ginns | 3c0c897977 | Add flax model curl | 2022-06-28 22:01:26 +01:00 |  | 
			
				
					| 
							
							
								 Andrew Ginns | 3af89066e8 | Simplified requirements: * No wandb login
* wandb install as part of requirements.txt | 2022-06-28 21:34:57 +01:00 |  | 
			
				
					| 
							
							
								 Brett Kuprel | 9d6b6dcc92 | previous commit broke flax model, fixed now | 2022-06-28 12:54:58 -04:00 |  | 
			
				
					| 
							
							
								 Brett Kuprel | 5aa6fe49bf | use cuda if available | 2022-06-28 12:47:11 -04:00 |  |