justheuristic
|
4e4769db70
kaggledebug
|
%!s(int64=3) %!d(string=hai) anos |
justheuristic
|
e5f04cac23
kaggledebug
|
%!s(int64=3) %!d(string=hai) anos |
Aleksandr Borzunov
|
2fed6ba68e
Use HF username as wandb run name
|
%!s(int64=3) %!d(string=hai) anos |
Alexander Borzunov
|
c365b2ec9f
Tweak settings for the upcoming demo (#2)
|
%!s(int64=3) %!d(string=hai) anos |
Aleksandr Borzunov
|
64dee420da
Upgrade to using hivemind.optim.experimental
|
%!s(int64=3) %!d(string=hai) anos |
Aleksandr Borzunov
|
e97e7b8811
Try removing OffloadOptimizer
|
%!s(int64=3) %!d(string=hai) anos |
Aleksandr Borzunov
|
3b184c57da
Don't download VQGAN weights
|
%!s(int64=3) %!d(string=hai) anos |
Aleksandr Borzunov
|
c61c61b20d
Use t5-small tokenizer
|
%!s(int64=3) %!d(string=hai) anos |
Aleksandr Borzunov
|
144d35ebce
Set share_input_output_emb=True
|
%!s(int64=3) %!d(string=hai) anos |
Aleksandr Borzunov
|
df54ab6da5
Log number of params
|
%!s(int64=3) %!d(string=hai) anos |
Aleksandr Borzunov
|
d0985de540
Enable rotary embeddings
|
%!s(int64=3) %!d(string=hai) anos |
Aleksandr Borzunov
|
20e2a3aab2
Use dalle-pytorch instead of LeanAlbert
|
%!s(int64=3) %!d(string=hai) anos |
Max Ryabinin
|
72fc0bcdb7
Initial commit (ru-max branch without private code)
|
%!s(int64=4) %!d(string=hai) anos |