Commit History

Autor SHA1 Mensaxe Data
  justheuristic e41d8933fb Merge branch 'main' into cache %!s(int64=2) %!d(string=hai) anos
  justheuristic 79a1475e62 Merge branch 'cache' of github.com:bigscience-workshop/petals into cache %!s(int64=2) %!d(string=hai) anos
  Aleksandr Borzunov a1ca926ca5 review %!s(int64=2) %!d(string=hai) anos
  justheuristic 8a0c056929 Fix calling rpc_info multiple times (#60) %!s(int64=2) %!d(string=hai) anos
  Aleksandr Borzunov d5d52cd7b2 review %!s(int64=2) %!d(string=hai) anos
  justheuristic 0f9d60badb Update src/server/cache.py %!s(int64=2) %!d(string=hai) anos
  justheuristic 356735e4aa Update src/server/cache.py %!s(int64=2) %!d(string=hai) anos
  Artem Chumachenko ada98a1b37 Add deep prompt inference (#66) %!s(int64=2) %!d(string=hai) anos
  Alexander Borzunov 54ad745bed Warn that current instructions involve 6B model but we will replace them soon (#63) %!s(int64=3) %!d(string=hai) anos
  Alexander Borzunov 5f0c5329d4 Update readme with arxiv link and more discussions (#62) %!s(int64=3) %!d(string=hai) anos
  Alexander Borzunov 9bea7b9ea8 Update bullet points with feedback from Tim and other people (#61) %!s(int64=3) %!d(string=hai) anos
  Alexander Borzunov 7653562aa1 Use latest version of Petals scheme, shrink Petals logo (#59) %!s(int64=3) %!d(string=hai) anos
  Alexander Borzunov 2eb5843852 Update readme for the 1st public release (#57) %!s(int64=3) %!d(string=hai) anos
  Pavel Samygin 0be21775af remove transformer block, implement as sequential of size 1 (#54) %!s(int64=3) %!d(string=hai) anos
  Artem Chumachenko 77220c718c Add shallow prefix-tuned inference (#55) %!s(int64=3) %!d(string=hai) anos
  justheuristic d271b75dd4 Let users specify sequence length instead of assuming 2048 (#52) %!s(int64=3) %!d(string=hai) anos
  justheuristic c8d22f8bbb make attention cache wait until memory is freed %!s(int64=3) %!d(string=hai) anos
  Dmitry Baranchuk 948877149c Fix recovering for sequential_backward (#50) %!s(int64=3) %!d(string=hai) anos
  Dmitry Baranchuk 24ba3433e4 [Fix] make distributed seq cls to not create the full bloom model (#49) %!s(int64=3) %!d(string=hai) anos
  justheuristic f12d0deee9 [quickfix 1/n] remove expensive assertions in inference code (#48) %!s(int64=3) %!d(string=hai) anos
  Dmitry Baranchuk 0fd2caa4be Convert actual model weights (#46) %!s(int64=3) %!d(string=hai) anos
  justheuristic a2634001e9 Reduce vocabulary size in test model, fix bug in routing when overlapped (#45) %!s(int64=3) %!d(string=hai) anos
  Dmitry Baranchuk 5745882c67 fix rpc_forward_stream %!s(int64=3) %!d(string=hai) anos
  Dmitry Baranchuk 6095f58681 Deep distributed prompt tuning (#42) %!s(int64=3) %!d(string=hai) anos
  justheuristic 9460220a10 make pytest outputs more verbose (#44) %!s(int64=3) %!d(string=hai) anos
  Dmitry Baranchuk c4aa1f49df Rename 350m -> 560m (#43) %!s(int64=3) %!d(string=hai) anos
  Dmitry Baranchuk 11a424837f integrate mixed-8bit model (#39) %!s(int64=3) %!d(string=hai) anos
  Alexander Borzunov 7d39d46966 Use "PETALS" as the readme title (#40) %!s(int64=3) %!d(string=hai) anos
  Dmitry Baranchuk 04a2b6f5e3 Support various backend dtypes & async serialization (#38) %!s(int64=3) %!d(string=hai) anos
  Artem Chumachenko d989b94614 Pack of Inference Changes (#37) %!s(int64=3) %!d(string=hai) anos