Commit History

Autor SHA1 Mensaxe Data
  justheuristic 28971dcedd black-isort %!s(int64=3) %!d(string=hai) anos
  Artem Chumachenko 5af1c9e3b4 fix test %!s(int64=3) %!d(string=hai) anos
  justheuristic aa769ce846 fix bug when cache assignment surpasses max length %!s(int64=3) %!d(string=hai) anos
  justheuristic 93951779e1 fix bug when cache assignment surpasses max length %!s(int64=3) %!d(string=hai) anos
  justheuristic 01d6ba8a9c dedup %!s(int64=3) %!d(string=hai) anos
  Artem Chumachenko 1b28432533 wip %!s(int64=3) %!d(string=hai) anos
  justheuristic 33aa952d41 client-side prompts %!s(int64=3) %!d(string=hai) anos
  Artem Chumachenko 85c2ec7b06 actually merge %!s(int64=3) %!d(string=hai) anos
  justheuristic e0bb3762b4 Merge branch 'generation-inference' into deep_prompt_inference %!s(int64=3) %!d(string=hai) anos
  Artem Chumachenko 01c1e198b8 Fix merge conficts %!s(int64=3) %!d(string=hai) anos
  Artem Chumachenko 88e6a75996 Add part of deepprompts %!s(int64=3) %!d(string=hai) anos
  Artem Chumachenko c003830cc6 fix %!s(int64=3) %!d(string=hai) anos
  Artem Chumachenko 1afd59a071 introduce hypo_ids %!s(int64=3) %!d(string=hai) anos
  Artem Chumachenko f62c65ec23 fixes %!s(int64=3) %!d(string=hai) anos
  Artem Chumachenko 53e19de6e0 Add tests %!s(int64=3) %!d(string=hai) anos
  Artem Chumachenko ade986ca58 Return multibatch mode %!s(int64=3) %!d(string=hai) anos
  Artem Chumachenko fb142375cb WIP %!s(int64=3) %!d(string=hai) anos
  Alexander Borzunov 54ad745bed Warn that current instructions involve 6B model but we will replace them soon (#63) %!s(int64=3) %!d(string=hai) anos
  Alexander Borzunov 5f0c5329d4 Update readme with arxiv link and more discussions (#62) %!s(int64=3) %!d(string=hai) anos
  Alexander Borzunov 9bea7b9ea8 Update bullet points with feedback from Tim and other people (#61) %!s(int64=3) %!d(string=hai) anos
  Alexander Borzunov 7653562aa1 Use latest version of Petals scheme, shrink Petals logo (#59) %!s(int64=3) %!d(string=hai) anos
  Alexander Borzunov 2eb5843852 Update readme for the 1st public release (#57) %!s(int64=3) %!d(string=hai) anos
  Pavel Samygin 0be21775af remove transformer block, implement as sequential of size 1 (#54) %!s(int64=3) %!d(string=hai) anos
  Artem Chumachenko 77220c718c Add shallow prefix-tuned inference (#55) %!s(int64=3) %!d(string=hai) anos
  justheuristic d271b75dd4 Let users specify sequence length instead of assuming 2048 (#52) %!s(int64=3) %!d(string=hai) anos
  Dmitry Baranchuk 948877149c Fix recovering for sequential_backward (#50) %!s(int64=3) %!d(string=hai) anos
  Dmitry Baranchuk 24ba3433e4 [Fix] make distributed seq cls to not create the full bloom model (#49) %!s(int64=3) %!d(string=hai) anos
  justheuristic f12d0deee9 [quickfix 1/n] remove expensive assertions in inference code (#48) %!s(int64=3) %!d(string=hai) anos
  Dmitry Baranchuk 0fd2caa4be Convert actual model weights (#46) %!s(int64=3) %!d(string=hai) anos
  justheuristic a2634001e9 Reduce vocabulary size in test model, fix bug in routing when overlapped (#45) %!s(int64=3) %!d(string=hai) anos