Commit History

Autor SHA1 Mensaxe Data
  Artem Chumachenko 568f21dc3b Add customizable input tensors (#445) %!s(int64=2) %!d(string=hai) anos
  Alexander Borzunov 8c546d988a Test Llama, rebalancing, throughput eval, and all CLI scripts (#452) %!s(int64=2) %!d(string=hai) anos
  Alexander Borzunov 11f0d992d7 Report inference, forward, and network RPS separately (#358) %!s(int64=2) %!d(string=hai) anos
  Alexander Borzunov 1a78638c02 Test that bitsandbytes is not imported when it's not used (#351) %!s(int64=2) %!d(string=hai) anos
  Alexander Borzunov de930918a0 Support loading blocks in 4-bit (QLoRA NF4 format, disabled by default) (#333) %!s(int64=2) %!d(string=hai) anos
  Alexander Borzunov cb3f018f9f Add LLaMA support (#323) %!s(int64=2) %!d(string=hai) anos
  Max Ryabinin 793726b041 Speed up loading blocks using init with meta weights (#285) %!s(int64=2) %!d(string=hai) anos
  Alexander Borzunov 702bb5a2c2 CI: Update deprecated actions, don't measure network RPS (#215) %!s(int64=2) %!d(string=hai) anos
  justheuristic ae9e71fe8e Add local tensor-parallel fwd/bwd (#143) %!s(int64=2) %!d(string=hai) anos
  justheuristic 91898c3c90 Switch to speedtest-cli (#157) %!s(int64=2) %!d(string=hai) anos
  justheuristic b04982c1a2 Bump transformers to 4.25.1 (#151) %!s(int64=2) %!d(string=hai) anos