Commit History

Autor SHA1 Mensaxe Data
  Alexander Borzunov 955eae30b3 Mention 1 sec/token explicitly %!s(int64=2) %!d(string=hai) anos
  Alexander Borzunov 33c210b973 Update Colab notebook %!s(int64=2) %!d(string=hai) anos
  Alexander Borzunov f56edaa13f Fix inference and rpc_info() fault tolerance (#131) %!s(int64=2) %!d(string=hai) anos
  justheuristic 79a4308992 Clear trigger before engaging in update (#130) %!s(int64=2) %!d(string=hai) anos
  Alexander Borzunov b8e1c1b7f5 Revert to hivemind==1.1.3 for stability (#129) %!s(int64=2) %!d(string=hai) anos
  justheuristic 68c85e7492 Avoid synchronous updates, ban peers based on request outcome (#127) %!s(int64=2) %!d(string=hai) anos
  Alexander Borzunov 9dbf5e2e6f Set dht.num_workers = n_layer, update_period = 150, expiration = 300 (#125) %!s(int64=2) %!d(string=hai) anos
  Max Ryabinin 3ca8b4f082 Fix typos with codespell (#126) %!s(int64=2) %!d(string=hai) anos
  justheuristic 8491ed2bd3 Add checks for forward() inputs on the client side (#123) %!s(int64=2) %!d(string=hai) anos
  Max Ryabinin 055f85b83e Call block.load_state_dict only once (#124) %!s(int64=2) %!d(string=hai) anos
  Artem Chumachenko 0855aa7347 Update notebooks to use full BLOOM-176B (#104) %!s(int64=2) %!d(string=hai) anos
  Max Ryabinin 4ffb4d83c7 Remove "-r" when installing Petals in examples (#122) %!s(int64=2) %!d(string=hai) anos
  Alexander Borzunov d29ef70c85 Update README.md %!s(int64=2) %!d(string=hai) anos
  Alexander Borzunov 1d9aa77697 Update README.md %!s(int64=2) %!d(string=hai) anos
  Alexander Borzunov da36470a4b Update README.md %!s(int64=2) %!d(string=hai) anos
  Alexander Borzunov 81b94df14b Rework readme, move code example to the top, link draft of Colab (#118) %!s(int64=2) %!d(string=hai) anos
  Alexander Borzunov 893987ebf8 Require hivemind==1.1.4 with p2pd v0.3.13 (#121) %!s(int64=2) %!d(string=hai) anos
  Alexander Borzunov fc6722576b Choose --num_blocks for bigscience/bloom-petals automatically (#119) %!s(int64=2) %!d(string=hai) anos
  Alexander Borzunov f72c220404 Suppress quantization warning and fix dtype defaults in compute benchmark (#117) %!s(int64=2) %!d(string=hai) anos
  Alexander Borzunov 643a054170 Make server use smart defaults (#115) %!s(int64=2) %!d(string=hai) anos
  justheuristic 9e11f73242 Fix tile size on ampere (#116) %!s(int64=2) %!d(string=hai) anos
  justheuristic 617d70f7dc Support --load_in_8bit on pre-Turing GPUs (#113) %!s(int64=2) %!d(string=hai) anos
  Alexander Borzunov 1ea44b0d3c Measure throughput for different configs, devices, and dtypes separately (#114) %!s(int64=2) %!d(string=hai) anos
  justheuristic 01838f9a99 Fix Linear8bitlt state config, update tests (#112) %!s(int64=2) %!d(string=hai) anos
  Aleksandr Borzunov 96033de921 Fix script for running servers robustly %!s(int64=2) %!d(string=hai) anos
  Aleksandr Borzunov 85cf32d2a4 Add script to run servers robustly %!s(int64=2) %!d(string=hai) anos
  justheuristic 088713912d Patch Linear8bit to enable CxB backward (#111) %!s(int64=2) %!d(string=hai) anos
  justheuristic 8dc0f513ba Hotfix span selection (#110) %!s(int64=2) %!d(string=hai) anos
  justheuristic a2066a4096 Optimize RemoteSequenceManager (#106) %!s(int64=2) %!d(string=hai) anos
  Artem Chumachenko 7d859a947b Expose request_timeout to DistributedBloomConfig (#105) %!s(int64=2) %!d(string=hai) anos