Historique des commits

Auteur SHA1 Message Date
  Alexander Borzunov 158013a671 Implement direct server-to-server communication (#331) il y a 2 ans
  Alexander Borzunov 4d9c26fe5c Allow free_disk_space_for() remove arbitrary files from Petals cache (#339) il y a 2 ans
  Alexander Borzunov de930918a0 Support loading blocks in 4-bit (QLoRA NF4 format, disabled by default) (#333) il y a 2 ans
  Alexander Borzunov 66a47c763e Require pydantic < 2.0 (2.0 is incompatible with hivemind 1.1.8) (#337) il y a 2 ans
  Alexander Borzunov 10c72acdf4 Fix warmup steps and minor issues in benchmarks (#334) il y a 2 ans
  Alexander Borzunov d126ee3053 Add benchmark scripts (#319) il y a 2 ans
  Alexander Borzunov fecee8c4dc Show license links when loading models (#332) il y a 2 ans
  Alexander Borzunov 47a2b1ee65 Fix llama's lm_head.weight.requires_grad (#330) il y a 2 ans
  Alexander Borzunov 7a37513f77 Add AutoDistributed{Model, ModelForCausalLM, ModelForSequenceClassification} (#329) il y a 2 ans
  Alexander Borzunov cb3f018f9f Add LLaMA support (#323) il y a 2 ans
  Max Ryabinin 5c0733711a Use number of tokens for attn_cache_size (#286) il y a 2 ans
  Max Ryabinin c839173e57 Determine block dtype in a unified manner (#325) il y a 2 ans
  Max Ryabinin 3e7ae5116d Remove unused imports and attributes (#324) il y a 2 ans
  Alexander Borzunov 675bacb592 Bump version to 1.1.5 (#312) il y a 2 ans
  Alexander Borzunov e026952338 Abort speedtest if it runs too long (#316) il y a 2 ans
  Alexander Borzunov 6eb306a605 Raise error for unexpected .generate() kwargs (#315) il y a 2 ans
  Alexander Borzunov d9e7bfc949 Divide compute throughput by average no. of used blocks (#314) il y a 2 ans
  Alexander Borzunov 6137b1b4b0 Replace .make_sequence(..., mode="random") with mode="max_throughput" (#313) il y a 2 ans
  Alexander Borzunov 0a313bf6c5 Update hivemind to 1.1.8, enable efficient bfloat16 encoding (#311) il y a 2 ans
  Alexander Borzunov 8f6342a861 Refactor RemoteSequenceManager (#309) il y a 2 ans
  Alexander Borzunov 454c193863 Fix OOMs happening in case of accelerate >= 0.16.0 (#310) il y a 2 ans
  Alexander Borzunov 93c4eba5d1 Bump version to 1.1.4 (#306) il y a 2 ans
  Alexander Borzunov c0e0e1319d Force transformers to use config.torch_dtype by default (#307) il y a 2 ans
  Alexander Borzunov 98be9ffe4c Relax the rest of Hugging Face dependencies (#305) il y a 2 ans
  Alexander Borzunov 5c0b4286b2 Suggest commands for Docker first (#304) il y a 2 ans
  Alexander Borzunov 35662b4a16 Require bitsandbytes == 0.38.0.post2, hivemind == 1.1.7 (#302) il y a 2 ans
  Alexander Borzunov 21c3526ec1 Start SequenceManager's thread only after first .make_sequence() (#301) il y a 2 ans
  Alexander Borzunov 6c6150f684 Remove use_auto_relay=True in client (#300) il y a 2 ans
  Alexander Borzunov 892fa2386a Remove CustomLinear8bitLt (#297) il y a 2 ans
  Alexander Borzunov 74d8cda8c4 Add Python 3.10 to CI (#299) il y a 2 ans