Historique des commits

Auteur SHA1 Message Date
  Max Ryabinin fa464dfc99 WIP Triton+QKV merge il y a 1 an
  Alexander Borzunov b4d822afb2 Force use_cache=True in config only (#497) il y a 1 an
  Alexander Borzunov abd547735f Force use_cache=True (#496) il y a 1 an
  Alexander Borzunov 6ef6bf5fa2 Create model index in DHT (#491) il y a 2 ans
  Alexander Borzunov 6bb3f54e39 Replace dots in repo names when building DHT prefixes (#489) il y a 2 ans
  Alexander Borzunov 02fc71eb25 Fix race condition in MemoryCache (#487) il y a 2 ans
  Alexander Borzunov dc0072fde1 Wait for DHT storing state OFFLINE on shutdown (#486) il y a 2 ans
  Alexander Borzunov a26559ff65 Fix `.generate(input_ids=...)` (#485) il y a 2 ans
  Alexander Borzunov 459933f846 Remove no-op process in PrioritizedTaskPool (#484) il y a 2 ans
  Alexander Borzunov 26ebbfe8f0 Support macOS (#477) il y a 2 ans
  Alexander Borzunov 75e516a8c1 Refactor readme (#482) il y a 2 ans
  justheuristic c08d09c4d3 Rewrite MemoryCache alloc_timeout logic (#434) il y a 2 ans
  Alexander Borzunov 90840dfea2 Fix requiring transformers>=4.32.0 (#480) il y a 2 ans
  Alexander Borzunov 915b357740 Require transformers>=4.32.0 (#479) il y a 2 ans
  Alexander Borzunov 18e93afc73 Don't install cpufeature on non-x86_64 machines (#478) il y a 2 ans
  Alexander Borzunov 6967904590 Bump version to 2.1.0 (#474) il y a 2 ans
  Alexander Borzunov df8ab09ca2 Hide excess key message (#476) il y a 2 ans
  Artem Chumachenko a14ae7334d Update peft to 0.5.0 version (#475) il y a 2 ans
  Alexander Borzunov a9b0e9ff1a Support loading weights from Safetensors on server (#473) il y a 2 ans
  justheuristic 4f850996bb Change transformers version assert (#472) il y a 2 ans
  justheuristic 9250025140 Support transformers 4.32.x (#471) il y a 2 ans
  justheuristic adda5f8c20 Temporarily require peft<0.5.0, transformers<4.32.0 (#470) il y a 2 ans
  Alexander Borzunov de2475f31c Make client compatible with transformers' GenerationMixin (#464) il y a 2 ans
  Alexander Borzunov 063e94b4c8 Move SequenceManagerConfig -> ClientConfig, petals.dht_utils -> petals.utils.dht (#463) il y a 2 ans
  Artem Chumachenko 568f21dc3b Add customizable input tensors (#445) il y a 2 ans
  Alexander Borzunov 329f7d31e8 Add `blocked_servers` argument (#462) il y a 2 ans
  Alexander Borzunov 722c4dc496 Bump version to 2.0.1.post2 (#459) il y a 2 ans
  Alexander Borzunov 056f22515a Prioritize short inference, unmerge pools for long inference (#458) il y a 2 ans
  justheuristic 55eb36ef48 Fix missing torch.cuda.synchronize for computing throughput (#456) il y a 2 ans
  Alexander Borzunov 0e7189b3ed benchmarks: Aggregate speed among workers, set default dtype torch32 (#454) il y a 2 ans