Historique des commits

Auteur SHA1 Message Date
  Alexander Borzunov a26559ff65 Fix `.generate(input_ids=...)` (#485) il y a 2 ans
  Alexander Borzunov 329f7d31e8 Add `blocked_servers` argument (#462) il y a 2 ans
  Alexander Borzunov 056f22515a Prioritize short inference, unmerge pools for long inference (#458) il y a 2 ans
  Alexander Borzunov 8c546d988a Test Llama, rebalancing, throughput eval, and all CLI scripts (#452) il y a 2 ans
  Alexander Borzunov de930918a0 Support loading blocks in 4-bit (QLoRA NF4 format, disabled by default) (#333) il y a 2 ans
  Alexander Borzunov cb3f018f9f Add LLaMA support (#323) il y a 2 ans
  Alexander Borzunov 8f6342a861 Refactor RemoteSequenceManager (#309) il y a 2 ans
  Alexander Borzunov 21c3526ec1 Start SequenceManager's thread only after first .make_sequence() (#301) il y a 2 ans
  Max Ryabinin 793726b041 Speed up loading blocks using init with meta weights (#285) il y a 2 ans
  Alexander Borzunov fee19e9b9b Use get_logger(__name__) instead of get_logger(__file__) (#265) il y a 2 ans
  justheuristic ae9e71fe8e Add local tensor-parallel fwd/bwd (#143) il y a 2 ans
  Alexander Borzunov 668b736031 Fix logging: do not duplicate lines, enable colors in Colab (#156) il y a 2 ans
  justheuristic a2066a4096 Optimize RemoteSequenceManager (#106) il y a 2 ans
  Alexander Borzunov 43ac6016ac Fix dtypes in backend schemas (#99) il y a 2 ans
  Alexander Borzunov 7bd5916744 Make Petals a pip-installable package (attempt 2) (#102) il y a 2 ans
  Dmitry Baranchuk 6095f58681 Deep distributed prompt tuning (#42) il y a 3 ans
  justheuristic f0c7383181 Implement RemoteSequential slicing and extra repr, add tests (#30) il y a 3 ans