Commit History

Autor SHA1 Mensaxe Data
  xtinkt b9d4d06859 test hai 1 ano
  xtinkt 5ca5822686 draft test hai 1 ano
  Anton Sinitsin c0a4d2e3d5 Add option to rollback inference for a certain number of steps (#588) hai 1 ano
  Anton Sinitsin 68585864ae Update transformers to 4.41.2 (#583) hai 1 ano
  Priyanshupareek e268c99a6b Restrict PyTorch version to <2.3.0 to resolve import error (#577) hai 1 ano
  Artem Chumachenko 30f522d1a0 Fix dummy cache allocation (#574) hai 1 ano
  Artem Chumachenko d6f4f80f3f Fix Mixtral-related issues (#570) hai 1 ano
  Artem Chumachenko d2fcbbc72e Add Mixtral models (#553) hai 1 ano
  justheuristic 2ad0b2b936 Fix p2p pushing in rpc_inference (by @miaoqijun ) , support transformers 4.38.2 (#563) hai 1 ano
  justheuristic efee5d1fa8 Clean disk space in push-docker-image.yaml (#558) hai 1 ano
  Denis Mazur 0d91bbdac3 Bump transformers and accelerate versions (#554) hai 1 ano
  justheuristic d59c15c578 Bump version for inference diagnostics (#543) hai 1 ano
  Max Ryabinin 03cbe90234 Optimize LLaMA for inference (#513) hai 1 ano
  justheuristic 25a0796b39 Hotfix: require peft version 0.5.0 (#539) hai 1 ano
  justheuristic dcce43670f Hotfix: set transformers version <=4.34 temporarily (#538) hai 1 ano
  Alexander Borzunov 82a97d6e9e Fix beam search in GPU clients (#531) hai 1 ano
  Alexander Borzunov 47d50e1e29 Improve default arguments for clients and servers (#530) hai 1 ano
  Max Ryabinin ae19b65095 Add position_ids argument to DistributedFalconModel (#525) hai 1 ano
  Alexander Borzunov 1d9401ddce Update README.md (#520) hai 1 ano
  FYY a2484b3053 Fix file locks in NFS-mounted directories (#517) hai 1 ano
  Alexander Borzunov 5ce4f1a159 Store (start_block, end_block) in each DHT record for reliability (#510) hai 1 ano
  Alexander Borzunov 158621677b Bump version to 2.2.0 (#502) hai 1 ano
  Max Ryabinin 1ebd88ae7b Optimize the Falcon block for inference (#500) hai 1 ano
  Alexander Borzunov d40eb6c701 Fix prompt tuning after #464 (#501) hai 1 ano
  Alexander Borzunov dd4a3230bc Add Falcon support (#499) hai 1 ano
  Alexander Borzunov b4d822afb2 Force use_cache=True in config only (#497) hai 1 ano
  Alexander Borzunov abd547735f Force use_cache=True (#496) hai 1 ano
  Alexander Borzunov 6ef6bf5fa2 Create model index in DHT (#491) %!s(int64=2) %!d(string=hai) anos
  Alexander Borzunov 6bb3f54e39 Replace dots in repo names when building DHT prefixes (#489) %!s(int64=2) %!d(string=hai) anos
  Alexander Borzunov 02fc71eb25 Fix race condition in MemoryCache (#487) %!s(int64=2) %!d(string=hai) anos