Commit History

Author SHA1 Message Date
  Artem Chumachenko d6f4f80f3f Fix Mixtral-related issues (#570) 1 year ago
  Denis Mazur 0d91bbdac3 Bump transformers and accelerate versions (#554) 1 year ago
  Max Ryabinin 03cbe90234 Optimize LLaMA for inference (#513) 1 year ago
  Max Ryabinin 1ebd88ae7b Optimize the Falcon block for inference (#500) 1 year ago