Tim Dettmers 43fdcac6aa Loading a bloom block working. 3 years ago
..
__init__.py 05faa0b3c8 add quantization script for cpu 3 years ago
config.json a798ea04a6 add minimalistic benchmarks 3 years ago
convert_model.py 83cd4412a1 black-isort 3 years ago
inference_one_block.py 43fdcac6aa Loading a bloom block working. 3 years ago
run_server.py 1ab5fb1630 fetch a specific bloom block without downloading the entire model 3 years ago