Aleksandr Borzunov c10e505324 Measure and cache throughput by default 3 gadi atpakaļ
..
__init__.py 05faa0b3c8 add quantization script for cpu 3 gadi atpakaļ
config.json a798ea04a6 add minimalistic benchmarks 3 gadi atpakaļ
convert_model.py 5695897620 fix imports 3 gadi atpakaļ
deploy_server.sh f055135b08 rm prefix 3 gadi atpakaļ
inference_one_block.py 4695071ad2 WIP: make DistributedBloom compliant with HF interface 3 gadi atpakaļ
local_server_config_example.cfg f60a7dd183 deploy swarm on local & remote machines 3 gadi atpakaļ
remote_server_config_example.cfg f60a7dd183 deploy swarm on local & remote machines 3 gadi atpakaļ
run_local_servers.sh d969172208 set requires_grad=False, lm_layer -> h @ word_embeddings, rm lm_layer from comverted_model 3 gadi atpakaļ
run_remote_servers.sh f60a7dd183 deploy swarm on local & remote machines 3 gadi atpakaļ
run_server.py c10e505324 Measure and cache throughput by default 3 gadi atpakaļ
speed_test.py 94f5e366b9 Draft measuring throughput 3 gadi atpakaļ