.. |
__init__.py
|
05faa0b3c8
add quantization script for cpu
|
3 жил өмнө |
config.json
|
a798ea04a6
add minimalistic benchmarks
|
3 жил өмнө |
convert_model.py
|
a2634001e9
Reduce vocabulary size in test model, fix bug in routing when overlapped (#45)
|
3 жил өмнө |
deploy_server.sh
|
11a424837f
integrate mixed-8bit model (#39)
|
3 жил өмнө |
inference_one_block.py
|
4695071ad2
WIP: make DistributedBloom compliant with HF interface
|
3 жил өмнө |
local_server_config_example.cfg
|
f60a7dd183
deploy swarm on local & remote machines
|
3 жил өмнө |
remote_server_config_example.cfg
|
f60a7dd183
deploy swarm on local & remote machines
|
3 жил өмнө |
run_local_servers.sh
|
11a424837f
integrate mixed-8bit model (#39)
|
3 жил өмнө |
run_remote_servers.sh
|
6573076883
Sequential and parallel forward / backward (#36)
|
3 жил өмнө |
run_server.py
|
c6e1b5a8e5
Add various server timeouts, lower --max_batch_size and --inference_max_length defaults (#97)
|
2 жил өмнө |
speed_test.py
|
e2711a033b
Add automated tests (#23)
|
3 жил өмнө |