.. |
bootstrap.id
|
8c546d988a
Test Llama, rebalancing, throughput eval, and all CLI scripts (#452)
|
2 rokov pred |
conftest.py
|
668b736031
Fix logging: do not duplicate lines, enable colors in Colab (#156)
|
2 rokov pred |
server2.id
|
8c546d988a
Test Llama, rebalancing, throughput eval, and all CLI scripts (#452)
|
2 rokov pred |
test_aux_functions.py
|
568f21dc3b
Add customizable input tensors (#445)
|
2 rokov pred |
test_block_exact_match.py
|
056f22515a
Prioritize short inference, unmerge pools for long inference (#458)
|
2 rokov pred |
test_chained_calls.py
|
8c546d988a
Test Llama, rebalancing, throughput eval, and all CLI scripts (#452)
|
2 rokov pred |
test_dtype.py
|
cb3f018f9f
Add LLaMA support (#323)
|
2 rokov pred |
test_full_model.py
|
8c546d988a
Test Llama, rebalancing, throughput eval, and all CLI scripts (#452)
|
2 rokov pred |
test_peft.py
|
b9f0a5467f
Support peft LoRA adapters (#335)
|
2 rokov pred |
test_priority_pool.py
|
084d565845
priority pool
|
2 rokov pred |
test_remote_sequential.py
|
329f7d31e8
Add `blocked_servers` argument (#462)
|
2 rokov pred |
test_sequence_manager.py
|
8c546d988a
Test Llama, rebalancing, throughput eval, and all CLI scripts (#452)
|
2 rokov pred |
test_server_stats.py
|
65e87395bc
wip (again)
|
2 rokov pred |
test_tensor_parallel.py
|
8c546d988a
Test Llama, rebalancing, throughput eval, and all CLI scripts (#452)
|
2 rokov pred |
test_utils.py
|
b9f0a5467f
Support peft LoRA adapters (#335)
|
2 rokov pred |