Alexander Borzunov
|
8c546d988a
Test Llama, rebalancing, throughput eval, and all CLI scripts (#452)
|
%!s(int64=2) %!d(string=hai) anos |
Alexander Borzunov
|
cb3f018f9f
Add LLaMA support (#323)
|
%!s(int64=2) %!d(string=hai) anos |
Max Ryabinin
|
793726b041
Speed up loading blocks using init with meta weights (#285)
|
%!s(int64=2) %!d(string=hai) anos |
justheuristic
|
c2cb6d19ae
Increase tolerances in test_tp_block (#196)
|
%!s(int64=2) %!d(string=hai) anos |
justheuristic
|
ae9e71fe8e
Add local tensor-parallel fwd/bwd (#143)
|
%!s(int64=2) %!d(string=hai) anos |