.. |
bloom
|
24ba3433e4
[Fix] make distributed seq cls to not create the full bloom model (#49)
|
3 år sedan |
client
|
8b8d54abc5
typo kwargs
|
2 år sedan |
server
|
97dd3c874a
s/expert/block/g
|
2 år sedan |
utils
|
0fd2caa4be
Convert actual model weights (#46)
|
3 år sedan |
__init__.py
|
75856e4769
Measure and cache network & compute throughput (#21)
|
3 år sedan |
data_structures.py
|
f0c7383181
Implement RemoteSequential slicing and extra repr, add tests (#30)
|
3 år sedan |
dht_utils.py
|
0be21775af
remove transformer block, implement as sequential of size 1 (#54)
|
3 år sedan |