justheuristic 8b8d54abc5 typo kwargs 2 سال پیش
..
bloom 24ba3433e4 [Fix] make distributed seq cls to not create the full bloom model (#49) 3 سال پیش
client 8b8d54abc5 typo kwargs 2 سال پیش
server 97dd3c874a s/expert/block/g 2 سال پیش
utils 0fd2caa4be Convert actual model weights (#46) 3 سال پیش
__init__.py 75856e4769 Measure and cache network & compute throughput (#21) 3 سال پیش
data_structures.py f0c7383181 Implement RemoteSequential slicing and extra repr, add tests (#30) 3 سال پیش
dht_utils.py 0be21775af remove transformer block, implement as sequential of size 1 (#54) 3 سال پیش