justheuristic
|
65ad7a131e
refactor priority pool, copy-paste runtime from hivemind (but without bugs!)
|
há 3 anos atrás |
Pavel Samygin
|
f295ec9e84
WIP
|
há 3 anos atrás |
justheuristic
|
1c456f4f46
Merge branch 'main' into priority-tasks
|
há 3 anos atrás |
justheuristic
|
f3984b192a
Make attention cache wait until memory is freed (#53)
|
há 3 anos atrás |
justheuristic
|
8a0c056929
Fix calling rpc_info multiple times (#60)
|
há 3 anos atrás |
justheuristic
|
a86145bbc2
serialize points in inference session
|
há 3 anos atrás |
justheuristic
|
fb0aa13054
re-fix
|
há 3 anos atrás |
justheuristic
|
5c67465154
re-fix
|
há 3 anos atrás |
justheuristic
|
b7ed72c4d6
default to fifo
|
há 3 anos atrás |
justheuristic
|
09d5533326
Merge branch 'priority-tasks' of github.com:bigscience-workshop/petals into priority-tasks
|
há 3 anos atrás |
justheuristic
|
64a2a24911
default to fifo
|
há 3 anos atrás |
Pavel Samygin
|
74abb1299e
priortize task in handler before submit task
|
há 3 anos atrás |
Pavel Samygin
|
0253ea7c73
Merge branch 'main' into priority-tasks
|
há 3 anos atrás |
Pavel Samygin
|
0c6350da17
intermediate changes
|
há 3 anos atrás |
Artem Chumachenko
|
ada98a1b37
Add deep prompt inference (#66)
|
há 3 anos atrás |
Pavel Samygin
|
aea707088b
Merge branch 'main' into priority-tasks
|
há 3 anos atrás |
Alexander Borzunov
|
54ad745bed
Warn that current instructions involve 6B model but we will replace them soon (#63)
|
há 3 anos atrás |
Alexander Borzunov
|
5f0c5329d4
Update readme with arxiv link and more discussions (#62)
|
há 3 anos atrás |
Alexander Borzunov
|
9bea7b9ea8
Update bullet points with feedback from Tim and other people (#61)
|
há 3 anos atrás |
Alexander Borzunov
|
7653562aa1
Use latest version of Petals scheme, shrink Petals logo (#59)
|
há 3 anos atrás |
Alexander Borzunov
|
2eb5843852
Update readme for the 1st public release (#57)
|
há 3 anos atrás |
Pavel Samygin
|
0be21775af
remove transformer block, implement as sequential of size 1 (#54)
|
há 3 anos atrás |
Artem Chumachenko
|
77220c718c
Add shallow prefix-tuned inference (#55)
|
há 3 anos atrás |
justheuristic
|
db481f32bf
pass metadata
|
há 3 anos atrás |
justheuristic
|
a9f9133175
Merge branch 'main' into priority-tasks
|
há 3 anos atrás |
justheuristic
|
3d0b9b86a0
we're forked
|
há 3 anos atrás |
justheuristic
|
d271b75dd4
Let users specify sequence length instead of assuming 2048 (#52)
|
há 3 anos atrás |
Dmitry Baranchuk
|
948877149c
Fix recovering for sequential_backward (#50)
|
há 3 anos atrás |
Dmitry Baranchuk
|
24ba3433e4
[Fix] make distributed seq cls to not create the full bloom model (#49)
|
há 3 anos atrás |
justheuristic
|
f12d0deee9
[quickfix 1/n] remove expensive assertions in inference code (#48)
|
há 3 anos atrás |