Dmitry Baranchuk
|
6095f58681
Deep distributed prompt tuning (#42)
|
il y a 3 ans |
Dmitry Baranchuk
|
11a424837f
integrate mixed-8bit model (#39)
|
il y a 3 ans |
Dmitry Baranchuk
|
04a2b6f5e3
Support various backend dtypes & async serialization (#38)
|
il y a 3 ans |
Artem Chumachenko
|
d989b94614
Pack of Inference Changes (#37)
|
il y a 3 ans |
Dmitry Baranchuk
|
6573076883
Sequential and parallel forward / backward (#36)
|
il y a 3 ans |
Artem Chumachenko
|
6ee942e915
Add GenerationMixin class (#29)
|
il y a 3 ans |
justheuristic
|
e2711a033b
Add automated tests (#23)
|
il y a 3 ans |
Dmitry Baranchuk
|
f5463812ad
Shallow prompt tuning (#22)
|
il y a 3 ans |
dbaranchuk
|
21e1f42f04
mv set_requires_grad to remote_model
|
il y a 3 ans |
dbaranchuk
|
79280c4371
refactoring
|
il y a 3 ans |
dbaranchuk
|
6bffeff0a1
fix
|
il y a 3 ans |
dbaranchuk
|
b3cc9e0d99
add LM head for DistributedBloomCausalLM
|
il y a 3 ans |
dbaranchuk
|
df42822f26
LM head for CausalLM & chunked forward
|
il y a 3 ans |
justheuristic
|
1c68670d06
it works
|
il y a 3 ans |
justheuristic
|
88c1bf9896
black-isort
|
il y a 3 ans |
justheuristic
|
4695071ad2
WIP: make DistributedBloom compliant with HF interface
|
il y a 3 ans |
justheuristic
|
e32208c954
black-isort
|
il y a 3 ans |
justheuristic
|
4ad845bce3
black-isort
|
il y a 3 ans |
Dmitry Baranchuk
|
e66ab6f1f2
design interface & refactoring
|
il y a 3 ans |
Dmitry Baranchuk
|
d969172208
set requires_grad=False, lm_layer -> h @ word_embeddings, rm lm_layer from comverted_model
|
il y a 3 ans |
justheuristic
|
331591c915
less intrusive warnings
|
il y a 3 ans |
justheuristic
|
9c492bbe8c
Infer prefix by defaukt
|
il y a 3 ans |
justheuristic
|
19ae71e8fc
from_pretrained
|
il y a 3 ans |
justheuristic
|
471e47c0f5
black-isort
|
il y a 3 ans |
justheuristic
|
7d68f6b9a4
fix model creation
|
il y a 3 ans |
justheuristic
|
5849cea28c
prototype remote sequential
|
il y a 3 ans |