justheuristic
|
2ad0b2b936
Fix p2p pushing in rpc_inference (by @miaoqijun ) , support transformers 4.38.2 (#563)
|
1 vuosi sitten |
Denis Mazur
|
0d91bbdac3
Bump transformers and accelerate versions (#554)
|
1 vuosi sitten |
justheuristic
|
25a0796b39
Hotfix: require peft version 0.5.0 (#539)
|
1 vuosi sitten |
justheuristic
|
dcce43670f
Hotfix: set transformers version <=4.34 temporarily (#538)
|
1 vuosi sitten |
Alexander Borzunov
|
abd547735f
Force use_cache=True (#496)
|
1 vuosi sitten |
Alexander Borzunov
|
26ebbfe8f0
Support macOS (#477)
|
2 vuotta sitten |
Alexander Borzunov
|
915b357740
Require transformers>=4.32.0 (#479)
|
2 vuotta sitten |
Alexander Borzunov
|
18e93afc73
Don't install cpufeature on non-x86_64 machines (#478)
|
2 vuotta sitten |
Artem Chumachenko
|
a14ae7334d
Update peft to 0.5.0 version (#475)
|
2 vuotta sitten |
justheuristic
|
4f850996bb
Change transformers version assert (#472)
|
2 vuotta sitten |
justheuristic
|
9250025140
Support transformers 4.32.x (#471)
|
2 vuotta sitten |
justheuristic
|
adda5f8c20
Temporarily require peft<0.5.0, transformers<4.32.0 (#470)
|
2 vuotta sitten |
Alexander Borzunov
|
593d980ad8
Use bitsandbytes 0.41.1 (#442)
|
2 vuotta sitten |
Alexander Borzunov
|
f3fafd14a4
Bump version to 2.0.1 (#411)
|
2 vuotta sitten |
Alexander Borzunov
|
eb0664b993
Support Python 3.11 (#393)
|
2 vuotta sitten |
Alexander Borzunov
|
e9a20e7e53
Require accelerate>=0.20.3 as transformers do (#383)
|
2 vuotta sitten |
Alexander Borzunov
|
895327a0ae
Fix readme code example, require Python < 3.11 until supported (#374)
|
2 vuotta sitten |
Alexander Borzunov
|
c735dd7ba3
Update transformers to 4.31.0 and peft to 0.4.0 (#371)
|
2 vuotta sitten |
Alexander Borzunov
|
f97582fb5f
Require transformers < 4.31.0 until we're compatible (#369)
|
2 vuotta sitten |
Alexander Borzunov
|
62d9ed5ce7
Implement shortest-path routing for inference (#362)
|
2 vuotta sitten |
Alexander Borzunov
|
3f733a96e3
Use bitsandbytes 0.40.1.post1 (#357)
|
2 vuotta sitten |
Alexander Borzunov
|
2c8959e713
Share more info about a server in DHT (#355)
|
2 vuotta sitten |
Alexander Borzunov
|
1a78638c02
Test that bitsandbytes is not imported when it's not used (#351)
|
2 vuotta sitten |
Artem Chumachenko
|
b9f0a5467f
Support peft LoRA adapters (#335)
|
2 vuotta sitten |
Alexander Borzunov
|
dfc6578c8e
Use bitsandbytes 0.40.0.post4 with bias hotfix (#342)
|
2 vuotta sitten |
Alexander Borzunov
|
fa095f6461
Use 4-bit for llama by default, use bitsandbytes 0.40.0.post3 (#340)
|
2 vuotta sitten |
Alexander Borzunov
|
de930918a0
Support loading blocks in 4-bit (QLoRA NF4 format, disabled by default) (#333)
|
2 vuotta sitten |
Alexander Borzunov
|
66a47c763e
Require pydantic < 2.0 (2.0 is incompatible with hivemind 1.1.8) (#337)
|
2 vuotta sitten |
Alexander Borzunov
|
cb3f018f9f
Add LLaMA support (#323)
|
2 vuotta sitten |
Alexander Borzunov
|
0a313bf6c5
Update hivemind to 1.1.8, enable efficient bfloat16 encoding (#311)
|
2 vuotta sitten |