Your Name
|
24efbeb236
black
|
il y a 11 mois |
justheuristic
|
13111911a6
Merge branch 'main' into speculative_inference
|
il y a 11 mois |
Ink
|
22afba627a
Upgrade Pydantic to >= 2.0.0 (#607)
|
il y a 1 an |
Alexander Borzunov
|
c68c1c3b92
Allow torch>=2.3.0 (#603)
|
il y a 1 an |
Anton Sinitsin
|
02bbd85ed8
Added primitives for speculative decoding and tests (#598)
|
il y a 1 an |
Aleksandr Borzunov
|
a2d4b65ae0
Update README.md
|
il y a 1 an |
Aleksandr Borzunov
|
10fab97e2b
Fix year in citation
|
il y a 1 an |
Alexander Borzunov
|
8ad5513bea
Fix server warnings, update license links and readme (#602)
|
il y a 1 an |
Alexander Borzunov
|
67ca11a282
Update hivemind to support torch >= 2.3.0, pydantic >= 2.0 (#601)
|
il y a 1 an |
Alexander Borzunov
|
103ef760da
Materialize buffers in get_block_size() (#600)
|
il y a 1 an |
justheuristic
|
10f7525ce0
Fix typo in README
|
il y a 1 an |
justheuristic
|
19be29e89e
note about llama 3.1 RoPE support
|
il y a 1 an |
justheuristic
|
6477cb85e7
Bump transformers to 4.43.1 (#596)
|
il y a 1 an |
Artem Chumachenko
|
f1e1b051d0
Update peft dependency, fix initialization and inference with new peft (#557)
|
il y a 1 an |
justheuristic
|
a59e38a578
running inference session with position getter/setter (#594)
|
il y a 1 an |
Anton Sinitsin
|
c0a4d2e3d5
Add option to rollback inference for a certain number of steps (#588)
|
il y a 1 an |
xtinkt
|
9aecb3f39e
style
|
il y a 1 an |
xtinkt
|
269028d0e6
fix
|
il y a 1 an |
xtinkt
|
7565ddee81
fix
|
il y a 1 an |
xtinkt
|
10de34b72a
fix
|
il y a 1 an |
xtinkt
|
de2c38ab1b
fix
|
il y a 1 an |
xtinkt
|
b6801af79d
fix
|
il y a 1 an |
xtinkt
|
4285ddbd7b
fix
|
il y a 1 an |
Anton Sinitsin
|
68585864ae
Update transformers to 4.41.2 (#583)
|
il y a 1 an |
Priyanshupareek
|
e268c99a6b
Restrict PyTorch version to <2.3.0 to resolve import error (#577)
|
il y a 1 an |
Artem Chumachenko
|
30f522d1a0
Fix dummy cache allocation (#574)
|
il y a 1 an |
Artem Chumachenko
|
d6f4f80f3f
Fix Mixtral-related issues (#570)
|
il y a 1 an |
Artem Chumachenko
|
d2fcbbc72e
Add Mixtral models (#553)
|
il y a 1 an |
justheuristic
|
2ad0b2b936
Fix p2p pushing in rpc_inference (by @miaoqijun ) , support transformers 4.38.2 (#563)
|
il y a 1 an |
justheuristic
|
efee5d1fa8
Clean disk space in push-docker-image.yaml (#558)
|
il y a 1 an |