justheuristic
|
fe7e012561
fix merging
|
1 year ago |
Anton Sinitsin
|
04bd8f2b52
fix typo
|
1 year ago |
justheuristic
|
5e4d884fa2
Merge branch 'main' into test_set_position
|
1 year ago |
Your Name
|
5ee583e584
add assertion
|
1 year ago |
Your Name
|
e45532793d
test running inference session with position getter/setter
|
1 year ago |
Anton Sinitsin
|
c0a4d2e3d5
Add option to rollback inference for a certain number of steps (#588)
|
1 year ago |
xtinkt
|
9aecb3f39e
style
|
1 year ago |
xtinkt
|
269028d0e6
fix
|
1 year ago |
xtinkt
|
7565ddee81
fix
|
1 year ago |
xtinkt
|
10de34b72a
fix
|
1 year ago |
xtinkt
|
de2c38ab1b
fix
|
1 year ago |
xtinkt
|
b6801af79d
fix
|
1 year ago |
xtinkt
|
4285ddbd7b
fix
|
1 year ago |
Anton Sinitsin
|
68585864ae
Update transformers to 4.41.2 (#583)
|
1 year ago |
Priyanshupareek
|
e268c99a6b
Restrict PyTorch version to <2.3.0 to resolve import error (#577)
|
1 year ago |
Artem Chumachenko
|
30f522d1a0
Fix dummy cache allocation (#574)
|
1 year ago |
Artem Chumachenko
|
d6f4f80f3f
Fix Mixtral-related issues (#570)
|
1 year ago |
Artem Chumachenko
|
d2fcbbc72e
Add Mixtral models (#553)
|
1 year ago |
justheuristic
|
2ad0b2b936
Fix p2p pushing in rpc_inference (by @miaoqijun ) , support transformers 4.38.2 (#563)
|
1 year ago |
justheuristic
|
efee5d1fa8
Clean disk space in push-docker-image.yaml (#558)
|
1 year ago |
Denis Mazur
|
0d91bbdac3
Bump transformers and accelerate versions (#554)
|
1 year ago |
justheuristic
|
d59c15c578
Bump version for inference diagnostics (#543)
|
1 year ago |
Max Ryabinin
|
03cbe90234
Optimize LLaMA for inference (#513)
|
1 year ago |
justheuristic
|
25a0796b39
Hotfix: require peft version 0.5.0 (#539)
|
1 year ago |
justheuristic
|
dcce43670f
Hotfix: set transformers version <=4.34 temporarily (#538)
|
1 year ago |
Alexander Borzunov
|
82a97d6e9e
Fix beam search in GPU clients (#531)
|
1 year ago |
Alexander Borzunov
|
47d50e1e29
Improve default arguments for clients and servers (#530)
|
1 year ago |
Max Ryabinin
|
ae19b65095
Add position_ids argument to DistributedFalconModel (#525)
|
1 year ago |
Alexander Borzunov
|
1d9401ddce
Update README.md (#520)
|
1 year ago |
FYY
|
a2484b3053
Fix file locks in NFS-mounted directories (#517)
|
1 year ago |