This website works better with JavaScript
首頁
探索
說明
註冊
登入
AIForce
/
petals
镜像来自
https://github.com/bigscience-workshop/petals.git
關註
5
讚好
0
複刻
0
Files
問題管理
0
Wiki
暫無描述
337
提交歷史
137
Branches
15
版本發佈
分支:
yozh-dev-branch
分支列表
標籤列表
8bit-model
8bit_backward
8bit_blocks
8bit_model_inference
CI
add-sst2-example
amd-gpus
beamsearch
beat-docker-into-submission
bnb-0-41-1
bootstrap-peers
borzunov-patch-1
borzunov-patch-2
borzunov-patch-3
bump
cache
client
client-attempt2
client-convenience
dbaranchuk-patch-1
debug-leak
declare_adapters
deep-prompt-tuning
deep_prompt_inference
demo-1
diff
diff-compression
distributed-deep-ptune
download_8bit_weights
efficient-forward-backward
empty-weights
enable-rebalancing
examples_fix_hivemind
extract-module-container
facelift
fault-tolerant-inference
fix-auth-token
fix-branch-name
fix-cache
fix-ci
fix-convert-8bit
fix-distr-seq-cls
fix-docker
fix-inference-retry
fix-joining-announce
fix-master-ci
fix-nf4-and-dtypes
fix-pb2
fix-protobuf
fix-ptune
fix-readme
fix-rebalancing-issues
fix-requirements
fix-seq-backward-recovery
fix-too-many-open-files
fix3
forward-backward-timeouts
forward_backward
forward_kwargs
friendly-timeout-errors
generation
generation-inference
get_sequence
hf_quantization_integration
hivemind-1.1.4
hivemind-dht-fork-process
hotfix_bnb
inference_chain
instruction-readability-style
investigate-segfault
justheuristic-patch-1
justheuristic-patch-2
justheuristic-patch-3
justheuristic-patch-4
justheuristic-patch-5
lm_head
load-balancing
lora_from_hub
lru
main
main_fix
measure-throughput
measurements
memory_savings
mockup
multiple-experts
no-cpufeature
no_qkv_merge
optimize_seq
partial_rollback
payload-size
petals-readme-title
pip-installable
pip-installable-v2
priority-tasks
processing_attention
prompt-inference
prompt-tuning
ptune-example-personachat
ptune-wip
pytest-verbose
qkv_merge
readme-clarifications
readme-release
remove-remote-block
rename-test-model
repetition-penalty
rpc
rtfd
sequence
server-dtypes
server-increase-startup-timeout
server-logging
server-timeouts
speculative_inference
speculative_test
standardize
step_metadata
support-backend-dtypes
test-push
test-with-jf160m
test_branch
test_main
test_opt_serving
test_set_position
upd-deps
update-bullet-points
update-hivemind
update-model
update-readme-disclaimers-faq
update-readme-pics
update_example_1
vectorized_beam_search
versions
warn-about-6b-instructions
wip_triton
yozh-dev-branch
v2.2.0
v2.1.0
v2.0.1.post2
v2.0.1.post1
v2.0.1
v2.0.0.post3
v2.0.0.post2
v2.0.0.post1
v1.1.5
v1.1.4
v1.1.3
v1.1.2
v1.1.1
v1.1.0
v1.0.0
petals
HTTP
SSH
ZIP
TAR.GZ
Just Heuristic
4748c03ac5
readme
2 年之前
.github
702bb5a2c2
CI: Update deprecated actions, don't measure network RPS (
#215
)
2 年之前
examples
487411e87e
Fix fine-tuning notebooks intros (
#194
)
2 年之前
src
c2e3c13241
benchmark
2 年之前
tests
702bb5a2c2
CI: Update deprecated actions, don't measure network RPS (
#215
)
2 年之前
.gitignore
99059ae667
install script
3 年之前
Dockerfile
34644f13e1
Downgrade CUDA in Docker image to 11.0.3 (
#145
)
2 年之前
LICENSE
4518d65fdd
Add MIT license
2 年之前
README.md
4748c03ac5
readme
2 年之前
pyproject.toml
7bd5916744
Make Petals a pip-installable package (attempt 2) (
#102
)
2 年之前
setup.cfg
6ba63c6cc8
Fix output shape when resuming generation (
#211
)
2 年之前
README.md
debug branch for fast benchmarking; do not use for production