This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-12-24 13:28:13 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
cfa0982aae2c86e5114febb23c4c712bc24d06e2
FastDeploy
/
fastdeploy
/
model_executor
/
layers
History
liyonghua0910
cfa0982aae
[fix] fix ep group all-reduce
2025-09-17 18:05:34 +08:00
..
attention
…
backends
[Feature] refactor metax_gpu attention and moe and remove some useless code (
#3688
)
2025-09-12 14:40:25 +08:00
moe
[fix] fix ep group all-reduce
2025-09-17 18:05:34 +08:00
quantization
[BugFix]Fix load kv cache quant scale (
#4077
)
2025-09-12 17:44:03 +08:00
sample
…
__init__.py
…
activation.py
…
embeddings.py
…
linear.py
…
lm_head.py
…
mtp_linear.py
…
normalization.py
…
rotary_embedding.py
[Feature] refactor metax_gpu attention and moe and remove some useless code (
#3688
)
2025-09-12 14:40:25 +08:00
utils.py
…