This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-10-08 18:11:00 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
fac2f64837572613ebdf1b46603d219c39d68a48
FastDeploy
/
test
History
yzwu
fbdd6b0663
[Iluvatar GPU] Optimze attention and moe performance (
#3234
)
2025-08-08 10:51:24 +08:00
..
ce
/server
[stop_seq] fix out-bound value for stop sequence (
#3216
)
2025-08-07 15:40:21 +08:00
ci_use
[Iluvatar GPU] Optimze attention and moe performance (
#3234
)
2025-08-08 10:51:24 +08:00
entrypoints
/openai
[Bugfix] Fix uninitialized decoded_token and add corresponding unit test. (
#3195
)
2025-08-04 19:23:58 +08:00
graph_optimization
[Executor]Update graph test case and delete test_attention (
#3257
)
2025-08-07 14:05:15 +08:00
layers
[Executor]Update graph test case and delete test_attention (
#3257
)
2025-08-07 14:05:15 +08:00
operators
[New Feature] Support W4Afp8 MoE GroupGemm (
#3171
)
2025-08-06 10:34:05 +08:00
plugins
[plugin] Custom model_runner/model support (
#3186
)
2025-08-04 18:52:39 -07:00
utils
[fix] multi source download (
#3259
)
2025-08-07 19:30:39 +08:00