This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-12-24 13:28:13 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
64d1aa973bc8d1a1bcb364900510393b04069e06
FastDeploy
/
fastdeploy
/
model_executor
/
layers
History
RAM
25a983ba9c
1.fix the bug of draft model with ep 2.fix sampler bug (
#4589
)
2025-10-27 17:47:34 +08:00
..
attention
Support GPT-OSS-BF16 (
#4240
)
2025-10-20 14:44:58 +08:00
backends
[XPU] bind some OPs for VL model with pybind (
#4522
)
2025-10-27 10:50:08 +08:00
moe
【BugFix】fix ep buffer clear (
#4450
)
2025-10-21 10:56:00 +08:00
pool
…
quantization
WINT4/WINT8 dense gemm default use Machete (
#4451
)
2025-10-23 17:57:59 +08:00
sample
1.fix the bug of draft model with ep 2.fix sampler bug (
#4589
)
2025-10-27 17:47:34 +08:00
__init__.py
…
activation.py
…
embeddings.py
…
linear.py
add qwen-2.5-7B-PRM/ernie-rm (
#4319
)
2025-10-20 15:31:03 +08:00
lm_head.py
…
mtp_linear.py
…
normalization.py
…
pooler.py
…
rotary_embedding.py
[Feature] Support Paddle-OCR (
#4396
)
2025-10-24 23:34:30 +08:00
utils.py
…