This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-12-24 13:28:13 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
64d1aa973bc8d1a1bcb364900510393b04069e06
FastDeploy
/
fastdeploy
/
model_executor
/
layers
/
quantization
History
Sunny-bot1
4ffe41a747
WINT4/WINT8 dense gemm default use Machete (
#4451
)
2025-10-23 17:57:59 +08:00
..
ops
WINT4/WINT8 dense gemm default use Machete (
#4451
)
2025-10-23 17:57:59 +08:00
__init__.py
…
block_wise_fp8.py
…
kv_cache.py
…
mix_quant.py
…
quant_base.py
…
tensor_wise_fp8.py
…
w4a8.py
…
w4afp8.py
…
w8a8.py
…
weight_only.py
WINT4/WINT8 dense gemm default use Machete (
#4451
)
2025-10-23 17:57:59 +08:00
wfp8afp8.py
[BugFix]Fix wfp8afp8 triton moe group_topk renormalized=True (
#4449
)
2025-10-16 23:17:48 +08:00
wint2.py
…