This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-10-05 16:48:03 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
684703fd72ef7013b7d067ff2ae212f435701fde
FastDeploy
/
fastdeploy
/
model_executor
/
layers
/
quantization
History
jiangjiajun
684703fd72
[LLM] First commit the llm deployment code
2025-06-09 19:20:15 +08:00
..
__init__.py
[LLM] First commit the llm deployment code
2025-06-09 19:20:15 +08:00
block_wise.py
[LLM] First commit the llm deployment code
2025-06-09 19:20:15 +08:00
kv_cache.py
[LLM] First commit the llm deployment code
2025-06-09 19:20:15 +08:00
quant_base.py
[LLM] First commit the llm deployment code
2025-06-09 19:20:15 +08:00
w4afp8.py
[LLM] First commit the llm deployment code
2025-06-09 19:20:15 +08:00
w8a8.py
[LLM] First commit the llm deployment code
2025-06-09 19:20:15 +08:00
weight_only.py
[LLM] First commit the llm deployment code
2025-06-09 19:20:15 +08:00
wfp8afp8.py
[LLM] First commit the llm deployment code
2025-06-09 19:20:15 +08:00