Logo
Explore Help
Sign In
apps/FastDeploy
1
0
Fork 0
You've already forked FastDeploy
mirror of https://github.com/PaddlePaddle/FastDeploy.git synced 2025-12-24 13:28:13 +08:00
Code Issues Actions 2 Packages Projects Releases Wiki Activity
Files
e150a418d44281f4564cf115d1027be2822bd772
FastDeploy/fastdeploy/model_executor/layers
History
xiaoxiaohehe001 e150a418d4 support moe offline quant (#5142)
2025-11-24 18:59:18 +08:00
..
attention
[PD Disaggregation][XPU] Add XPU support for PD disaggregation (#5113)
2025-11-21 14:09:01 +08:00
backends
[Others]get_block_shape_and_split_kv_block clean code (#5123)
2025-11-20 16:40:04 +08:00
moe
support moe offline quant (#5142)
2025-11-24 18:59:18 +08:00
pool
…
quantization
support moe offline quant (#5142)
2025-11-24 18:59:18 +08:00
sample
[Feature] ThreadPoolExecutor async fill_token_bitmask (#5083)
2025-11-19 10:04:16 +08:00
__init__.py
…
activation.py
fix the get_act_fn,_load_st_projector (#4824)
2025-11-06 16:13:35 +08:00
embeddings.py
[Metax] support default_v1 loader & thinking model (#4956)
2025-11-12 16:32:26 +08:00
linear.py
[RL]Fix missing is_distributed attribute (#5150)
2025-11-21 14:14:25 +08:00
lm_head.py
[RL]Resolve shape mismatch problems in RL-related modules (#5032)
2025-11-19 11:12:48 +08:00
mtp_linear.py
update (#4985)
2025-11-13 15:58:01 +08:00
normalization.py
[TSP] Support qwen3 moe tsp + cudagraph (#4871)
2025-11-10 23:37:51 +08:00
pooler.py
fix the get_act_fn,_load_st_projector (#4824)
2025-11-06 16:13:35 +08:00
rotary_embedding.py
…
utils.py
…
Powered by Gitea Version: 1.25.2 Page: 1837ms Template: 111ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API