Files
FastDeploy/fastdeploy/model_executor/models
AIbin 41aee08982 【Inference Optimize】Update MergedReplicatedLinear for DSK qkv_a_proj_with_mqa. (#3673)
* support MergedReplicatedLinear

* update MergedReplicatedLinear to support DSK_wint4 V1_load

* update model name

* update linear class

* fix

* fix v0 moe_bias load

---------

Co-authored-by: bukejiyu <52310069+bukejiyu@users.noreply.github.com>
2025-09-04 21:16:05 -07:00
..
2025-08-28 22:53:57 +08:00
2025-09-03 16:05:41 +08:00
2025-08-28 19:42:32 +08:00
2025-09-03 10:54:34 +08:00
2025-09-03 10:54:34 +08:00
2025-09-03 10:54:34 +08:00