This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-12-24 13:28:13 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
cead6b26fa2c6b29377695b97dcb21854911fc08
FastDeploy
/
custom_ops
/
gpu_ops
/
speculate_decoding
/
draft_model
History
freeliuzc
f44f4bafd1
support mtp in splitewise and scheduler_v1 mode (
#4743
)
2025-11-03 10:07:15 +08:00
..
draft_model_postprocess.cu
…
draft_model_preprocess.cu
support mtp in v1_scheduler mode (
#3695
)
2025-09-04 17:39:59 +08:00
draft_model_set_value_by_flags.cu
…
draft_model_update.cu
[Feature][MTP]support new speculative decoding method named hybrid mtp with ngram (
#3610
)
2025-08-26 14:29:22 +08:00
eagle_get_hidden_states.cu
rename eagle_get_base_model_hidden_states.cu (
#3753
)
2025-09-07 10:24:58 +08:00
eagle_get_self_hidden_states.cu
polish code with new pre-commit rule (
#2923
)
2025-07-19 23:19:27 +08:00
hydra_fetch_hidden_states.cu
…
mtp_save_first_token.cc
support mtp in splitewise and scheduler_v1 mode (
#4743
)
2025-11-03 10:07:15 +08:00
mtp_step_paddle.cu
…
ngram_match_mixed.cu
[Feature][MTP]support new speculative decoding method named hybrid mtp with ngram (
#3610
)
2025-08-26 14:29:22 +08:00