This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-12-24 13:28:13 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
65425bf8583cfa4a7c2a94d5132c3fa35792d09b
FastDeploy
/
custom_ops
/
gpu_ops
/
speculate_decoding
/
draft_model
History
freeliuzc
52eda7fdb3
[Feature][MTP]support new speculative decoding method named hybrid mtp with ngram (
#3610
)
2025-08-26 14:29:22 +08:00
..
draft_model_postprocess.cu
…
draft_model_preprocess.cu
[Feature][MTP]support new speculative decoding method named hybrid mtp with ngram (
#3610
)
2025-08-26 14:29:22 +08:00
draft_model_set_value_by_flags.cu
…
draft_model_update.cu
[Feature][MTP]support new speculative decoding method named hybrid mtp with ngram (
#3610
)
2025-08-26 14:29:22 +08:00
eagle_get_base_model_hidden_states.cu
[Feature][MTP]support new speculative decoding method named hybrid mtp with ngram (
#3610
)
2025-08-26 14:29:22 +08:00
eagle_get_self_hidden_states.cu
polish code with new pre-commit rule (
#2923
)
2025-07-19 23:19:27 +08:00
hydra_fetch_hidden_states.cu
…
mtp_save_first_token.cc
…
mtp_step_paddle.cu
…
ngram_match_mixed.cu
[Feature][MTP]support new speculative decoding method named hybrid mtp with ngram (
#3610
)
2025-08-26 14:29:22 +08:00