This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-12-24 13:28:13 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
4f830aa505a4e8489f8ce9b7dcdb43fb14475bce
FastDeploy
/
custom_ops
/
gpu_ops
/
speculate_decoding
History
Yuanle Liu
867803ae10
[BugFix] fix speculate_limit_thinking_content_length (
#5590
)
...
* fix speculate_limit_thinking_content_length * update
2025-12-16 04:31:45 -08:00
..
draft_model
…
ngram_match.cc
…
speculate_calcu_accept_ratio.cu
…
speculate_clear_accept_nums.cu
…
speculate_get_output_padding_offset.cu
…
speculate_get_output_with_topk.cc
…
speculate_get_output.cc
…
speculate_get_padding_offset.cu
Remove CUDA ERROR 9 of inputs of get_padding_offset kernel (
#5440
)
2025-12-09 14:17:30 +08:00
speculate_get_seq_lens_output.cu
…
speculate_get_token_penalty_multi_scores.cu
…
speculate_limit_thinking_content_length_v1.cu
[BugFix] fix speculate_limit_thinking_content_length (
#5590
)
2025-12-16 04:31:45 -08:00
speculate_limit_thinking_content_length_v2.cu
[BugFix] fix speculate_limit_thinking_content_length (
#5590
)
2025-12-16 04:31:45 -08:00
speculate_logprob_utils.cu
…
speculate_msg.h
…
speculate_save_output_with_topk.cc
[BugFix] fix mtp logprob bugs in chunk prefill (
#5244
)
2025-11-27 11:31:29 +08:00
speculate_save_output.cc
…
speculate_schedule_cache.cu
…
speculate_set_stop_value_multi_seqs.cu
[Feature] support stop_token_ids (
#5399
)
2025-12-09 17:49:12 +08:00
speculate_set_value_by_flags_and_idx.cu
…
speculate_step_reschedule.cu
…
speculate_step_system_cache.cu
…
speculate_step.cu
…
speculate_update_input_ids_cpu.cc
…
speculate_update.cu
…
speculate_verify.cu
…
top_p_candidates.cu
[Metax] modify wrapSize to WARP_SIZE (
#5442
)
2025-12-09 01:44:02 -08:00