RAM
775edcc09a
[Executor] Default use CUDAGraph ( #3594 )
...
* add start intercept
* Adjustment GraphOptConfig
* pre-commit
* default use cudagraph
* set default value
* default use cuda graph
* pre-commit
* fix test case bug
* disable rl
* fix moba attention
* only support gpu
* Temporarily disable PD Disaggregation
* set max_num_seqs of test case as 1
* set max_num_seqs and temperature
* fix max_num_batched_tokens bug
* close cuda graph
* success run wint2
* profile run with max_num_batched_tokens
* 1.add c++ memchecker 2.success run wint2
* updatee a800 yaml
* update docs
* 1. delete check 2. fix plas attn test case
* default use use_unique_memory_pool
* add try-except for warmup
* ban mtp, mm, rl
* fix test case mock
* fix ci bug
* fix form_model_get_output_topp0 bug
* fix ci bug
* refine deepseek ci
* refine code
* Disable PD
* fix sot yaml
2025-10-21 14:25:45 +08:00
..
2025-09-26 14:23:29 +08:00
2025-08-20 14:33:54 +08:00
2025-06-29 23:29:37 +00:00
2025-06-29 23:29:37 +00:00
2025-06-29 23:29:37 +00:00
2025-06-29 23:29:37 +00:00
2025-07-19 23:19:27 +08:00
2025-06-29 23:29:37 +00:00
2025-06-29 23:29:37 +00:00
2025-06-29 23:29:37 +00:00
2025-06-29 23:29:37 +00:00
2025-07-19 23:19:27 +08:00
2025-07-19 23:19:27 +08:00
2025-06-29 23:29:37 +00:00
2025-09-16 16:38:36 +08:00
2025-10-21 14:25:45 +08:00
2025-06-29 23:29:37 +00:00
2025-07-19 23:19:27 +08:00
2025-06-29 23:29:37 +00:00
2025-07-19 23:19:27 +08:00
2025-07-19 23:19:27 +08:00
2025-06-29 23:29:37 +00:00
2025-06-29 23:29:37 +00:00
2025-06-29 23:29:37 +00:00
2025-09-16 16:38:36 +08:00
2025-09-16 16:38:36 +08:00
2025-10-21 14:25:45 +08:00
2025-06-29 23:29:37 +00:00
2025-06-29 23:29:37 +00:00
2025-09-16 16:38:36 +08:00
2025-06-29 23:29:37 +00:00
2025-09-16 15:55:12 +08:00
2025-09-16 16:38:36 +08:00
2025-06-29 23:29:37 +00:00
2025-06-29 23:29:37 +00:00
2025-06-29 23:29:37 +00:00
2025-09-16 16:38:36 +08:00
2025-09-16 16:38:36 +08:00
2025-06-29 23:29:37 +00:00
2025-09-16 16:38:36 +08:00
2025-09-16 16:38:36 +08:00
2025-09-16 16:38:36 +08:00
2025-09-16 16:38:36 +08:00
2025-07-15 19:49:01 -07:00
2025-07-15 19:49:01 -07:00
2025-07-15 19:49:01 -07:00
2025-07-15 19:49:01 -07:00
2025-07-15 19:49:01 -07:00
2025-07-15 19:49:01 -07:00
2025-07-15 19:49:01 -07:00
2025-09-26 14:23:29 +08:00
2025-09-26 14:23:29 +08:00
2025-07-15 19:49:01 -07:00
2025-07-15 19:49:01 -07:00
2025-06-29 23:29:37 +00:00
2025-07-15 19:49:01 -07:00
2025-06-29 23:29:37 +00:00
2025-06-29 23:29:37 +00:00
2025-07-15 19:49:01 -07:00
2025-07-15 19:49:01 -07:00
2025-07-15 19:49:01 -07:00
2025-07-15 19:49:01 -07:00
2025-07-15 19:49:01 -07:00
2025-07-15 19:49:01 -07:00
2025-06-29 23:29:37 +00:00
2025-06-29 23:29:37 +00:00
2025-06-29 23:29:37 +00:00
2025-06-29 23:29:37 +00:00
2025-06-29 23:29:37 +00:00
2025-06-29 23:29:37 +00:00
2025-06-29 23:29:37 +00:00
2025-06-29 23:29:37 +00:00
2025-06-29 23:29:37 +00:00
2025-06-29 23:29:37 +00:00
2025-07-19 23:19:27 +08:00
2025-07-19 23:19:27 +08:00
2025-06-29 23:29:37 +00:00
2025-06-29 23:29:37 +00:00
2025-06-29 23:29:37 +00:00
2025-06-29 23:29:37 +00:00
2025-06-29 23:29:37 +00:00
2025-06-29 23:29:37 +00:00
2025-07-19 23:19:27 +08:00
2025-06-29 23:29:37 +00:00
2025-10-21 14:25:45 +08:00
2025-10-14 15:04:06 +08:00