Zero Rains
|
25698d56d1
|
polish code with new pre-commit rule (#2923)
|
2025-07-19 23:19:27 +08:00 |
|
周周周
|
1339e56282
|
[XPU] Remove padding_offsets from get_padding_offset.cu (#2911)
|
2025-07-18 14:16:44 +08:00 |
|
周周周
|
ddb10ac509
|
[Inference, rename] remove padding_offsets from atten use batch_id_per_token (#2880)
* remove padding_offsets from atten
|
2025-07-17 18:41:31 +08:00 |
|
freeliuzc
|
7cdd8d290d
|
[MTP] optimize mtp infer speed (#2840)
Deploy GitHub Pages / deploy (push) Has been cancelled
|
2025-07-14 19:50:22 +08:00 |
|
GoldPancake
|
f7cad30a38
|
[Feature] Add speculative decoding simulation benchmark. (#2751)
* Add speculative decoding simulation benchmark
* Fix the name of the parameter
|
2025-07-09 12:08:43 +08:00 |
|
Jiang-Jia-Jun
|
92c2cfa2e7
|
Sync v2.0 version of code to github repo
|
2025-06-29 23:29:37 +00:00 |
|
jiangjiajun
|
684703fd72
|
[LLM] First commit the llm deployment code
|
2025-06-09 19:20:15 +08:00 |
|