Zero Rains
|
0fb37ab7e4
|
update flake8 version to support pre-commit in python3.12 (#3000)
* update flake8 version to support pre-commit in python3.12
* polish code
|
2025-07-24 01:43:31 -07:00 |
|
lifulll
|
2c6a9e887e
|
native top_p_sampling (#2901)
|
2025-07-22 14:09:59 +08:00 |
|
lizexu123
|
67990e0572
|
[Feature] support min_p_sampling (#2872)
Deploy GitHub Pages / deploy (push) Has been cancelled
* Fastdeploy support min_p
* add test_min_p
* fix
* min_p_sampling
* update
* delete vl_gpu_model_runner.py
* fix
* Align usage of min_p with vLLM
* fix
* modified unit test
* fix test_min_sampling
* pre-commit all files
* fix
* fix
* fix
* fix xpu_model_runner.py
|
2025-07-20 23:17:59 -07:00 |
|
Zero Rains
|
25698d56d1
|
polish code with new pre-commit rule (#2923)
|
2025-07-19 23:19:27 +08:00 |
|
ming1753
|
1f15ca21e4
|
[Feature] support prompt repetition_penalty (#2806)
Deploy GitHub Pages / deploy (push) Has been cancelled
|
2025-07-17 12:05:52 +08:00 |
|
freeliuzc
|
7cdd8d290d
|
[MTP] optimize mtp infer speed (#2840)
Deploy GitHub Pages / deploy (push) Has been cancelled
|
2025-07-14 19:50:22 +08:00 |
|
Sunny-bot1
|
240d6236bc
|
[Fix]fix top_k_top_p sampling (#2801)
Deploy GitHub Pages / deploy (push) Has been cancelled
* fix topk-topp
* update
* add base_non_truncated
|
2025-07-10 22:35:10 +08:00 |
|
Sunny-bot1
|
1e2319cbef
|
Rename top_p_sampling to top_k_top_p_sampling (#2791)
|
2025-07-10 00:09:25 -07:00 |
|
Sunny-bot1
|
e45050cae3
|
[Feature] support top_k_top_p sampling (#2753)
* support top_k_top_p sampling
* fix
* add api param
* add api para
* fix
* fix
* fix
* fix
* fix
* fix
* fix
|
2025-07-09 20:58:58 -07:00 |
|
EnflameGCU
|
d0f4d6ba3a
|
[GCU] Support gcu platform (#2702)
baseline: e7fa57ebae
Co-authored-by: yongqiangma <xing.wo@163.com>
|
2025-07-08 13:00:52 +08:00 |
|
liddk1121
|
1b54a2831e
|
Adapt for iluvatar gpu (#2684)
|
2025-07-07 16:53:14 +08:00 |
|
Jiang-Jia-Jun
|
92c2cfa2e7
|
Sync v2.0 version of code to github repo
|
2025-06-29 23:29:37 +00:00 |
|
jiangjiajun
|
684703fd72
|
[LLM] First commit the llm deployment code
|
2025-06-09 19:20:15 +08:00 |
|