sg263
|
c71d955e9c
|
[Trace]fix opentelemetry can not work in uvicorn (#2907)
* add opentelemetry
* add opentelemetry
* add opentelemetry on dequeue
* add opentelemetry on dequeue
* add opentelemetry on dequeue
* fix opentelemetry-instrumentation-fastapi
* fix annotation
* fix opentelemetry-bootstrap
* fix opentelemetry-bootstrap
* fix opentelemetry can not work in uvicorn
* remove unless import
* move conf to env
* fix useless commit
---------
Co-authored-by: shige <shige@baidu.com>
|
2025-07-17 23:16:29 +08:00 |
|
sg263
|
0d61c65de1
|
[Trace] Support trace log (#2864)
* add opentelemetry
* add opentelemetry
* add opentelemetry on dequeue
* add opentelemetry on dequeue
* add opentelemetry on dequeue
|
2025-07-16 15:35:44 +08:00 |
|
lddfym
|
94e1a895e3
|
fix spelling error (#2826)
* fix spelling error
* fix scheduler reset error
|
2025-07-14 13:13:08 +08:00 |
|
zhenwenDang
|
87203ec87b
|
After enabling "top_logprobs supports passing 0 and fix max_completion_tokens", an incorrect finish_reason was returned. (#2815)
* /v1/chat/completions endpoint now supports max_completion_tokens and fixes the return value of finish_reason
* top_logprobs supports passing 0
|
2025-07-11 16:53:12 +08:00 |
|
lddfym
|
ec986642df
|
Global scheduler supports configuring hot updates (#2812)
|
2025-07-11 13:39:30 +08:00 |
|
chen
|
823a47e64a
|
[Feature] Support return logprob of generated tokens (#2784)
* online chat support logprobs
* check xpu
* check vl_gpu_model_runner
* only cuda support logprob
* get_worker() check platform
---------
Co-authored-by: Jiang-Jia-Jun <163579578+Jiang-Jia-Jun@users.noreply.github.com>
|
2025-07-10 15:47:42 +08:00 |
|
Sunny-bot1
|
1107e08cd9
|
[Feature 2.0.2] support top_k_top_p sampling (#2789)
* support top_k_top_p sampling
* fix
* add api param
* add api para
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* change func name
|
2025-07-09 21:01:51 -07:00 |
|
ltd0924
|
87e638498c
|
[RL] update reschedule finish reason (#2709)
|
2025-07-04 13:47:36 +08:00 |
|
Jiang-Jia-Jun
|
05c670e593
|
[Sync] Update to latest code (#2679)
* [Sync] Update to latest code
* Add new code files
* Add new code files
* update code
* Try to fix build.sh
* Try to fix build.sh
* Update code
* Update requirements.txt
* Update code
---------
Co-authored-by: Jiang-Jia-Jun <jiangjiajun@baidu.com>
|
2025-07-03 15:43:53 +08:00 |
|
Jiang-Jia-Jun
|
92c2cfa2e7
|
Sync v2.0 version of code to github repo
|
2025-06-29 23:29:37 +00:00 |
|
jiangjiajun
|
684703fd72
|
[LLM] First commit the llm deployment code
|
2025-06-09 19:20:15 +08:00 |
|