Jiang-Jia-Jun
|
e421d51001
|
[Feature] Support include_stop_str_in_output (#2919)
Co-authored-by: Jiang-Jia-Jun <jiangjiajun@baidu.com>
|
2025-07-18 19:43:19 +08:00 |
|
chen
|
823a47e64a
|
[Feature] Support return logprob of generated tokens (#2784)
* online chat support logprobs
* check xpu
* check vl_gpu_model_runner
* only cuda support logprob
* get_worker() check platform
---------
Co-authored-by: Jiang-Jia-Jun <163579578+Jiang-Jia-Jun@users.noreply.github.com>
|
2025-07-10 15:47:42 +08:00 |
|
Jiang-Jia-Jun
|
ea787d8f62
|
fix bug. (#2718) (#2720)
Co-authored-by: Ting <wtmlon@foxmail.com>
|
2025-07-05 09:00:01 +08:00 |
|
Ting
|
90ef28d982
|
spec token map lazy. (#2715)
Deploy GitHub Pages / deploy (push) Has been cancelled
|
2025-07-05 00:14:54 +08:00 |
|
Jiang-Jia-Jun
|
05c670e593
|
[Sync] Update to latest code (#2679)
* [Sync] Update to latest code
* Add new code files
* Add new code files
* update code
* Try to fix build.sh
* Try to fix build.sh
* Update code
* Update requirements.txt
* Update code
---------
Co-authored-by: Jiang-Jia-Jun <jiangjiajun@baidu.com>
|
2025-07-03 15:43:53 +08:00 |
|
Jiang-Jia-Jun
|
92c2cfa2e7
|
Sync v2.0 version of code to github repo
|
2025-06-29 23:29:37 +00:00 |
|
jiangjiajun
|
684703fd72
|
[LLM] First commit the llm deployment code
|
2025-06-09 19:20:15 +08:00 |
|