yangjianfengo1
|
ba5c2b7e37
|
[Docx] add language (en/cn) switch links (#4470)
* add install docs
* 修改文档
* 修改文档
|
2025-10-17 15:47:41 +08:00 |
|
AIbin
|
a7392a0ff9
|
【Inference Optimize】DeepSeek-V3-model MLA Optimize (#3886)
* support MLA chunk_size auto search & cuda_graph
|
2025-09-11 10:46:09 +08:00 |
|
AIbin
|
316ac546d3
|
update_wint2_doc (#3968)
|
2025-09-08 15:53:09 +08:00 |
|
AIbin
|
54b458fd98
|
[Doc] update wint2 doc (#3819)
* update_wint2_doc
|
2025-09-03 11:27:43 +08:00 |
|
Zero Rains
|
25698d56d1
|
polish code with new pre-commit rule (#2923)
|
2025-07-19 23:19:27 +08:00 |
|
handiz
|
b8a8a19689
|
add wint2 performance (#2673)
|
2025-07-02 17:10:01 +08:00 |
|
AIbin
|
1bb296c5ad
|
update quantization doc (#2659)
|
2025-07-01 15:05:02 +08:00 |
|
Jiang-Jia-Jun
|
92c2cfa2e7
|
Sync v2.0 version of code to github repo
|
2025-06-29 23:29:37 +00:00 |
|