lizexu123
c563eca791
[Feature] support reward model ( #5301 )
...
* Your commit message here
* add test
* update develop
* support reward
* support enable_chunk_prefill
* support bingfa
* support convert is reward
* update test
* delete print
* fix enable_thinking
* add document
* fix place
* fix test
* fix
* support enable_prefix_caching
* add no-enable_prefix-caching test
* fix
* support enable_prefix_caching
* delete print
* fix document
* fix
* fix test
* fix document and delete chinese
* udpate
* enable_thinking
* fix test
2025-12-02 14:55:31 +08:00
Echo-Nie
c18b177f21
fix the get_act_fn,_load_st_projector ( #4824 )
2025-11-06 16:13:35 +08:00
lizexu123
c234b995ab
[Feature] support pooling model dummy_run ( #4345 )
...
* support qwen3-embedding
* fix ci bug
* support pooling dummy_run
* fix
* delete print
* parallel_config.max_model_len
* delete is_pooling_model in dummy_run
* fix
* fd_model
* fix embedding load
* fix
* fix post_process
2025-10-17 13:30:55 +08:00
lizexu123
e8318b7477
[BugFix] fix qwen3-embedding model tp>1 ( #4223 )
...
* support qwen3-embedding
* fix ci bug
* fix
* fix ci bug
* fix ci bug
* fix
* fix qwen3-embedding
* fix
* fix
* fix
2025-09-24 14:13:26 +08:00
lizexu123
c86945ef49
[Feature] support pool ( #3827 )
...
* support pool
* update pooling
* add pooler_config and check
* update
* support AutoWeightsLoader load weight
* fix
* update
* delete print
* update pre-commit
* fix
* fix xpu
* fix ModelRegistry->model_registry
* fix Copilot review
* fix pooler.py
* delete StepPooler
* fix abstract
* fix default_loader_v1
* fix Pre Commit
* support torch qwen3 dense
* add test and fix torch-qwen
* fix
* fix
* adapter ci:
* fix review
* fix pooling_params.py
* fix
* fix tasks.py 2025
* fix print and logger
* Modefy ModelRegistry and delete AutoWeightsLoader
* fix logger
* fix test_embedding
* fix ci bug
* ernie4_5 model_registry
* fix test
* support Qwen3-Embedding-0.6B tp=1 load
* fix extra code
* fix
* delete fix vocab_size
* delete prepare_params_dict
* fix:
2025-09-22 14:09:09 +08:00