This website requires JavaScript.
Explore
Help
Sign In
apps
/
FastDeploy
Watch
1
Star
0
Fork
0
You've already forked FastDeploy
mirror of
https://github.com/PaddlePaddle/FastDeploy.git
synced
2025-12-24 13:28:13 +08:00
Code
Issues
Actions
2
Packages
Projects
Releases
Wiki
Activity
Files
0925d44f182315ab5195f6f6cea6f2ecc506cf73
FastDeploy
/
fastdeploy
/
cache_manager
History
Juncai
0925d44f18
[PD Disaggregation] support different tp_size for prefill and decode (
#5296
)
...
* up * up * up * fix
2025-12-01 17:50:20 +08:00
..
transfer_factory
[PD Disaggregation] support different tp_size for prefill and decode (
#5296
)
2025-12-01 17:50:20 +08:00
__init__.py
…
cache_data.py
[Feature] mm support prefix cache (
#4134
)
2025-10-27 17:39:51 +08:00
cache_messager.py
[PD Disaggregation] support different tp_size for prefill and decode (
#5296
)
2025-12-01 17:50:20 +08:00
cache_metrics.py
[Feature] mm support prefix cache (
#4134
)
2025-10-27 17:39:51 +08:00
cache_transfer_manager.py
[Feature] dyc8 support prefixcache (
#5125
)
2025-11-21 19:46:26 +08:00
multimodal_cache_manager.py
[Feature] mm support prefix cache (
#4134
)
2025-10-27 17:39:51 +08:00
ops.py
dummy import fd (
#5192
)
2025-11-24 20:23:07 +08:00
prefix_cache_manager.py
[Metrics] Update time_to_first_token to include tokenization & queue time, and remove redundant metrics (
#4993
)
2025-11-26 14:42:17 +08:00