Zihan Tang
tzh21
AI & ML interests
None yet
Recent Activity
authored a paper about 1 month ago
xLLM Technical Report authored a paper about 1 month ago
RTPrune: Reading-Twice Inspired Token Pruning for Efficient DeepSeek-OCR Inference authored a paper about 1 month ago
OOCO: Latency-disaggregated Architecture for Online-Offline Co-locate LLM ServingOrganizations
None yet