"TPI-LLM: Serving 70B-Scale LLMs Efficiently on Low-Resource Mobile Devices."

Zonghang Li et al. (2025)

Details and statistics

DOI: 10.1109/TSC.2025.3596892

access: closed

type: Journal Article

metadata version: 2025-11-09