Published several articles on AI technologies, including but not limited to '27 lines of code for LLM inference' and 'Tiling in AI Compilation - From Theory to Hardware Acceleration'.
Research Experience
Played key roles in multiple AI-related projects, including analyzing the C code implementation of the Llama large language model and researching updates to Cerebras programming interfaces.
Background
Focused on research and development in the field of AI, covering areas such as large language model inference, AI compiler technology, distributed AI systems, and more.
Miscellany
Has deep insights into the AI industry, continuously paying attention to the latest trends and technological developments within the field.