Ruidong Zhang

Research and Development Engineer at Microsoft Research Asia

He is a research and development engineer in the Systems Group at Microsoft Research Asia (Shanghai), obtained his master’s degree from New York University. He primarily focuses on the field of artificial intelligence systems, with current research directions including sparse computation and long-text reasoning for large language models. His work aims to optimize the training, pre-filling, and decoding processes of these models through joint design of systems and algorithms. He has been involved in the development of the Phi-3 series of models at Microsoft, and his recent projects include MInference, an accelerator for long-text reasoning; LongRoPE, an algorithm for extending context windows; ParrotServe, a distributed LLM service system; and PIT, a dynamic sparse operator compiler.

© boolan.com 博览 版权所有

沪ICP备15014563号

沪公网安备31011502003949号