Software Engineer vLLM
Series A Start-up and leading developer and maintainer of vLLM.
As a Software Engineer specializing in vLLM, you will be at the forefront of innovation and addressing critical challenges in model performance and efficiency. Your expertise in machine learning and high-performance computing will directly influence the development of a cutting-edge software platform, playing a pivotal role in the future of AI deployment and utilization.
Responsibilities:
- Develop robust Python and C++ code, focusing on vLLM systems, high-performance machine learning primitives, performance analysis and modeling, and numerical methods.
- Review code and contribute to the development of best practices within the team.
- Work closely with machine learning teams to optimize neural network performance in our engine.
Requirements:
- Extensive experience in writing high-performance code for GPUs and deep knowledge of GPU hardware.
- BS, MS, or PhD in computer science.
- Proficiency in mathematical software, especially linear algebra or signal processing.
- Expertise in modern C++ and Python.
- Experience with PyTorch.
- Proficiency in tensor computations.
- In-depth understanding of deep neural network models and techniques.