AI Inference and Deployment Founding Engineer
San Francisco – Daly City
Permanent
Equity + Bonus + Benefits
Up To: $200000
Join my pioneering global technology client that is at the forefront of innovation and cutting-edge research. They are seeking a talented AI Inference and Deployment Founding Engineer specialising in Generative AI to join their dynamic team in San Francisco- Daly City . This is a unique opportunity to contribute to groundbreaking projects and collaborate with industry experts.
Key Responsibilities:
- AI inference and deployment optimisation
- ML/Ops management
- Manage software libraries
- Scaling and deploying GPU clusters
- Generative AI for text and image processing and/or other AI NLP tasks is highly appreciated.
- Collaborate with cross-functional teams to design and execute solutions
Must-Have Qualifications:
- Masters in Computer Science, Engineering, Mathematics, or a related discipline.
- Proficiency in one or more general-purpose programming languages including Python, Java, C, and C++.
- AI Engineering Expertise: Experience and demonstrated output in personalised efficient AI Engineering, including continual learning, few-shot learning, domain adaptation, etc., particularly in Computer Vision.
- Tool set to include: Cuda. Onnx, Kubernetes, Docker, Terraform
- Generative AI: Experience in generative AI for text and image processing and/or other AI NLP tasks is highly appreciated.
- Hands-on experience with software libraries and toolboxes such as PyTorch, TensorFlow, CUDA.
If you are passionate about AI Engineering and meet the experience listed above, please apply for this exciting opportunity. Submit your CV and a cover letter detailing your relevant experience and accomplishments.