Jobs search

AI Inference and Deployment Founding Engineer

Apollo Solutions • united states, united states, us • 2m ago

AI Inference and Deployment Founding Engineer

San Francisco – Daly City

Permanent

Equity + Bonus + Benefits

Up To: $200000

Join my pioneering global technology client that is at the forefront of innovation and cutting-edge research. They are seeking a talented AI Inference and Deployment Founding Engineer specialising in Generative AI to join their dynamic team in San Francisco- Daly City . This is a unique opportunity to contribute to groundbreaking projects and collaborate with industry experts.

Key Responsibilities:

AI inference and deployment optimisation
ML/Ops management
Manage software libraries
Scaling and deploying GPU clusters
Generative AI for text and image processing and/or other AI NLP tasks is highly appreciated.
Collaborate with cross-functional teams to design and execute solutions

Must-Have Qualifications:

Masters in Computer Science, Engineering, Mathematics, or a related discipline.
Proficiency in one or more general-purpose programming languages including Python, Java, C, and C++.
AI Engineering Expertise: Experience and demonstrated output in personalised efficient AI Engineering, including continual learning, few-shot learning, domain adaptation, etc., particularly in Computer Vision.
Tool set to include: Cuda. Onnx, Kubernetes, Docker, Terraform
Generative AI: Experience in generative AI for text and image processing and/or other AI NLP tasks is highly appreciated.
Hands-on experience with software libraries and toolboxes such as PyTorch, TensorFlow, CUDA.

If you are passionate about AI Engineering and meet the experience listed above, please apply for this exciting opportunity. Submit your CV and a cover letter detailing your relevant experience and accomplishments.