Our client is a market leader in their space! They offer the industry's most comprehensive and powerful SaaS defence solution.
Due to an exciting growth phase, they're looking for hire a Principal Software Engineer (LLM Services)
Full-Time
Ideal location would be Newport Beach or Palo Alto.
Hybrid Working (2 days on-site)
About the role: In your role as a Principal Software Engineer, you will focus on leveraging and optimizing Large Language Models (LLMs) along with the implementation of advanced AI technologies.
Responsibilities:
- Development and Optimization of LLMs: Implement and fine-tune state-of-the-art Large Language Models for various applications, focusing on performance and accuracy.
- Evaluating Model Performance: Conduct rigorous evaluations of LLMs, assessing effectiveness, efficiency, and business alignment.
- Integration of Advanced AI Technologies: Implement Retrieval-Augmented Generation (RAG), function calling, and code interpreter technologies to enhance the capabilities of Large Language Models.
- Research and Development: Stay abreast of the latest advancements in machine learning, particularly in LLMs, LLM agents, and large-scale neural network training.
- Data and Model Parallel Training: Utilize data and model parallel training techniques for efficient handling of large-scale models.
- GPU Cluster Management for Training: Oversee extensive training jobs on GPU clusters, ensuring optimal resource utilization for complex tasks.
- Cross-Functional Collaboration and Leadership: Work with ML engineers, data scientists, and product teams, providing guidance and mentorship.
- Documentation and Reporting: Maintain detailed documentation of methodologies, models, and results, communicating findings across the organization.
Requirements:
- Bachelor's degree in Computer Science, Engineering, or related field. Advantage for Master's or PhD in Computer Science, AI, Linguistics, or related fields, with a focus on machine learning and natural language processing.
- Must have working experience within Cyber Security and/or Compliance.
- Experience with LLMs and Python: Extensive experience with python and proficiency in leveraging LLM's.
- Expertise in Parallel Training and GPU Cluster Management: Some background in parallel training methods and managing large-scale training jobs on GPU clusters.
- Analytical and Problem-Solving Skills: Ability to address complex challenges in model training and optimization.
- Leadership and Mentorship Capabilities: Proven leadership in guiding projects and mentoring team members.
- Communication and Collaboration Skills: Effective communication skills for conveying technical concepts and collaborating with cross-functional teams.
- Innovation and Continuous Learning: Passion for staying updated with the latest trends in AI and machine learning.
To apply please contact Shane Donohue at Henderson Scott