Principal LLM Engineer

Henderson Scott • palo alto, ca, us • 7m ago

Our client is a market leader in their space! They offer the industry's most comprehensive and powerful SaaS defence solution.
Due to an exciting growth phase, they're looking for hire a Principal Software Engineer (LLM Services)
Full-Time
Ideal location would be Newport Beach or Palo Alto.
Hybrid Working (2 days on-site)
About the role: In your role as a Principal Software Engineer, you will focus on leveraging and optimizing Large Language Models (LLMs) along with the implementation of advanced AI technologies.
Responsibilities:

Development and Optimization of LLMs: Implement and fine-tune state-of-the-art Large Language Models for various applications, focusing on performance and accuracy.
Evaluating Model Performance: Conduct rigorous evaluations of LLMs, assessing effectiveness, efficiency, and business alignment.
Integration of Advanced AI Technologies: Implement Retrieval-Augmented Generation (RAG), function calling, and code interpreter technologies to enhance the capabilities of Large Language Models.
Research and Development: Stay abreast of the latest advancements in machine learning, particularly in LLMs, LLM agents, and large-scale neural network training.
Data and Model Parallel Training: Utilize data and model parallel training techniques for efficient handling of large-scale models.
GPU Cluster Management for Training: Oversee extensive training jobs on GPU clusters, ensuring optimal resource utilization for complex tasks.
Cross-Functional Collaboration and Leadership: Work with ML engineers, data scientists, and product teams, providing guidance and mentorship.
Documentation and Reporting: Maintain detailed documentation of methodologies, models, and results, communicating findings across the organization.

Requirements:

Bachelor's degree in Computer Science, Engineering, or related field. Advantage for Master's or PhD in Computer Science, AI, Linguistics, or related fields, with a focus on machine learning and natural language processing.
Must have working experience within Cyber Security and/or Compliance.
Experience with LLMs and Python: Extensive experience with python and proficiency in leveraging LLM's.
Expertise in Parallel Training and GPU Cluster Management: Some background in parallel training methods and managing large-scale training jobs on GPU clusters.
Analytical and Problem-Solving Skills: Ability to address complex challenges in model training and optimization.
Leadership and Mentorship Capabilities: Proven leadership in guiding projects and mentoring team members.
Communication and Collaboration Skills: Effective communication skills for conveying technical concepts and collaborating with cross-functional teams.
Innovation and Continuous Learning: Passion for staying updated with the latest trends in AI and machine learning.

To apply please contact Shane Donohue at Henderson Scott