About the Position
My client is an innovative startup creating cutting-edge conversational commerce experiences. Unlike traditional shopping methods that require app downloads or website sign-ups, our platform enables customers to engage and transact directly within the chat thread. Joining our team means becoming part of a VC-backed series A startup on a fast growth trajectory, collaborating with a world-class team, and seizing opportunities for rapid career advancement.
About the Team:
Their founding team comes from high level leadership roles at companies like Meta, Google, Uber and various successful startups.
We are looking for a seasoned AI research engineer to join our team and develop a state-of-the-art sales assistant. You will collaborate with a talented group of research, ML, data, and software engineers to deliver innovative agentic systems.
Responsibilities
- Utilize cutting-edge research to enhance our AI sales assistant.
- Propose and develop new experimental ideas.
- Lead efforts in dataset collection and generation.
- Optimize models and inference processes.
- Work with ML and Data engineers on distributed training scripts.
Requirements
- Minimum of 3 years’ experience with production AI systems.
- At least one publication as the first author.
- Strong knowledge of LLMs, including the ability to build GPT-1 in NumPy and manually perform backpropagation for transformer blocks.
- Solid understanding of ML fundamentals.
- Experience with distributed training.
- Some experience with production inference.
- Proficiency in PyTorch.
Preferred Qualifications
- Experience building agentic systems with task-specific models.
- Experience iterating AI products based on customer feedback.