Senior Data Engineer
Company Overview:
At Roots, our mission is to make work more human. We are developing fully autonomous, AI-powered Digital Coworkers that streamline tedious and repetitive tasks for the enterprise. By tackling core challenges in natural language understanding and computer vision, we are building an automation product that embodies the future of work. Our platform makes automation accessible to everyone, enabling users to generate automations by describing tasks in simple English, while solving complex business problems with enterprise-grade results and performance.
Harnessing the power of AI, machine learning, and analytics on our data is fundamental to our business. Our production processes capture vast amounts of data, which feed directly into our training and analytics pipelines. As this data continues to grow, we need a data engineer to design and build the data architecture that underpins our AI systems and supports our evolving needs for reporting and business analytics.
At Roots, we are committed to building a team of talented individuals who share our love for innovation and problem-solving. We encourage curious minds from a wide array of disciplines and backgrounds to apply.
Responsibilities:
- Collaborate with stakeholders across engineering, research, product, sales, and marketing to capture their data production workflows.
- Design and build ETL pipelines or similar methodologies to merge and refine data into a centralized data lake, optimizing it for analytics, reporting, and model training.
- Implement de-identification techniques to protect sensitive information.
- Build, monitor and maintain data infrastructure, ensuring reliability, performance, scalability, and cost-effectiveness.
- Ensure security, integrity, and compliance of data according to industry standards
Qualifications:
- 4+ years of experience as a data engineer.
- Expertise in ETL scheduling tools like Airflow, Prefect, Luigi, or comparable frameworks.
- Proficiency in Python.
- Experience in working with databases, data warehouses, and data lakes.
- Experience with DataBricks, Snowflake, or similar data platforms
- Strong written and verbal communication skills, with a particular emphasis on the written word. We greatly appreciate public articles or blogs that showcase writing skills.
- Ability to work independently, prioritize tasks, and manage multiple projects simultaneously in a fast-paced and dynamic environment.
As a startup, Roots Automation offers a high-paced environment with ample growth and learning opportunities across multiple disciplines.