About:
We are a Series A startup focused on developing cutting-edge compressed models for devices. Our mission is to bring powerful AI capabilities to edge devices, enabling seamless integration of advanced language models into everyday technology. Join us as we push the boundaries of AI, creating impactful solutions optimized for real-world applications.
Responsibilities:
- Specialize in Google Cloud / AWS tech stacks.
- Familiarity with LLM technologies, particularly with the Transformers library.
- Experience with model compression is a plus.
- Knowledge of model deployment on edge device is a plus.
- Contribute to the development of our SDKs across multiple platforms, including Android, iOS, and Linux.
You may be a good fit if you:
- 2+ years of professional experience.
- Minimum BS/MS in Computer Science.
- Excellent understanding of computer science fundamentals, including data structures, algorithms, and coding.
- Knowledge of operating system internals, compilers, and low-power/mobile optimization.
- Experience with low-level programming in C and frameworks like CUDA, OpenCL.
- Proficiency in multithreading and performance optimization.