Backed by T1 AI investors (Andreeson Horowitz), you will join a crack research team of ex Skolkovo Institute of Science and Technology, Stanford, Google, Intel, Salesforce) to build a multimodal text to video application for real-time retail.
They are playing in a $13 trillion industry and are reacting to fashion's landscape, shaped by a relentless pursuit of speed, creativity, and the continuous renewal of trends. Most brands are hindered by manual workflows, juggling disparate tools. This startup has anticipated a tech shift to unlock lengthy creative bottlenecks and level the landscape for all. Enter Generative AI.
As a senior applied scientist, you will have deep skills in fine-tuning foundational multi-modal text to video models (or fine-tuned existing ones). At the bare minimum, you know how a foundational model works from ideation to implementation! Having a diffusion background is preferred, but we can riff with you if you come with a GANs background. We're also happy if you've done some cool things with image generation or image editing. We have just raised significant investment from big AI players, with a burn rate of 7 years, and a single-digit million ARR, just ask about the existing client list.
Offices in NYC & SF!