Are you excited about building the future of multimodal AI content generation?
Would you like to work in a fast-moving research-driven start-up?
Are you passionate about LLMs, NLP, and generative AI?
We’re working with a company at the forefront of video generation, combining AI with graphic design expertise to build a groundbreaking multimodal platform.
Responsibilities:
- Conduct applied research in multimodal models for video and graphic design generation.
- Develop and fine-tune generative AI models, including text-to-video and image generation.
- Collaborate closely with product teams to integrate AI solutions into a user-friendly platform.
- Prototype new features, improve existing ones, and deploy them into the product.
- Continuously assess and iterate on models for better accuracy and performance.
Requirements:
- Strong experience working with vision/ Multimodal models either in academia or Industry.
- Solid understanding and experience working with large Transformer models, such as GPT or LlaMA
- Interest in multimodal models, ideally with experience in graphic design or video content generation.
- Proficiency in machine learning frameworks (e.g., PyTorch, TensorFlow).
- Past publication record in top conferences like CVPR, Neurips, ACL, EMNLP or other venues is required.
Note: this role has 100% remote flexibility, but the team also has an office in San Francisco if hybrid-working is preferred.
Please apply if interested.
Big Cloud is a leading recruiter in the AI & Data Science space. We're lucky enough to recruit the best candidates in partnership with some of the most exciting companies all over the world. We try to reply to all applications, but we’re only human, for now! So, you may only hear from us if you are successful.
Check out our latest vacancies to see what else we’re recruiting for: www.bigcloud.global/find-a-job