Role Description
This role focuses on large-scale world models for temporal reasoning and generation, including video models, multimodal generative models, LLM/VLM/VLA models, and predictive models of traffic participants and scenes. Your work will directly power Waabi World’s ability to model future evolution, synthesize realistic safety-critical scenarios, and provide rich generative priors for downstream planning, testing, and training.
• Conduct fundamental and applied research in generative and predictive world-modeling:
• Video generation and prediction.
• Latent diffusion / autoregressive / flow-matching models.
• Multimodal foundation models for driving scenes.
• LLM / VLM / VLA methods for scene understanding, reasoning, and control.
• Generative scenario modeling and controllable simulation.
• Model distillation.
• Collaborate with engineers to integrate models into large-scale, distributed training and rendering pipelines.
• Publish high-impact research at top conferences (CVPR, ECCV, ICCV, NeurIPS, ICLR, ICRA, SIGGRAPH).
• Mentor junior scientists and interns; foster a culture of scientific rigor and rapid experimentation.
• Stay on top of emerging advances in generative AI, differentiable rendering, knowledge distillation/compression, and robotics.
Qualifications
• Demonstrated technical innovation: You have a Ph.D. in Computer Vision, Machine Learning, Robotics, or a related field or equivalent research experience pushing the boundaries of a technical field.
• Strong prototyping and implementation: You have expert-level Python & PyTorch (or JAX) skills; strong software-engineering fundamentals and experience with distributed training.
• Expert domain knowledge: You have built generative or predictive models of the physical world with scale and efficiency in mind for real-world applications.
• Team player: You have worked in a close-knit team of researchers and engineers and have strong communication to deliver successful projects.
Requirements
• Bonus: Proven ability to translate research into production-quality code and measurable product impact.
• Demonstrated publications (first-author) in top-tier venues on topics such as world models, generative simulation, video prediction, diffusion, flow-matching, or foundation models for autonomy.
Benefits
• Competitive compensation and equity awards.
• Health and Wellness benefits encompassing Medical, Dental and Vision coverage (for full-time employees only).
• Unlimited Vacation.
• Flexible hours and Work from Home support.
• Daily drinks, snacks and catered meals (when in office).
• Regularly scheduled team building activities and social events both on-site, off-site & virtually.
• As we grow, this list continues to evolve!