San Jose, US-United States
Posted 13 hours ago
| About The Company This company pioneers short-form video creation and social engagement, boasting a vast, engaged user base. Its platform empowers users with creative tools, filters, and effects. With a diverse content ecosystem, it’s a hub of creativity and expression. The proprietary algorithm ensures personalized content feeds, enhancing user engagement and satisfaction. This company wields significant influence on digital media, making it an invaluable partner for innovative collaborations and marketing endeavors. About the Team We are an applied research team focused on Generative AI and Multimodal Understanding. The group works on advanced generative technologies across image, video, and multimodal systems, enabling scalable and practical AI creation tools. Research areas include generative modeling, image and video synthesis, intelligent editing, and virtual human technologies. The team emphasizes translating cutting-edge research into production-ready, efficient model systems. Role Overview We are looking for Research Engineers / Scientists to design and implement efficient large-scale generative models, with a strong focus on model distillation, compression, and acceleration. The role focuses on transferring capabilities from large foundation models into smaller, more efficient models for scalable training, optimization, and deployment. Work areas include distillation frameworks, model acceleration, and hardware-efficient inference. Responsibilities • Develop efficient algorithms and architectures for large-scale generative and multimodal models • Apply techniques such as distillation, quantization, and efficiency optimization for image, video, and multimodal generation models • Improve efficiency of diffusion and autoregressive generative models • Design scalable training and inference approaches for high-performance generative systems • Build and optimize model acceleration and compression pipelines • Collaborate with research and engineering teams to translate efficient model research into production systems Minimum Qualifications • Bachelor’s degree in Computer Science or related field, or equivalent experience • Strong expertise in efficient model design and acceleration methods • Experience identifying computational bottlenecks and optimizing model performance • Hands-on experience training generative AI or large models • Proficiency with frameworks such as PyTorch or JAX • Strong communication and collaboration skills in fast-paced research environments Bilingual in Mandarin is required Preferred Qualifications • PhD in Generative AI, ML Systems, or related field, or equivalent research experience • Research experience in GenAI, MLSys, or large model systems • Experience in one or more of the following areas: • Image or video generation and editing • Model compression and distillation • Quantization and efficient architectures (MoE, sparse or window attention) • Efficient large model design • Reinforcement learning–based model training (RLHF, DPO, GRPO) Compensation Base salary range for this position in major U.S. markets: $208,800 – $616,000 annually. Total compensation may include performance bonuses, equity, and additional incentives depending on experience and level. Benefits Comprehensive benefits may include medical, dental, vision coverage, retirement plans with company match, paid parental leave, disability coverage, life insurance, paid holidays, sick leave, and paid personal time. Benefits vary by employment type and location. |
Job Features
| Job Category | AI Research |
| Seniority | Senior IC / Tech Lead |
| Base Salary | $208,800 - $616,000 |
| Recruiter | nina.li@ocbridge.ai |
