AI Research Scientist, Computer Vision - Facebook Video Intelligence

Facebook Inc.

Bellevue, United States of America

4 months ago

Role details

Contract type

Permanent contract

Employment type

Full-time (> 32 hours)

Working hours

Regular working hours

Languages

English

Experience level

Junior

Job location

Bellevue, United States of America

Tech stack

Artificial Intelligence

Computer Vision

Computer Programming

Computer Engineering

Core Foundation

Python

Machine Learning

Reinforcement Learning

PyTorch

Large Language Models

Information Technology

Job description

The Video Intelligence team is an applied AI research team within the Facebook pillar. This role is expected to develop advanced video generation and understanding foundation models, enabling innovative AI-driven video creation experiences and enhancing our ability to comprehend video content. The team is responsible for building State-of-the-art GenAI technology to empower video generation and understanding., * Build a variety of multimodal foundation models such as text-to-video generative models, image-to-video generative models, video understanding models, unified native video generative models

Design core foundation model architectures and progressive pre-train
Post-train foundation models using techniques such as Supervised Fine-Tuning (SFT), Reinforcement Learning from Human Feedback (RLHF), Direct Preference Optimization (DPO), and Low-Rank Adaptation (LoRA)
Conduct research to develop SOTA GenAI models for the Facebook family of apps
Collaborate with colleagues from the infrastructure and product teams on launching models

Requirements

Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
PhD in Computer Science, Machine Learning, or a relevant technical field
1+ year of industry experience training multimodal, computer vision, LLM or related AI/ML models
Experience owning and/or driving complex technical projects from end-to-end
Publications at peer-reviewed conferences (e.g. ICLR, NeurIPS, ICML, KDD, CVPR, ICCV, ACL)
Programming experience in Python and hands-on experience with frameworks such as PyTorch
Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment, * First-authored publications at peer-reviewed conferences (e.g. ICLR, NeurIPS, ICML, KDD, CVPR, ICCV, ACL)
Experience collaborating in cross-functional teams, including product, engineering, and research
Experience building text-to-video generative models, image-to-video generative models, video understanding models, and/or unified native video generative models

About the company

Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology. People who choose to build their careers by building with us at Meta help shape a future that will take us beyond what digital connection makes possible today-beyond the constraints of screens, the limits of distance, and even the rules of physics.

Role details

Job location

Tech stack

Job description

Requirements

About the company

Apply for this position

Good distractions

Moments

Videos View all