Senior Distributed Systems Engineer – Data Platform

Netflix
- 1 active job (view)
- jobs.netflix.com
Description
At Netflix, we want to entertain the world and are constantly innovating on how entertainment is imagined, created and delivered to a global audience. We currently stream content in more than 30 languages in 190 countries, topping over 220 million paid subscribers and are expanding into new forms of entertainment such as gaming.
The data infrastructure teams at Netflix enable us to leverage data to bring joy to our members in many different ways. We provide centralized data platforms and tools for various business functions at Netflix, so they can utilize our data to make critical data-driven decisions. We do all the heavy lifting to make it easy for our business partners to work with data efficiently, securely, and responsibly. We aspire to lead the industry standard in building a world-class data infrastructure, as Netflix leads the way to be the most popular and pervasive destination for global internet entertainment.
We are looking for distributed systems engineers to help evolve and innovate our infrastructure as we work towards our ambitious goal of 500 million members worldwide. We are committed to building a diverse and inclusive team to bring new perspectives as we solve the next set of challenges. In addition, we are open to remote candidates. We value what you can do, from anywhere in the U.S.
Spotlight on Data Infrastructure Teams:
Big Data Compute
Responsible for providing the cloud-native platform for distributed data processing at Netflix. This team is central to batch data processing in Data Platform. It provides support for Spark, to ETL data into the Petabytes-scale data warehouse and access that data using Spark and Presto/TrinoDB. It also provides sub-second latency for a certain class of queries using Druid. We are looking for exceptional talent with experience in Spark, Presto / Trino, Druid, Iceberg and distributed database systems in general. Roles in this team involve solving super interesting and challenging problems of working with data at scale, building features and performance enhancements and working closely with open source communities to shape the projects and make contributions.
Big Data Orchestration
Offers the platform for scheduling, orchestrating and executing big data jobs and workflows in a self serve manner. These platforms include foundational services that host all ETL and ML workloads running on Big Data Systems at Netflix. These fully distributed systems are constantly evolving for Netflix scale with state of the art technology. We are moving towards event driven and intelligent orchestration which would need minimal user input/intervention.
Big Data Warehouse and Metadata Platform
Responsible for the analytical data infrastructure to organize and auto-optimize hundreds of petabytes of data in AWS S3 using Apache Iceberg format. Offers a Netflix-wide data catalog and schema registry to capture and infer business metadata across all datasets at Netflix. Also offers a data detection framework that could sample, detect, and report violations across all datasets and a pluggable policy engine framework that allows users to customize data policy rules for all datasets.
This would be your dream job if you enjoy:
- Solving real business needs at large scale by applying your software engineering and analytical problem solving skills.
- Architecting and building a robust, scalable, and highly available distributed infrastructure.
- Leading cross-functional initiatives and collaborating with engineers, product managers, and TPM across teams.
- Sharing our experiences with the open source communities and contributing to Netflix OSS.
About you:
- You have 5+ years of experience in building large-scale distributed systems or applications.
- You are proficient in design and development of RESTful web services.
- Experienced building and operating scalable, fault-tolerant, distributed systems
- You are an expert in Java or other object-oriented programming languages. Python or Scala expertise is a plus.
- Multi-threading is a challenge that you are comfortable tackling.
- You have a BS in Computer Science or related field.