The Data
Revolution

Building the Workforce of the Future

"There cannot be equity in society without equity in data collection, curation, and decisions."

Women in Big Data Founders
Read Report

Senior Distributed Systems Engineer – Data Platform

Netflix
Published
May 5, 2022
Location
Remote, United States, United States of America
Category
Job Type

Description

At Netflix, we want to entertain the world and are constantly innovating on how entertainment is imagined, created and delivered to a global audience. We currently stream content in more than 30 languages in 190 countries, topping over 220 million paid subscribers and are expanding into new forms of entertainment such as gaming.
The data infrastructure teams at Netflix enable us to leverage data to bring joy to our members in many different ways. We provide centralized data platforms and tools for various business functions at Netflix, so they can utilize our data to make critical data-driven decisions. We do all the heavy lifting to make it easy for our business partners to work with data efficiently, securely, and responsibly. We aspire to lead the industry standard in building a world-class data infrastructure, as Netflix leads the way to be the most popular and pervasive destination for global internet entertainment.
We are looking for distributed systems engineers to help evolve and innovate our infrastructure as we work towards our ambitious goal of 500 million members worldwide. We are committed to building a diverse and inclusive team to bring new perspectives as we solve the next set of challenges. In addition, we are open to remote candidates. We value what you can do, from anywhere in the U.S.
Spotlight on Data Infrastructure Teams:
Big Data Compute
Responsible for providing the cloud-native platform for distributed data processing at Netflix. This team is central to batch data processing in Data Platform. It provides support for Spark, to ETL data into the Petabytes-scale data warehouse and access that data using Spark and Presto/TrinoDB. It also provides sub-second latency for a certain class of queries using Druid. We are looking for exceptional talent with experience in Spark, Presto / TrinoDruidIceberg and distributed database systems in general. Roles in this team involve solving super interesting and challenging problems of working with data at scale, building features and performance enhancements and working closely with open source communities to shape the projects and make contributions.
Big Data Orchestration
Offers the platform for scheduling, orchestrating and executing big data jobs and workflows in a self serve manner. These platforms include foundational services that host all ETL and ML workloads running on Big Data Systems at Netflix. These fully distributed systems are constantly evolving for Netflix scale with state of the art technology. We are moving towards event driven and intelligent orchestration which would need minimal user input/intervention.
Big Data Warehouse and Metadata Platform
Responsible for the analytical data infrastructure to organize and auto-optimize hundreds of petabytes of data in AWS S3 using Apache Iceberg format. Offers a Netflix-wide data catalog and schema registry to capture and infer business metadata across all datasets at Netflix. Also offers a data detection framework that could sample, detect, and report violations across all datasets and a pluggable policy engine framework that allows users to customize data policy rules for all datasets.

This would be your dream job if you enjoy:

  • Solving real business needs at large scale by applying your software engineering and analytical problem solving skills.
  • Architecting and building a robust, scalable, and highly available distributed infrastructure.
  • Leading cross-functional initiatives and collaborating with engineers, product managers, and TPM across teams.
  • Sharing our experiences with the open source communities and contributing to Netflix OSS.

About you:

  • You have 5+ years of experience in building large-scale distributed systems or applications.
  • You are proficient in design and development of RESTful web services.
  • Experienced building and operating scalable, fault-tolerant, distributed systems
  • You are an expert in Java or other object-oriented programming languages. Python or Scala expertise is a plus.
  • Multi-threading is a challenge that you are comfortable tackling.
  • You have a BS in Computer Science or related field.
Apply
Drop files here browse files ...

Related Jobs

Data Scientist - REMOTE   REMOTE, United States of America new
June 24, 2022
Global Director, Research Data & Analytics   London, Chicago, New York, Boston, Dallas, Atlanta, DC, United States of America new
June 16, 2022
April 19, 2021
Full-Stack Developer, Cloud   Montreal, Canada
April 19, 2021
Product Marketing Manager   Vancouver, Remote, Canada
April 19, 2021
Are you sure you want to delete this file?
/