Software Engineer — Data Infrastructure

We’re on a mission to democratize AI by building the definitive AI data development platform. The AI landscape has gone through incredible change between 2016, when Snorkel started as a research project in the Stanford AI Lab, to the generative AI breakthroughs of today. But one thing has remained constant: the data you use to build AI is the key to achieving differentiation, high performance, and production-ready systems. We work with some of the world’s largest organizations to empower scientists, engineers, financial experts, product creators, journalists, and more to build custom AI with their data faster than ever before. Excited to help us redefine how AI is built? Apply to be the newest Snorkeler!

As a Software Engineer on our Enterprise & Data Infra team, you’ll focus on the Data Infra pillar, designing and building scalable services, APIs, and libraries that power data workloads across Snorkel’s enterprise platforms. You’ll tackle challenges involving structured and unstructured data, multiple storage types (hot/warm/cold), and various deployment models (public cloud/private cloud/on-prem). If you enjoy architecting performant data infrastructure solutions and working with modern data technologies, this is a unique opportunity to drive impactful innovation in the AI/ML space.

Main Responsibilities

Data Infrastructure Development

Design, build, and maintain scalable services, APIs, and libraries for managing data workloads on Snorkel’s enterprise platforms.
Implement secure, access-controlled, and governance-enabled storage solutions that meet diverse needs for structured/unstructured data.
Integrate and support data ingress/egress with popular data providers (e.g., AWS, Databricks, Snowflake).

Scalability & Performance

Work closely with cross-functional engineering teams to define workload and performance requirements for different storage tiers (hot, warm, cold).
Architect flexible solutions that can be deployed in various environments, including public and private clouds, as well as on-premises.

Collaboration & Operations

Collaborate with enterprise customers to understand their use cases, translate them into engineering specifications, and deliver high-quality solutions.
Participate in an on-call rotation to troubleshoot and resolve production issues.
Work a hybrid schedule of three days per week in our Redwood City HQ or SF office.

Minimum Qualifications

Bachelor’s degree in Computer Science or related field, or equivalent professional experience
2+ years of experience in software development, preferably in distributed systems or cloud-native applications
Strong Python development and debugging skills
Excellent communication skills and a track record of cross-functional collaboration

Preferred Skills

Experience with storage infrastructure or storage technologies
Familiarity with cloud storage solutions (e.g., S3, GCS)
Familiarity with databases (e.g., Postgres) and ORMs
Ability to own problems end-to-end and learn new domains or technologies quickly
Ability to design and integrate storage access controls with IAM systems
[Nice to have] Experience with Kubernetes

The salary range for this position based in the San Francisco Bay Area is $110,000 - $200,000. All offers include equity compensation in the form of employee stock options.

#LI-HS

Be Your Best At Snorkel

Snorkel AI is on a mission to make machine learning practical for everyone, and it starts with building a team that welcomes, represents and gives opportunity to all. We work at the frontier of AI and software engineering, and believe that underrepresented communities need to play a part in shaping the future of these fields. At Snorkel AI, we actively work to create an environment that values end-to-end ownership, diverse forms of impact, and opportunities for personal growth.

Snorkelers are supported by an amazing team and an amazing set of benefits. For Full-time employees, we offer comprehensive medical, dental, and vision plans for Snorkelers and their families, plus a yearly wellness stipend. Our 401k program lets Snorkelers plan for their future and our parental leave program lets new parents take up to 20 weeks of paid time off. Learn more about these benefits and more — like our workstation setup allowance — on our Careers page.

Snorkel AI is proud to be an Equal Employment Opportunity employer and is committed to building a team that represents a variety of backgrounds, perspectives, and skills. Snorkel AI embraces diversity and provides equal employment opportunities to all employees and applicants for employment. Snorkel AI prohibits discrimination and harassment of any type on the basis of race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state, or local law. All employment is decided on the basis of qualifications, performance, merit, and business need.

We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.

Apply for this job

Software Engineer — Data Infrastructure

Main Responsibilities

Minimum Qualifications

Preferred Skills

Other AI Jobs like this

We have moved our Careers Page to: https://jobs.ashbyhq.com/anyscale

Software Engineer, Infrastructure

Senior Software Engineer, Infrastructure

Engineering

Data

Other Roles

Locations