Member of Technical Staff (Data Engineer)
We are seeking highly skilled Data Engineers to join our team. In this role, you will take ownership of the entire data lifecycle, collaborating closely with researchers and engineers to ensure that data is reliable, accessible, and optimized for model training and evaluation. Additionally, you will have the opportunity to explore innovative data augmentation techniques and gain firsthand experience in how data is used to develop cutting-edge multimodal foundation models.
Ideal Experience:
- Pre-processing datasets for AI training.
- Proven experience in data engineering with a strong background in building and managing scalable data pipelines.
- Has worked on one or more modalities other than text.
- Proficiency in Python.
- Keeping up with state-of-the-art techniques for preparing language model training data.
- Experience with cloud platforms (e.g., AWS, Azure, Google Cloud) and data-related services (e.g., S3, BigQuery, Redshift).
- Familiarity with machine learning workflows, particularly in the context of model training and evaluation.
- Preferred: experience with containerization technologies (e.g., Docker) and orchestration tools (e.g., Kubernetes).
Why Reka?
Reka's mission is to build useful multimodal artificial intelligence and use it to empower organisations and businesses. We are a globally distributed foundation model startup, headquartered in the San Francisco Bay Area, California. Embracing a remote-first approach, our team brings together top talent from around the world. Our founding team, along with many of our team members, has contributed to many of the breakthroughs in AI over the past decade.
- Opportunity to work with a collaborative mission-driven team on cutting-edge AI technology.
- Open and inclusive work environment.
- 4 weeks paid leave.
- Visa support (such as H1B and OPT transfer for US Employees).
- Healthcare benefits, including vision and dental.