Build Scalable Classifier Infrastructure: Design and build scalable pipelines for training and evaluating many novel classifiers.
Train and Evaluate Classifiers: Train and evaluate classifiers for distillation detection and other key GDM security priorities.
Own Classifier Deployment and Integration: Collaborate with model training, infrastructure, and deployment teams to deploy classifiers into our production models.
Qualifications
Ph.D. in Computer Science or a related quantitative field, or a B.S./M.S. in a similar field with 2+ years of relevant industry experience.
Demonstrated research or product expertise in machine learning, with a focus on classifier training, classifier evaluation and model evaluation.
Deep expertise in training and evaluating a wide range of classifier architectures.
Proven experience building and scaling ML training and evaluation infrastructure.
Experience building data pipelines using LLM autoraters.
Strong understanding of model distillation, model stealing, and other capability extraction techniques.
Strong software engineering skills and experience with ML frameworks like JAX, PyTorch, or TensorFlow.
A track record of landing research impact or shipping production ML systems in a multi-team environment.
Benefits
The US base salary range for this full-time position is between $166,000 - $244,000 + bonus + equity + benefits. Your recruiter can share more about the specific salary range for your targeted location during the hiring process.