




Summary: Seeking experienced machine learning engineers and researchers to design high-quality evaluation suites for advanced ML systems, focusing on translating practical research into structured benchmarks. Highlights: 1. Collaborate with a leading AI research lab 2. Design and write detailed evaluation suites for ML engineering tasks 3. Focus on practical ML research and engineering workflows **Engagement Type:** Independent Contractor **Work Mode:** Fully Remote **Contract Type:** Project\-Based (Short\-Term, Extendable) **Role Overview:** collaborating with a leading AI research lab to support the evaluation of advanced machine learning systems. We are seeking experienced machine learning engineers and researchers to contribute to the design of high\-quality evaluation suites that measure AI performance on real\-world machine learning engineering tasks. The work focuses on translating practical ML research and engineering workflows into structured benchmarks for frontier models. This is a project\-based, remote opportunity suited for experts with hands\-on ML research experience. **Key responsibilities** * Design and write detailed evaluation suites for machine learning engineering tasks * Assess AI\-generated solutions across areas such as model training, debugging, optimization, and experimentation **Ideal qualifications** * 3\+ years of experience in machine learning engineering or applied ML research * Hands\-on experience with model development, experimentation, and evaluation * Background in ML research (industry lab or academic setting strongly preferred) * Strong ability to reason about ML system design choices and tradeoffs * Clear written communication and high attention to technical detail We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request. **Contract and Payment Terms** ------------------------------ * You will be engaged as an independent contractor. * This is a fully remote role that can be completed on your own schedule. * Projects can be extended, shortened, or concluded early depending on needs and performance. * Your work at will not involve access to confidential or proprietary information from any employer, client, or institution. * Payments are weekly on Stripe or Wise based on services rendered.


