ML/AI Platform Engineer
SquarePeg
Software Engineering, Data Science
United States
Posted on Nov 7, 2024
About The ML/AI Platform Team
The ML/AI Platform team designs and manages end-to-end solutions to support the entire lifecycle of machine learning and generative AI workflows. This includes everything from setting up standardized development environments to managing model training, deployment, monitoring, and integrating tools for efficient ML/AI operations. As a Principal Engineer on this impactful team, you will play a key role in democratizing scalable, accessible, and cost-effective ML/AI solutions, shaping the future of machine learning within the organization.
Your Key Responsibilities
The target compensation for this role includes:
The ML/AI Platform team designs and manages end-to-end solutions to support the entire lifecycle of machine learning and generative AI workflows. This includes everything from setting up standardized development environments to managing model training, deployment, monitoring, and integrating tools for efficient ML/AI operations. As a Principal Engineer on this impactful team, you will play a key role in democratizing scalable, accessible, and cost-effective ML/AI solutions, shaping the future of machine learning within the organization.
Your Key Responsibilities
- Develop AI/ML Frameworks: Create reusable frameworks to streamline AI/ML model development, testing, deployment, and monitoring, while championing engineering best practices.
- Collaborate on Cloud Strategy: Work with Cloud Engineering on roadmaps and technical designs that support our scalable infrastructure and efficient data processing.
- Innovate for Excellence: Lead efforts to implement state-of-the-art technologies that enhance availability, scalability, and operational efficiency.
- Drive Cross-Functional Impact: Engage closely with ML, Tech Delivery, Business Operations, and Product teams to understand needs, optimize processes, and deliver value incrementally.
- Lead in AI Ethics and Compliance: Ensure our AI platform is designed with responsible AI principles, focusing on privacy compliance and trustworthiness.
- Mentor and Grow Talent: Educate and mentor ML and Data Engineers on the latest tools and technologies, leading by example and fostering a collaborative team culture.
- Recruit and Onboard: Conduct technical interviews with well-defined standards and support onboarding for junior engineers and interns.
- Customer Empathy: Anticipate and understand customer needs, delivering user-centered and impactful solutions.
- Engineering Excellence: Bring leadership and innovation to build exceptional products that delight our customers and drive the business forward.
- One Team Approach: Work seamlessly across functions, fostering open communication and shared goals.
- Owner’s Mindset: Take proactive responsibility for your domain, striving for solutions that benefit the company as a whole.
- Bachelor’s in Computer Science, Mathematics, or equivalent experience (Master’s/PhD preferred)
- 8+ years (Bachelor’s), 6+ years (Master’s), or 4+ years (PhD) experience with ML platforms, distributed systems, or high-scale deployment
- Proficiency in Python, Java, or C++, and experience deploying on cloud platforms (AWS, Azure, GCP)
- Expertise in production-quality code, code standards, and SDLC best practices (CI/CD, testing, debugging, and deployment)
- Leadership experience in cross-functional initiatives and scaling large foundational models, including experience with distributed training for LLMs
The target compensation for this role includes:
- Base Salary: Ranging from $202,500 to $417,000 annually
- Annual Variable Bonus: This may include cash and equity, along with eligibility for a sign-on RSU grant