Staff Platform Engineer, AI/ML Infrastructure

Staff Platform Engineer, AI/ML Infrastructure Department:AI Software & Operations Role Summary The Staff Platform Engineer, AI/ML Infrastructure will provide technical leadership for thecloud platforms, deployment systems, and operational foundations that power enterprise-scalegenerative AI applications. This role will define and evolve the infrastructure architecture for AI/ML platforms running across AWS,Kubernetes, serverless, and containerized environments. The engineer will lead platform standards forreliability, scalability, observability, CI/CD, security, and developer enablement, while partnering closelywith software engineering, AI engineering, security, and operations teams. The ideal candidate combines deep hands-on cloud engineering experience with staff-level technicalinfluence. They are comfortable designing infrastructure patterns, writing infrastructure-as-code,improving delivery pipelines, mentoring engineers, and making architectural decisions that raise theoperational maturity of AI platforms across multiple teams. Key Responsibilities Define and drive the technical strategy for AI/ML platform infrastructure supporting generative AIapplications, LLM integrations, model routing, and enterprise AI services. Architect, build, and operate scalable cloud platforms using AWS services such as EKS, ECSFargate, Lambda, DynamoDB, S3, OpenSearch, Secrets Manager, CloudWatch, ALB, and MWAA. Establish reusable infrastructure patterns using CloudFormation, Helm, and Terraform to supportreliable multi-environment and multi-region deployments. Lead CI/CD architecture using GitHub Actions, reusable workflows, OIDC-based AWSauthentication, automated quality gates, deployment promotion, and environment approvals. Design and improve observability across AI platforms, including CloudWatch dashboards, logs,alarms, Prometheus/Grafana, OpenSearch, Langfuse, and LLM-specific operational metrics. Build platform capabilities for GenAI workloads, including model availability monitoring. Partner with software engineering teams to improve deployment reliability, rollback strategies,health checks, autoscaling, load testing, and runtime performance. Define and enforce security and compliance practices for infrastructure, including IAM permissionboundaries, Secrets Manager usage, secret scanning, audit logging, tagging standards, andchange-management controls. Provide technical leadership for cost optimization, capacity planning, environment standardization,and operational resilience across development, test, production, and sandbox environments. Mentor engineers, review architecture and infrastructure designs, and influence platformengineering practices across teams. Basic Qualifications Bachelor’s degree in Computer Science, Engineering, Information Technology, or a relatedtechnical field, or equivalent practical experience. 7+ years of experience in DevOps, platform engineering, cloud infrastructure, site reliabilityengineering, or software engineering roles. Strong hands-on experience with AWS/Azure/GCP infrastructure and services, including container,serverless, networking, storage, observability, and security services. Experience designing and operating production systems on Kubernetes, ECS/Fargate, orcomparable container orchestration platforms. Proficiency with infrastructure-as-code, especially CloudFormation, Terraform, Helm, or similartooling. Strong CI/CD experience with GitHub Actions or similar platforms, including reusable workflows,automated testing, deployment gates, and cloud authentication. Experience building and operating observability solutions using CloudWatch, Prometheus/Grafana,OpenSearch, or similar tools. Strong understanding of cloud security practices, IAM, secrets management, least-privilegeaccess, audit logging, and compliance requirements. Experience supporting distributed systems, microservices, APIs, asynchronous workloads, andmulti-environment deployments. Demonstrated ability to lead technical design, mentor engineers, and influence engineeringpractices across teams. Preferred Qualifications Experience supporting AI/ML or generative AI platforms, including LLM gateways, model routing,prompt observability, token metering, or model failover. Experience operating platforms in regulated enterprise environments, ideally healthcare,pharmaceutical, finance, or life sciences. Experience with multi-account, multi-region AWS architectures and enterprise governancepatterns. Experience with cost optimization, autoscaling strategies, capacity planning, and cloud budgetmonitoring. Experience with load testing and performance validation using tools such as Locust or comparableframeworks. Strong Python or scripting skills for platform automation, operational tooling, and CI/CD extensions. Ability to communicate complex technical decisions clearly to engineering, security, operations,and leadership audiences. Technical Environment This role works across a modern AI platform ecosystem including: Cloud: AWS EKS, ECS Fargate, Lambda, DynamoDB, S3, OpenSearch, CloudWatch, SecretsManager, ALB, VPC, IAM Infrastructure-as-Code: CloudFormation, Helm, Terraform CI/CD: GitHub Actions, reusable workflows, OIDC federation, environment approvals, automatedrelease promotion AI/ML Platform: AWS Bedrock, Azure OpenAI, LiteLLM, Langfuse Observability: CloudWatch dashboards and alarms, Prometheus, Grafana, OpenSearch, Langfuse,custom metrics Security & Governance: IAM permission boundaries, secret scanning, audit logging, taggingcompliance, change-management automation Engineering Practices: Docker, Python, pre-commit, automated testing, load testing, code qualitygates, monorepo service standards Leadership Expectations As a J090 Staff-level engineer, this role is expected to operate beyond individual delivery. The engineerwill identify systemic platform gaps, define technical direction, create reusable standards, and raiseengineering maturity across multiple teams. Success in this role requires strong judgment, ownership, and communication. The engineer should beable to balance hands-on implementation with architectural leadership, guide teams through ambiguoustechnical decisions, and build platform capabilities that make AI product teams faster, safer, and morereliable. Work location assignment : Remote The annual base salary for this position ranges from €65.250,00 to €108.750,00. This salary range applies to the location France - Rives de Paris. We also offer a range of benefits and programs to meet colleagues’ needs. Benefits vary by location and can include health care coverage, retirement savings plans, insurance benefits, an Employee Assistance Program, wellness benefits and more. Additional details about total compensation and benefits will be provided during the hiring process. Pfizer compensation structures and benefit packages are aligned based on the location of hire. Final compensation will be determined based on the successful candidate’s relevant skills, experience, and qualifications, in accordance with pay equity principles and applicable employment laws.This role is posted in multiple locations. If you are applying for the role in an secondary job posting location where pay transparency regulations apply, your Talent Advisor will share the local pay information with you during the first interview.Pfizer is an equal opportunity employer and complies with all applicable equal employment opportunity legislation in each jurisdiction in which it operates. Égalité des chances & Emploi Nous croyons que des équipes diversifiées et inclusives sont essentielles à la réussite d'une entreprise. En tant qu'employeur, Pfizer s'engage à valoriser la diversité et l’inclusion sous toutes ses formes. Cette diversité se reflète également à travers les patients et les communautés que nous servons. Ensemble, continuons à bâtir une culture qui encourage, soutient et responsabilise nos employés. Handicap & Inclusion Notre mission est de libérer le potentiel de nos collaborateurs et nous sommes fiers d'être un employeur inclusif pour les personnes handicapées, garantissant ainsi l'égalité des chances en matière d'emploi pour tous les candidats. Nous vous encourageons à donner le meilleur de vous-même en sachant que nous apporterons tous les ajustements raisonnables pour soutenir votre candidature et votre carrière future. Votre expérience avec Pfizer commence ici ! Pfizer endeavors to make accessible to all users. If you would like to contact us regarding the accessibility of our website or need assistance completing the application process and/or interviewing, please email disabilityrecruitment@pfizer.com. This is to be used solely for accommodation requests with respect to the accessibility of our website, online application process and/or interviewing. Requests for any other reason will not be returned.Pour mieux comprendre les usages autorisés et interdits de l’intelligence artificielle tout au long du processus de recrutement, nous vous invitons à consulter nos bonnes pratiques dédiées à l’utilisation de l’IA par les candidats sur Pfizer Careers. Information & Business Tech

Back to blog

Common Interview Questions And Answers

1. HOW DO YOU PLAN YOUR DAY?

This is what this question poses: When do you focus and start working seriously? What are the hours you work optimally? Are you a night owl? A morning bird? Remote teams can be made up of people working on different shifts and around the world, so you won't necessarily be stuck in the 9-5 schedule if it's not for you...

2. HOW DO YOU USE THE DIFFERENT COMMUNICATION TOOLS IN DIFFERENT SITUATIONS?

When you're working on a remote team, there's no way to chat in the hallway between meetings or catch up on the latest project during an office carpool. Therefore, virtual communication will be absolutely essential to get your work done...

3. WHAT IS "WORKING REMOTE" REALLY FOR YOU?

Many people want to work remotely because of the flexibility it allows. You can work anywhere and at any time of the day...

4. WHAT DO YOU NEED IN YOUR PHYSICAL WORKSPACE TO SUCCEED IN YOUR WORK?

With this question, companies are looking to see what equipment they may need to provide you with and to verify how aware you are of what remote working could mean for you physically and logistically...

5. HOW DO YOU PROCESS INFORMATION?

Several years ago, I was working in a team to plan a big event. My supervisor made us all work as a team before the big day. One of our activities has been to find out how each of us processes information...

6. HOW DO YOU MANAGE THE CALENDAR AND THE PROGRAM? WHICH APPLICATIONS / SYSTEM DO YOU USE?

Or you may receive even more specific questions, such as: What's on your calendar? Do you plan blocks of time to do certain types of work? Do you have an open calendar that everyone can see?...

7. HOW DO YOU ORGANIZE FILES, LINKS, AND TABS ON YOUR COMPUTER?

Just like your schedule, how you track files and other information is very important. After all, everything is digital!...

8. HOW TO PRIORITIZE WORK?

The day I watched Marie Forleo's film separating the important from the urgent, my life changed. Not all remote jobs start fast, but most of them are...

9. HOW DO YOU PREPARE FOR A MEETING AND PREPARE A MEETING? WHAT DO YOU SEE HAPPENING DURING THE MEETING?

Just as communication is essential when working remotely, so is organization. Because you won't have those opportunities in the elevator or a casual conversation in the lunchroom, you should take advantage of the little time you have in a video or phone conference...

10. HOW DO YOU USE TECHNOLOGY ON A DAILY BASIS, IN YOUR WORK AND FOR YOUR PLEASURE?

This is a great question because it shows your comfort level with technology, which is very important for a remote worker because you will be working with technology over time...