8194460 Job Detail
M

Site Reliability Engineer - Chicago/US

at Meta IT North America

Desired Skills

About Job

Develop and maintain advanced telemetry and automation tools for monitoring and managing global platform health. Actively participate in on-call rotations, swiftly diagnosing and resolving system issues and escalations from the customer support team (this is not a customer-facing role). Implement automated solutions for incident response, system optimization, and reliability improvement. Provide operational support for backend services and Kafka producers/consumers written in Python running on ECS. Full-Stack Troubleshooting: Support, debug, and enhance the entire application stack, from our React.js frontend to our Python backend services (Flask, Litestar, Celery, ESK, MSK) Hands-on experience building and/or supporting applications written with React.js. Must have professional experience building and/or supporting applications with React.js. Effectively troubleshoot issues between the frontend UI and backend APIs.

Requirements

Minimum 3 years of experience with Python
Experience with Icinga2, Prometheus, or Splunk a plus
Experience with AWS a plus
Solid understanding of functional programming, object oriented programming and computer science foundations
Good understanding of backend and server side components
Ability to work on-call rotation for support with global team members on a semi-frequent basis
Proven and strong communication skills
Must be self-directed, flexible and have the ability to prioritize and handle multiple projects simultaneously
Experience working in an Agile environment a plus

Additional Instructions

Develop and maintain advanced telemetry and automation tools for monitoring and managing global platform health.
Actively participate in on-call rotations, swiftly diagnosing and resolving system issues and escalations from the customer support team (this is not a customer-facing role).
Implement automated solutions for incident response, system optimization, and reliability improvement.
Provide operational support for backend services and Kafka producers/consumers written in Python running on ECS.
Full-Stack Troubleshooting: Support, debug, and enhance the entire application stack, from our React.js frontend to our Python backend services (Flask, Litestar, Celery, ESK, MSK)
Hands-on experience building and/or supporting applications written with React.js. Must have professional experience building and/or supporting applications with React.js. Effectively troubleshoot issues between the frontend UI and backend APIs.

Perks and Benefits

We offer autonomy, clear goals and a dynamic and challenging environment, where professionals have the opportunity to interact with different technologies, participate in all types of projects, bring new ideas and work from anywhere in Brazil and (why not?) anywhere in the world. In addition, we are one of the best companies to work for in Brazil according to Great Place to Work and one of the 10 fastest growing technology companies in the country for 3 consecutive years, according to Anuário Informático Hoje.
M

Meta IT North America

-

Details

Job Type
Remote
Preferred location
Belarus
Apply Before
Jan 19, 2026
Apply To Job