We use cookies. Find out more about it here. By continuing to browse this site you are agreeing to our use of cookies.
#alert
Back to search results
New

Software Engineer LMTS (Site Reliability Engineering)

salesforce.com, inc.
parental leave, 401(k)
United States, Massachusetts, Boston
Jun 13, 2025

To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts.

Job Category

Software Engineering

Job Details

About Salesforce

We're Salesforce, the Customer Company, inspiring the future of business with AI+ Data +CRM. Leading with our core values, we help companies across every industry blaze new trails and connect with customers in a whole new way. And, we empower you to be a Trailblazer, too - driving your performance and career growth, charting new paths, and improving the state of the world. If you believe in business as the greatest platform for change and in companies doing well and doing good - you've come to the right place.

This candidate must be a U.S. citizen (U.S. born or naturalized) operating on U.S. Soil who does not hold dual citizenship with the ability to meet customer and government screening standards applicable to this role.
This position requires onsite presence in either Boston, San Francisco or Bellevue offices.

As a Software Engineer in Site Reliability Engineering (SRE) at MuleSoft, you will be part of a high-impact team focused on architecting, building, and scaling the infrastructure, tools, and platforms that improve the resiliency, reliability, performance, and scalability of distributed systems running on the MuleSoft Anypoint Platform. This is a software engineering-driven role, where you'll write production-grade code to automate operations, enhance observability, and strengthen service resilience-especially in high-security environments, including FedRAMP, Protected B, among others.

Your work will span the entire stack: from shaping engineering practices and building proactive failure-prevention mechanisms to streamlining deployment pipelines and improving the end-to-end reliability of mission-critical services. As stewards of observability, incident management, release automation, and reliability engineering, our team's mission is to embed resiliency and reliability into every layer of the system and consistently exceed industry standards for uptime, latency, and performance.

What You'll Be Doing

  • Engineering Resiliency and Reliability: Design and develop systems, libraries, and tools that strengthen the resiliency and reliability of distributed services running on the MuleSoft Anypoint Platform.
  • Observability by Design: Develop and extend monitoring, logging, and alerting capabilities using industry-standard observability platforms (e.g., metrics, tracing, and log aggregation tools) to ensure issues are detected and diagnosed before they impact customers.
  • Automation at Scale: Write production-grade code in Python, Go, or similar languages to automate operational tasks, scale deployment pipelines, and implement self-healing systems.
  • Incident Response & Prevention: Participate in on-call rotations, drive root cause analysis, and deliver software-based solutions that prevent recurrence and reduce meantime to recovery (MTTR).
  • Platform and Infrastructure Development: Build internal platforms, shared APIs, and systems that enhance developer velocity while improving overall system resilience and operability.
  • CI/CD and Deployment Engineering: Optimize and evolve our CI/CD pipelines using Jenkins, Spinnaker, and infrastructure-as-code tools such as Terraform and Kubernetes to enable safe and frequent delivery.
  • Security and Compliance as Code: Develop and maintain automated solutions to meet FedRAMP, Protected B, and other regulatory requirements-integrating security and compliance directly into deployment workflows.
  • Collaborative Reliability Advocacy: Work closely with product engineers, platform teams, and security stakeholders to influence architectural decisions and bake reliability into all layers of the stack.
  • Runbooks and Design Documentation: Create and maintain high-quality documentation for systems, processes, and playbooks to promote operational excellence and team scalability.

Requirements:

  • 8+ years of experience in Software Engineering, SRE, or DevOps roles, with a strong focus on building resilient, scalable, and highly available systems.
  • Proven proficiency in Java, Python, Go, Bash, with experience writing production-quality, maintainable, and testable code for infrastructure and platform automation.
  • Hands-on experience with infrastructure as code, CI/CD pipelines, and deployment automation using tools like Terraform, Jenkins, and Spinnaker.
  • Proven experience architecting, developing, and operating systems in cloud-native environments (AWS) and managing containerized workloads with Kubernetes.
  • Strong understanding of observability engineering, including instrumentation, metrics, logging, and distributed tracing-experience with OpenTelemetry, Grafana, Splunk, Sumo Logic, or similar platforms.
  • Solid knowledge of distributed systems, network protocols (TCP/IP, DNS, HTTP, TLS), and API design standards (REST, RAML, OAS).
  • Demonstrated ability to diagnose complex system issues, design for fault tolerance and high availability, and continuously improve reliability through software.
  • Familiarity with compliance-bound environments, including FedRAMP, Protected B, or similar, and experience incorporating security and compliance into engineering workflows.
  • A passion for engineering reliability through software-you drive automation, eliminate toil, and foster a culture of operational excellence.
  • A related technical degree required.

Preferred:

  • Experience with chaos engineering, fault injection, or reliability gamedays to proactively validate system resilience and recovery readiness.
  • Background in platform-as-a-service (PaaS), internal developer tooling, or building self-service infrastructure that accelerates engineering productivity.
  • Prior experience operating in hybrid or multi-cloud environments, with a focus on portability, automation, and infrastructure standardization.

This candidate must be a U.S. citizen (U.S. born or naturalized) operating on U.S. Soil who does not hold dual citizenship with the ability to meet customer and government screening standards applicable to this role, including a Criminal Justice Information Services screening with fingerprint scan. Due to the citizenship requirements for this role, which supports U.S. federal, state, and/or local government customers, citizenship will be verified through two of the following REAL ID Act documents: U.S. Passport, Passport Card, REAL Driver's License, Global Entry Card, U.S. Government CAC/PIV. You agree to complete a Minimum Background Investigation (MBI) for a Moderate Public Trust position with the U.S. federal government and gain other clearances as deemed appropriate for the role.

Benefits & Perks
Check out our benefits site which explains our various benefits, including wellbeing reimbursement, generous parental leave, adoption assistance, fertility benefits, and more.

Salesforce Information
Check out our Salesforce Engineering Site.

Accommodations

If you require assistance due to a disability applying for open positions please submit a request via this Accommodations Request Form.

Posting Statement

Salesforce is an equal opportunity employer and maintains a policy of non-discrimination with all employees and applicants for employment. What does that mean exactly? It means that at Salesforce, we believe in equality for all. And we believe we can lead the path to equality in part by creating a workplace that's inclusive, and free from discrimination. Know your rights: workplace discrimination is illegal. Any employee or potential employee will be assessed on the basis of merit, competence and qualifications - without regard to race, religion, color, national origin, sex, sexual orientation, gender expression or identity, transgender status, age, disability, veteran or marital status, political viewpoint, or other classifications protected by law. This policy applies to current and prospective employees, no matter where they are in their Salesforce employment journey. It also applies to recruiting, hiring, job assignment, compensation, promotion, benefits, training, assessment of job performance, discipline, termination, and everything in between. Recruiting, hiring, and promotion decisions at Salesforce are fair and based on merit. The same goes for compensation, benefits, promotions, transfers, reduction in workforce, recall, training, and education.

In the United States, compensation offered will be determined by factors such as location, job level, job-related knowledge, skills, and experience. Certain roles may be eligible for incentive compensation, equity, and benefits. Salesforce offers a variety of benefits to help you live well including: time off programs, medical, dental, vision, mental health support, paid parental leave, life and disability insurance, 401(k), and an employee stock purchasing program. More details about company benefits can be found at the following link: https://www.salesforcebenefits.com.Pursuant to the San Francisco Fair Chance Ordinance and the Los Angeles Fair Chance Initiative for Hiring, Salesforce will consider for employment qualified applicants with arrest and conviction records. For Washington-based roles, the base salary hiring range for this position is $184,000 to $253,000. For California-based roles, the base salary hiring range for this position is $200,800 to $276,100. For Massachusetts based roles, the base salary hiring range for this position is $184,000 to $253,000.
Applied = 0

(web-696f97f645-4mdcj)