Siemens Careers

Reliability Engineer - Senior

Charlotte, North Carolina; Plano, Texas; Plano, Texas
Research & Development

English (US)

Job Description

Division: Digital Factory
Business Unit: Product Lifecycle Management-PLM
Requisition Number: 228530
Primary Location: United States-North Carolina-Charlotte
Other Locations: United States-Texas-Plano, United States-Texas-Plano
Assignment Category: Part-time regular
Experience Level: Mid level
Education Required Level: Bachelor's Degree
Travel Required: 15%

Division Description:

Siemens Digital Factory offers a comprehensive portfolio of seamlessly-integrated hardware software and technology-based services in order to support manufacturing companies worldwide. Siemens PLM Software, a Plano, Texas-based business unit of the Digital Factory Division, is a leading global provider of product lifecycle management (PLM) and manufacturing operations management (MOM) software, systems and services with over nine million licensed seats and more than 77,000 customers worldwide.


For more information, please visit:


Job Description:

Job Overview:
From digitalization to automation, we're changing the cities you live in and the places you work. Being part of Siemens PL lets you solve complex challenges, every day.  We are looking for a Principal Site Reliability/Chaos Engineer with a strong operations and software engineering background to help shape and increase our cloud resiliency to be best in class.  In this role, you will inject failure & disruption within our systems to fortify the availability of our service offerings. Your goal will be to uncover weaknesses using resource, state & network intensive attacks in controlled experiments.


A successful candidate will be a self-starter, actively curious, have a solid understanding of operational best practices, & have the ability to run multiple, large initiatives simultaneously.  The candidate will also be experienced with agile development methodologies and able to drive chaos engineering principles throughout Siemens PL.


Responsibilities Include:
• Develop effective tooling, alerts, and response to both identify and address reliability risks
• Participate in on-call rotation with development & operations teams
• Engage with product engineering teams to triage production outages and carry forward action items to improve ongoing reliability
• Define and evangelize cloud-related optimizations and best practices to improve reliability and performance to include Chaos Engineering
• Promote & define Chaos Engineering tooling, services & metrics


Minimum Job Qualifications:
• Ability to root cause sources of instability in a high-traffic, large-scale distributed system
• Experience with configuration and troubleshooting of Linux, Java, Tomcat, and other middleware technologies
• Understands large-scale complex systems from a reliability perspective
• Scripting abilities in python, perl, or JVM-based languages
• Passion for resolving reliability issues and identify strategies to mitigate going forward


Preferred Skills 
• Experience with Cloud Computing platforms (particularly AWS) a plus
• Deep network analysis experience a plus
• Strong Linux system-level analysis capabilities



Equal Employment Opportunity Statement
Siemens is an Equal Opportunity and Affirmative Action Employer encouraging diversity in the workplace. All qualified applicants will receive consideration for employment without regard to their race, color, creed, religion, national origin, citizenship status, ancestry, sex, age, physical or mental disability, marital status, family responsibilities, pregnancy, genetic information, sexual orientation, gender expression, gender identity, transgender, sex stereotyping, protected veteran or military status, and other categories protected by federal, state or local law.

EEO is the Law
Applicants and employees are protected under Federal law from discrimination. To learn more, Click here.

Pay Transparency Non-Discrimination Provision
Siemens follows Executive Order 11246, including the Pay Transparency Nondiscrimination Provision. To learn more, Click here.