Join us as a Software Chaos Engineer at Barclays, where you'll spearhead the evolution of our digital landscape, driving innovation and excellence. You'll harness cutting-edge technology to revolutionize our digital offerings, ensuring unapparelled customer experiences. The ideal candidate will have deep expertise in cloud-native technologies, distributed streaming platforms, Kubernetes orchestration, and multi-region architecture patterns.
To be a successful Software Chaos Engineer, you should have experience with:
- Design and execute hypothesis‑driven chaos engineering experiments using tools such as Gremlin and/or Harness Chaos Engineering, across Kubernetes, cloud, and application layers.
- Perform controlled fault‑injection scenarios, including infrastructure, network, application, and platform failures, to validate system availability, recovery behavior, and operational resilience.
- Validate system resilience and dependency behavior through infrastructure, network, and downstream service failure scenarios, identifying systemic weaknesses and recovery gaps.
- Design, develop, and maintain chaos test scenarios, ensuring repeatability, observability, and measurable outcomes aligned to resilience objectives.
- Implement PACT consumer‑driven contract testing for microservices and API‑based systems to reduce integration risk and improve deployment confidence.
- Design, maintain, and verify consumer and provider contracts, ensuring backward compatibility and safe evolution of services.
- Embed contract testing and chaos experiments into CI/CD pipelines as quality and resilience gates, preventing regressions from reaching higher environments.
- Develop and maintain automation frameworks and utilities using Python to support resilience testing, test orchestration, and reporting.
- Analyse test outcomes, identify failure patterns, and work closely with engineering teams to remediate issues and improve platform reliability.
- Apply strong working knowledge of Git, DevOps tooling, Kubernetes, and public cloud platforms (AWS preferred) to support scalable and automated resilience testing.
- Deliver software using Agile methodologies, actively participating in sprint planning, reviews, retrospectives, and continuous improvement.
- Hands‑on experience with: Docker Desktop for local containerized development, IntelliJ IDEA or equivalent enterprise IDEs and Secure source control, branching strategies, and automated quality gates.
- Demonstrates strong familiarity with Event‑Driven Architecture (EDA) concepts and patterns, with exposure to industry‑standard messaging and streaming platforms, supporting solutions designed for high throughput, low latency, and resilience.
- Drive a test‑first, quality‑driven engineering culture with hands‑on experience.
- .Contract Testing (PACT) for consumer‑provider assurance.
- Unit Testing (JUnit) and integration testing.
- Performance & load testing (JMeter).
- Mutation testing to validate test effectiveness.
- Ensure test coverage and automation are embedded into CI/CD pipelines, not treated as post‑delivery activities.
Some other highly valued skills may include:
- Contribute as a hands‑on Individual Contributor Engineer (ICE), owning end‑to‑end delivery of chaos and contract testing capabilities with minimal supervision.
- Work effectively within Agile/Scrum delivery models, using Jira for backlog management, delivery tracking, and sprint execution.
- Apply strong analytical and troubleshooting skills to diagnose and resolve complex issues in distributed, event‑driven systems.
- Demonstrate excellent verbal and written communication, producing clear architecture designs, decision records, and technical documentation.
- Collaborate closely with cross‑functional teams (engineering, DevOps, security, architecture) to deliver integrated, enterprise‑ready solutions.
- Exhibit strong ownership, self‑drive, and innovation mindset, staying current with cloud‑native technologies and modern engineering best practices.
You may be assessed on key critical skills relevant for success in role, such as risk and controls, change and transformation, business acumen, strategic thinking and digital and technology, as well as job-specific technical skills.
This role is based out of Bengaluru.
Purpose of the role
To design, develop and improve software, utilising various engineering methodologies, that provides business, platform, and technology capabilities for our customers and colleagues.
Accountabilities
- Development and delivery of high-quality software solutions by using industry aligned programming languages, frameworks, and tools. Ensuring that code is scalable, maintainable, and optimized for performance.
- Cross-functional collaboration with product managers, designers, and other engineers to define software requirements, devise solution strategies, and ensure seamless integration and alignment with business objectives.
- Collaboration with peers, participate in code reviews, and promote a culture of code quality and knowledge sharing.
- Stay informed of industry technology trends and innovations and actively contribute to the organization’s technology communities to foster a culture of technical excellence and growth.
- Adherence to secure coding practices to mitigate vulnerabilities, protect sensitive data, and ensure secure software solutions.
- Implementation of effective unit testing practices to ensure proper code design, readability, and reliability.
Analyst Expectations
- To perform prescribed activities in a timely manner and to a high standard consistently driving continuous improvement.
- Requires in-depth technical knowledge and experience in their assigned area of expertise
- Thorough understanding of the underlying principles and concepts within the area of expertise
- They lead and supervise a team, guiding and supporting professional development, allocating work requirements and coordinating team resources.
- If the position has leadership responsibilities, People Leaders are expected to demonstrate a clear set of leadership behaviours to create an environment for colleagues to thrive and deliver to a consistently excellent standard. The four LEAD behaviours are: L – Listen and be authentic, E – Energise and inspire, A – Align across the enterprise, D – Develop others.
- OR for an individual contributor, they develop technical expertise in work area, acting as an advisor where appropriate.
- Will have an impact on the work of related teams within the area.
- Partner with other functions and business areas.
- Takes responsibility for end results of a team’s operational processing and activities.
- Escalate breaches of policies / procedure appropriately.
- Take responsibility for embedding new policies/ procedures adopted due to risk mitigation.
- Advise and influence decision making within own area of expertise.
- Take ownership for managing risk and strengthening controls in relation to the work you own or contribute to. Deliver your work and areas of responsibility in line with relevant rules, regulation and codes of conduct.
- Maintain and continually build an understanding of how own sub-function integrates with function, alongside knowledge of the organisations products, services and processes within the function.
- Demonstrate understanding of how areas coordinate and contribute to the achievement of the objectives of the organisation sub-function.
- Make evaluative judgements based on the analysis of factual information, paying attention to detail.
- Resolve problems by identifying and selecting solutions through the application of acquired technical experience and will be guided by precedents.
- Guide and persuade team members and communicate complex / sensitive information.
- Act as contact point for stakeholders outside of the immediate function, while building a network of contacts outside team and external to the organisation.
All colleagues will be expected to demonstrate the Barclays Values of Respect, Integrity, Service, Excellence and Stewardship – our moral compass, helping us do what we believe is right. They will also be expected to demonstrate the Barclays Mindset – to Empower, Challenge and Drive – the operating manual for how we behave.