Join us as a Software Engineer -Observability at Barclays, where you'll spearhead the evolution of our digital landscape, driving innovation and excellence. You'll harness cutting-edge technology to revolutionize our digital offerings, ensuring unapparelled customer experiences. The ideal candidate will have deep expertise in cloud-native technologies, distributed streaming platforms, Kubernetes orchestration, and multi-region architecture patterns.
To be a successful Software Engineer -Observability, you should have experience with:
- Design, implement, and integrate end‑to‑end observability solutions by instrumenting applications and platforms with Metrics, Events, Logs, and Traces (MELTs), ensuring consistent visibility across distributed systems.
- Integrate application using Open Telemetry (OTEL), with strong preference for the Grafana suite, enabling unified monitoring, visualization, and operational insight.
- Develop and maintain high‑quality dashboards that provide actionable insights into system health, performance, capacity, and business‑critical metrics, tailored for engineering, operations, and senior stakeholders.
- Define and implement alerts, notifications, and escalation mechanisms using metrics and logs.
- Apply observability best practices to Kafka‑based event‑driven systems, Integrate Kafka clients (producers and consumers), including monitoring broker health, consumer lag, throughput, latency, error rates, and topic‑level behavior.
- Implement observability for NoSQL data stores (MongoDB preferred), capturing performance metrics, query behavior, resource utilization, and failure patterns to support proactive issue identification.
- Work within cloud‑native environments (AWS preferred) to implement observability across compute, storage, networking, and managed services, ensuring alignment with Barclays security and resilience standards.
Some other highly valued skills may include (Mandatory):
- Contribute as a Individual Contributor (ICE), owning end‑to‑end technical delivery of observability capabilities, from design and implementation through to production readiness and ongoing optimization, with minimal supervision and strong engineering accountability.
- Take full ownership of assigned observability solutions, demonstrating accountability for design quality, operational stability, performance, security, and alignment with Barclays engineering standards.
- Operate effectively within Agile / Scrum delivery models, actively participating in sprint planning, refinement, reviews, and retrospectives, with practical experience using Jira for backlog management, tracking, and delivery transparency.
- Demonstrate excellent verbal and written communication skills, with the ability to clearly articulate technical concepts to both technical and non‑technical stakeholders.
- Collaborate effectively with cross‑functional teams, including application engineers, DevOps, SRE, security, platform, and architecture teams, to deliver cohesive and well‑integrated solutions.
- Exhibit a strong sense of ownership and self‑motivation, proactively identifying risks, gaps, and improvement opportunities, and driving them to resolution without the need for close oversight.
- Demonstrate a passion for innovation and continuous learning, staying current with cloud‑native technologies, observability tooling, and industry best practices, and applying relevant advancements to improve platform reliability and efficiency.
Deploy and operate applications and monitoring components on Kubernetes, with hands‑on experience observing containerized workloads, pods, services, and cluster‑level behavior.
- Integrate observability into CI/CD and DevOps pipelines using Git and DevOps toolchains, enabling early detection of issues through automated checks, deployment health validation, and post‑release monitoring.
- Apply strong understanding of microservices architecture, event‑driven design, and distributed systems principles to ensure observability is designed‑in rather than retrofitted.
- Demonstrate hands‑on understanding of stream‑processing frameworks such as Apache Flink, with the ability to observe, monitor, and troubleshoot stateful stream‑processing jobs, including latency, back‑pressure, checkpointing, and failure recovery.
- Implement and operate observability using the Grafana LGTM stack, including: Loki, Tempo, Mimir.
- Support production systems, contributing to incident response, root‑cause analysis, and continuous improvement through enhanced instrumentation and operational feedback loops.
- Hands‑on experience with: Docker Desktop for local containerized development, IntelliJ IDEA or equivalent enterprise IDEs and Secure source control, branching strategies, and automated quality gates.
You may be assessed on key critical skills relevant for success in role, such as risk and controls, change and transformation, business acumen, strategic thinking and digital and technology, as well as job-specific technical skills.
This role is based out of Bengaluru.
Purpose of the role
To design, develop and improve software, utilising various engineering methodologies, that provides business, platform, and technology capabilities for our customers and colleagues.
Accountabilities
- Development and delivery of high-quality software solutions by using industry aligned programming languages, frameworks, and tools. Ensuring that code is scalable, maintainable, and optimized for performance.
- Cross-functional collaboration with product managers, designers, and other engineers to define software requirements, devise solution strategies, and ensure seamless integration and alignment with business objectives.
- Collaboration with peers, participate in code reviews, and promote a culture of code quality and knowledge sharing.
- Stay informed of industry technology trends and innovations and actively contribute to the organization’s technology communities to foster a culture of technical excellence and growth.
- Adherence to secure coding practices to mitigate vulnerabilities, protect sensitive data, and ensure secure software solutions.
- Implementation of effective unit testing practices to ensure proper code design, readability, and reliability.
Analyst Expectations
- To perform prescribed activities in a timely manner and to a high standard consistently driving continuous improvement.
- Requires in-depth technical knowledge and experience in their assigned area of expertise
- Thorough understanding of the underlying principles and concepts within the area of expertise
- They lead and supervise a team, guiding and supporting professional development, allocating work requirements and coordinating team resources.
- If the position has leadership responsibilities, People Leaders are expected to demonstrate a clear set of leadership behaviours to create an environment for colleagues to thrive and deliver to a consistently excellent standard. The four LEAD behaviours are: L – Listen and be authentic, E – Energise and inspire, A – Align across the enterprise, D – Develop others.
- OR for an individual contributor, they develop technical expertise in work area, acting as an advisor where appropriate.
- Will have an impact on the work of related teams within the area.
- Partner with other functions and business areas.
- Takes responsibility for end results of a team’s operational processing and activities.
- Escalate breaches of policies / procedure appropriately.
- Take responsibility for embedding new policies/ procedures adopted due to risk mitigation.
- Advise and influence decision making within own area of expertise.
- Take ownership for managing risk and strengthening controls in relation to the work you own or contribute to. Deliver your work and areas of responsibility in line with relevant rules, regulation and codes of conduct.
- Maintain and continually build an understanding of how own sub-function integrates with function, alongside knowledge of the organisations products, services and processes within the function.
- Demonstrate understanding of how areas coordinate and contribute to the achievement of the objectives of the organisation sub-function.
- Make evaluative judgements based on the analysis of factual information, paying attention to detail.
- Resolve problems by identifying and selecting solutions through the application of acquired technical experience and will be guided by precedents.
- Guide and persuade team members and communicate complex / sensitive information.
- Act as contact point for stakeholders outside of the immediate function, while building a network of contacts outside team and external to the organisation.
All colleagues will be expected to demonstrate the Barclays Values of Respect, Integrity, Service, Excellence and Stewardship – our moral compass, helping us do what we believe is right. They will also be expected to demonstrate the Barclays Mindset – to Empower, Challenge and Drive – the operating manual for how we behave.