Kubernetes Engineer
Big Bank Funding. FinTech Thinking.
Our technology teams in the UK work closely with HSBC’s global businesses to help design and build digital services that allow our millions of customers around the world, to bank quickly, simply and securely. We also run and manage our IT infrastructure, data centres and core banking systems that power the world’s leading international bank.
Our multi-disciplined teams include: DevOps engineers, IT architects, front and back end developers, infrastructure specialists, cyber experts, as well as project and programme managers.
We work in small, agile DevOps teams with colleagues around the world.
Following extensive investment across our Technology and Digital domains, we are currently seeking an experienced Kubernetes Engineer to join HSBC Technology.
Overview:
The Internal Kubernetes Platform (IKP) is a Containers as a service (CaaS) cloud service model that allows users to manage and deploy containers, applications and clusters through container-based virtualization.
IKP provides an easy way to set up a container workload. Orchestration, which essentially automates key IT functions, is an essential quality of CaaS technology. CaaS is often considered to be a subset of IaaS (infrastructure as a service), but includes containers as its fundamental resource, as opposed to bare metal systems and virtual machines.
IKP is a hosting platform used buy our internal Application development teams to host their non-production environments, it also hosts their production environments that deliver services to HSBC customers, some of these apps are business critical systems (Tier-0).
Role Responsibilities
The Internal Kubernetes Platform (IKP) team follows a Site Reliability Engineering (SRE) model, this role will be working as an Engineer for IKP, the platform is based on GKE Kubernetes clusters, as well as the management of bundled services such as Istio and Prometheus. Members of the SRE team are expected to work closely with L1 support, Services Engineering, and the IKP Core team. Team members should address incidents and resolve issues, while striving to improve monitoring and build automation.
Objectives of this Role
Run the IKP clusters by monitoring availability and taking a holistic view of system health
Build tools and automation to manage platform infrastructure and services
Improve reliability, quality, and time to upgrade cluster and service versions
Measure and optimize system performance and resource utilization, and plan for future capacity
Build dashboards and visualizations to graph system health
Define system alerts and automate responses where possible
Provide operational support and engineering for multiple software development teams
Work with the senior team members on driving the platform forward to align with the Banks 2027 goals
Work closely with application teams that consume the IKP Hosting platform, however the IKP team is not responsible for the Application development of services deployed onto IKP.
Daily and Monthly Responsibilities
Gather and analyze metrics from cluster components and services to assist in performance tuning and fault finding
Partner with Core Engineering and Services Engineering teams to improve services through rigorous testing and release procedures
Participate in system design consulting, platform management, and capacity planning
Create sustainable systems and services through automation and uplifts
Balance feature development speed and reliability with well-defined service level objectives
Confirm the health of clusters to assist developers when they have issues deploying a new workload
Be proactive and identify opportunities where you and the team can provide increased customer service, reliability, and scalability of the IKP Platform