MUFG Union Bank Jobs

Mobile mufg Logo

Job Information

MUFG Kubernetes Site Reliability Engineer - VP in London, United Kingdom

Do you want your voice heard and your actions to count?

Discover your opportunity with Mitsubishi UFJ Financial Group (MUFG), the 7th largest financial group in the world. Across the globe, we’re 160,000 colleagues, striving to make a difference for every client, organization, and community we serve. We stand for our values, building long-term relationships, serving society, and fostering shared and sustainable growth for a better world.

With a vision to be the world’s most trusted financial group, it’s part of our culture to put people first, listen to new and diverse ideas and collaborate toward greater innovation, speed and agility. This means investing in talent, technologies, and tools that empower you to own your career.

Join MUFG, where being inspired is expected and making a meaningful impact is rewarded.


  • Site Reliability Engineering are responsible for delivering continuous improvement, automation and self-service offerings to operational teams across Bank EMEA and Securities International






  • Responsible for the reliability and efficiency of infrastructure through the delivery of common, repeatable tools and processes that greatly reduce the amount of toil operations must perform

  • Member of L3 Engineering team providing subject matter expertise and ultimate escalation



  • Develop software to make infrastructure services self-managing and self-service dashboards.

  • Deliver continuous service improvement by developing Infrastructure as Code

  • Eliminate manual, repetitive, automatable, tactical tasks that are devoid from value

  • Improve system performance, make effective use of resources, distribute load and reduce latency

  • Identify SLO’s (Service Level Objectives) to meet availability and latency objectives

  • Develop pro-active monitoring solutions that alert on symptoms and not just on outages

  • Perform detailed root cause analysis (RCA’s) on incidents and outages to prevent future

  • Partner with development teams to improve services via rigorous testing and release procedures

  • Identify technical debt and partner with application teams to build remediation plans

  • Develop standard operational procedures and produce effective documentation

  • Analyse workloads and devise suitable cloud migration strategies where appropriate

  • Ensure all project / investment workloads are delivered according to plans and budget defined

  • Liaise with Infrastructure Control and IT Risk teams to satisfy internal and external audit requests

  • Deputise for team lead when required to do so and act-up accordingly

  • Identify cost saving and optimisation opportunities across the group

  • Build strong working relationships across the organisation

  • Adhere to the core values of the bank


  • Perform daily health and compliance checks for all systems as required

  • Ensure all systems are backed up successfully and any issues are promptly resolved

  • Validate monitoring alerts and batch job failures are detected promptly and satisfactorily resolved

  • Ensure sufficient capacity is available to accommodate drive growth

  • Respond to emails sent to the team distribution list / mailboxes in a timely manner

  • Handle incidents and requests with efficiency and a “customer first” mindset

  • Maintain infrastructure in a highly available, reliable, secure and performant manner

  • General Server / Database / Virtualisation Administration maintenance activities

  • Provide technical support to application support and development teams

  • Provide consultancy to application support and development teams

  • Take part in On-Call & weekend work rotation; triaging and addressing production issues as they arise



  • Exceptional skills in Docker/Kubernetes deployment and configuration, scaling and management of containerized applications.

  • Excellent skills in managing, performance optimisation of complex Prometheus, Influxdb and Grafana monitoring stack.

  • Excellent skills in writing/maintaining Grafana Dashboard using PromQL, InfluxQL/Flux.

  • Experience in distributed technologies like Rook, Ceph, Noobaa, Trino, MariaDB Xpand, Dremio, Kibana, KX platform

  • Experience in CI/CD/CT platforms like Git, Ansible, Terraform and TeamCity

  • Serena Deployment Automation (SDA) and Jenkins

  • “Infrastructure as Code” Principles and practices.

  • “Continuous Integration (CI) and Continuous Development (CD)” Principles and practices

  • Agile, Site Reliability Engineering (SRE) and DevOps Principles and practices

  • Scripting and programming languages such as PowerShell, Python, Bash and C#

  • Fluent in Backup and Recovery processes and procedures

  • Advanced knowledge of Clustering, High-Availability, Replication and Disaster Recovery techniques

  • Ability to tune Network, Storage, Server and Virtualisation layers for optimal performance and reliability

  • Excellent Performance Tuning skills, in-depth knowledge of system internals

  • Ability to interpret and implement CIS security hardening recommendations in a controlled manner

  • Acute awareness of Security and Auditing requirements in a regulated environment

Highly Desirable:

  • RHEL, Oracle Linux, Oracle Solaris and related technologies

  • Microsoft Windows Server and related technologies

  • Microsoft SQL Server, Oracle, Sybase ASE, MongoDB and Snowflake

  • Active Directory, LDAP and Kerberos

  • IBM Tivoli / Netcool

  • Nutanix HCI and VMWare ESX

  • Networking Protocols (TCP/IP, DNS, DHCP, VLAN’s)

  • Cloud computing - IaaS, PaaS and SaaS offerings across Azure, AWS, GCP and Oracle

  • Knowledge of data security governance and regulations such as GDPR and SOX


  • Dell EMC PowerStore (SAN) and Isilon (NAS)

  • Rubrik, EMC Networker, Data Domain and IBM Tivoli Storage Manager

  • CyberArk

  • Splunk

  • Qualys

  • Cisco Tetration

  • ServiceNow

  • JIRA and Confluence


  • Excellent communication and interpersonal skills

  • Ability to handle pressure during outages and systematically resolve issues

  • Excellent problem-solving skills

  • Results driven, with a strong sense of accountability

  • A proactive, motivated approach

  • The ability to operate with urgency and prioritise work accordingly

  • A structured and logical approach to work

  • Attention to detail and accuracy

  • Ability to perform well in a pressurised environment

  • Ability to manage constructive conflict effectively

  • The ability to manage large workloads and tight deadlines

  • Able to communicate complex technical concepts to non-technical persons at all levels


The role holder will be assessed in accordance with their employing entity’s performance framework and process with relevant input obtained from the dual hatting entity as relevant.

As duties and responsibilities change, the job description will be reviewed and emended in consultation with the role holder. The role holder will carry out other duties as are within the scope, spirit and purpose of the role as requested by their line manager or Department Head.


  • The role holder will have responsibilities for both MUFG Bank and MUFG Securities EMEA plc.

  • The role holder will be required to perform their duties and responsibilities on an entity neutral basis, without favour.

  • The role holder is required to follow regulatory requirements applicable to ensure each business is appropriately supported and to maintain the legal entity integrity of each of MUFG Bank and MUS.

  • Working terms are dictated by functional mandates, the terms of the Dual-Hat Arrangement Agreement in place between MUFG Bank and MUFG Securities EMEA plc and any other relevant agreements entered into between MUFG Bank and MUFG Securities EMEA plc.

  • The role holder will have responsibility for identifying and resolving where there may be a difference or conflict in needs between MUFG Bank and MUFG Securities EMEA plc, escalating to their manager where required.

We are open to considering flexible working requests in line with organisational requirements.

MUFG is committed to embracing diversity and building an inclusive culture where all employees are valued, respected and their opinions count. We support the principles of equality, diversity and inclusion in recruitment and employment, and oppose all forms of discrimination on the grounds of age, sex, gender, sexual orientation, disability, pregnancy and maternity, race, gender reassignment, religion or belief and marriage or civil partnership.

We make our recruitment decisions in a non-discriminatory manner in accordance with our commitment to identifying the right skills for the right role and our obligations under the law.

At MUFG, our colleagues are our greatest assets. Our Culture Principles provide a roadmap for how each of our colleagues must think and act to become more client-obsessed, inclusive and innovative. They reflect who we are, who we want to be and what we expect from one another. We are excited to see you take the next step in exploring a career with us and encourage you to spend more time reviewing them!

Our Culture Principles

  • Client Centric

  • People Focused

  • Listen Up. Speak Up.

  • Innovate & Simplify

  • Own & Execute