Director FHP Site Reliability Engineering
Company: Wells Fargo
Location: Irving
Posted on: January 25, 2023
Job Description:
About this role: Wells Fargo is seeking a Technology Director to
lead the stability and sustainability of our Technology
Infrastructure (TI) Foundational Hosting Platforms (FHP). This role
will design, create and manage the Site Reliability Engineering
(SRE) practice integrating compute, storage, mainframe, database,
middleware and application hosting activities and functions. In
this role, you will:
- Manage a team of engineering managers and engineering
leads
- Provide oversight to software craftsmanship, security,
availability, resilience, and scalability of solutions developed by
the teams or third party providers
- Identify financial management and strategic resourcing
- Champion TI's assessment to baseline the SRE environment and
organization
- Create strategies to align development and operations through
shared goals and balance between functional and nonfunctional
requirements
- Support FHP's development and operation teams to define each
stakeholder's service-level indicators (SLIs) that reflects
reliability. (Ex. Availability, Response Time, Latency,
Throughput)
- Lead how SLOs are calculated (formula) including how data is
collected, aggregated, analyzed and reported
- Lead FHP's SRE team resource building (Acquire, Develop,
Manage) forming, storming, norming, performing, and adjourning
- Build and organize the team: mix of established domain
knowledge and fresh viewpoints. Broad Talent mix, small, fast,
nimble, with authority and reduced bureaucracy
- Develop practical SRE principles using automation to scale
load, balance operation toil and improvements
- Define and measure production availability, navigating known
downtime, and service level outages
- Introduce continuous improvement for infrastructure services
through integration, automation, standardization, development of
tools
- Provide Support for SRE on-boarded applications identifying
systemic issues, conducting blameless post mortems, root cause
analysis, and remediate issues strategically leveraging the Product
Operating Model Required Qualifications:
- 8+ years of Technology Strategic Leadership experience, or
equivalent demonstrated through one or a combination of the
following: work experience, training, military experience,
education
- 4+ years of Management experience Desired Qualifications:
- BS degree or higher in Computer Science, Engineering or related
field
- 3+ years of Incident Management System experience
- 2+ years of Configuration Management Tools experience
- 5+ years of experience in/as DevOps/Site Reliability
Engineer
- Working knowledge of Cloud, API and No-SQL databases
- 5+ years of build-deploy automation and configuration
experience within the Linux and Unix environment.
- Application development and implementation experience and
understanding
- Excellent verbal, written, and interpersonal communication
skills
- Experience with OS and Platform, such as; AWS Lambda EC2,
Linux, Windows, Kubernetes, GCP, Pivotal Cloud Foundry, Azure
- Experience with Automation and CI/CD Framework, such as;
Jenkins, Gitlab, Artifactory, Ansible, Puppet, APIgee
- Experience with Programming language, such as; Python, Java,
.Net, Javascript, Go, C/C++
- Working knowledge of SRE Toolchain: AIOPS, Big Panda, Amelia,
API Automation, Gremlin, ServiceNow, PagerDuty, Symphony/Slack, New
Relic
- Experience with Observability / Monitoring technologies
including one or more of the following is desired: Elastic
Stack/ELK, Grafana, Prometheus, AppDynamics, Splunk, Kafka,
DataDog, Cloudwatch We Value Diversity At Wells Fargo, we believe
in diversity, equity and inclusion in the workplace; accordingly,
we welcome applications for employment from all qualified
candidates, regardless of race, color, gender, national origin,
religion, age, sexual orientation, gender identity, gender
expression, genetic information, individuals with disabilities,
pregnancy, marital status, status as a protected veteran or any
other status protected by applicable law. Employees support our
focus on building strong customer relationships balanced with a
strong risk mitigating and compliance-driven culture which firmly
establishes those disciplines as critical to the success of our
customers and company. They are accountable for execution of all
applicable risk programs (Credit, Market, Financial Crimes,
Operational, Regulatory Compliance), which includes effectively
following and adhering to applicable Wells Fargo policies and
procedures, appropriately fulfilling risk and compliance
obligations, timely and effective escalation and remediation of
issues, and making sound risk decisions. There is emphasis on
proactive monitoring, governance, risk identification and
escalation, as well as making sound risk decisions commensurate
with the business unit's risk appetite and all risk and compliance
program requirements. Candidates applying to job openings posted in
US: All qualified applicants will receive consideration for
employment without regard to race, color, religion, age, sex,
sexual orientation, gender identity, national origin, disability,
or status as a protected veteran. Candidates applying to job
openings posted in Canada: Applications for employment are
encouraged from all qualified candidates, including women, persons
with disabilities, aboriginal peoples and visible minorities.
Accommodation for applicants with disabilities is available upon
request in connection with the recruitment process.
Keywords: Wells Fargo, Irving , Director FHP Site Reliability Engineering, Executive , Irving, Texas
Didn't find what you're looking for? Search again!
Loading more jobs...