Data Center Lead Engineer
-
Other
Job Description
Responsibilities
Take operational ownership of the New Jersey/Texas data center
infrastructure and its associated components and services (Wintel servers,
Citrix, Linux, Storage, backups, etc.).
Take
ownership of our cloud operations in AWS and Azure.
Participate in enterprise level projects from a data center and
cloud perspective.
Perform necessary analysis of the data center infrastructure and
contribute to the design of the architecture, integration and installation of
the DC infrastructure.
Perform necessary analysis of the cloud infrastructure and
contribute to the design of the architecture, integration and implementation.
Manage relationships and work with 3rd party vendors to
escalate issues when appropriate.
Ensure capacity planning is in place and reviewed on a regular
basis with the Team Leader and Fareportal Management. Coordinate and work collaboratively with
other departments in order to understand and meet business requirements.
Create and maintain documentation for all Fareportal
infrastructure environments, including data center, cloud, on premise,
etc.
Contribute to disaster recovery plans and initiates routine review
and testing of DR plans where applicable.
Participate in evaluation of 3rd party vendor
proposals.
Take proactive approach in provisioning extra capacity to meet
needs or determines more analysis is required to determine types of usage.
Desire to work in a fast-paced, entrepreneurial environment
Required Skills: (Hard Skills ex. Coding
languages, GDS systems etc)
Strong troubleshooting skills
Capacity planning
Solution designing
Project deployment and time management
Strong knowledge of data center
technologies (Wintel servers, Linux servers, Citrix, Storage, backups, etc)
Implement,
secure, scale and troubleshoot a global, multi-cloud cloud platform and
products.
Strong cloud operations
experience in AWS and/or Azure
Stay current
on technologies and best practice designs, proper implementations,
high-performance optimizations, and self-service troubleshooting skills.
As a team,
create runbooks, process, policies and “Infrastructure as Code” recipes to
build scalable, fault-tolerant, and resilient networks for product and
operations teams.
Accelerate
the transformation from datacenter-centric design patterns to immutable,
automated, highly-available and a zero-trust cloud-based infrastructure.
With the
team, design cloud-native privileged account management solutions in the cloud
for key management, service account and secrets management, rotation and event
response.
Implement and monitor the security controls and performance
metrics of the cloud network layer, including resiliency checks and failure
prediction.
Experience with automation tools such as Chef, Puppet,
Ansible, SaltStack, etc.
Previous participation in bug-hunting, pen tests,
vulnerability assessments a plus
Good working
knowledge of networking a plus
Willingness to be part of an on-call rotation when needed.
Job Competencies: (Soft
skills ex. Communication skills, behavioral, leadership skills etc)
Strong verbal and written communication
skills. Must be able to communicate with a wide variety of audiences, both
business and technical as well as executive through team levels;
Establishes and maintains effective
management and peer-level relationships;
Ability to manage cross-functional, matrixed
organizations
Strong program/project management skills;
ability to prioritize tasks and work with multiple priorities;
Proven analytical, problem-solving and
research skills; attention to detail;
Ability to work in fast paced, highly
flexible environment
Strong organization and planning skills.
Take operational ownership of the New Jersey/Texas data center
infrastructure and its associated components and services (Wintel servers,
Citrix, Linux, Storage, backups, etc.).
Take
ownership of our cloud operations in AWS and Azure.
Participate in enterprise level projects from a data center and
cloud perspective.
Perform necessary analysis of the data center infrastructure and
contribute to the design of the architecture, integration and installation of
the DC infrastructure.
Perform necessary analysis of the cloud infrastructure and
contribute to the design of the architecture, integration and implementation.
Manage relationships and work with 3rd party vendors to
escalate issues when appropriate.
Ensure capacity planning is in place and reviewed on a regular
basis with the Team Leader and Fareportal Management. Coordinate and work collaboratively with
other departments in order to understand and meet business requirements.
Create and maintain documentation for all Fareportal
infrastructure environments, including data center, cloud, on premise,
etc.
Contribute to disaster recovery plans and initiates routine review
and testing of DR plans where applicable.
Participate in evaluation of 3rd party vendor
proposals.
Take proactive approach in provisioning extra capacity to meet
needs or determines more analysis is required to determine types of usage.
Desire to work in a fast-paced, entrepreneurial environment
Required Skills: (Hard Skills ex. Coding
languages, GDS systems etc)
Strong troubleshooting skills
Capacity planning
Solution designing
Project deployment and time management
Strong knowledge of data center
technologies (Wintel servers, Linux servers, Citrix, Storage, backups, etc)
Implement,
secure, scale and troubleshoot a global, multi-cloud cloud platform and
products.
Strong cloud operations
experience in AWS and/or Azure
Stay current
on technologies and best practice designs, proper implementations,
high-performance optimizations, and self-service troubleshooting skills.
As a team,
create runbooks, process, policies and “Infrastructure as Code” recipes to
build scalable, fault-tolerant, and resilient networks for product and
operations teams.
Accelerate
the transformation from datacenter-centric design patterns to immutable,
automated, highly-available and a zero-trust cloud-based infrastructure.
With the
team, design cloud-native privileged account management solutions in the cloud
for key management, service account and secrets management, rotation and event
response.
Implement and monitor the security controls and performance
metrics of the cloud network layer, including resiliency checks and failure
prediction.
Experience with automation tools such as Chef, Puppet,
Ansible, SaltStack, etc.
Previous participation in bug-hunting, pen tests,
vulnerability assessments a plus
Good working
knowledge of networking a plus
Willingness to be part of an on-call rotation when needed.
Job Competencies: (Soft
skills ex. Communication skills, behavioral, leadership skills etc)
Strong verbal and written communication
skills. Must be able to communicate with a wide variety of audiences, both
business and technical as well as executive through team levels;
Establishes and maintains effective
management and peer-level relationships;
Ability to manage cross-functional, matrixed
organizations
Strong program/project management skills;
ability to prioritize tasks and work with multiple priorities;
Proven analytical, problem-solving and
research skills; attention to detail;
Ability to work in fast paced, highly
flexible environment
Strong organization and planning skills.