Senior Engineer - Cloud Solutions and Operations
Dialog
| 2024-10-28
Job Purpose:
We are seeking a highly skilled and motivated Senior Engineer - Cloud Solutions and Operations to join our dynamic team. This role is pivotal in ensuring the seamless operation of our cloud solutions, focusing on implementation, optimization, and information security best practices. The ideal candidate will provide cross-functional support for troubleshooting and problem-solving across various business applications and solutions.
The Job:
Conduct regular health checks on all critical applications deployed across public cloud, private cloud, and on-premises systems, focusing on uptime, stability, and performance.
Collaborate with the in-house application operations team, vendors, and hardware support teams to address issues promptly and effectively.
Proactively identify and resolve issues affecting system, application, and database performance.
Provide advice and guidance on improvements to maximize the availability of critical business systems.
Participate in system requirements gathering, design reviews, and raise concerns regarding functionality, performance, and reliability.
Coordinate with development teams in a DevOps model to foster a continuous integration environment.
Support users of cloud services in resolving failures, performance issues, and connectivity problems.
Conduct problem management reviews to identify known issues and implement solutions.
Participate in daily huddles to identify improvement opportunities.
Utilize scripting (PowerShell, Perl) to enhance cloud operations.
Propose and create system design models, specifications, diagrams, and charts to guide development teams.
Improve team efficiency and performance through automation of processes where feasible.
Create efficient automated deployment workflows to support development through testing and staging.
Maintain advanced knowledge of Linux OS and troubleshoot issues in cloud solutions and virtual environments using native or third-party tools.
Review dashboards, system/application logs, and custom monitoring tools regularly.
Design and implement disaster recovery processes.
Support the company’s growth into public cloud services and maintain operational processes, tools, and documentation.
Collaborate with Security Operations and Information Security teams to manage security in public cloud infrastructure.
Manage and support public cloud production, testing, and development environments.
Drive automation initiatives to streamline administration and prevent recurring issues.
Keep stakeholders updated on relevant areas and new technologies.
Provide technical expertise and advice regarding the development and execution of in-house applications for cloud readiness.
Collaborate with developers, management, and vendors to ensure secure application deployment and achievement of business objectives.
Not available