- Facilitate process hazard analysis (PHA, HAZOPs, what-if/checklist, FMEA, and human factors
- Support asset integrity, ensure equipment is designed to code, inspected, tested, maintained
- Manage change program, review changes, assess safety impacts, update safeguards
- – Design reliability architecture for distributed services, ensuring fault tolerance.
- – Build monitoring and observability systems to enhance operational awareness.
- – Lead incident response efforts, driving improvements to reduce service disruptions.
- – Design and implement secure, automated AWS or GCP cloud infrastructures.
- – Develop and maintain CI/CD pipelines to streamline build, test, and deployment.
- – Monitor and troubleshoot data infrastructure to ensure high availability and performance.