Location: Palo Alto, CA
Responsibilities:
- Develop and manage a 24x7 technical organization
- Implement and expand the monitoring tools for rapid identification and resolution of network/application service issues
- Develop code to automate repetitive tasks to allow the team to scale with Facebook's rapid growth
- Identify and triage all outage related events
- Facilitate communication, coordinate escalation, work with subject matter
- Troubleshoot issues with hardware, software, applications and network
- Automate and streamline processes
- Track issues, run reports and escalate issues
Requirements:
- Management and technical experience with the desire to roll up your sleeves when required to get the job done
- Develop and document repeatable operational procedures
- Planning and executing routine network maintenances with a premium on minimizing risk and application downtime
- Certification such as CCNA, RHCT or equivalent experience
- Working knowledge of TCP/IP, Linux and LAMP
- Experience working with network management systems and other industry standard NOC tools
- Excellent verbal and written communication skills
- Prior experience working in a NOC environment required
No comments:
Post a Comment