Social Discovery Group (SDG) is the 3rd largest social discovery company in the world, uniting 60+ brands with 500 million users. We solve the problems of loneliness, isolation, and disconnection by transforming virtual intimacy into the new normal. Our portfolio includes online communication platforms focusing on AI, game mechanics, and video streaming - Dating.com, DateMyAge, Cupid Media, Dil Mil, Kiseki, and others.
SDG invests in IT startups around the world. Our investments include Open AI, Patreon, Flo, Clubhouse, Woebot, Flure, Astry, Coursera, Academia.edu, and many others.
We bring together a team of like-minded people and IT professionals specializing in the creation and development of globally impactful social discovery products. Our international team of 1200 professionals and digital nomads works all over the world.
Our teams of digital nomads work remotely from Cyprus, Malta, the USA, Armenia, Georgia, Kazakhstan, Montenegro, Poland, Latvia, Serbia, Spain, Portugal, UAE, Israel, Turkey, Thailand, Indonesia, Japan, Hong Kong, Australia and many other locations.
In August 2024, we achieved Great Place to Work US Certification™! This achievement reflects our core belief that a truly exceptional workplace is built on trust, pride, and camaraderie—not just great perks.
We are looking for a Head of IT Monitoring Team to lead two teams—24/7 Duty Admins (L1) and Technical Monitoring Specialists—and to design, develop, implement, and operate a comprehensive monitoring service that ensures stability, performance, and security of our IT infrastructure and products.
Your main tasks will be:
Provide strategic leadership, set team goals aligned with company objectives, and own the roadmap for advancing monitoring capabilities.
Build, operate, and evolve the monitoring stack (Zabbix, Grafana, Prometheus and others) with strong support for microservices and cloud monitoring (AWS CloudWatch / Azure Monitor / Google Cloud Monitoring).
Ensure timely detection and resolution of alerts, increasing the share of incidents resolved by the L1 duty team without escalation; establish procedures based on ITIL and manage SLAs.
Collaborate with IT/product teams to smoothly transition new monitoring solutions into production, and maintain clear operational documentation and runbooks.
Develop people: upskill teammates, define a transparent career ladder, and prepare regular reports with operational metrics and team results.
We expect from you:
Proven leadership running monitoring/observability teams in companies with high-loaded web systems.
Strong knowledge of monitoring protocols, tools (Zabbix, Grafana, Prometheus), methodologies, and best practices; proficiency in monitoring microservices.
Hands-on experience with RCA practices for critical events and with cloud monitoring (CloudWatch, Azure Monitor, Google Cloud Monitoring).
Excellent communication skills and responsibility; experience building teams, developing people, and giving regular feedback; English B2+.
Nice to have: ITIL Foundation certification; familiarity with AIOps and AI-driven monitoring; full-stack development experience to build internal tools, integrations, and dashboards.
What do we offer:
Sounds good? Join us now!
Sounds good? Join us now!