De belangrijkste informatie in één oogopslagType dienstverband | Hybrid, Voltijd |
---|
Soort contract | Ongelimiteerd |
---|
Werkmodel | Kantoor aan huis niet mogelijk |
---|
Bedrijf | Uniper |
---|
Taak-ID | 88606 |
---|
Contact us | career@uniper.energy |
---|
Our Platform Engineering Team in Düsseldorf is looking for YOU! Your responsibilitiesPosition Summary: You will be the technical lead for site reliability for our algorithmic trading and other key platforms—owning reliability, performance, and operational excellence end-to-end. you’ll set the technical direction, drive standards, mentor engineers, and partner with quants, traders, development, and other teams to deliver geo-redundant, containerized, and compliant trading systems with near-zero downtime. Your Responsibilities: - Reliability ownership: Define and drive SLOs/SLIs, error budgets, and golden signals for latency-sensitive algo-trading services; lead incident response and postmortems with a blameless culture.
- Production architecture: Design and evolve geo-redundant, active-active/active-passive topologies across regions and availability zones, including failover, data replication, and disaster recovery (RTO/RPO).
- Kubernetes at scale: Architect, harden, and operate AKS-based multi-cluster environments (multi-tenant, multi-region), including networking, security, autoscaling, node pools, and upgrade strategies.
- Infrastructure as Code: Own Terraform blueprints and Ansible automations for everything from base images to cluster add-ons, ensuring idempotent, policy-guarded, and auditable changes.
- Automation & Efficiency: Build progressive delivery (blue/green, canary) pipelines with gated rollouts and automated rollback for trading microservices, adapters, market data, and execution gateways.
- Observability & performance: Implement end-to-end tracing (OpenTelemetry), metrics, logs, and synthetic probes; lead capacity planning, perf tests, and p99/p999 latency optimization.
- Runtime safety: Enforce runtime security, secrets management, image hygiene, and compliance controls integrated “shift-left” into build and deploy workflows.
- Algo-trading runtime: Operate and optimize Deltix-based components (Timebase DB, Ember, Strategy Server) in containerized, high-availability setups. Own the corresponding Helm charts.
- Collaboration & leadership: Mentor SREs/DevOps/Developers, guide design reviews, and align with Platform, Security, and Trading stakeholders on priorities and roadmaps.
- Innovation: Promote a culture of innovation by staying up to date with new technologies and integrating useful advancements into the commercial area.
Your profileYour profile: - A degree in Computer Science, Mathematics, Engineering or other related discipline
Experience: - 10+ years in SRE/Platform/Infrastructure roles
- Hands-on experience running complex, low-latency algo-trading or market-facing systems in production
- 3+ years of experience as a DevOps/SRE with a clear observability focus
- 3+ years of experience as Software Developer
- Expert with Kubernetes (AKS preferred), including cluster lifecycle, networking (CNI, Ingress, eBPF), HPA/VPA, node autoscaling, PodDisruptionBudgets, and surge/zero-downtime upgrades
- Deep Azure experience: VNet design, Private Link/Endpoints, peering, routing, Managed Identity/Entra ID, Key Vault, Storage, Azure Monitor/Log Analytics, Front Door/Traffic Manager, Load Balancers, App Gateway, API Management
- Terraform (expert): modular design, state management, workspaces, policies (OPA/Sentinel), and pipeline integration
- Containers & supply chain: Docker/OCI, image scanning/signing, SBOMs, and build reproducibility
- Observability: Prometheus, Grafana, alerting design; OpenTelemetry tracing; log pipelines and retention strategies
- Deltix (required): hands-on operating and tuning Deltix components (e.g., TimeBase/QuantOffice/Ember) in containerized, HA contexts
- Strong networking (L4/L7, TLS/mTLS, DNS, BGP basics), Linux internals, and performance tuning for low-latency services
- Proven track record of geo-redundant architectures, DR planning/testing
- Experience with market data distribution (multicast/unicast), FIX/OUCH/ITCH, and exchange connectivity
- Fluency in GitHub Actions or similar CI/CD and at least one programming language (e.g., Python or C#) for tooling and diagnostics
- Excellent communication; ability to lead through influence
- Fluent in English; German advantageous
|