Skip to main content

Training Wrap-Up

Training Summary

Congratulations on completing the xScaler Observability Platform Training! Over two days, you have covered:


What You Learned

Day 1

Session 1 — Platform Introduction

  • xScaler two-tier architecture: control plane + edge data plane
  • Component roles: portal-api, agent-api, Envoy, proxy-auth, xMetrics, xLogs, xTraces
  • Authentication: Cognito → JWT (30-min TTL) for users, SHA-256 API keys for collectors
  • Multi-tenant isolation via X-Scope-OrgID header

Session 2 — OpenTelemetry Fundamentals

  • OTLP protocol, OTel Collector pipeline: receivers → processors → exporters
  • Agent Mode vs Gateway Mode deployment patterns
  • OpAMP: remote config push for OTel agents
  • memory_limiter and batch processor best practices

Session 3 — Data Collection Architecture

  • Prometheus scrape (pull) vs OTLP push collection models
  • Envoy ext_authz pattern and the four-listener architecture
  • Cardinality management: what to avoid in metric labels

Day 2

Session 4 — Tenant Setup and Agent Deployment

  • Tenant lifecycle: create, key management, usage monitoring
  • OpAMP enrollment: xse_ token → xag_ per-agent key exchange
  • Config templates with ${secret:NAME} KMS-encrypted references
  • Label-selector assignments and priority-based config routing

Session 5 — Grafana Integration

  • Grafana's role: pure visualisation, not storage
  • Three datasources: Prometheus (xMetrics), xLogs, xTraces
  • Cross-signal correlation: trace → log → metric
  • Managed Grafana vs self-managed options

Session 6 — Dashboards, APM and Alerting

  • Four golden signals dashboards: latency, traffic, errors, saturation
  • Distributed tracing with TraceQL and the service map
  • Alert rules, contact points, notification policies, silences

Session 7 — Hands-On Lab

  • Complete end-to-end workflow: tenant → API key → push data → dashboard → alert

Key Concepts Reference

ConceptQuick Summary
Control Planeportal-api + portal-web + agent-api — manages configuration, not data
Data PlaneEnvoy + proxy-auth + xMetrics/xLogs/xTraces — handles all telemetry
X-Scope-OrgIDThe tenant namespace header — set by proxy-auth, never trusted from client
ext_authzEnvoy delegates auth to proxy-auth via gRPC before forwarding any request
fail-closedIf proxy-auth is unavailable, ALL requests are denied (security by default)
OpAMPWebSocket protocol for pushing OTel config to agents
xse_ tokenFleet enrollment token — shared among all agents in a group
xag_ keyPer-agent API key — created during enrollment, unique to each agent
${secret:NAME}Config template placeholder resolved via AWS KMS at delivery time
NOTIFY/LISTENPostgreSQL mechanism for near-real-time config push to agent-api
xMetricsMulti-tenant metrics: multitenancy_enabled: true, port 9009
xLogsMulti-tenant logs: auth_enabled: true, HTTP 3100, gRPC 9095
xTracesMulti-tenant traces: multitenancy_enabled: true, HTTP 3200
usage-syncPolls xMetrics every 60s for usage → writes PostgreSQL rollup tables
Four golden signalsLatency, Traffic, Errors, Saturation — the foundation of SRE alerting

Next Steps

Immediate (This Week)

  • Review your organisation's current observability tooling
  • Identify 2-3 services as pilot candidates for xScaler instrumentation
  • Create a production tenant structure (one per environment)
  • Test OTel collector configuration against your first service
  • Schedule a follow-up session with your xScaler customer success manager

Short-Term (30 Days)

  • Deploy OTel DaemonSet to first Kubernetes cluster
  • Create fleet enrollment token and enrol first agents
  • Configure Grafana datasources and build service dashboards
  • Create alert rules for four golden signals
  • Document your tenant naming convention in your team wiki

Long-Term (90 Days)

  • Complete fleet rollout across all production clusters
  • Review cardinality (monthly) — check series count trends
  • Instrument all services with OTel SDK for traces
  • Establish SLO dashboards for critical services
  • Train additional team members using this training site

Additional Resources

xScaler Documentation

External Resources

Support Channels

  • xScaler Portalhttps://portal.xscalerlabs.com
  • Support tickets — via portal → Support menu
  • Status pagehttps://status.xscalerlabs.com

Feedback

Please complete the training feedback form provided by your instructor. Your feedback helps us improve the training programme for future participants.


You're Ready!

You now have the knowledge to deploy, configure, and operate the xScaler Observability Platform in production. Start with a single team's services, prove the value, and expand from there. The platform scales from a single DaemonSet to thousands of agents across multiple regions.


← Previous: Lab Guide
Next: Architecture Reference →