# Monitoring

# Monitoring

Monitoring ensures your Curiosity Workspace environment is healthy, performant, and secure. Effective monitoring covers:

  • availability (is the workspace reachable?)
  • performance (latency, throughput)
  • ingestion and background jobs (success/failure, duration)
  • search/index health
  • security signals (auth failures, token usage, admin actions)

# What to monitor (baseline)

  • Service health
    • uptime, restarts, error rates
  • Ingestion
    • last successful run time
    • items processed per run
    • failure rate and top failure reasons
  • Indexing
    • queue depth / progress (if available)
    • completion status for rebuilds
  • Query performance
    • slow queries (graph and search)
    • timeouts and resource saturation

# Logs and audit trails

Recommended log categories:

  • ingestion connector logs
  • endpoint invocation logs
  • admin configuration changes
  • authentication and authorization events

# Alerting

Start with a small set of high-signal alerts:

  • workspace unavailable
  • repeated ingestion failures
  • index rebuild stuck or failed
  • elevated error rate or latency regression

# Next steps