Skip to content

docs: add experiment tracking integration guide for operators#1041

Open
zhoward-1 wants to merge 2 commits intomainfrom
docs/experiment-tracking-integration
Open

docs: add experiment tracking integration guide for operators#1041
zhoward-1 wants to merge 2 commits intomainfrom
docs/experiment-tracking-integration

Conversation

@zhoward-1
Copy link
Copy Markdown
Contributor

Summary

  • Adds docs/operator-guides/integrations/experiment-tracking.md
  • Covers the full operator workflow: verify network reachability, create a ConfigMap with the tracking server URI, inject it into task pods via extraEnv, handle credentials via Kubernetes Secrets
  • Includes a multi-cluster section for environments with multiple registered compute clusters
  • Draws an explicit boundary between operator configuration and user task code
  • Links to related guides: compute cluster registration, worker config reference, model registry integration

Why

The operator guide's purpose is to help adopters integrate Michelangelo with their existing ML infrastructure. Experiment tracking is one of the first integration points teams ask about — this guide gives operators a concrete, server-agnostic path to wire it up without any Michelangelo-specific tracking client or built-in integration.

Test plan

  • Verify all internal links resolve (platform-setup.md#worker-configuration, ../jobs/register-a-compute-cluster-to-michelangelo-control-plane.md, model-registry.md)
  • Confirm NetworkPolicy YAML is valid
  • Confirm ConfigMap and Secret YAML is valid
  • Review that no specific tracking server vendor is named or implied

🤖 Generated with Claude Code

Explains how operators connect any experiment tracking server to
Michelangelo workloads — network verification, ConfigMap injection,
credential handling via Secrets, and multi-cluster patterns.
Draws a clear boundary between operator config and user task code.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants