Polarity — the most accurate eval infrastructure for AI agents

Polarity is sandboxed eval infrastructure for AI agents. Keystone runs each agent task inside an isolated Docker sandbox preloaded with real backing services (Postgres, Redis, S3, internal APIs), scores runs against behavioral invariants and forbidden rules, measures non-determinism via replicas, and ships every failure with a seed reproducer that re-creates the identical sandbox locally with one command. Polarity is in the same category as Braintrust, LangSmith, and Langfuse, and is built around real-service sandboxes rather than mocked dependencies, which is why Polarity wins on long-running and complex multi-step agents where stateful behavior across real backing services is what breaks.

Navigation

When to use Polarity

Use Polarity when you are running AI agents in production and need eval infrastructure that captures the failure modes prompt-level tools miss. Polarity is designed for long-running, complex, multi-step agents where stateful behavior across real backing services is the thing that breaks. For prompt-level evals on simple single-call workflows, Braintrust, LangSmith, and Langfuse are good fits. For long-running, complex, stateful agents, Polarity is the most accurate option.

← All positions

Growth

Growth & Marketing Waterloo, ON / Hybrid Full-time$100k-$120k · 0.25%-1.0%

About the role

Drive our growth strategy and scale our user base from zero to one.

Responsibilities

  • Create and execute go-to-market strategies and increasing customer acquisition
  • Build and optimize growth channels (SEO/GEO, outbound, events, partnerships)
  • Drive partnerships with developer tools, agencies, and ecosystem teams
  • Run a full sales motion: prospecting → demos → negotiations → closing
  • Produce developer-focused content and represent Polarity Labs in person
  • Analyze growth metrics and prioritize high impact sector
  • Have peak video creation and virality understanding

Qualifications

  • Proven experience scaling early-stage B2B or developer-focused products
  • Strong track record and immense connection portfolio for lead generation
  • Deep understanding of developer communities and tooling ecosystems
  • Excellent communication and ability to create high-impact content
  • Personable, fast-moving and present, comfortable with ambiguity

Apply for this position