Skip to content

Product Team Integration Spec

General Augment is the agent backend for your app.

This spec is for product, engineering, and partnership teams evaluating General Augment for an existing application. It explains the value, the integration shape, the main product capabilities, and the proof gates to capture before real users rely on the agent.

Most product teams can call a model API for a demo. The hard part is turning that demo into a production feature with app identity, user memory, tool permissions, approvals, traces, usage limits, cost controls, channels, support workflows, and security review.

General Augment supplies that missing backend layer. Your team keeps the product, UX, users, app database, business logic, and customer relationship. General Augment provides the managed agent layer around it:

  • an Open Responses-compatible /v1/responses backend API for in-app chat and backend workflows;
  • durable per-user memory and profile context;
  • governed tools generated from app APIs or connected through MCP;
  • human approval gates for sensitive writes;
  • identity linking across app users and messaging users;
  • Telegram, WhatsApp, SMS, and in-app delivery paths;
  • usage, traces, audit rows, support bundles, and cost metadata;
  • rate limits, budget gates, and plan usage controls;
  • GA SDKs, CLI, raw HTTP examples, local mock testing, and dashboard setup surfaces.

The result is not another chatbot. It is the best agent control plane a product team can adopt once and reuse across product chat, backend automations, support workflows, messaging channels, app-owned scheduled or event-driven jobs, and General Augment-managed scheduled jobs.

“Open Responses-compatible” means General Augment accepts a Responses-style request and returns a Responses-shaped object on POST /v1/responses; it does not mean every OpenAI API, SDK, model name, hosted file/vector feature, or account-level workflow is implemented by General Augment.

StatusMeaning for an app team
GA-readyBuild against this now for production traffic after project-specific smoke, trace, budget, and support evidence pass.
GA-managedThe capability is available with General Augment-managed setup, quota, channel, or operations support.
RoadmapUse the app-owned fallback until the public product surface is promoted.
Enterprise termsCustomer-specific commercial, security, support, residency, retention, or regulated-data commitments belong in the accepted customer agreement.
CapabilityCurrent maturityWhat a new AI app team should do
Backend agent turns with POST /v1/responsesGA-readyCall from trusted backend code with a project-scoped key, stable user, metadata, and an idempotency key. Treat it as Responses-style compatibility, not broad OpenAI account compatibility.
Raw HTTP integration pathGA-readyUse curl or generated backend clients when an SDK is unavailable, especially for mobile apps that must keep General Augment keys server-side.
Stable user scoping and explicit memory lifecycleGA-readyUse the same app user id for /v1/responses plus memory store, search, profile, delete, and purge calls. Prove isolation for at least two users before launch.
Local mock and hosted verificationGA-readyUse genaug mock for offline contract tests, then preserve genaug smoke, genaug verify, and genaug onboarding verify --json evidence for the hosted project.
Technical metering, rate limits, and budget gatesGA-readyHandle stable 402 and 429 responses in app code and reconcile usage from returned metadata and project usage surfaces.
TypeScript, Python, and CLI package distributionPartially registry-readynpm packages @general-augment/sdk and @general-augment/local-imessage are visible at 0.1.1; PyPI currently exposes 0.1.0 for general-augment-sdk and general-augment-cli while source packages target 0.1.1. Use repo-local packages or raw HTTP until PyPI 0.1.1 is visible.
OpenAPI/MCP-generated tools and delegated executionGA-managedStart with app-owned execution or low-risk read tools. Delegate writes after tool review, credentials, identity links, allowlists, approval UX, and audit evidence are ready.
Human approval gatesGA-managedChannel and dashboard/admin approval flows exist. Apps can list pending approvals, approve/deny through admin APIs, and receive signed approval lifecycle webhooks when a project callback URL is configured.
Approval webhooks to the app backendGA-managedConfigure approval_webhook_url for signed approval.pending, approval.approved, approval.denied, and approval.expired callbacks. Polling/listing approvals remains the fallback and reconciliation path.
App-owned scheduled or event-driven jobsGA-readyRun the schedule in the app backend or existing job system, then call /v1/responses with app-scheduled context and the same stable user id.
Managed scheduled-job lifecycle in General AugmentGA-readyUse the authenticated admin API, genaug jobs, or SDK helpers for the self-serve scheduled-job create/list/pause/resume/delete surface, validation dispatch, and retry history.
Self-serve scheduled-job dashboard UIRoadmapUse the API, CLI, or SDK lifecycle while the dashboard management view catches up.
Telegram, WhatsApp, SMS, and identity linkingGA-managedUse channels included in the launch plan, link channel users back to app users before app-specific tools run, and verify provider webhooks per channel.
Self-serve billing checkout and customer portalGA-managedTechnical usage, credit, and plan gates exist. Stripe-configured tenants can open hosted Build/Pro/Team Checkout, paid credit top-up Checkout, and Customer Portal sessions. Webhooks can activate plans, record invoice/payment-failure events, and create subscription_included or paid_top_up credit grants. Automatic off-session auto top-up charges require tenant opt-in.
Production billing terms, overages, and support tierEnterprise termsAgree commercial terms, included usage, overages, payment path, support owner, and rate window in the customer launch packet.
Healthcare, PHI, HIPAA mode, DPA/BAA, residency, retention, or certification claimsEnterprise termsHealth and care examples are product examples. HIPAA mode is a technical guardrail; regulated commitments are made through signed customer terms.
Production SLA and formal support commitmentsEnterprise termssupport@generalaugment.com is the GA support path. Uptime, response-time, incident, deletion, or backup-destruction SLAs are signed customer terms.
The app keepsGeneral Augment provides
Product UX and mobile/web surfacesAgent backend API and managed turn execution
App users, auth, roles, and account modelProject-scoped user resolution and memory scoping
App database and business rulesAgent memory, tool context, and response orchestration
App-owned secrets and user OAuth where preferredCredential vault and auth proxy for delegated tools
Sensitive side-effect UX where preferredApproval flows for delegated actions
App support and customer success workflowResponse IDs, trace IDs, usage, logs, audit rows, support bundles
App billing and packagingAgent-turn metering, budget gates, plan limits, usage exports
Existing API surfaceOpenAPI-to-tool generation and MCP tool connections

Add a chat or assistant surface inside an existing product without building the agent platform from scratch. The app calls POST /v1/responses from trusted backend code with a stable app user id, then renders assistant output in the product UI.

Strong first experiences include support triage, onboarding help, account summaries, structured extraction, next-action recommendations, and schema-validated drafts.

Your backend can call the same agent from server jobs, workflow handlers, event processors, or admin tools. This adds AI reasoning to existing product workflows without exposing API keys to browser or mobile clients.

General Augment can store, search, profile, delete, and purge scoped user memory. The same stable user id should be sent on /v1/responses and explicit memory APIs so memory, traces, approvals, and future channel links stay isolated per app user.

General Augment turns app capabilities into governed agent tools. Teams can import an OpenAPI spec, connect MCP servers, or enable built-ins. Tool execution is controlled by project allowlists, risk metadata, credentials, identity checks, audit rows, and approval policy.

API operationDefault behavior
GETRead tool, can auto-execute
POSTWrite tool, approval required
PUT / PATCHWrite tool, approval required
DELETEDestructive tool, disabled by default

Sensitive delegated actions can pause before side effects run. Approval rows are scoped to project, user, session, and tool call. Users can approve through supported channels or dashboard/admin flows, depending on the surface.

When approval_webhook_url is configured, General Augment enqueues signed outbound approval events for pending, approved, denied, and expired approvals. Events include a stable event_id, the approval id, project id, user id, session id, tool id, redacted input summary, status, and timestamps. Receivers should verify X-General-Augment-Signature, use event_id for idempotency, and still reconcile with the approval list/admin APIs when needed.

The same project agent can serve product backend calls, in-app chat, Telegram bots, WhatsApp and SMS webhooks, app-owned scheduled workflows, and General Augment-managed scheduled jobs that call the same Responses path. For scheduled work, teams can either keep scheduling app-owned or let General Augment own the generic scheduled-job lifecycle through authenticated API, CLI, and SDK surfaces.

General Augment returns response IDs, request IDs, trace IDs, actual model metadata, token usage, latency, and cost metadata when available. Project APIs and dashboard surfaces expose logs, traces, usage, observability status, support bundles, and PII-redacted tool-call audit rows.

General Augment uses agent-turn metering plus rate and budget controls, including API key rate limits, per-user burst limits, daily plan limits, project/user/feature LLM budget caps, and stable 402 / 429 error reasons.

These controls are the foundation of the customer billing lifecycle. Usage rollups, daily gates, budget caps, threshold notifications, hosted checkout, customer portal, credit grants, reservations, settlement, and billing exports are GA billing controls. Customer-specific rates, purchase orders, and support terms belong in the customer launch packet.

Project API keys stay server-side. Browser and mobile clients call the app backend, and the backend calls General Augment. For delegated tools, credentials are resolved server-side through the auth proxy or app backend; the model sees sanitized tool schemas and sanitized tool results, not raw provider tokens.

Browser or mobile app
-> app backend
-> POST /v1/responses with project key, stable user id, input, metadata
-> General Augment project agent
-> response output, trace id, usage, cost, tool/audit metadata
-> app backend
-> app UI or app workflow

When tools are delegated:

General Augment agent
-> allowed tool schema
-> approval or credential check when required
-> auth proxy or app MCP/OpenAPI backend
-> sanitized tool result
-> trace, usage, and audit rows
  1. Create a General Augment project from dashboard onboarding, genaug projects create, genaug integrate <openapi-url> --auto-deploy, or genaug deploy.
  2. Create project-scoped API keys for production, staging, development, and CI. Store raw keys only in backend secret storage.
  3. Add the first backend call to the Open Responses-compatible POST /v1/responses API with model, user, input, metadata, and an idempotency key for retry-safe operations. The request is Responses-style and portable for the supported v1 fields, but teams should not assume account-level OpenAI platform compatibility or provider model selection.
  4. Store response id, request id, trace id, stable app user id, feature/source metadata, model, token, latency, and cost fields when present.
  5. Test offline with genaug mock --host 127.0.0.1 --port 8787 --quiet.
  6. Add explicit memory for durable user facts and use the same app user id for memory store/search/profile/delete/purge APIs.
  7. Connect app APIs as tools with genaug integrate or MCP, then review generated risk and approval defaults before production.
  8. Use approval gates or app-owned execution for sensitive writes.
  9. Link channel identities before app-specific tools operate over Telegram, WhatsApp, or SMS.
  10. Run genaug smoke, genaug verify, and genaug onboarding verify --json before launch. Treat project_key_execution=PASS as the strongest project-key proof.
Product categoryWhat General Augment unlocks
Productivity appsPlanning, calendar/email assistants, reminders, meeting prep, follow-up drafting
Support and CRMTicket triage, account summaries, next-action recommendations, email drafts, traceable support evidence
Health and care appsIntake summaries, care-plan guidance, user preference memory, appointment workflows, approval-first writes, and regulated launch packets for PHI, HIPAA, BAA, residency, or production compliance claims
Fintech and operations appsWorkflow copilots, exception triage, policy-aware summaries, approval-controlled actions
Marketplaces and commerceBuyer/seller support, order summaries, resolution proposals, feature-level budget caps
Consumer appsIn-app assistants, personalized onboarding, durable preferences, channel companions
Developer tools and SaaSAPI documentation agents, setup copilots, tenant health checks, usage explanations

Capture evidence that:

  • the project exists and has the expected slug/name;
  • environment keys are separated and server-side only;
  • the app backend calls /v1/responses with a project-scoped key;
  • project_key_execution=PASS appears in verification output;
  • response id, request id, trace id, and idempotency key are stored in app logs;
  • local mock contract tests pass in CI;
  • memory behavior is proven for the app users in scope;
  • generated tools are reviewed, allowlisted, and governed;
  • write tools have approval UX or app-owned execution;
  • destructive tools are disabled unless explicitly accepted;
  • channel identity links are tested before channel users rely on app-specific tools;
  • 402 and 429 responses are handled in app code;
  • dashboard project, tools, integration, analytics, users, and observability surfaces open for the same project;
  • returned trace IDs can be retrieved through trace or support-bundle surfaces;
  • commercial, compliance, residency, retention, and support terms are accepted for the planned traffic and data type.

The GA-ready core is a backend-owned app integration that calls /v1/responses with a project-scoped key, stable user id, trace metadata, budget handling, local mock tests, and hosted verification evidence.

GA-managed surfaces include generated tools, delegated tool execution, channels, approval workflows, and signed approval lifecycle webhooks. GA-ready scheduled-job management is available through API/CLI/SDK surfaces. Roadmap or Enterprise-term surfaces include dashboard scheduled-job management, customer-selectable residency, DPA/BAA workflows, compliance attestations, and formal SLA/support commitments. Use the availability matrix above when deciding whether a partner can launch now, should keep a capability app-owned, or should attach Enterprise terms.

Pick one narrow app surface, keep sensitive writes app-owned for V1, call /v1/responses from the backend with stable user ids and trace metadata, add explicit memory only for durable facts, use local mock tests in CI, run hosted smoke and onboarding verification, then review traces, usage, costs, and support evidence after internal dogfood traffic.

Useful next reads: Quickstart, Add chat to your app, Integration examples, Connect your API, SDK reference, Security, Compliance and Security Pack, and Status and Readiness.