Skip to main content

Collect Module API Routes

Step-by-step tutorials for common API-based pipeline management scenarios.

Use cases

ScenarioDescriptionGuide
New column in the sourceAdd a column to an incremental pipeline and backfill historical dataCase 1
Duplicate recordsMigrate from incremental to incremental_with_qualifyCase 2
Job failing with OOMIncrease memory for a job that fails due to lack of resourcesCase 3
Customize table namesCustomize raw and qualify table names in SnowflakeCase 4

Quick concepts

Pipeline and jobs

Pipeline (postgres_vendas)
├── Job 0: clientes
├── Job 1: pedidos
└── Job 2: produtos

A pipeline contains one or more jobs. Each job synchronizes one table.

Job ID

The jobId follows this format: {pipeline_id}-{job_index}.

Example: f47ac10b-58cc-4372-a567-0e02b2c3d479-0 (first job)

Note: in API responses, job_id may appear normalized with _ (underscores), but when calling the endpoints you should use - (hyphens).

Synchronization modes (JDBC)

ModeBehavior
full_loadExtracts everything on every run
incrementalExtracts only new or changed records
incremental_with_qualifyIncremental extraction plus primary-key deduplication

Main endpoints

ActionEndpoint
List pipelinesGET /platform/pipelines
Inspect a JDBC jobGET /platform/jobs/jdbc/{jobId}
Add columnsPATCH /platform/jobs/{jobId}/input
Migrate sync modePOST /platform/jobs/jdbc/{jobId}/sync-mode
Change job memoryPUT /platform/jobs/{jobId}/memory
Change pipeline memoryPUT /platform/pipeline/{pipelineId}/memory
Reset statePOST /platform/jobs/{jobId}/reset-state
Execute pipelinePOST /platform/pipeline/execute
Monitor runsGET /platform/pipeline/{pipelineId}/runs

Full reference

For detailed endpoint documentation, see the API reference.