Docs

Connectors & Integration

DataFlow AI connects everything. Whether it's an on-prem legacy PostgreSQL database, a SaaS API, or an Amazon S3 bucket, our plug-and-play connector framework handles synchronization resiliently.

300+ Prebuilt Integrations

No need to write fragile extraction scripts. Connect to Salesforce, Stripe, Shopify, REST APIs, or MongoDB instantly via OAuth or API keys. Fully compatible with Airbyte paradigms.

Real-Time CDC Streams

For transactional databases (PostgreSQL, MySQL), we utilize logical replication (Debezium + Kafka) to stream row-level mutations precisely as they happen, ensuring immediate analytics availability.

Ingestion Modes

  • Full Snapshot (Batch): Performs an initial `SELECT *` dump. Best for static lookup tables or slowly changing dimensions (SCD Type 1).
  • Incremental (Cursor/Watermark): Tracks state (e.g., `updated_at > '2026-03-21'`) to only pull newly modified records, saving compute costs drastically.
  • Log-Based CDC (Streaming): Subscribes directly to the Write-Ahead Log (WAL) of the origin database. Replicates strict ACID transactions in sub-second time without querying the main table.

Destinations (Sinks)

Our architecture defaults to Delta Lake / Apache Iceberg standard formats. You can effortlessly sink transformed data into:

Snowflake
Databricks
BigQuery
Redshift
DuckDB
AWS S3
PostgreSQL
Redis
← Back to Main App

© 2026 DataFlow AI Docs