Docs
Connectors & Integration
DataFlow AI connects everything. Whether it's an on-prem legacy PostgreSQL database, a SaaS API, or an Amazon S3 bucket, our plug-and-play connector framework handles synchronization resiliently.
300+ Prebuilt Integrations
No need to write fragile extraction scripts. Connect to Salesforce, Stripe, Shopify, REST APIs, or MongoDB instantly via OAuth or API keys. Fully compatible with Airbyte paradigms.
Real-Time CDC Streams
For transactional databases (PostgreSQL, MySQL), we utilize logical replication (Debezium + Kafka) to stream row-level mutations precisely as they happen, ensuring immediate analytics availability.
Ingestion Modes
- Full Snapshot (Batch): Performs an initial `SELECT *` dump. Best for static lookup tables or slowly changing dimensions (SCD Type 1).
- Incremental (Cursor/Watermark): Tracks state (e.g., `updated_at > '2026-03-21'`) to only pull newly modified records, saving compute costs drastically.
- Log-Based CDC (Streaming): Subscribes directly to the Write-Ahead Log (WAL) of the origin database. Replicates strict ACID transactions in sub-second time without querying the main table.
Destinations (Sinks)
Our architecture defaults to Delta Lake / Apache Iceberg standard formats. You can effortlessly sink transformed data into:
Snowflake
Databricks
BigQuery
Redshift
DuckDB
AWS S3
PostgreSQL
Redis
← Back to Main App
© 2026 DataFlow AI Docs