Staging / Ingestion Module

Connect to any data source — databases, files, NoSQL, APIs, Python classes, streaming — and load data in batch, delta, CDC, or near real-time mode. All configured visually, no code.

Load modes

  • Batch Loading

    Full extracts on schedule. Configurable parallelism and load windows.

  • Delta Loads

    Load only changed records based on watermarks, timestamps, or sequence numbers.

  • CDC Loads

    Change Data Capture from transactional databases — capture inserts, updates, and deletes.

  • Near Real Time

    Micro-batch and streaming ingestion for operational data with low-latency requirements.

Source connectors

  • Databases

    Any JDBC-compliant database: Oracle, SQL Server, PostgreSQL, MySQL, SAP HANA, Exasol, and more.

  • Files

    CSV, Excel, JSON, XML, Parquet, ORC from local storage, SFTP, S3, Azure Blob, or Google Cloud Storage.

  • NoSQL

    MongoDB, Cosmos DB, Cassandra and other document and key-value stores.

  • Python Classes

    Write custom ingestion logic in Python and register it as a reusable source connector.

  • Compatible with other ingestion tools

    Works alongside Fivetran, Airbyte, dbt, and other ingestion or transformation tools in your stack.

See the ingestion module live

20-minute demo. We'll connect to a source and show a full staging-to-vault pipeline.