Staging / Ingestion Module
Connect to any data source — databases, files, NoSQL, APIs, Python classes, streaming — and load data in batch, delta, CDC, or near real-time mode. All configured visually, no code.
Load modes
-
Batch Loading
Full extracts on schedule. Configurable parallelism and load windows.
-
Delta Loads
Load only changed records based on watermarks, timestamps, or sequence numbers.
-
CDC Loads
Change Data Capture from transactional databases — capture inserts, updates, and deletes.
-
Near Real Time
Micro-batch and streaming ingestion for operational data with low-latency requirements.
Source connectors
-
Databases
Any JDBC-compliant database: Oracle, SQL Server, PostgreSQL, MySQL, SAP HANA, Exasol, and more.
-
Files
CSV, Excel, JSON, XML, Parquet, ORC from local storage, SFTP, S3, Azure Blob, or Google Cloud Storage.
-
NoSQL
MongoDB, Cosmos DB, Cassandra and other document and key-value stores.
-
Python Classes
Write custom ingestion logic in Python and register it as a reusable source connector.
-
Compatible with other ingestion tools
Works alongside Fivetran, Airbyte, dbt, and other ingestion or transformation tools in your stack.
See the ingestion module live
20-minute demo. We'll connect to a source and show a full staging-to-vault pipeline.