Duck Data Master — AI-Powered SQL Analytics Platform

// HOW IT WORKS

Provisioned for you. Full analytics stack.

Sign up and your dedicated analytics instance provisions automatically — the complete Duck Data Master stack is live in under 5 minutes. No cloud account required. No setup. Just upload your data, ask in plain English, and Duck Master AI writes the SQL or Python, runs it, and returns the answer.

Your dedicated analytics instance provisions automatically the moment you subscribe. The full stack is live in under 5 minutes — dashboard, AI, and JupyterLab — isolated to your account. No cloud account, no SSH, no configuration.

Ask a question in plain English

Type what you want to know — "show me revenue by month", "which customers churned last quarter", "what's my top product by margin". Duck Master AI writes the SQL against your actual schema.

Get results. Do data work.

The SQL runs on your dedicated instance against your data. Review, edit, run again. Profile, join, score, fuzzy-match, export — 12 tabs of analytics tools. Your data is isolated on your instance. We handle the infrastructure. You do the analysis.

Duck Data Master — AI analytics dashboard on your cloud instance

← Duck Master is always here

Start 3-Day Free Trial →

3-day free trial. Full Guru access. No credit card required.

// PRODUCT PREVIEW

From file to insight
in under 60 seconds.

Deploy a full analytics stack to your cloud in one command. Load any data, ask a question in plain English, get SQL and results instantly. ML scoring, visual joins, post-quantum signed exports. No cluster, no DevOps, no shared tenancy.

Duck Data Master — live analytics dashboard

// FEATURES

Everything you need to
answer your own data questions.

A full cloud analytics stack deployed into your own infrastructure. AI that writes the SQL, profiles your data, and guides your analysis. Every format, every scale, your compute, your region.

Built-In AI Assistant

Duck Master — Your Analytics Assistant

Duck Master is built into the platform and always available — the yellow duck in the bottom corner of the app. Ask him anything about your data in plain conversation: what's in your file, what queries to run, how to interpret results. He sees your loaded tables and column names automatically. No SQL knowledge required — just ask.

Two AI modes work together: the SQL bar generates and runs queries instantly, while Duck Master handles the back-and-forth — follow-up questions, data exploration, sanity-checks on results. It's like having a data analyst on call, inside the app.

Automated Data Engineering

AI-Powered ETL — Extract, Transform, Load

ETL is the process every data team runs before analysis can begin: take raw, messy data from the real world, clean and restructure it into something a query can actually use, then load it into the analytics engine. Traditionally this requires a data engineer, custom scripts, and hours of work. Duck Data Master does it automatically, in seconds, using AI.

EXTRACT

Drop any file — CSV, Excel, Parquet, JSON, Arrow, TSV. The engine reads it instantly, detects column types, delimiters, encodings, and headers automatically. No configuration required.

TRANSFORM

Duck Master profiles your data the moment it loads — spots dates stored as text, revenue columns with dollar signs, "Active" vs "active" vs "ACTIVE" in the same column, null values, duplicates. He tells you what's wrong and writes the SQL to fix it. You approve. Done.

LOAD

The cleaned, structured table is ready for queries — correct types, consistent values, no garbage. Run any SQL against it, join it with other tables, export as CSV / Parquet / JSON, or write back to S3/GCS/Azure with COPY TO. Your pipeline, end to end, on your instance.

What used to take a data engineer half a day takes Duck Master thirty seconds. The AI identifies the issues. You decide what to fix. The engine applies it. No code, no scripts, no pipeline to maintain.

Core Feature

AI Text-to-SQL

Ask your question in plain English. The AI generates a correct SQL query against your actual table schema — column names, types, and all. No guessing. No generic templates. The SQL runs immediately and you can edit it before or after. Powered by Google Gemini on GCP infrastructure.

Core Engine

Production-Grade SQL Analytics Engine

Not a toy query builder — a production-grade columnar analytics engine running natively on your cloud compute. PIVOT, window functions, CTEs, joins, aggregates, regex, time series, LIST and STRUCT types. Query billions of rows — right-size your instance to your workload and budget. No cluster management, no per-query billing, no shared tenancy.

Guru Cloud Instance

Deploy a Full Analytics Stack — Automated, GCP Native

Sign up and your dedicated GCP analytics instance provisions automatically — no cloud account, no setup, no DevOps required. The complete Duck Data Master stack is live in under 5 minutes. AI analytics dashboard with Duck Master built in, JupyterLab, Python NL mode, and Chat AI — all running on compute isolated exclusively to your account. Your data is never shared with other customers.

GCP NATIVE

Intel Sapphire Rapids C3 compute — 6 tiers from 4 vCPU to 176 vCPU. Native GCS bucket mounts, low-latency Gemini AI via VPC, and automated provisioning. Right-size your instance to your workload — scale up when you need it, scale down when you don't.

12-TAB ANALYTICS IDE + JUPYTER

A full 12-tab analytics IDE — Ingest (GCS file manager + file upload), Extract (spatial, Delta/Iceberg, direct cloud query), Query (NL-to-SQL results), Transform (SQL editor), Profile, Join Builder, ML Score, Fuzzy Match, Export (multi-format + write-back), PQC Sign (keypair lifecycle), Notebook (Jupyter shortcuts), and Chat AI. JupyterLab for custom pipelines. All run as systemd services on your GCP instance.

YOUR DATA, YOUR INSTANCE

Upload files, connect S3/GCS/Azure, or browse your dedicated GCS bucket. Your data goes directly into your dedicated analytics instance — never shared with other customers, never accessible to anyone else.

✓ Automated provisioning — no setup required ✓ TLS certificate auto-issued ✓ Duck Master AI active from minute one

Formats

Every File Format

CSV, TSV, Parquet, JSON, NDJSON, Excel (.xlsx), Apache Arrow IPC. Auto-detects schema, delimiter, encoding, and header row. Handles messy real-world exports — mixed types, missing values, inconsistent dates — without preprocessing.

Post-Quantum Security

ML-DSA-65 Signed Exports — NIST FIPS 204

Every Guru cloud instance ships with a post-quantum ML-DSA-65 signing keypair (NIST FIPS 204, Security Level 3). Sign any CSV, Parquet, or JSON export with one click — the dashboard produces a .sig file your clients can verify with your public key. No PKI, no certificate authority, no certificate chain.

TAMPER-EVIDENT

Signature covers SHA-256 of the file + filename + timestamp. Any modification to the output — even one byte — invalidates the signature. Proof of data integrity at the point of export.

POST-QUANTUM SECURE

ML-DSA-65 is NIST FIPS 204 — resistant to attacks from both classical and quantum computers. The same algorithm the US government standardized for long-lived digital signatures. Future-proof by default.

FULL KEYPAIR LIFECYCLE

Generate, rotate, back up to your GCS bucket, and restore keypairs — all from the PQC Sign tab. Your private key lives on your instance. Save a backup to your GCS bucket. Rotate on your schedule. No certificate authority, no expiry dates, no revocation infrastructure. Share the public key once. Verify forever.

No other analytics platform at this price point offers post-quantum signed exports. Databricks costs $5,000+/month and does not include this. Duck Data Master Guru includes it at no extra charge — because your data's integrity should be provable, not assumed.

Privacy

Your Data Stays on Your Dedicated Instance — Isolated by Architecture

Your dedicated instance is yours alone — no shared compute, no other customers on the same machine. Your data loads directly into your instance and never touches any other customer's environment.

When you ask a question in plain English or Python, only your question and your table schema (column names and types — never the actual data rows) are sent to our AI backend via Cloud Run. The SQL comes back, runs on your dedicated compute against your data, and results stay in your session.

Privacy is not a policy. It is a physical consequence of the architecture. One instance per customer. Your data and our other customers' data are never co-mingled. Audit-safe from day one.

Output

Export & Write Back to Cloud Storage

Download query results as CSV, Parquet, or JSON from the Export tab. Write any table or result directly back to your cloud storage (S3, GCS, Azure) with COPY TO — with optional partitioning by any column. Sign exports with ML-DSA-65 post-quantum signature for tamper-evident data delivery. Your pipeline, end to end, on your dedicated instance.

Advanced Analytics

ML Scoring · Fuzzy Match · Join Builder · Data Profiling

Train and score machine learning models directly on your loaded tables — Random Forest, Gradient Boosting, Regression — with feature importance charts and a scored output table. Fuzzy-match across datasets using Jaro-Winkler similarity — find "Acme Corp" and "ACME Corporation" in the same join without exact-match SQL. Build cross-table joins visually without writing SQL. Profile any column with SUMMARIZE — min, max, mean, std, percentiles, null % — in one click. This is a Databricks-class analytics environment at a fraction of the cost.

Cloud Storage

Connect Amazon S3, Google Cloud Storage & Azure Blob — Directly

Your data already lives in the cloud. Duck Data Master connects directly to your S3 buckets, GCS buckets, and Azure Blob containers — no downloads, no manual exports, no moving files. Enter your credentials, paste the path, and the file loads straight into the analytics engine on your cloud instance.

AMAZON S3

Access Key ID + Secret Access Key. Any region. Any bucket you have read access to. Supports CSV, Parquet, JSON, NDJSON, Arrow, Excel.

GOOGLE CLOUD STORAGE

Service account JSON. Any GCS bucket the account has read access to. Works with all supported file formats.

AZURE BLOB STORAGE

Storage account name + account key. Any container, any blob. Your data flows directly from Azure Blob to your GCP analytics instance — never through Duck Data Master infrastructure.

🔒 Credentials are used only to authenticate your instance against your bucket. Your data flows directly from your cloud storage to your cloud instance — it never passes through Duck Data Master infrastructure.

Ingest Tab

GCS Bucket File Manager — Browse, Upload, Organize

The Ingest tab is step one of every pipeline. Upload files by drag-and-drop, enter a local path, or browse your dedicated GCS bucket directly in the dashboard — no separate Cloud Console window required.

FOLDER NAVIGATION

Breadcrumb navigation — click any path segment to jump to it. Hit ../ to go up a level. Folders open inline. Full tree navigation without leaving the dashboard.

CREATE & DELETE

Create new folders inline. Delete individual files or entire folder trees with one click. Each file and folder row shows a 🗑 delete button — no confirmation modals slowing you down.

LOAD SELECTED FILES

Check any files in the browser and hit "Load Selected →" — they're pulled from GCS into the analytics engine instantly, ready to query. No manual path copy-paste required.

Extract Tab

Spatial Analytics · Delta Lake · Iceberg · Direct Cloud Query

The Extract tab goes beyond standard file formats — connect to open table formats, run geospatial queries, and read cloud data lakes directly without loading data into memory first.

SPATIAL — ST_ FUNCTIONS

Full geospatial extension — ST_Distance, ST_Contains, ST_Intersects, and 50+ spatial functions. Run geographic queries directly on your data. GeoJSON, WKT, and coordinate point support.

DELTA LAKE & ICEBERG

Read Delta Lake and Apache Iceberg tables from GCS or S3 directly — no Spark cluster, no Databricks, no ETL step. Point at your lakehouse path, query it like a table.

DIRECT CLOUD QUERY (httpfs)

Query Parquet, CSV, or JSON files on S3/GCS/Azure without downloading them first. The analytics engine streams what it needs — nothing lands permanently on disk. Ideal for large data lakes where you only need a subset.

Notebook Tab

Built-in AI Notebook — Full Jupyter Keyboard Shortcuts

The Notebook tab is a full code + markdown cell environment — not a stripped-down REPL. If you know Jupyter or Colab, you're already at home. Every keyboard shortcut you depend on works exactly as expected.

KEYBOARD SHORTCUTS

Shift+↵ Run cell + advance · Ctrl+↵ Run in place
A Insert above · B Insert below
M → Markdown · Y → Code
D,D Delete cell · ↑↓ Navigate
Esc Command mode · Enter Edit mode

CELL FEATURES

Auto-growing cells — CodeMirror expands as you type. Collapsible cells — fold long outputs. Left gutter selector with cell number [N] and collapse toggle. Amber border = command mode. Green border = edit mode.

AI CELL ASSIST

✦ Suggest AI bar below the toolbar — describe what you want, Duck Master writes the cell. Inject results from the Query tab or Chat directly into a Notebook cell with "→ Notebook". Start clean or build from a working query.

Infrastructure

Elastic GCP — Full Stack

Built entirely on Google Cloud Platform. The AI text-to-SQL runs on Google AI (Gemini). Auth and data on Firebase. Backend on Cloud Run — scales to zero when idle, scales out automatically under load. No cluster to manage, no DevOps, no capacity planning. Enterprise-grade GCP infrastructure at $99/mo platform fee.

// DDM VS. THE FIELD

Where the savings come from.

You're paying $5,000–$15,000/month for a Databricks or Snowflake cluster to run reports that a single compute instance finishes in seconds. Duck Data Master provisions a dedicated instance exclusively for your account — $99/mo platform fee plus compute at cost + 10%. No cloud account, no DevOps, no setup. Just analytics.

Capability	Databricks · Snowflake	BigQuery · AWS Redshift	Duck Data Master
Monthly infrastructure cost	$5,000–$15,000/mo cluster + DevOps salary	Pay-per-query · unpredictable · $2k–$10k/mo at scale	$99/mo platform fee + compute at cost + 10% · dedicated instance provisioned automatically · no cloud account required
SQL expertise required	Yes — data engineer or analyst on staff	Yes — plus BigQuery/Redshift-specific dialect quirks	No — Duck Master AI writes the SQL from plain English
Python scripting / automation	Databricks notebooks (extra DBU cost) · manual setup	Scheduled jobs · complex setup · separate runtime	Python NL mode — describe what you want, AI writes and runs it instantly
Built-in AI notebook	Databricks AI assistant — premium tier only	Not included	Built-in AI Notebook tab with cell-level AI assist — included
AI analytics chat	Add-on / third-party integration	Not included	Duck Master Chat AI — ask anything about your data, get answers in conversation
ETL / data cleaning	Separate pipeline tools (dbt, Glue, ADF) · custom scripts · engineering hours	Dataflow / Glue · separate billing · separate team	Built-in — AI profiles, identifies issues, and fixes them on load
Data privacy	Data on vendor's shared infrastructure · SOC2 required	Processed on Google/AWS shared infrastructure	Dedicated instance · isolated per customer · data never shared with other customers
ML scoring	MLflow / Databricks AutoML · expensive premium tier	Vertex AI / SageMaker · separate billing · separate workflow	Random Forest · Gradient Boosting · Regression — built in, no extra cost
Post-quantum signed exports	Not available at any price	Not available	ML-DSA-65 · NIST FIPS 204 · included on every Guru instance
Per-query billing	Yes — Databricks DBUs · Snowflake credits	Yes — BigQuery charges per byte scanned · Redshift per hour	No — flat GCP compute cost · run as many queries as you want
Cloud storage access	Complex IAM · SDK setup · separate ETL pipeline	Native only — cross-cloud adds cost and latency	S3 · GCS · Azure Blob — enter credentials, paste path, done
Setup time	Weeks of hiring, onboarding, and infrastructure	Hours to days of configuration	Sign up · instance provisions automatically · start in under 5 minutes
Vendor lock-in	Proprietary formats · multi-year contracts · migration costs	Cloud-locked — moving data out costs money	Cancel any time · open formats · your data is always yours

// PRICING

One product.
One price. All in.

3-day free trial. Full Guru access from day one — deploy your cloud instance, connect your data lake, and run Duck Master AI at scale. If you're serious about your data, 3 days is all you need.

Duck Data Master Guru

3-day free trial · full access · no credit card

$99/mo

platform fee · cancel any time

+ GCP compute at cost + 10% markup — billed via Stripe. Choose your tier: Starter (4 vCPU · $0.19/hr) up to Guru (176 vCPU · $8.57/hr). Stop the VM when you're not using it — you pay for what you run.

✓GCP analytics instance — automated provisioning, live in under 5 minutes
✓AI text-to-SQL — ask in plain English, get production SQL
✓Python NL mode — describe what you want, AI writes and runs the Python script
✓Built-in AI Notebook — code + markdown cells, Shift+Enter run, AI cell assist
✓Duck Master Chat AI — GPT-style analytics chat built into the dashboard
✓AI ETL — profile, clean, and transform data automatically
✓All file formats — CSV, Parquet, JSON, Arrow, Excel, TSV, NDJSON
✓S3, Google Cloud Storage & Azure Blob — direct connectors with full credential support
✓12-tab analytics IDE + JupyterLab — Ingest, Extract, Query, Transform, Profile, Join, ML Score, Fuzzy, Export, PQC Sign, Notebook, Chat
✓GCS bucket file manager — browse folders, create folders, upload files, delete files and folders
✓ML scoring — train & score Random Forest / Gradient Boosting / Regression on your data
✓Fuzzy match — Jaro-Winkler similarity joins across datasets
✓Visual Join Builder — cross-table joins with no SQL required
✓Spatial analytics — ST_ functions, geospatial queries on your data
✓Delta Lake & Apache Iceberg — read open table formats directly from cloud storage
✓ML-DSA-65 post-quantum signed exports — NIST FIPS 204, full keypair lifecycle, included at no extra charge
✓Query billions of rows — right-size your GCP instance to your workload
✓Your data stays in your GCP region — never touches our servers
✓Export results as CSV, Parquet, or JSON — download or write back to cloud storage
✓Direct engineer email support — not a bot
✓Cancel any time — no lock-in, open formats

Start 3-Day Free Trial →

No credit card required. Full Guru access from minute one.

// TRANSPARENT BILLING

You see exactly
what you pay. Always.

We pass GCP compute through at cost plus a 10% platform fee. That is the entire bill. No hidden markups, no per-query charges, no overage surprises. Stop your instance and the meter stops. Set a monthly compute budget and your instance stops automatically before you overspend — with a warning at $5 remaining. Your portal shows live spend in real time.

HOW YOUR BILL IS CALCULATED

Platform fee $99 / mo

GCP compute (at cost) hours × tier rate

Platform markup on compute + 10%

Nothing else $0 hidden fees

● Stop your VM and the meter stops immediately — pay only for what you run

● Compute bill resets on the 1st of each month — no carry-over

● Set a monthly compute budget — instance auto-stops at your limit, warning at $5 remaining

● Resize your instance any time — right-size to your workload and budget

● Cancel any time — no lock-in, no exit fees, your data stays in your GCS bucket

● 10% markup covers our managed infrastructure, monitoring, and direct support

// YOUR PORTAL — LIVE VIEW

Hours Run

14.3 h

Billed (GCP+10%)

$6.13

Projected / Mo

$9.40

Monthly Budget $50 ✎

$0 Budget: $50 / mo · instance stops at limit limit

Standard · $0.390/hr GCP · +10% platform fee · ● running

Click the budget amount to change it. Instance stops automatically at your limit — no surprise bills.

COMPUTE RATES — GCP AT COST + 10%

Tier	vCPU	RAM	GCP Rate	Your Rate (+10%)	8 hrs/day est.
Starter	4	16 GB	$0.173/hr	$0.190/hr	~$4.56/mo
Standard default	8	32 GB	$0.354/hr	$0.390/hr	~$9.36/mo
Pro	22	88 GB	$0.973/hr	$1.070/hr	~$25.68/mo
Power	44	176 GB	$1.945/hr	$2.140/hr	~$51.36/mo
Ultra	88	352 GB	$3.900/hr	$4.290/hr	~$102.96/mo
Guru	176	704 GB	$7.791/hr	$8.570/hr	~$205.68/mo

8 hrs/day estimate = 240 hrs/mo × tier rate (with 10% markup). Most workloads run far less — stop the VM between sessions. GCP rate = published n2-standard pricing in us-central1. Rates subject to GCP pricing changes; we pass any decreases through immediately.

// HOW IT WORKS UNDER THE HOOD

Cloud-native.
Your data never leaves your region.

Duck Data Master Guru provisions a dedicated analytics instance exclusively for your account. The analytics engine runs on your dedicated compute — right-sized to your workload. We handle all the infrastructure. You handle the data work.

The Engine

A production-grade columnar analytics engine runs natively on your cloud instance — not in a shared environment, not rate-limited by someone else's cluster. It has direct access to the instance's full memory and CPU. Right-size it to your workload: a 4-core, 16GB node handles hundreds of millions of rows. Scale up when you need it; scale down when you don't. No cluster to manage, no per-query charge.

The AI Layer

When you ask a question in plain English, only your question and your table schema (column names and types — never the data) are sent to Google AI (Gemini) via our secure Cloud Run backend. Gemini writes the SQL. The SQL runs locally against your data in the native engine. The AI never sees your actual rows.

Your Data

Files load into the engine's in-memory column store on your dedicated instance. A 1GB CSV typically uses ~200MB in columnar format. Load as many files as your instance RAM allows — each becomes a separate queryable table. Join across files, run aggregates, export results. Your data stays on your dedicated instance — isolated from all other customers, under your control at all times.

GCP Instance Specs — Auto-Provisioned

Starter Tier

c3-standard-4 · Intel Sapphire Rapids
4 vCPU · 16 GB RAM
pd-ssd boot disk
$0.19/hr compute

Guru Tier (Top)

c3-standard-176 · Intel Sapphire Rapids
176 vCPU · 704 GB RAM
pd-ssd · NVMe available
$8.57/hr compute

Platform

GCP-hosted · dedicated per customer
Ubuntu 24.04 LTS
Caddy + Let's Encrypt TLS
Dedicated GCS bucket

Pre-installed Stack

12-tab analytics IDE (Ingest · Extract · Query · Transform · Profile · Join · ML Score · Fuzzy · Export · PQC Sign · Notebook · Chat)
Python NL mode · AI Notebook with Jupyter shortcuts · Chat AI
JupyterLab · httpfs · spatial · Delta/Iceberg · Fuzzy Match
ML-DSA-65 PQC signing

// BUILT BY

One engineer.
The whole stack.

Duck Data Master is designed, built, and run by a single engineer with 14+ years in enterprise data systems, security architecture, and cloud infrastructure.

Built from the inside out

Spent years inside enterprise data orgs watching companies burn $15k/mo on Databricks clusters to run reports that finish in seconds on a single node. Built the alternative — a SaaS analytics platform where sign-up provisions a dedicated instance automatically. No cloud account required. No DevOps. Just data work.

Certified across the stack

Databricks Certified Associate Developer (Apache Spark, Scala). AWS Solutions Architect Associate. Production data systems built on AWS, Azure, and GCP. The platform is built by someone who knows exactly what it replaces — and why.

Privacy by architecture

Each customer gets a dedicated instance — physically isolated compute. Your data is not co-mingled with other customers'. The AI receives only your question and table schema, never your data rows. Post-quantum ML-DSA-65 signed exports add a tamper-evident audit trail no other platform at this price point offers.

// GET STARTED

Your data has answers.
Start asking.

Deploy your cloud analytics stack. Ask in plain English. Get production SQL and results at any scale. 3-day free trial — no credit card required.

Start Free Trial →

Questions? support@duckdatamaster.guru

Your data has answers.No data team required.