What is Tabular?

Tabular is a cloud-native warehouse and automation platform. It’s one central store for all your analytic data that can be used anywhere …and Tabular optimizes itself to save on your cloud bill.

Schedule a Demo

"Tabular is the platform we’ve always wanted as experts that pioneered big data architecture in the cloud — it combines the freedom to use any query engine or compute framework with the guarantees, reliability, and security of a traditional data warehouse."

Self-optimizing

Tabular continuously optimizes your data to keep S3 costs low and make your queries run faster. Its automated analysis will tune your tables as data changes or new query patterns show up. Using the optimal settings for each table, Tabular automatically clusters and compacts your tables to keep them at top performance and to free you from worrying about file sizes.

Secure by design

Tabular secures tables in your S3 bucket using role-based access control policies, just like traditional data warehouses. Use any engine or framework and be confident your data is safe. All data accesses and access control decisions are easy to understand and audit using log tables in your warehouse. Tabular gives you clarity into where your data is flowing and why.

Tabular warehouses also keep you up-to-date with best practices so you’re prepared for disasters and protected against malicious actors.

Painless data infrastructure

Stop wasting energy self-hosting infrastructure and maintaining tables. Tabular handles data retention, routine maintenance, and cleanup in your warehouse automatically. Tabular warehouses also come with modern catalog features that make life easier, like dropped table recovery.

Best of all, it takes just a few minutes to set up a warehouse and connect to the query engine of your choice — including Athena, Starburst Galaxy, Snowflake, Trino, and Apache Spark.

Built on open standards

Tabular is built on open standards to ensure your data is secure while being universally and easily accessible.

Access is managed using OAuth2 and OpenID Connect so your data is safe but easy to connect to when you need it.

Tables are powered by Apache Iceberg, the modern foundation for cloud data warehouses that makes data problems disappear — Iceberg just works the way you expect.

Frequently Asked Questions (FAQ)

Does Tabular work with my current data tools?

Yes! Tabular can be used side-by-side with Hive-like tables in your existing infrastructure. Tabular tables can be used seamlessly with your existing data catalog.

Does Tabular support streaming data?

Yes! You can use Tabular with AWS Kinesis and Apache Flink to deliver data to your warehouse with exactly-once semantics.

What is Apache Iceberg?

Iceberg is an open table format developed by Netflix and Apple that was carefully designed to make data practitioners more productive while solving the need for reliable transactions, query performance, and scalability up to 10s or 100s of petabytes.

Iceberg is supported by nearly every big data processing framework and is being adopted by SQL data warehouses because of its unique emphasis on SQL behavior and reliability. Wide adoption has made Iceberg the open standard for cloud-native analytic tables.

Can I use Tabular with . . . ?

Because Tabular is based on Iceberg, warehouses can be used with nearly any compute engine.

  • Spark
  • Trino
  • Flink
  • Hive
  • Starburst Galaxy
  • AWS Athena
  • AWS Kinesis
  • AWS EMR
  • … and more!

And with Tabular, your access control policies are applied consistently across all those engines.


If you’d like to find out whether Tabular works with a specific engine or version, please schedule a demo.

How is Tabular deployed?

Tabular is a SaaS service, currently supporting AWS.

During setup, you create an IAM role for Tabular to control access to your data. Tabular uses that role to control file-level access and universally enforce your RBAC policy.

The deployment model is isolated, auditable, and painless.

Isolated

Tabular runs in a separate AWS account.

  • No “black box” code or containers run in customer AWS accounts
  • You own and control the S3 buckets where data and metadata is stored
  • Access is strictly limited to reads, writes, and soft deletes in specific buckets
  • Tabular accesses S3 from within AWS, without going over the public internet
  • Tabular maintenance services never persist data outside of your buckets
  • Tabular follows best practices from AWS for external access

Auditable

All accesses are logged and are available for inspection.

  • S3 access logs show all data access
  • Tabular audit tables show all authorization requests, by user
  • All accesses use the same authorization path, including Tabular services

Painless

Exceptional data infrastructure, set up in minutes.