Introducing FlexQuery

Save money, improve performance, and ensure data governance with GoodData’s in-memory, 
metadata-driven analytics layer.

Request a demo Live demo + Q&A
FlexQuery is a versatile analytics engine built on Apache Arrow, which ingests data in batch or real-time, federates across sources, computes in-memory, and performs pre and post processing transformations — for fast and cost-effective analytics.
Save money Icon
Save money Icon
Save money
  • Includes caching engine and controls by data source, user, or workspace.
  • Reduces burden and cost of source cloud data warehouses.
  • Provides levers for performance vs. cost trade-offs to meet needs.

Try our FlexCache calculator

Improve performance Icon
Improve performance Icon
Improve performance
  • Processes large datasets and runs complex algorithms with efficiency.
  • Supports real-time via a balance of distributed vs consolidated data.
  • Built on Apache Arrow for speedy, analytical processing.
Ensure governance Icon
Ensure governance Icon
Ensure governance
  • Integrates a metrics layer within the analytics lake’s architecture.
  • Enables the seamless federation of diverse data sources.
  • Enhances data discoverability while maintaining data integrity.

FlexQuery features

The latest FlexQuery enhancements enable near-real-time analytics, with 
additional features coming soon.

Icon
Query acceleration and result-set caching Improved query performance and responsiveness.
Icon
Real-time data updates Live connections and data streaming capabilities.
Icon
Direct query and granular cache control Control what data is cached and how often.
Icon
SQL and FlightRPC interface Access and manipulate distributed data.
Icon
Materialization and pre-aggregation of data Create materialized datasets on the fly via SQL.
Icon
Headless BI engine Integrate BI components without UI limitations.
Coming soon Icon
Data source federation Merge materialized datasets from multiple sources.
Coming soon Icon
AI-driven intelligent pre-caching Let AI identify and schedule which data to cache.

Dive deeper into the GoodData platform

Request a demo Live demo + Q&A

FlexQuery, built on open standards

Cutting-edge, in-memory Apache Arrow is an open-source project that supports diverse data formats.

  • Columnar storage format, enabling the efficient handling and processing of large volumes of data.
  • Optimized for live, high-speed analytics and machine learning rather than just data storage.

Learn more about the Apache Arrow project

Common questions

FlexQuery is GoodData’s in-memory, metadata-driven analytics service layer that improves performance and scalability while controlling cloud data warehouse costs. It is built on open standards and technologies such as Apache Arrow, Iceberg, and DuckDB. Using FlexQuery, analytics and business intelligence engineers federate, transform, and enrich data for dashboards, custom applications, and AI/ML use cases.

FlexQuery is the part of GoodData that makes reports really fast and makes your data developers more productive while saving money.

Yes, FlexQuery is the underlying caching and analytics layer that powers every GoodData environment. FlexQuery is available free to all GoodData customers.

At GoodData, we are committed to using open source standards and technologies as core components of our analytics platform, and FlexQuery follows in that commitment. The reasons we do this are many:

  • Open-source technologies are widely adopted and understood by our customers and the broader data community.
  • Skills for open source technologies are easier to acquire than proprietary commercial BI technologies.
  • Customers get increased portability and less vendor lock-in with open source technologies.
  • GoodData’s development teams can do more, faster by embracing open source and thus our customers receive more innovation.

Business intelligence platforms are a prime driver of cloud data warehouse costs. By utilizing cutting edge caching and aggregate-awareness technology, FlexQuery dramatically reduces the number and complexity of queries processed by your data warehouse. With FlexQuery, we see a 55% average reduction of BI’s contribution to DW spend.

‘Analytics lake’ is the term we use to describe the combination of analytics storage, processing, semantics, and visualization offered by the GoodData platform. Unlike a data warehouse or data lake, which are structured for data retention and system-of-record reporting, the analytics lake exists to provide a single interface for all relevant analytics objects — including data, transformations, semantic information, AI/ML processing, and visualizations/reports — via common interfaces such as REST APIs, Python libraries, and SQL endpoints.

Simply put, a data warehouse or data lake stores your data in structured or unstructured formats independent of any downstream uses and serves as a source for the analytics lake, which stores and provides the processes or objects necessary to turn that data into user-facing analytics.

Please see our technical documentation about how FlexCache works here. You may also be interested in this article.

Dive deeper into the GoodData platform

Request a demo Live demo + Q&A