Base vocabulary
Analytical database - Postgres database where Data sources, Datamarts, Rollups and ModelClass tables are stored.
Transactional database - Postgres database (previously MariaDB in the older infrastructure) where all other tables are stored (Master data, Company parameters, Condition records, Price lists, Quotes, Agreements, Rebates …)
What is Query API?
Available since 14.0 Caribou Lou release
Query API is a Groovy API available in all logics since 14.0 release to query the data from Pricefx. It was introduced to provide a unified access to data in Pricefx and long term wise it should be the only API for querying data.
The main driver is to support JOIN of data from two or more tables to utilize the performance of the database engine since joining data using various techniques in groovy logic in the memory of the backend was not that efficient. It can also save the number of queries made to the database.
Main features:
Supports JOIN of tables
Provide a single unified interface for querying data for both transactional as well as analytical database
Works natively with field names (analytical tables) or field labels (master and transactional tables)
Data is provided in a stream (supported in the input generation mode)
Aggregation of the data
Advanced expressions
Allows to use column aliases to retrieve column under different name
Use of joins can be a significant performance improvement. But at the same time it can also cause a huge harm if not used properly. Therefore a knowledge of how databases work is required.
Note: Joining data between analytical database and transactional database (other modules) is not possible (e.g. a product extensions with a data source). You can join tables only within the same database.
Simple Query
The main entrance point for use of QueryAPI is a method api.queryApi()
that returns a QueryAPI
interface. This interface provides access to a set of methods which you will need for querying.
Let’s look at this query which reads a single row with sku = “MB-0001” from the Product master table:
def qapi = api.queryApi() def p = qapi.tables().products() return qapi.source(p, p.sku().equal("MB-0001")) .stream { it.collect { it } }
Line 1: the QueryAPI reference is stored in q
variable for later use.
Line 3: the reference to product master table is stored in q
variable.
Line 5: the source()
method says to query table t
Line 5-8: filter expression that defines which rows to return in the result
Line 8: QueryAPI provides the result always as a stream (ResultStream
interface). So it is up to you how you will consume each individual row. In our example, all rows are collected to a List.
Let’s look on each part in the next sections.
Tables
Reference to a given table is made using a method (e.g. products()) available under QueryApi.tables()
. All those methods return Table
interface. These references are used to query the desired table.
For example:
def tables = api.queryApi().tables() def tProduct = tables.products() def tCosts = tables.productExtension("Costs") def tCountries = tables.companyParameterTable("Countries")
The table objects are used to reference columns.
Supported transactional tables:
Product
Product extension
Customer
Customer extension
Seller
Seller extension
Company parameters
Condition records
Pricelist items
Price grid items
Calculation grid items
Quote line items
Contract line items
Rebate line items
Rebate records
Compensation line items
Compensation records
Supported analytical tables:
Analytical tables are currently not supported, the plan is to support these in 15.0 release:
Data sources
Datamarts
Rollups
Model Class Tables
Columns
Columns are fields referenced from table objects. For transactional tables, those can be:
Attribute fields
System fields
Attribute fields can be used in the QueryAPI only if you configure a name and data type for that attribute field. Attribute fields are accessed as a property by the configured label, e.g. px.Cost,
cp.Country
. Therefore it is not recommended to configure attribute labels with spaces, since it makes the code worse readable, however such column names are supported. Attribute fields cannot be accessed by their system field name, e.g. p.attribute3
will not work.
System fields can be accessed using methods in the given table interface, e.g. p.sku()
, c.customerId()
, p.lastUpdateDate()
.
If you want to retrieve the column under a different name, you can use an alias method as()
, eg.:
px.Cost.as("StandardCost")
p.sku().as("Product")
Data types
In transactional database, the Groovy data type for an attribute field is determined by it’s configured Type:
String → java.lang.String
Integer → java.lang.Long
Real → java.math.BigDecimal
Date → java.time.LocalDate
Timestamp → org.joda.time.DateTime
Boolean → java.lang.Boolean
Link → java.lang.String
Entity reference → java.lang.String
QueryAPI converts the value using CAST in SQL. So if there is an issue with the data integrity (e.g. integration loaded a string value in an attribute field of type Integer) then the whole query call crashes.
Unity stores empty values of attribute fields in transactional database as an empty string. Due to the CAST above, QueryAPI performs a special CASE-WHEN check for empty string values in order to not cause crashes for empty values. E.g.:
CASE
WHEN attribute1 = '' THEN NULL
ELSE CAST(attribute1 AS LocalDate)
END
This check cause index that has attribute1
not to be used.
The check is omitted if the attribute field is marked with Required flag. The expression will be come simple cast:
CAST(attribute1 AS LocalDate)
So if you can it is recommended to mark attribute fields with Required flag whenever possible. But it has to be ensured (e.g. in integration or by boundCall) that a value cannot be empty.
In analytical database, the Groovy data type for attribute field is determined by it’s configured Data Type:
Text → java.lang.String
Integer → java.lang.Long
Money → java.math.BigDecimal
Number → java.math.BigDecimal
Date → java.time.LocalDate
DateTime → org.joda.time.DateTime
Boolean → java.lang.Boolean
Currency → java.lang.String
Quantity → java.math.BigDecimal
Source table
Source table is spcified using source()
method is telling which table will be queried. If you are familiar with SQL language, you can look at this method as “FROM” clause in SQL.
The first parameter is a mandatory one and accepts a reference to a table, the other parameters are optional. The second parameter specifies the list of columns to be returned. The third parameter is an expression that defines filter on the returned rows. So it is like a “WHERE” clause in SQL language. Since existing Filter
class was not sufficient for the purpose of QueryAPI, a new interface Expression
has been introduced.
Example:
qapi.source(t1, [t1.sku(), t1.Brand])
Returns sku and Brand fields from the table t1. No row filtering.
Example:
qapi.source(t1, [t1.sku(), t1.Brand], t1.sku().equal("MB-0001") )
Returns sku
and Brand
fields from the table t1
and filters the rows to return row with sku = “MB-0001”.
Result stream
QueryAPI provides the result always as a stream via stream(closure)
method. The closure receives ResultStream
(which extends Iterator
), as a parameter. So it is up to you to iterate over rows and tell how to consume each individual row. The processing of the row should be fast, because it is blocking the database connection. On the other hand collecting big amount of records to memory using collect { it }
and processing it later may lead to OutOfMemory exception. So the approach should be wisely evaluated for a good compromise.
Expressions
There are quite many methods available, please refer to the Javadoc.
Joining tables
Adding columns from another table is done using these methods:
leftOuterJoin()
- result contains all records from source table and matching records from the joining tableinnerJoin()
- result containes only matching records from source and joining table
Right outer join method is not implemented since the same effect can be done usingleftOuterJoin()
.
Multiple joins can be done.
The methods takes following parameters:
reference to a table
closure that returns the list of columns to be added to the result
closure that returns the joining condition as a boolean expression
It is strongly recommended that the joining conditions use all fields of the database primary index.
Example:
def date = new Date() def q = api.queryApi() def t1 = q.tables().products() def t2 = q.tables().productExtensionRows("Cost") return q.source(t1, [t1.sku(), t1.ProductGroup]) .leftOuterJoin(t2, [t2.Cost, t2.Currency], { cols -> q.exprs().and( t2.sku().equal(cols.sku), t2.ValidFrom.lessOrEqual(date), t2.ValidTo.greaterOrEqual(date), ) } ) .stream { it.collect { it } }
The query returns all product records and their cost and currency from the product extension Cost. Or null for cost and currency if the product extension record does not exist.
Removing columns
If you had to add columns via source()
, leftOuterJoin
or innerJoin
in order to make the joining condition but you do not need those in the result, you can do this using removeColumns()
method. The method takes a closure as the argument and should return a list of columns to be removed.
Example:
.removeColumns { cols -> [cols.ValidFrom, cols.ValidTo] }
Retaining columns
If you had to add columns via source()
, leftOuterJoin
or innerJoin
in order to make the joining condition but you do not need those in the result, you can do this using retainColumns()
method. The method takes a closure as the argument and should return a list of columns that should be present in the result.
Example:
.retainColumns { cols -> [cols.sku, cols.Cost] }
Additional filtering
If you made several joins using leftOuterJoin
or innerJoin
you can apply additional filtering of the result using filter()
method. The method takes a closure as the argument and should return an expression which if valuated to true, telss if the row should be present in the result.
Example:
.filter { cols -> cols.Cost.isNotNull() }
Aggregations
Example:
def qapi = api.queryApi() def t1 = qapi.tables().productExtension("Competition") def exprs = qapi.exprs() return qapi.source(t1, [t1.sku(), t1.jm Price]) .aggregateBy({ cols -> [cols.sku] }, { cols -> [cols.sku, exprs.min(cols.Price).as("MinPrice")] }) .stream { it.collect { it } }
Returns lowest competitor’s price for each SKU.
Sorting
To sort the result rows is done using sortBy()
call. The method takes a closure as an argument. The closure should return a list of orders. The orders can be one of the following:
Orders.ascNullsFirst()
Orders.ascNullsLast()
Orders.descNullsFirst()
Orders.descNullsLast()
Example:
def qapi = api.queryApi() def t1 = qapi.tables().companyParameterRows("ProductHierarchy") def orders = qapi.orders() return qapi.source(t1, [t1.ProductGroup, t1.ProductLine]) .sortBy { cols -> [orders.ascNullsLast(cols.ProductGroup), orders.ascNullsLast(cols.ProductLine)] } .stream { it.collect { it } }
Will sort the result rows by ProductGroup and then by ProductLine.
Row limit
The limit of the number of rows being returned is set by take(maxResults)
method called at the end of the pipeline and just before the stream()
call.
Example:
def qapi = api.queryApi() def p = qapi.tables().products() return qapi.source(p) .take(10) .stream { it.collect { it } }
Will return the first 10 rows.
Use of take()
is recommended when debugging the queries.
Tracing
The QueryAPI builds a SQL (for analytical database) or HQL (for transactional database) which is then executed by a database engine. To check the final SQL/HQL query built by QueryAPI, you can call the method traceQuery()
at the end of the pipeline before the stream()
call.
def qapi = api.queryApi() def p = qapi.tables().products() return qapi.source(p) .traceQuery() .stream { it.collect { it } }
It will return the final HQL/SQL query which may help you to better understand how the data will be retrieved.
Performance, Important Prerequisites
Specify number of columns
Join criteria on all key fields
Fast processing of stream row
Avoid loading big amount of rows to memory
Transactional tables:
Analytical tables:
Studio support
REST API
Currently, thre is no REST API for queries using QueryAPI. But you can create a logic that returns the data using QueryAPI since every generic logic can be executed using formulamanager.execute endpoint.
See more Custom REST API Service Using Logics
Troubleshooting
Future vision
Following methods are deprecated as of 14.0 since they are supported by the QueryAPI:
api.getItemCompleteCalculationResults()
(performance)api.productExtension()
(performance)api.customerExtension()
(performance)
More tables will be supported by the QueryAPI in the future. Therefore these methods that may be deprecated in the future:
DatamartContext.executeQuery()
DatamartContext.executeSqlQuery()
api.find()
api.findLookupTable()
api.findLookupTableValues()
api.priceGrids()
api.pricelists()
api.findApprovedPricelistItems()
api.findDataload()
api.findCalculatedFieldSets()