Skip to main content
Version: Next

Getting Started with Data Modeling in Synmetrix

Synmetrix employs the Cube data model framework to turn raw data into strategic business insights and to pre-aggregate data for optimal querying efficiency. This data model is accessible through Synmetrix's querying API, enabling users to perform a wide array of analytical queries without needing to modify the data model directly.

Consider an example using a users table with the following structure:

idpayingcitycompany_name
1trueSan FranciscoPied Piper
2truePalo AltoRaviga
3trueRedwoodAviato
4falseMountain ViewBream-Hall
5falseSanta CruzHooli

From this data, we seek answers to several questions regarding our user base:

  • Total number of users
  • Number of paying users
  • Percentage of paying users compared to the overall user count
  • Distribution of users, regardless of payment status, across different cities and companies

In Synmetrix, leveraging the Cube data model allows us to avoid crafting individual SQL queries for each question, facilitating the creation of organized and reusable SQL queries.

1. Creating a Cube in Synmetrix

Cubes in Synmetrix, structured around the Cube framework, serve to organize data entities and their interrelations. Typically, a cube corresponds to a database table, such as users, orders, or products. We define a base table for each cube using the sql_table parameter. For our case, the base table is the users table.

YAML

cubes:
- name: Users
sql_table: users

JavaScript

cube(`Users`, {
sql_table: `users`,
});

2. Integrating Measures and Dimensions

After defining the base table, the next step involves adding measures and dimensions to the cube. Measures refer to quantitative data, like sales volume or user count, while dimensions categorize data, such as by city or company name.

Let's add our first measure and dimensions to illustrate:

YAML

cubes:
- name: Users
sql: SELECT * FROM users

measures:
- name: TotalCount
sql: id
type: count

dimensions:
- name: City
sql: city
type: string

- name: CompanyName
sql: company_name
type: string

JavaScript

cube(`Users`, {
sql: `SELECT * FROM users`,

measures: {
TotalCount: {
sql: `id`,
type: `count`,
},
},

dimensions: {
City: {
sql: `city`,
type: `string`,
},

CompanyName: {
sql: `company_name`,
type: `string`,
},
},
});

3. Applying Filters to Measures

To discern the number of paying users, Synmetrix introduces measure filters, allowing for specific data segmentation:

YAML

cubes:
- name: Users

measures:
- name: PayingUsersCount
sql: id
type: count
filters:
- sql: "{CUBE}.paying = true"

# Other definitions...

JavaScript

cube(`Users`, {
measures: {
PayingUsersCount: {
sql: `id`,
type: `count`,
filters: [{ sql: `${CUBE}.paying = true` }],
},
},

// Other definitions...
});

4. Crafting Calculated Measures

For inquiries like the percentage of paying users relative to the total, Synmetrix simplifies the creation of calculated measures. This enables the comparison of PayingUsersCount to TotalCount:

YAML

cubes:
- name: Users

measures:
- name: PayingUsersPercentage
sql: "100.0 * {PayingUsersCount} / NULLIF({TotalCount}, 0)"
type: number
format: percent

# Additional configurations...

JavaScript

cube(`Users`, {
measures: {
PayingUsersPercentage: {
sql: `100.0 * ${PayingUsersCount} / NULLIF(${TotalCount}, 0)`,
type: `number`,
format: `percent`,
},
},

// Additional configurations...
});

In this approach, Synmetrix not only streamlines the process of data modeling but also enhances the analytical capabilities available to end-users, making it easier to derive meaningful insights from complex data sets.

5. Next Steps

  1. Data models documentation
  2. Data models reference
  3. Query format