Creating Datasets - Trickest Platform Documentation

Exclusive access to creating datasets for Trickest Solutions is provided solely for Enterprise users. If you are interested in learning more about the Enterprise Edition, please contact us.

Overview

Datasets are the structured storage layer for your Solution results. They define the schema that determines how data is organized, queried, and displayed in Insights. A well-designed dataset schema is critical for effective data analysis and change tracking.

Building a Complete Solution? If you’re creating a custom solution from scratch, see the Custom Solutions guide for an end-to-end tutorial that covers solution creation, datasets, and workflow building.

Understanding Dataset Fields

Each dataset field (key) consists of several components:

Component	Purpose	Options
Default	Marks primary/key fields that identify unique records	Toggle on/off
Icon	Visual identifier for the field type	Flag, text, database, chart, etc.
Key Name	Field identifier used in queries and exports	Lowercase with underscores (e.g., `hostname`, `vulnerability_id`)
Type	Data type for validation and formatting	`text`, `int`, `data`, `bool`, `uuid`, `datetime`
Description	Human-readable explanation of the field	Helps users understand field purpose

Field Types

text      - String values (URLs, hostnames, descriptions)
int       - Integer numbers (ports, counts, severity scores)
data      - Binary or large text data (raw responses, files)
bool      - True/false values (is_active, has_vulnerability)
uuid      - Unique identifiers (record IDs, correlation IDs)
datetime  - Timestamps (discovered_at, last_seen, scanned_at)

Default fields serve as the primary key for your dataset. Mark fields that uniquely identify records (e.g., endpoint_url + http_method for APIs, or hostname for assets). At least one field must be marked as default.

Creating a Dataset

Navigate to Insights

Open your Solution and go to the Insights tab.

You can create a dataset immediately after creating the solution.

Click 'Create Dataset'

If this is your first dataset, you’ll see an empty state with a Create Dataset button.

Name the Dataset

Choose a descriptive name that reflects the data being stored (e.g., “API Endpoints”, “Discovered Assets”, “Vulnerabilities”).

Define Fields

Click Add key to add fields one by one. Configure each field’s icon, name, type, and description.

Mark Default Fields

Toggle Default on for fields that form the primary key. At least one field should be marked as default.

Validate and Create

Ensure your schema is valid (no duplicate key names, at least one default field). Click Create Dataset.

Schema Validation: The system validates your schema before creation. Common errors include:

Missing default fields
Duplicate key names
Invalid characters in key names (use lowercase, numbers, underscores only)
Missing required fields (icon, type, description)

Example: API Endpoints Dataset

Here’s a complete example of a dataset schema for tracking API endpoints:

Default | Icon | Key Name          | Type     | Description
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
   ✓    |  🔗  | url               | text     | Full endpoint URL
   ✓    |  🔨  | method            | text     | HTTP method (GET, POST, etc.)
   ✓    |  📖  | body_parameters   | data     | Request body parameters
        |  📝  | api_title         | text     | API title from documentation
        |  📍  | source            | text     | Discovery source URL
        |  📄  | content_type      | text     | Request payload format
        |  🌐  | hostname          | text     | URL hostname
        |  🏠  | domain_name       | text     | Registered domain

Key Design Choices:

Primary Key: url + method + body_parameters uniquely identify each endpoint variant
Metadata Fields: api_title, source, content_type provide context
Hierarchical Data: hostname and domain_name enable domain-level filtering
Data Type: body_parameters uses data type to store complex structures

Creating Datasets via API

You can create datasets programmatically using the Trickest API.

Getting Your Vault UUID

First, retrieve your vault UUID by calling the user info endpoint:

curl -X GET https://api.trickest.io/hive/v1/users/me/ \
  -H "Authorization: Token YOUR_API_TOKEN"

The response includes your vault_info.id which is your {vault_uuid}:

{
  "profile": {
    "vault_info": {
      "id": "b12a3fda-1161-4bd2-9549-d6bda39d59b0",
      "name": "your-vault-name",
      ...
    }
  }
}

API Endpoint

POST https://api.trickest.io/solutions/v1/{vault_uuid}/dataset

Replace {vault_uuid} with your vault UUID from the /users/me/ endpoint and {solution_id} with your Solution ID (found in the URL when viewing your solution).

Request Example

curl -X POST https://api.trickest.io/solutions/v1/{vault_uuid}/dataset \
  -H "Content-Type: application/json" \
  -H "Authorization: Token YOUR_API_TOKEN" \
  -d '{
    "solution": "{solution_id}",
    "name": "API Endpoints",
    "schema": {
      "fields": [
        {
          "name": "url",
          "description": "Full endpoint URL",
          "is_key": true,
          "icon": "square-arrow-out-up-right",
          "type": "text"
        },
        {
          "name": "api_title",
          "description": "The title of the API according to the documentation if available",
          "is_key": false,
          "icon": "letter-text",
          "type": "text"
        },
        {
          "name": "source",
          "description": "The source where the endpoint was discovered",
          "is_key": false,
          "icon": "anchor",
          "type": "text"
        },
        {
          "name": "method",
          "description": "The HTTP method used to call the endpoint",
          "is_key": true,
          "icon": "axe",
          "type": "text"
        },
        {
          "name": "body_parameters",
          "description": "Request body parameters associated with the endpoint",
          "is_key": true,
          "icon": "book-open",
          "type": "data"
        },
        {
          "name": "content_type",
          "description": "The format of the request payload",
          "is_key": false,
          "icon": "code-xml",
          "type": "text"
        },
        {
          "name": "hostname",
          "description": "The hostname where the URL is located",
          "is_key": false,
          "icon": "arrow-big-right-dash",
          "type": "text"
        },
        {
          "name": "domain_name",
          "description": "The registered domain associated with the hostname",
          "is_key": false,
          "icon": "arrow-big-right",
          "type": "text"
        }
      ]
    }
  }'

Field Properties

Property	Required	Description
`name`	Yes	Field identifier (lowercase with underscores)
`description`	Yes	Human-readable explanation
`is_key`	Yes	`true` for primary key fields, `false` otherwise
`icon`	Yes	Icon identifier for visual representation
`type`	Yes	Data type: `text`, `int`, `data`, `bool`, `uuid`, or `datetime`

At least one field must have is_key: true to serve as the primary key. Primary key fields uniquely identify records and enable change tracking.

Common Schema Patterns

Web Assets:

url (text, key) + hostname (text) + ip_address (text) + status_code (int)

Vulnerabilities:

cve_id (text, key) + affected_asset (text, key) + severity (text) + cvss_score (int)

Subdomains:

subdomain (text, key) + ip_addresses (data) + discovered_at (datetime)

Network Services:

ip_address (text, key) + port (int, key) + service_name (text) + banner (data)

API Endpoints:

url (text, key) + method (text, key) + body_parameters (data, key) + content_type (text)

Connecting Workflow Outputs to Datasets

After creating your dataset, you need to connect your workflow outputs to populate it with data.

Select Output Node

In the workflow Builder, identify which node produces the final results you want in your dataset.

Configure Dataset Connection

In the node settings, select your target dataset from the dropdown.

Map Fields

Ensure output field names match dataset key names exactly. If they don’t match, use transformation scripts or the Transform Data module.

Test

Run the workflow and verify data appears correctly in the dataset with proper field mapping.

Troubleshooting

Schema Validation Errors

Common Errors:

Invalid field name: Use lowercase, numbers, underscores only
Duplicate key: Each field name must be unique
No default field: At least one field must be marked as default (or is_key: true in API)
Missing required fields: All fields need icon, type, and description

Solutions:

Rename fields to follow naming conventions: api_endpoint not API-Endpoint
Remove or rename duplicate keys
Toggle Default on for primary key fields
Fill in all field properties before creating

Data Not Appearing in Dataset

Possible Causes:

Workflow run incomplete
Output nodes not connected to dataset
Field mapping mismatch between node output and dataset schema
Data filtered out by queries

Solutions:

Verify run completed successfully in Run tab
Check node-to-dataset connections in Builder
Review field names match exactly (case-sensitive)
Remove filters to see all data

Data Type Mismatches

Issue: Data doesn’t display correctly in Insights.Cause: Output data type doesn’t match dataset field type.Solutions:

Use transformation scripts to convert types (string to int, date parsing)
Update dataset schema to match actual data types
Add validation nodes to catch type errors before dataset insertion

Next Steps

Query Language

Learn how to filter and query your dataset data effectively.

Insights Overview

Explore all Insights features for data visualization and analysis.

Custom Solutions

Build complete custom solutions with workflows and datasets.

Transform Data

Learn how to transform data to match your dataset schema.

Tutorials

​Overview

​Understanding Dataset Fields

​Field Types

​Creating a Dataset

​Example: API Endpoints Dataset

​Creating Datasets via API

​Getting Your Vault UUID

​API Endpoint

​Request Example

​Field Properties

​Common Schema Patterns

​Connecting Workflow Outputs to Datasets

​Troubleshooting

​Next Steps

Query Language

Insights Overview

Custom Solutions

Transform Data

Overview

Understanding Dataset Fields

Field Types

Creating a Dataset

Example: API Endpoints Dataset

Creating Datasets via API

Getting Your Vault UUID

API Endpoint

Request Example

Field Properties

Common Schema Patterns

Connecting Workflow Outputs to Datasets

Troubleshooting

Next Steps