timechart

Table of contents

Description
Version
Syntax
Notes
Limitations
Examples
Example 1: Count events by hour
Example 2: Count events by minute with zero-filled results
Example 3: Calculate average CPU usage by minute
Example 4: Calculate average CPU usage by second and region
Example 5: Count events by second and region with zero-filled results
Example 6: Using the limit parameter with count() function
Example 7: Using limit=0 with count() to show all values
Example 8: Using useother=false with count() function
Example 9: Using limit with useother parameter and avg() function
Example 10: Handling null values in the "by" field

Description

The timechart command creates a time-based aggregation of data. It groups data by time intervals and optionally by a field, then applies an aggregation function to each group. The results are returned in an unpivoted format with separate rows for each time-field combination.

Version

3.3.0

Syntax

timechart [span=<time_interval>] [limit=<number>] [useother=<boolean>] <aggregation_function> [by <field>]

Parameters:

span: optional. Specifies the time interval for grouping data.
- Default: 1m (1 minute)
- Available time units:
  - millisecond (ms)
  - second (s)
  - minute (m, case sensitive)
  - hour (h)
  - day (d)
  - week (w)
  - month (M, case sensitive)
  - quarter (q)
  - year (y)
limit: optional. Specifies the maximum number of distinct values to display when using the "by" clause.
- Default: 10
- When there are more distinct values than the limit, the additional values are grouped into an "OTHER" category if useother is not set to false.
- The "most distinct" values are determined by calculating the sum of the aggregation values across all time intervals for each distinct field value. The top N values with the highest sums are displayed individually, while the rest are grouped into the "OTHER" category.
- Set to 0 to show all distinct values without any limit (when limit=0, useother is automatically set to false).
- The parameters can be specified in any order before the aggregation function.
- Only applies when using the "by" clause to group results.
useother: optional. Controls whether to create an "OTHER" category for values beyond the limit.
- Default: true
- When set to false, only the top N values (based on limit) are shown without an "OTHER" column.
- When set to true, values beyond the limit are grouped into an "OTHER" category.
- Only applies when using the "by" clause and when there are more distinct values than the limit.
aggregation_function: mandatory. The aggregation function to apply to each time bucket.
- Currently, only a single aggregation function is supported.
- Available functions: All aggregation functions supported by the :doc:`stats <stats>` command are supported.
by: optional. Groups the results by the specified field in addition to time intervals.
- If not specified, the aggregation is performed across all documents in each time interval.

Notes

The timechart command requires a timestamp field named @timestamp in the data.
Results are returned in an unpivoted format with separate rows for each time-field combination that has data.
Only combinations with actual data are included in the results - empty combinations are omitted rather than showing null or zero values.
The "top N" values for the limit parameter are selected based on the sum of values across all time intervals for each distinct field value.
When using the limit parameter, values beyond the limit are grouped into an "OTHER" category (unless useother=false).
Examples 6 and 7 use different datasets: Example 6 uses the events dataset with fewer hosts for simplicity, while Example 7 uses the events_many_hosts dataset with 11 distinct hosts.
Null values: Documents with null values in the "by" field are treated as a separate category and appear as null in the results.

Limitations

Only a single aggregation function is supported per timechart command.
The bins parameter and other bin options are not supported since the bin command is not implemented yet. Use the span parameter to control time intervals.

Examples

Example 1: Count events by hour

This example counts events for each hour and groups them by host.

PPL query:

PPL> source=events | timechart span=1h count() by host

Result:

+---------------------+--------+-------+
| @timestamp          | host   | count |
+---------------------+--------+-------+
| 2024-07-01 00:00:00 | db-01  | 1     |
| 2024-07-01 00:00:00 | web-01 | 2     |
| 2024-07-01 00:00:00 | web-02 | 2     |
+---------------------+--------+-------+

Example 2: Count events by minute with zero-filled results

This example counts events for each minute and groups them by host, showing zero values for time-host combinations with no data.

PPL query:

PPL> source=events | timechart span=1m count() by host

Result:

+---------------------+--------+-------+
| @timestamp          | host   | count |
+---------------------+--------+-------+
| 2024-07-01 00:00:00 | web-01 | 1     |
| 2024-07-01 00:00:00 | web-02 | 0     |
| 2024-07-01 00:00:00 | db-01  | 0     |
| 2024-07-01 00:01:00 | web-01 | 0     |
| 2024-07-01 00:01:00 | web-02 | 1     |
| 2024-07-01 00:01:00 | db-01  | 0     |
| 2024-07-01 00:02:00 | web-01 | 1     |
| 2024-07-01 00:02:00 | web-02 | 0     |
| 2024-07-01 00:02:00 | db-01  | 0     |
| 2024-07-01 00:03:00 | web-01 | 0     |
| 2024-07-01 00:03:00 | web-02 | 0     |
| 2024-07-01 00:03:00 | db-01  | 1     |
| 2024-07-01 00:04:00 | web-01 | 0     |
| 2024-07-01 00:04:00 | web-02 | 1     |
| 2024-07-01 00:04:00 | db-01  | 0     |
+---------------------+--------+-------+

Example 3: Calculate average CPU usage by minute

This example calculates the average CPU usage for each minute without grouping by any field.

PPL query:

PPL> source=events | timechart span=1m avg(cpu_usage)

Result:

+---------------------+------------------+
| @timestamp          | avg(cpu_usage)   |
+---------------------+------------------+
| 2024-07-01 00:00:00 | 45.2             |
| 2024-07-01 00:01:00 | 38.7             |
| 2024-07-01 00:02:00 | 55.3             |
| 2024-07-01 00:03:00 | 42.1             |
| 2024-07-01 00:04:00 | 41.8             |
+---------------------+------------------+

Example 4: Calculate average CPU usage by second and region

This example calculates the average CPU usage for each second and groups them by region.

PPL query:

PPL> source=events | timechart span=1s avg(cpu_usage) by region

Result:

+---------------------+---------+------------------+
| @timestamp          | region  | avg(cpu_usage)   |
+---------------------+---------+------------------+
| 2024-07-01 00:00:00 | us-east | 45.2             |
| 2024-07-01 00:01:00 | us-west | 38.7             |
| 2024-07-01 00:02:00 | us-east | 55.3             |
| 2024-07-01 00:03:00 | eu-west | 42.1             |
| 2024-07-01 00:04:00 | us-west | 41.8             |
+---------------------+---------+------------------+

Example 5: Count events by second and region with zero-filled results

This example counts events for each second and groups them by region, showing zero values for time-region combinations with no data.

PPL query:

PPL> source=events | timechart span=1s count() by region

Result:

+---------------------+---------+-------+
| @timestamp          | region  | count |
+---------------------+---------+-------+
| 2024-07-01 00:00:00 | us-east | 1     |
| 2024-07-01 00:00:00 | us-west | 0     |
| 2024-07-01 00:00:00 | eu-west | 0     |
| 2024-07-01 00:01:00 | us-east | 0     |
| 2024-07-01 00:01:00 | us-west | 1     |
| 2024-07-01 00:01:00 | eu-west | 0     |
| 2024-07-01 00:02:00 | us-east | 1     |
| 2024-07-01 00:02:00 | us-west | 0     |
| 2024-07-01 00:02:00 | eu-west | 0     |
| 2024-07-01 00:03:00 | us-east | 0     |
| 2024-07-01 00:03:00 | us-west | 0     |
| 2024-07-01 00:03:00 | eu-west | 1     |
| 2024-07-01 00:04:00 | us-east | 0     |
| 2024-07-01 00:04:00 | us-west | 1     |
| 2024-07-01 00:04:00 | eu-west | 0     |
+---------------------+---------+-------+

Example 6: Using the limit parameter with count() function

When there are many distinct values in the "by" field, the timechart command will display the top values based on the limit parameter and group the rest into an "OTHER" category. This query will display the top 2 hosts with the highest count values, and group the remaining hosts into an "OTHER" category.

PPL query:

PPL> source=events | timechart span=1m limit=2 count() by host

Result:

+---------------------+--------+-------+
| @timestamp          | host   | count |
+---------------------+--------+-------+
| 2024-07-01 00:00:00 | web-01 | 1     |
| 2024-07-01 00:00:00 | web-02 | 0     |
| 2024-07-01 00:00:00 | OTHER  | 0     |
| 2024-07-01 00:01:00 | web-01 | 0     |
| 2024-07-01 00:01:00 | web-02 | 1     |
| 2024-07-01 00:01:00 | OTHER  | 0     |
| 2024-07-01 00:02:00 | web-01 | 1     |
| 2024-07-01 00:02:00 | web-02 | 0     |
| 2024-07-01 00:02:00 | OTHER  | 0     |
| 2024-07-01 00:03:00 | web-01 | 0     |
| 2024-07-01 00:03:00 | web-02 | 0     |
| 2024-07-01 00:03:00 | OTHER  | 1     |
| 2024-07-01 00:04:00 | web-01 | 0     |
| 2024-07-01 00:04:00 | web-02 | 1     |
| 2024-07-01 00:04:00 | OTHER  | 0     |
+---------------------+--------+-------+

Example 7: Using limit=0 with count() to show all values

To display all distinct values without any limit, set limit=0:

PPL query:

PPL> source=events_many_hosts | timechart span=1h limit=0 count() by host

Result:

+---------------------+--------+-------+
| @timestamp          | host   | count |
+---------------------+--------+-------+
| 2024-07-01 00:00:00 | web-01 | 1     |
| 2024-07-01 00:00:00 | web-02 | 1     |
| 2024-07-01 00:00:00 | web-03 | 1     |
| 2024-07-01 00:00:00 | web-04 | 1     |
| 2024-07-01 00:00:00 | web-05 | 1     |
| 2024-07-01 00:00:00 | web-06 | 1     |
| 2024-07-01 00:00:00 | web-07 | 1     |
| 2024-07-01 00:00:00 | web-08 | 1     |
| 2024-07-01 00:00:00 | web-09 | 1     |
| 2024-07-01 00:00:00 | web-10 | 1     |
| 2024-07-01 00:00:00 | web-11 | 1     |
+---------------------+--------+-------+

This shows all 11 hosts as separate rows without an "OTHER" category.

Example 8: Using useother=false with count() function

Limit to top 10 hosts without OTHER category (useother=false):

PPL query:

PPL> source=events_many_hosts | timechart span=1h useother=false count() by host

Result:

+---------------------+--------+-------+
| @timestamp          | host   | count |
+---------------------+--------+-------+
| 2024-07-01 00:00:00 | web-01 | 1     |
| 2024-07-01 00:00:00 | web-02 | 1     |
| 2024-07-01 00:00:00 | web-03 | 1     |
| 2024-07-01 00:00:00 | web-04 | 1     |
| 2024-07-01 00:00:00 | web-05 | 1     |
| 2024-07-01 00:00:00 | web-06 | 1     |
| 2024-07-01 00:00:00 | web-07 | 1     |
| 2024-07-01 00:00:00 | web-08 | 1     |
| 2024-07-01 00:00:00 | web-09 | 1     |
| 2024-07-01 00:00:00 | web-10 | 1     |
+---------------------+--------+-------+

Example 9: Using limit with useother parameter and avg() function

Limit to top 3 hosts with OTHER category (default useother=true):

PPL query:

PPL> source=events_many_hosts | timechart span=1h limit=3 avg(cpu_usage) by host

Result:

+---------------------+--------+------------------+
| @timestamp          | host   | avg(cpu_usage)   |
+---------------------+--------+------------------+
| 2024-07-01 00:00:00 | web-03 | 55.3             |
| 2024-07-01 00:00:00 | web-07 | 48.6             |
| 2024-07-01 00:00:00 | web-09 | 67.8             |
| 2024-07-01 00:00:00 | OTHER  | 330.4            |
+---------------------+--------+------------------+

Limit to top 3 hosts without OTHER category (useother=false):

PPL query:

PPL> source=events_many_hosts | timechart span=1h limit=3 useother=false avg(cpu_usage) by host

Result:

+---------------------+--------+------------------+
| @timestamp          | host   | avg(cpu_usage)   |
+---------------------+--------+------------------+
| 2024-07-01 00:00:00 | web-03 | 55.3             |
| 2024-07-01 00:00:00 | web-07 | 48.6             |
| 2024-07-01 00:00:00 | web-09 | 67.8             |
+---------------------+--------+------------------+

Example 10: Handling null values in the "by" field

This example shows how null values in the "by" field are treated as a separate category. The dataset events_null has 1 entry that does not have a host field.

PPL query:

PPL> source=events_null | timechart span=1h count() by host

Result:

+---------------------+--------+-------+
| @timestamp          | host   | count |
+---------------------+--------+-------+
| 2024-07-01 00:00:00 | db-01  | 1     |
| 2024-07-01 00:00:00 | web-01 | 2     |
| 2024-07-01 00:00:00 | web-02 | 2     |
| 2024-07-01 00:00:00 | null   | 1     |
+---------------------+--------+-------+

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

timechart

Description

Version

Syntax

Notes

Limitations

Examples

Example 1: Count events by hour

Example 2: Count events by minute with zero-filled results

Example 3: Calculate average CPU usage by minute

Example 4: Calculate average CPU usage by second and region

Example 5: Count events by second and region with zero-filled results

Example 6: Using the limit parameter with count() function

Example 7: Using limit=0 with count() to show all values

Example 8: Using useother=false with count() function

Example 9: Using limit with useother parameter and avg() function

Example 10: Handling null values in the "by" field

FilesExpand file tree

timechart.rst

Latest commit

History

timechart.rst

File metadata and controls

timechart