What’s New

    While 3.0 is still a ways off, we’ll be pushing some of the new features into a new branch of the repo. Some are in progress and other features are planned. If you have any features that you want to see, let us know.

    • Distributed Queries - Based on the great work of Turn on we have a distributed query layer to split queries amongst multiple TSDs for greater throughput.

    • Query Caching - Improve queries with time-sharded caching of results.

    • Improved Expressions - Perform group by, downsampling and arithmetic modifications in any order. Potentially support UDFs as well.

    • Anomaly Processing/Forecasting - Integrate with modeling libraries (such as EGADs) for deeper time series analysis.

    2.4

    • Rollup/Pre-Aggregates - Support for storing and querying time-based rolled up data and/or pre-aggregated values.

    • Distributed Percentile - Store histograms (or sketches) for calculating proper percentiles over multiple sources.

    • Expressions - Query time computations using time series data. For example, dividing one metric by another.

    • Graphite Style Functions - Additional filtering and mutation of data at query time using Graphite style functions.

    • Calendar Based Downsampling - The ability to align downsampled data on Gregorian calendar boundaries.

    • Bigtable Support - Run TSDB in the cloud using Google’s hosted Bigtable service.

    • Cassandra Support - Support for running OpenTSDB on legacy Cassandra clusters.

    • Write Filters - Block or allow time series or UID assignments based on plugins or whitelists.

    • New Aggregators - None for returning raw data. First and Last to return the first or last data points during downsampling.

    • Startup Plugins - APIs to help with service discovery on TSD startup.

    • Example Java API usage classes.

    2.2

    • Appends - Support writing all data points for an hour in a single column. This saves the need for TSD compactions and reduces network traffic at query time.

    • Random Metric UIDs - Enables better distribution of writes when creating new metrics

    • Storage Exception Plugin - Enables various handling of data points when HBase is unavailable

    • Secure AsyncHBase - Access HBase clusters requiring Kerberos or simple authentication along with optional encryption.

    • Fill Policy - Enable emitting NaNs or Nulls via the JSON query endpoint when data points are “missing”

    • Count and Percentiles - New aggregator functions

    • More Stats - Gives greater insight into query performance via the query stats endpoint and new stats for threads, region clients and the JVM

    • Annotations - Scan for multiple annotations only via the /api/annotations endpoint

    • Query Filters - New filters for flexibility including case (in)sensitive literals, wildcards and regular expressions.

    • Override Tag Widths - You can now override tag widths in the config instead of having to recompile the code.

    • Compaction Tuning - New parameters allow for tuning the TSD compaction process.

    • Delete Data And UIDs - Allow for deleting data at query time as well as removing UIDs from the system.

    • Synchronous Writing - The HTTP Put API now supports synchronous writing to make sure data is flushed to HBase.

    • Query Stats - Query details are now logged that include timing statistics. A new endpoint also shows running and completed queries.

    • Downsampling - Timestamps are now aligned on modulus boundaries, reducing the need to interpolation across series.

    • Last Data Point API - Query for the last data point for specific time series within a certain time window

    • FSCK - An updated FSCK utility that iterates over the main data table, finding and fixing errors

    • Read/Write Modes - Block assigning UIDs on individual TSDs for backup clusters

    2.0

    • Lock-less UID Assignment - Drastically improves write speed when storing new metrics, tag names, or values

    • Restful API - Provides access to all of OpenTSDB’s features as well as offering new options, defaulting to JSON

    • Cross Origin Resource Sharing - For the API so you can make AJAX calls easily

    • Store Data Via HTTP - Write data points over HTTP as an alternative to Telnet

    • Configuration File - A key/value file shared by the TSD and command line tools

    • Pluggable Serializers - Enable different inputs and outputs for the API

    • Annotations - Record meta data about specific time series or data points

    • Meta Data - Record meta data for each time series, metrics, tag names, or values

    • Trees - Flatten metric and tag combinations into a single name for navigation or usage with different tools

    • Search Plugins - Send meta data to search engines to delve into your data and figure out what’s in your database

    • Real-Time Publishing Plugin - Send data to external systems as they arrive to your TSD

    • Ingest Plugins - Accept data points in different formats

    • Millisecond Resolution - Optionally store data with millisecond precision

    • Variable Length Encoding - Use less storage space for smaller integer values

    • Non-Interpolating Aggregation Functions - For situations where you require raw data

    • Rate Counter Calculations - Handle roll-over and anomaly supression

    Thank you to everyone who has contributed to 2.4. Help us out by sharing your ideas and code at