@influxdb - usenix...influxdb u ser v a l u e time? 1 9 / 6 influxdb 2 1 / 6 9. influxdb time series...

@InfluxDBDavid Norton (@dgnorton)

david@influxdb.com

1 / 69

Instrumenting a Data Center

2 / 69

3 / 69

4 / 69

The problem:Efficiently monitor hundreds or

thousands of servers

5 / 69

The solution:Automate it!

6 / 69

Automate what?data collection, storage, analysis, &

alerting

7 / 69

Easy!Write a few scripts & store the data in

8 / 69

1 server

100 measurements

8,640 per day (once every 10s)

365 days

= 315 million records (points) per year9 / 69

10 servers

100 measurements per server

365 days

= 3.2 billion records (points) per year10 / 69

2,000 servers

200 measurements per server

365 days

= 2.5 trillion records (points) per year

11 / 69

Time series database is ideal for thistype of data & workload

12 / 69

"time series"

13 / 69

Values and time stampsValue

Time?14 / 69

Values and time stamps

time value20150422 5:00 PM20150422 6:00 PM

10,020,0

15 / 69

CPU usage every 10 seconds...

16 / 69

Cumulative steps taken...

Count Value

17 / 69

Happy InfluxDB users...

TimeInfluxDB

18 / 69

Time series database

Count Value

TimeInfluxDB

Time?19 / 69

20 / 69

InfluxDB

21 / 69

InfluxDB time series database

22 / 69

no external dependencies

23 / 69

distributed & scalable

24 / 69

easy to install, use, & maintain

25 / 69

open source (MIT license)

26 / 69

open source (MIT license)

written in Go27 / 69

Features SQLlike query language

HTTP(S) API for writes & queries

Supports other protocols (collectd, graphite,opentsdb)

Automated data retention policies

Aggregate data onthefly

28 / 69

Data model...

29 / 69

influx d

30 / 69

influx d

31 / 69

influx d

32 / 69

influx d

measurements

33 / 69

influx d

measurements

country ='DEU'

region ='neast'

34 / 69

influx d

country ='DEU'

region ='west'

series = measurement + unique tag set

35 / 69

influx d

country ='DEU'

region ='west'

series = measurement + unique tag set

country ='DEU'

region ='east'

36 / 69

influx d

Points: values in a series

country ='DEU'

region ='east'

+time value20150422 5:00 PM20150422 5:01 PM

10,020,0

country ='USA'

region ='east'

+time value20150422 5:00 PM20150422 5:01 PM

14,038,0

37 / 69

What's indexed?

38 / 69

What's indexed?Metadata:

Measurement names

39 / 69

Measurement names

Tag keys & values

40 / 69

Measurement names

Tag keys & values

Field keys

41 / 69

Measurement names

Tag keys & values

Field keys

Field values by time*

WHERE time > now() 1m

42 / 69

Metadata index is held inmemory atrun time

43 / 69

What's not indexed?

44 / 69

What's not indexed? Field values by value

WHERE value > 2.718

45 / 69

What's not indexed? Field values by value

WHERE value > 2.718

Metadata by time

SHOW MEASUREMENTS WHERE time > now() - 1h

46 / 69

Data retention

47 / 69

48 / 69

InfluxDB hasRetention Policies

49 / 69

How do they work?

50 / 69

All data is written to aretention policy

51 / 69

Each retention policy has aduration specified(between 1 hour and infinite)

(between 1 hour and infinite)

52 / 69

raw(1 day)

cpu_1h(1 week)

shards shards

SELECT mean(value) INTO cpu_1h.cpu FROM raw.cpu WHERE time > now() 1w GROUP BY time(1h)

53 / 69

How to automatedownsampling?

54 / 69

Continuous QueriesStored in the database and run periodically in

the background

55 / 69

raw(1 day)

cpu_1h(1 week)

shards shards

CREATE CONTINUOUS QUERY mycq ON mydbCREATE CONTINUOUS QUERY mycq ON mydb BEGIN BEGIN SELECT mean(value) SELECT mean(value) INTO cpu_1h.cpu INTO cpu_1h.cpu FROM raw.cpu FROM raw.cpu GROUP BY time(1h) GROUP BY time(1h) END END

56 / 69

Replication is handled throughRetention Policies

57 / 69

FunctionsAggregations:

count, distinct, mean, median, sum

58 / 69

Selectors:

bottom, first, last, max, min, percentile, top

59 / 69

Selectors:

bottom, first, last, max, min, percentile, top

Transformations:

ceiling, derivative, difference, floor, histogram,stddev

stddev60 / 69

Data ingestion

61 / 69

InfluxDB supports: collectd

openTSDB

graphite

62 / 69

Telegraf http://github.com/influxdb/telegraf

Open Source (MIT License)

Also written in Go

Plugin based (~30 currently and growing)

Cross platform

63 / 69

Client libraries

Client libraries Go

Python

etc. (http://github.com/influxdb)

64 / 69

HTTP(S) APIWrite data via HTTP using a simpletextbased protocol

cpu,host=serverA value=10.0 1234567890

65 / 69

Visualization

66 / 69

Grafana

67 / 69

Chronograf

68 / 69

Thank You!@InfluxDB

David Norton (@dgnorton)

david@influxdb.com

69 / 69

@influxdb - usenix...influxdb u ser v a l u e time? 1 9 / 6 influxdb 2 1 / 6 9. influxdb time series...

Documents

time series database (influxdb) - university of...

influxdb: como monitorar milhares de dados por segundo em...

vs. influxdb for time-series data...benchmarking timescaledb...

custom devops monitoring system in melon (with influxdb +...

development with ansible, influxdb, and grafana …€¦ ·...

incident response platform - securitylearningacademy.com ·...

using prometheus with influxdb for metrics storage -...

metric extraction with sensu check output influxdb and...

time series data with influxdb

measure your app internals with influxdb and symfony2

graphing performance with collectd influxdb grafana

introduction to influxdb, an open source distributed time...

company in...

vs. influxdb for time-series data - outfluxdata.com ·...

influxdb の概要 - sonots #tokyoinfluxdb

influxdb for monitoring data - indico...influxdb setup /...

influxdb ver0.9.5#yjdsw3

real-time monitoring with grafana, statsd and influxdb

monitoring postgresql - p2d2 · influxdb & grafana –...

time series database, influxdb & php