You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
TL;DR: Store LRU cache is no longer leaking, Upgraded Thanos UI to Prometheus 2.9, Fixed auto-downsampling, Moved to Go 1.12.5 and more.
@@ -105,28 +109,28 @@ Using cadvisor `container_memory_usage_bytes` metric could be misleading e.g: ht
105
109
New options:
106
110
107
111
New Store flags:
108
-
112
+
109
113
* `--store.grpc.series-sample-limit` limits the amount of samples that might be retrieved on a single Series() call. By default it is 0. Consider enabling it by setting it to more than 0 if you are running on limited resources.
110
114
* `--store.grpc.series-max-concurrency` limits the number of concurrent Series() calls in Thanos Store. By default it is 20. Considering making it lower or bigger depending on the scale of your deployment.
111
115
112
116
New Store metrics:
113
-
117
+
114
118
* `thanos_bucket_store_queries_dropped_total` shows how many queries were dropped due to the samples limit;
115
119
* `thanos_bucket_store_queries_concurrent_max` is a constant metric which shows how many Series() calls can concurrently be executed by Thanos Store;
116
120
* `thanos_bucket_store_queries_in_flight` shows how many queries are currently "in flight" i.e. they are being executed;
117
121
* `thanos_bucket_store_gate_duration_seconds` shows how many seconds it took for queries to pass through the gate in both cases - when that fails and when it does not.
118
-
122
+
119
123
New Store tracing span:
120
124
* `store_query_gate_ismyturn` shows how long it took for a query to pass (or not) through the gate.
121
-
122
-
-[#1016](https://github.com/improbable-eng/thanos/pull/1016) Added option for another DNS resolver (miekg/dns client).
125
+
126
+
-[#1016](https://github.com/improbable-eng/thanos/pull/1016) Added option for another DNS resolver (miekg/dns client).
123
127
Note that this is required to have SRV resolution working on [Golang 1.11+ with KubeDNS below v1.14](https://github.com/golang/go/issues/27546)
124
128
125
129
New Querier and Ruler flag: `-- store.sd-dns-resolver` which allows to specify resolver to use. Either `golang` or `miekgdns`
126
-
130
+
127
131
-[#986](https://github.com/improbable-eng/thanos/pull/986) Allow to save some startup & sync time in store gateway as it is no longer needed to compute index-cache from block index on its own for larger blocks.
128
-
The store Gateway still can do it, but it first checks bucket if there is index-cached uploaded already.
129
-
In the same time, compactor precomputes the index cache file on every compaction.
132
+
The store Gateway still can do it, but it first checks bucket if there is index-cached uploaded already.
133
+
In the same time, compactor precomputes the index cache file on every compaction.
130
134
131
135
New Compactor flag: `--index.generate-missing-cache-file` was added to allow quicker addition of index cache files. If enabled it precomputes missing files on compactor startup. Note that it will take time and it's only one-off step per bucket.
132
136
@@ -143,31 +147,31 @@ Note that this is required to have SRV resolution working on [Golang 1.11+ with
143
147
-[#1021](https://github.com/improbable-eng/thanos/pull/1021) Query API `series` now supports POST method.
144
148
-[#939](https://github.com/improbable-eng/thanos/pull/939) Query API `query_range` now supports POST method.
145
149
146
-
### Changed
150
+
### Changed
147
151
148
152
-[#970](https://github.com/improbable-eng/thanos/pull/970) Deprecated `partial_response_disabled` proto field. Added `partial_response_strategy` instead. Both in gRPC and Query API.
149
153
No `PartialResponseStrategy` field for `RuleGroups` by default means `abort` strategy (old PartialResponse disabled) as this is recommended option for Rules and alerts.
150
154
151
155
Metrics:
152
-
156
+
153
157
* Added `thanos_rule_evaluation_with_warnings_total` to Ruler.
154
158
* DNS `thanos_ruler_query_apis*` are now `thanos_ruler_query_apis_*` for consistency.
155
159
* DNS `thanos_querier_store_apis*` are now `thanos_querier_store_apis__*` for consistency.
156
160
* Query Gate `thanos_bucket_store_series*` are now `thanos_bucket_store_series_*` for consistency.
157
161
* Most of thanos ruler metris related to rule manager has `strategy` label.
158
-
162
+
159
163
Ruler tracing spans:
160
-
164
+
161
165
*`/rule_instant_query HTTP[client]` is now `/rule_instant_query_part_resp_abort HTTP[client]"` if request is for abort strategy.
162
-
166
+
163
167
-[#1009](https://github.com/improbable-eng/thanos/pull/1009): Upgraded Prometheus (~v2.7.0-rc.0 to v2.8.1) and TSDB (`v0.4.0` to `v0.6.1`) deps.
164
-
168
+
165
169
Changes that affects Thanos:
166
-
* query:
167
-
*[ENHANCEMENT] In histogram_quantile merge buckets with equivalent le values. #5158.
168
-
*[ENHANCEMENT] Show list of offending labels in the error message in many-to-many scenarios. #5189
170
+
* query:
171
+
*[ENHANCEMENT] In histogram_quantile merge buckets with equivalent le values. #5158.
172
+
*[ENHANCEMENT] Show list of offending labels in the error message in many-to-many scenarios. #5189
169
173
*[BUGFIX] Fix panic when aggregator param is not a literal. #5290
170
-
* ruler:
174
+
* ruler:
171
175
*[ENHANCEMENT] Reduce time that Alertmanagers are in flux when reloaded. #5126
172
176
*[BUGFIX] prometheus_rule_group_last_evaluation_timestamp_seconds is now a unix timestamp. #5186
173
177
*[BUGFIX] prometheus_rule_group_last_duration_seconds now reports seconds instead of nanoseconds. Fixes our [issue #1027](https://github.com/improbable-eng/thanos/issues/1027)
@@ -179,26 +183,26 @@ Note that this is required to have SRV resolution working on [Golang 1.11+ with
179
183
*[CHANGE]*breaking* Renamed flag `--sync-delay` to `--consistency-delay`[#1053](https://github.com/improbable-eng/thanos/pull/1053)
180
184
181
185
For ruler essentially whole TSDB CHANGELOG applies beween v0.4.0-v0.6.1: https://github.com/prometheus/tsdb/blob/master/CHANGELOG.md
182
-
186
+
183
187
Note that this was added on TSDB and Prometheus: [FEATURE] Time-ovelapping blocks are now allowed. #370
184
188
Whoever due to nature of Thanos compaction (distributed systems), for safety reason this is disabled for Thanos compactor for now.
185
189
186
190
-[#868](https://github.com/improbable-eng/thanos/pull/868) Go has been updated to 1.12.
187
-
-[#1055](https://github.com/improbable-eng/thanos/pull/1055) Gossip flags are now disabled by default and deprecated.
191
+
-[#1055](https://github.com/improbable-eng/thanos/pull/1055) Gossip flags are now disabled by default and deprecated.
188
192
-[#964](https://github.com/improbable-eng/thanos/pull/964) repair: Repair process now sorts the series and labels within block.
189
193
-[#1073](https://github.com/improbable-eng/thanos/pull/1073) Store: index cache for requests. It now calculates the size properly (includes slice header), has anti-deadlock safeguard and reports more metrics.
190
194
191
195
### Fixed
192
196
193
197
-[#921](https://github.com/improbable-eng/thanos/pull/921)`thanos_objstore_bucket_last_successful_upload_time` now does not appear when no blocks have been uploaded so far.
194
-
-[#966](https://github.com/improbable-eng/thanos/pull/966) Bucket: verify no longer warns about overlapping blocks, that overlap `0s`
198
+
-[#966](https://github.com/improbable-eng/thanos/pull/966) Bucket: verify no longer warns about overlapping blocks, that overlap `0s`
195
199
-[#848](https://github.com/improbable-eng/thanos/pull/848) Compact: now correctly works with time series with duplicate labels.
196
200
-[#894](https://github.com/improbable-eng/thanos/pull/894) Thanos Rule: UI now correctly shows evaluation time.
197
201
-[#865](https://github.com/improbable-eng/thanos/pull/865) Query: now properly parses DNS SRV Service Discovery.
198
202
-[#889](https://github.com/improbable-eng/thanos/pull/889) Store: added safeguard against merging posting groups segfault
199
203
-[#941](https://github.com/improbable-eng/thanos/pull/941) Sidecar: added better handling of intermediate restarts.
200
204
-[#933](https://github.com/improbable-eng/thanos/pull/933) Query: Fixed 30 seconds lag of adding new store to query.
201
-
-[#962](https://github.com/improbable-eng/thanos/pull/962) Sidecar: Make config reloader file writes atomic.
205
+
-[#962](https://github.com/improbable-eng/thanos/pull/962) Sidecar: Make config reloader file writes atomic.
202
206
-[#982](https://github.com/improbable-eng/thanos/pull/982) Query: now advertises Min & Max Time accordingly to the nodes.
203
207
-[#1041](https://github.com/improbable-eng/thanos/issues/1038) Ruler is now able to return long time range queries.
204
208
-[#904](https://github.com/improbable-eng/thanos/pull/904) Compact: Skip compaction for blocks with no samples.
0 commit comments