Summary and count causes performance issues on large datasets

With very large datasets (e.g. 13m rows), `summary` and `count` appear to significantly slow down the response:
https://github.com/openspending/babbage/blob/9416105fd18dda13b06aaaeec0ce7abdd13d8453/babbage/cube.py#L89-L96

Without generating `summary` and `count`, it's 2-3 times faster to return the response.

It would be useful to make returning these properties optional. E.g. by adding an optional `&simple` parameter to the request.

	# Count
	count = count_results(self, prep(cuts,
	drilldowns=drilldowns,
	columns=[1])[0])

	# Summary
	summary = first_result(self, prep(cuts,
	aggregates=aggregates)[0].limit(1))

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Summary and count causes performance issues on large datasets #37

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Summary and count causes performance issues on large datasets #37

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions