Python SDK (v.1.2.4) has a kind of “memory leak” when query view with “include_docs” set to True.
I have about 20M docs in my couchbase (one server 30GB of RAM for my bucket) and view that contains about 15M of docs.
I’m trying to iterate via my index to do some aggregation. I’m using “include_docs” option to get docs. But python script memory usage constantly grows (1G, 2G, 3G, etc).
def run(server, bucket): cb = couchbase.Couchbase.connect(host=server, bucket=bucket, timeout=100) for item in cb.query('myhouse', 'houses', streaming=True, include_docs=True): pass def main(): # .... parsing arguments run(args.server, args.bucket)
(Couchbase Server version - 2.5)