More powerful XDCR replication filtering

lbertrand · October 22, 2015, 8:14am

Hi,

I have requirements to be able to replicate data between different cluster - but I need a more advanced filtering than just filtering on id as per the current available choice in Couchbase 4.

Basically, to choose if the document need to be replicated or not is not only based on the document id itself, but some other information specific to our application. Also, we may want to filter part of the document and merge it back inside the other cluster, so it is like a partial replication…

Our current approach is to use the kafka connector to access the replication log of the cluster and put logic around reading / writing to kafka to handle our case for filtering / partial replication. But this leads to handling concurrent modification ourselves and also avoid feedback loop by adding metadata into our document, like a logical clock, … We feel that this information is already somewhere in the metadata of the document that Couchbase hold to manage its own replication protocol…

What will you guys recommend doing? Could we. somehow, plug into the XDCR protocol by just extending the logic for filtering / stripping information from document and let Couchabse handle the nitty gritty of cross cluster replication? Is there a way to access the metadata of a document and extend it to add our own, so we do not have to wrap our original document into one with metadata to handle our replication.

Any idea and proposal?
Thanks.

anil · October 23, 2015, 7:09am

Yes, you’re right with 4.0 you can only do per document key based filtering with cross datacenter replication. However the advanced filtering you’re looking for such as filtering based-of document content is in our roadmap. It will be great to sit with you and understand more about your requirements. Can you please connect with me at anil@couchbase.com.

minouti_27 · March 17, 2016, 6:13am

Hi,
Just wanted to know if we do have advanced filtering in place based on a non-id field in a document. We are also looking for a similar replication from a Couchbase bucket to Elastic search, but where no of fields and filter param both are customizable.

anil · April 28, 2016, 5:44pm

Hi, Value based advanced filtering in XDCR is feature which we currently don’t have but it is in our future release roadmap.

Anil Kumar