I am using Couchbase to load some master data from SQL Server database. One of the issues, I am facing is that the fragmentation goes very high after the data is loaded. I am not sure about the specifics of the data load here since I do not own that process but its something that runs from .Net Code. After this load is complete, I see that the disk space used by Couchbase goes from 21 GB to 154 GB (numbers based on Web Console and I assume Console only shows data size not indexes).
As soon as data load is complete, I run compaction on the bucket and it goes back down to ~ original size. This indicates to me that data volume is not increasing on a net basis.
I am looking for some suggestions to explore on how to perform data load in Couchbase in an efficient manner to avoid fragmentation. Are there any guidelines?