In this powerful machine, `pmap` reports a total memory usage of `bitcoind` of about 14e9. It seems that 8 GB of RAM is at least in part responsible for the bottleneck, no matter what you put in `dbcache`.
Your
chainstate directory will grow to about 12 GB, which means anything less than that largely increases disk activity.