Oh, it can render arbitrary scenes (so overlapping, yes), but I haven't specified a data structure that supports that yet so yeah, you're seeing a height map.
The renderer works with vertical columns of voxels of the same color. So if I want a bunch of green voxels stacked on top of each other I give the renderer a green column however many voxels high.