It's called a voxel engine, meaning that the world is made up of voxels, or volumetric pixels. Every location on the 2D height map is a voxel with a specific height. Basically, the renderer casts rays from the player to points on the map, and then draws them based on their heights.