I'm currently experiencing a slight chicken and egg problem with my implementation of all this memory stuff in my kernel.
Just after coming from GRUB, I enable Paging. I need to do this because I'm using the Higher-Half-Trick.
The page aligned space for this process is reserved by a placement allocator. So far so good.
Then my physical memory manager is set up. It's just a Bitmap which keeps track of any allocated and free frames. I didn't spend any more time for this because I just want it to work for now... Speed improvements are later tasks.

After that my kernel heap is initialized because it relies on Paging. It has an initial size which is configurable and will be expanded dynamically if needed. Means when no more space is available at least one frame is requested and mapped into the heap space.
The problem is that sometimes I need to get page aligned memory for all later page directories I set up for other threads/processes. Getting my heap to do this is a hard task I think.
I thought about taking full 4KB pages directly from the physical memory manager. But where do I map them into my virtual address space to modify them afterwards?
I could take another region for this. Let's say I use a 512MB section above my heap for this, but this would limit my flexibility. After creating/modifying them I could unmap them again. When there is need to modify the particular address space of a thread/process I would need to map it again into the kernel directory search for existing page tables inside it and map them too to some location. That's probably even more work than getting my heap to support page aligned memory allocation, would always destroy almost my complete TLB because I need to flush it after unmapping and takes much time.
How do you do this in your operating systems?