Mapping all PM to VM (long mode).
- prinzrainer
- Posts: 13
- Joined: Sun Nov 22, 2009 11:36 pm
Mapping all PM to VM (long mode).
Is it okay to map all the physical memory to all the process address space? Assume that PM mappings are typed as write back. Suppose a user process request a memory so that kernel would allocate it a memory and it on the process space and return the address of allocation to the caller, also with a type of write back. So there are now 2 different addresses in same space having same PM pointed. Then the user process read that memory, performing a cache line fill. What will happen if a code modifies the data using the PM mappings (instead of using the address being returned upon request), will it still have a write hit (as it has been cached during read) even if they don't have the same virtual address? Also, does mapping the whole PM to VM has a bad performance impact since it would require a lots of translation tables, filling the TLB with useless translation(increasing the tlb miss)?
Re: Mapping all PM to VM (long mode).
The situation when several linear addresses mapped to same physical is called memory aliasing.What will happen if a code modifies the data using the PM mappings (instead of using the address being returned upon request), will it still have a write hit (as it has been cached during read) even if they don't have the same virtual address?
Usually it is used to share R/O data or code between several processes. This way only one physical page could be mapped to several processes.
Answer to your question depends on particual hardware CPU you gonna use. Of the CPU has physically indexed caches no performance penalty will occur.
If the CPU has linearly indexed caches - performance penaly will happen, sometimes even very significant.
Most of the CPUs on the marked have physically indexed caches , not all of them. For example - AMD processors are linearly indexed.
Stanislav
Re: Mapping all PM to VM (long mode).
Right. From the software point of view, loads and stores are all matched up using physical addresses. The hardware has to make it right, or it is buggy (these sorts of bugs are probably the most common!)stlw wrote:Answer to your question depends on particual hardware CPU you gonna use. If the CPU has physically indexed caches no performance penalty will occur.What will happen if a code modifies the data using the PM mappings (instead of using the address being returned upon request), will it still have a write hit (as it has been cached during read) even if they don't have the same virtual address?
If the CPU has linearly indexed caches - performance penaly will happen, sometimes even very significant.
Most of the CPUs on the marked have physically indexed caches , not all of them. For example - AMD processors are linearly indexed.
Of course, it may not be fast...
Even processors with physically indexed caches often use one or two bits of the virtual address. That's why it is best not to alias below, say, 1 MB offset.
- prinzrainer
- Posts: 13
- Joined: Sun Nov 22, 2009 11:36 pm
Re: Mapping all PM to VM (long mode).
so mapping the whole PM is not a good thing?
Re: Mapping all PM to VM (long mode).
You can map all of physical memory, no problem. Just don't map it all with slightly different virtual addresses for different processes.
For example:
kernel gets VM=PM
user gets VM=PM+4K (say to allow for some kernel space)
That would alias every address at the 4K level, which would probably be a huge performance hit.
For example:
kernel gets VM=PM
user gets VM=PM+4K (say to allow for some kernel space)
That would alias every address at the 4K level, which would probably be a huge performance hit.
- prinzrainer
- Posts: 13
- Joined: Sun Nov 22, 2009 11:36 pm
Re: Mapping all PM to VM (long mode).
What does VM=PM+4K mean? My kernel doesn't have it's own address space. Calling kernel functions is only a matter of jumping from user to kernel(which is in the PM mappings).kernel gets VM=PM
user gets VM=PM+4K (say to allow for some kernel space)
Well my purpose for mapping PM to VM is to ease the complexity of the overall system. Here's a sample memory map of my proccess virtual space.
+------------------------+
;------------------------;
;-----USER CODE------;
;-----AND DATA ------;
;------------------------;
;........................; <--128TiB (start of code and data)
; - - - - - - - - - - - - - ;
;-USERMODE SHARED-;
;- LIBRARY/MODULE -;
; - - - - - - - - - - - - - ;
;........................;<--124TiB (start of shared usermode libs)
;~~~~~~~~~~~~~~~~~;
;------------------------;<--MAX_PM (end of PM mapping)
; / / / / / / / / / / /;
; / / / / / / / / / / /;<--16MiB(kernel base)
; / / / / / / / / / / /;
+------------------------+<<--PM is mapped at the bottom of all the process address space)
So, there would be some parts of memory between 0 to
MAX_PM that would be aliased to 124TiB above..
Also, I am using 1GiB pages in mapping the PM, would there be any changes in how cpu cache 1GB pages from 4KB or 2MB pages?