Limit of IPI sending?

WindowsNT · Post by **WindowsNT** » Fri May 22, 2015 7:28 am

I send an IPI from a CPU to another, that's fine. Real mode.
But as I saw, I can only send 2 IPIs one after another.

The third one is not generating an interrupt to the target CPU.

What could be the problem?

Code: Select all

	
        SendIPIF: ; bl = APIC ID, ECX = IPI. Called with BL = 1 and IPI = 0x0F1 to generate INT 0xF1 to target CPU.
		PUSHAD

		; Write it to 0x310
		MOV EDI,[DS:LocalApic] ; default 0xFEE00000
		ADD EDI,0x310
		MOV EDX,[FS:EDI] ; FS is an unreal mode seg 
		AND EDX,0xFFFFFF
		XOR EAX,EAX
		MOV AL,BL
		SHL EAX,24
		OR EDX,EAX
		MOV [FS:EDI],EDX
		
		; Write it to 0x300
		MOV EDI,[DS:LocalApic]
		ADD EDI,0x300
		MOV [FS:EDI],ECX

		; Verify it got delivered
		.Verify:
                PAUSE
		MOV EAX,[FS:EDI];
		SHR EAX,12
		TEST EAX,1
		JNZ .Verify

		; Write to 0xB0 (EOI)
		MOV EDI,[DS:LocalApic]
		ADD EDI,0xB0
		MOV dword [FS:EDI],0
		
		POPAD
		RETF

Update: If I have some spin loop before the call, only 1 IPI can be sent.
Is there some limitations on spinloops related to IPIs?

Brendan · Post by **Brendan** » Fri May 22, 2015 10:40 am

Hi,

This looks a bit backwards to me. Typically you'd:

Make sure the CPU has finished sending the previous IPI using the "Delivery Status" flag (with special attention to race conditions - you don't want to see "Delivery Status = idle" and be interrupted by something that sends an IPI and causes the flag to become set before you've sent your IPI)
Write to the ICR; making sure that you set the high dword first and then the low dword. Note: There's no need to read the old high dword from the ICR (e.g. just do "ICR.high = dest << 24;" and don't bother with "ICR.high = (dest << 24;) | (ICR.high & 0x00FFFFFF);").

Never send EOI when sending an IPI. The CPU that receives the IPI does EOI after it has finished handling the IPI.

When the other CPU's local APIC receives the IPI, it sets the corresponding flag in its IRR register and starts trying to deliver the interrupt to the CPU (according to interrupt priortity rules, if the CPU has interrupts enabled, etc). If/when the local APIC delivers the interrupt to CPU it clears that flag in the IRR and sets the flag in the ISR. The EOI clears the flag in the ISR.

If the other CPU's local APIC receives a second IPI for the same interrupt vector while the corresponding flag in the IRR is already set, then the second IPI is ignored.

This means that you have to wait until after the first IPI has been delivered to the CPU (and not just received by the CPU's local APIC) before you send a second IPI for the same interrupt vector. In practice there's no way to do that; so the interrupt handler has to set some sort of "interrupt being serviced" flag in memory (e.g. a volatile variable or something), and the sending CPU has to wait until this variable has been modified by the other CPUs interrupt handler before it can send another IPI for the same interrupt vector.

More specifically; in practice typically you've got some other data to go along with the IPI (e.g. an address to invalidate) and/or multiple CPUs may send the IPI to the same CPU at the same time; and you have to use some sort of lock where you acquire the lock and send the IPI, then wait until the other CPU does something to indicate it has finished handling your IPI before you release the lock.

For an example, multi-CPU TLB shootdown code might look more like this:

Code: Select all

sendTLBinvalidation:
    acquire spinlock for the interrupt vector
    set the address to invalidate in a global variable
    set "number of CPUs still handling IPI" counter to the number of CPUs that will be receiving the IPI
    wait until "Delivery Status" flag says local APIC is idle (and disable IRQs to avoid race conditions)
    send the IPI (then re-enable IRQs)
    wait for "number of CPUs still handling IPI" counter to become zero
    release the spinlock for the interrupt vector

Code: Select all

TLBinvalidationIPIhandler:
    get the address to invalidate from the global variable
    invalidate that TLB entry
    atomically decrement the "number of CPUs still handling IPI" counter
    send EOI to local APIC

Note that for decent scalability you're probably going to want to use multiple interrupt vectors for IPIs (each with their own "data" variables, their own "number of CPUs still handling IPI" counters and their own locks).

Also, for convenience (especially for the less frequent uses of IPIs) you can generalise it. For example, you can have a routine that takes an "address of code I want remote CPUs to execute" input parameter that gets stored in a global variable, where other CPU's IPI handler executes whatever you like.

Cheers,

Brendan

WindowsNT · Post by **WindowsNT** » Fri May 22, 2015 3:39 pm

I was already doing most of the stuff you are suggesting, but this was the final clue:

>>
Never send EOI when sending an IPI. The CPU that receives the IPI does EOI after it has finished handling the IPI.
>>

This means that my handler should set the EOI, not the sender.

You 're the best. Thanks a lot.

OSDev.org

Limit of IPI sending?

Limit of IPI sending?

Re: Limit of IPI sending?

Re: Limit of IPI sending?