[erlang-questions] "processor affinity" for processes
Zvi
exta7@REDACTED
Mon Jan 14 21:08:59 CET 2008
Kenneth Lundin wrote:
>
> See questions below.
>
> On Jan 10, 2008 3:24 PM, Zvi <exta7@REDACTED> wrote:
>>
>> Another usefull feature will be specifying CPU Affinity for ports and
>> external processes launched via os:cmd/1 .
>
> This might be a potentially interesting feature, but what knowledge,
> research or measurements do
> you have which says that you get better performance with that feature?
>
Kenneth,
I'll post here some snippets from my presentation "Migration to Multi-Core"
(I prepared it for one-day seminar, maybe I'll clean it up and put on the
web some day):
----------------------------------------
Why mess with CPU Affinity?
- Legacy Code Migration
Usually OS handles CPU Affinity automatically very good.
So why mess with it?
* When migrating old applications to multi-core CPU, their performance may
start degrading. This happens because OS moving application’s threads and
interrupt routines between cores.
* By fixing application’s process and threads to CPU #0, application will
start behave the same way, as on uniprocessor machine.
----------------------------------------
Why mess with CPU Affinity?
- Real-Time and Determinism
* By setting CPU affinity for our application’s thread - we get
deterministic execution times, needed for Real-Time.
* OS will no more move our threads between cores.
* For example, let’s say we developing video surveillance system using
dual-core CPU.
We have two threads:
#1 – receiving video frames from the Matrox Frame Grabber
#2 – transmitting the received frames via TCP/IP to the Server
By setting affinity of the receiving thread on CPU #0 and of the
transmitting thread on CPU #1 – we getting deterministic execution times.
------------------------------------------
Why mess with CPU Affinity?
- Memory Affinity
Cache locality
* There are some multi-core CPUs with separate L1 caches (for example:
Pentium-D, some Intel’s quad-core CPUs).
* If OS moving thread from one core to another, the data is missing on the
2nd core, so it must fetch data from RAM, thus we get performance
degradation.
NUMA
* On the NUMA, each memory bank connect directly to single CPU. When
accessing RAM from another CPU it will be slower.
* By setting thread’s CPU affinity and allocating memory on the faster RAM
bank, we getting performance improvement.
Kenneth Lundin wrote:
>
>> My current workarround is to launch script whcih temporary changing CPU
>> affinity of the executable.
>
> Can you explain more about this?
> A workaround for what?
> What do you want to achieve with explicit spec of CPU affinity?
>
> Suppose you already use all processor cores or processors inside the
> Erlang VM, how do
> you then know if it pays off to specify a specific CPU for the external
> program?
>
There are also need to control number of CPUs/cores Erlang VM using, then
the rest we can use for ports or another non-Erlang external processes.
for example, I have 8 cores server, when I running Erlang VM it shows smp:8.
Now let's say I want Erlang VM only use 1 core and control long-running
computations/encoding on the other cores.
In Windows I can do it using this command:
%% running mp3 encoding on CPU #3
Cmd = "ffmpeg -y -i input.wav output.mp3".
AffinityMask = "100".%% this means CPU #3
os:cmd("process.exe -a \""++Cmd++"\" "++AffinityMask).
I hope this clear the things.
Best Regards,
Zvi
--
View this message in context: http://www.nabble.com/%22processor-affinity%22-for-processes-tp9956818p14812184.html
Sent from the Erlang Questions mailing list archive at Nabble.com.
More information about the erlang-questions
mailing list