cpuops: Use cmpxchg for xchg to avoid lock semantics (8270137a) · Commits · Android-smartphones / bq / namek / android_kernel_bq_namek

Commit 8270137a authored Dec 14, 2010 by Christoph Lameter Committed by Tejun Heo Dec 18, 2010

cpuops: Use cmpxchg for xchg to avoid lock semantics



Use cmpxchg instead of xchg to realize this_cpu_xchg.

xchg will cause LOCK overhead since LOCK is always implied but cmpxchg
will not.

Baselines:

xchg()		= 18 cycles (no segment prefix, LOCK semantics)
__this_cpu_xchg = 1 cycle

(simulated using this_cpu_read/write, two prefixes. Looks like the
cpu can use loop optimization to get rid of most of the overhead)

Cycles before:

this_cpu_xchg	 = 37 cycles (segment prefix and LOCK (implied by xchg))

After:

this_cpu_xchg	= 11 cycle (using cmpxchg without lock semantics)

Signed-off-by: Christoph Lameter <cl@linux.com>
Signed-off-by: Tejun Heo <tj@kernel.org>

parent 7296e08a

Hide whitespace changes

Inline Side-by-side

Please register or to comment