'Why does memcpy cause compilers to seemingly forget about strict aliasing?
Consider the following C++ code:
std::uint32_t func(std::uint32_t* p1, std::uint64_t* p2) {
*p2 = *p1;
return *p1;
}
Compiling with -O3
yields the following disassembly on Clang (GCC is similar):
func(unsigned int*, unsigned long*):
mov eax, dword ptr [rdi]
mov qword ptr [rsi], rax
ret
(Live demo: https://godbolt.org/z/vKPvT7o51)
Due to the strict aliasing rule, the compiler assumes p1
and p2
don't point to the same memory location, and thus there is no need to reload *p1
from memory for the return statement. So far so good, optimisations are working as I expect.
Now consider a similar piece of code, where the copy is done via memcpy()
:
std::uint32_t func(std::uint32_t* p1, std::uint64_t* p2) {
std::memcpy(p2, p1, sizeof(std::uint32_t));
return *p1;
}
Disassembly (identical on Clang and GCC):
func(unsigned int*, unsigned long*):
mov eax, dword ptr [rdi]
mov dword ptr [rsi], eax
mov eax, dword ptr [rdi]
ret
(Live demo: https://godbolt.org/z/rszEEc1Gr)
This time, *p1
is reloaded from memory, as if writing via p2
could have modified the memory at p1
- i.e. p1
and p2
could be aliased.
I find this to be strange, because:
- The compiler already knows that
p1
andp2
do not alias due to the strict aliasing rule, as demonstrated in the first example. - (Even if the above point is untrue)
memcpy()
has the requirement that its arguments cannot alias, otherwise it's UB. I presume that compilers are aware of this restriction and take advantage of it for optimisation purposes (otherwise you'd end up with a slowermemcpy()
).
Therefore it seems as though the use of memcpy()
is causing the compiler to forget about these aliasing restrictions.
What is going on here?
- Is it simply a missed optimisation (or bug?) by the compiler?
- Is it behaviour somehow required by the Standard?
- Is it UB and thus we can't reason about the generated code?
On the point of UB, one line of possible reasoning is that the code is UB because the types of the source and destination are mismatched, which is likely not allowed (possibly violates the strict aliasing rule?). However, I do not believe this is true, because memcpy()
copies at the byte level and it doesn't care what the types of its argument pointers are. p1
and p2
could just as well point to objects of the same type, but have been reinterpret_cast
'd into different pointer types (which I believe is OK, as long as those pointers are not dereferenced, which they are not in this example).
One could then argue that if p1
and p2
could have been reinterpret_cast
'd from any memory location, then they could alias, and thus the compiler must reload *p1
from memory. However, the restriction that memcpy()
's source and destination cannot overlap is still present, so the compiler should still know that p1
and p2
cannot alias.
Sources
This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.
Source: Stack Overflow
Solution | Source |
---|