movabsq $cosf, %rax movaps %xmm6, %xmm0 callq *%raxWhereas as some usual non-jitted code calling cos looks like this:
call cosf (0140C1E9F4h)It's unfortunate that the jitted code requires two instructions to do what the normal code can do in one, and that the call is now indirect.
After asking on the LLVM IRC channel, I think I understand why.
The reason for this is that LLVM conservatively (and correctly in my tests) assumes that cosf is > 2^32 bytes away from the location of the call instruction, and so cannot be jumped to directly with the call instruction, as it uses 32 bit relative offsets.
See http://eli.thegreenplace.net/2012/01/03/understanding-the-x64-code-models/ for more information.
This could be improved in LLVM in the following way (I'm sure there are other possibilities as well)
Try and allocate code for the Jitted code to go around the rest of the executable code. This seems to be around 0140C1E9F4h in my tests, which is about 5GB through the address space.
If the jitted code is succesfully allocated near e.g. the cos call, then emit a direct call instruction.
Allocating memory at a specific address from the kernel seems to be possible with mmap (and the Windows equivalent) although I haven't tried it.
Edit, 7th May 2014: On Windows, VirtualAlloc takes as its first parameter the address at which to allocate memory, which indicates that allocating at certain addresses should be possible.