memcpy implementation in c overlap

How to implement overlap-checking memcpy in C?

stackoverflow.com › questions › 13095488 › how-to-implement-overlap-checking-memcpy-in-c

The only portable way to determine if two memory ranges overlap is:

int overlap_p(void *a, void *b, size_t n)
{
    char *x = a, *y =  b;
    for (i=0; i<n; i++) if (x+i==y || y+i==x) return 1;
    return 0;
}

This is because comparison of pointers with the relational operators is undefined unless they point into the same array. In reality, the comparison does work on most real-world implementations, so you could do something like:

int overlap_p(void *a, void *b, size_t n)
{
    char *x = a, *y =  b;
    return (x<=y && x+n>y) || (y<=x && y+n>x);
}

I hope I got that logic right; you should check it. You can simplify it even more if you want to assume you can take differences of arbitrary pointers.

Answer from R.. GitHub STOP HELPING ICE on Stack Overflow

Stack Overflow

stackoverflow.com › questions › 13095488 › how-to-implement-overlap-checking-memcpy-in-c

unix - How to implement overlap-checking memcpy in C? - Stack Overflow

Top answer

1 of 3

The only portable way to determine if two memory ranges overlap is:

int overlap_p(void *a, void *b, size_t n)
{
    char *x = a, *y =  b;
    for (i=0; i<n; i++) if (x+i==y || y+i==x) return 1;
    return 0;
}

int overlap_p(void *a, void *b, size_t n)
{
    char *x = a, *y =  b;
    return (x<=y && x+n>y) || (y<=x && y+n>x);
}

I hope I got that logic right; you should check it. You can simplify it even more if you want to assume you can take differences of arbitrary pointers.

2 of 3

What you want to check is the position in memory of the source relatively to the destination:

If the source is ahead of the destination (ie. source < destination), then you should start from the end. If the source is after, you start from the beginning. If they are equal, you don't have to do anything (trivial case).

Here are some crude ASCII drawings to visualize the problem.

|_;_;_;_;_;_|          (source)
      |_;_;_;_;_;_|    (destination)
            >-----^    start from the end to shift the values to the right

      |_;_;_;_;_;_|    (source)
|_;_;_;_;_;_|          (destination)
^-----<                 start from the beginning to shift the values to the left

Following a very accurate comment below, I should add that you can use the difference of the pointers (destination - source), but to be on the safe side cast those pointers to char * beforehand.

In your current setting, I don't think that you can check if the operation will fail. Your memcpy prototype prevents you from doing any form of checking for that, and with the rule given above for deciding how to copy, the operation will succeed (outside of any other considerations, like prior memory corruption or invalid pointers).

Stack Exchange

cs50.stackexchange.com › questions › 14615 › memory-overlap-in-c

Memory overlap in C - CS50 Stack Exchange

Top answer

1 of 5

I believe you mean memmove which takes care of memory overlapping as oppose to memset. but what is memory overlapping anyway?

suppose we have an array of 5 chars, where each char is a byte long

+++++++++++++++++++++++++++++++
| 'a' | 'b' | 'c' | 'd' | 'e' |
+++++++++++++++++++++++++++++++
 0x100 0x101 0x102 0x103 0x104

now according to the man page of memcpy, it takes 3 arguments, a pointer to the destination block of memory, a pointer to the source block of memory, and the size of bytes to be copied.

what if the destination is 0x102, the source is 0x100 and the size is 3? memory overlapping happens here. that is, 0x100 would be copied into 0x102, 0x101 would be copied into 0x103 and 0x102 would be copied into 0x104.

notice that we first copied into 0x102 then we copied from 0x102 which means that the value which was originally in 0x102 was lost as we overwrote it with the value we copied into 0x102 before we copy from it. so we would end up with something like

+++++++++++++++++++++++++++++++
| 'a' | 'b' | 'a' | 'b' | 'a' |
+++++++++++++++++++++++++++++++
 0x100 0x101 0x102 0x103 0x104

instead of

+++++++++++++++++++++++++++++++
| 'a' | 'b' | 'a' | 'b' | 'c' |
+++++++++++++++++++++++++++++++
 0x100 0x101 0x102 0x103 0x104

how does a function like memmove take care of memory overlapping? according to its man page, it first copies the bytes to be copied into a temporary array then pastes them into the destination block as oppose to a function like memcpy which copies directly from the source block to the destination block.

2 of 5

Lets see:

memset: sets a memory segment to a constant value, so, there is no "overlapping" possible here, because there is just a unique, contiguous, memory segment to "set".

memcpy: you are reading from one memory segment and, well, copying it to another memory segment. If the memory segments coincide at some point, a "overlapping" would occur. Imagine a memory segment starts at address 0x51, and the other starts at address 0x70, and you try to copy 50 bytes from 0x51 to 0x70... at some point, the process will start reading from address at 0x70, and copying to address 0x8F. This is most likely not what you wanted to do.

At a lower level, in assembly, you should be able to find several ways of doing this, including MMX, SSE2 and other SIMD instructions. If you download glibc source code (https://www.gnu.org/software/libc/download.html), you will see some implementations done in assembly.

C is a "high-level" language, but is quite close to assembly, you can get memory address for variables and even for functions, so, it is quite powerful, allowing you to do all kind of things, like reading/writing an array after its "official" end (the OS will stop you once you try to access memory outside your process' memory), so, yes, memory overlapping is totally possible in C. Something like this would create two potentially overlapping memory "segments" (actually, the same segment, that I am manually dividing and assigning to two pointers).

This is a funny-behaving program, it is definitely, and intentionally buggy, just to show what kind of odd things can happen if memory do overlap with memcpy:

#include <stdio.h>
#include <stdlib.h>
#include <string.h>

int main(void)
{
        char *a,*b;
        a=malloc(100*sizeof(char));
        b=(a+25);
        strcpy(a,"This is just a test");
        strcpy(b,"And this is another test, longer test string.");
        printf("a: %s\nb: %s\n",a,b);
        printf("Now, I am copying b in a, and lets see what happen...\n");
        memcpy(a,b,75);
        printf("a: %s\nb: %s\n",a,b);
}

Save it to a .c file, like test.c, and compile it using gcc, like this:

gcc -O0 -o test test.c

Run it and then try again compiling like that:

gcc -O2 -o test test.c

It will (most likely) behave differently.

Try replacing memcpy with strncpy and see what happen.

I hope the example is useful.

Discussions

c - Meaning of overlapping when using memcpy - Stack Overflow

But it's not typically implemented that way in practice. 2022-11-17T21:56:59.16Z+00:00 ... @urvi_189 sounds like copying with a reverse iterator has not yet been invented [case 2 in Jabberwocky's answer]. 2024-11-25T11:41:15.623Z+00:00 ... Save this answer. Show activity on this post. *) The main difference between memcpy and memmove is,memcpy works on the same string but memmove works in separate memory by taking a copy of the string. *) Due to this,overlapping ... More on stackoverflow.com

stackoverflow.com

What is memcpy in c?

I always thought of it as a fairly ... it to the destination. This prevents problems if the source and destination blocks overlap each other. Memcpy vs memmove in c | memmove implementation | own code... More on experts-exchange.com

experts-exchange.com

September 8, 2022

How does memmove allow the copying to be done in a non-destructive manner unlike memcpy?

Your question is a bit hard to understand. A naive memmove operation could copy source to some temp buffer and then copy that temp buffer to dest. Ofcourse, you can make it wayy more optimized, but creating a safe and correct memmove is not that hard. More on reddit.com

r/C_Programming

May 14, 2023

c - memcpy() vs memmove() - Stack Overflow

In general, memcpy is implemented in a simple (but fast) manner. Simplistically, it just loops over the data (in order), copying from one location to the other. This can result in the source being overwritten while it's being read. memmove does more work to ensure it handles the overlap correctly. More on stackoverflow.com

stackoverflow.com

Videos

45:07

YouTube

The Art of Optimizing memcpy and memset! - YouTube

September 26, 2019

1.92K